For applications that require real-time and low latency voice support, such as
chatbots or agentic interactions, the **Gemini Live API** provides an optimized
way to stream both input and output for a Gemini model. By using Firebase AI
Logic, you can call the Gemini Live API directly from your Android app without
the need for a backend integration. This guide shows you how to use the Gemini
Live API in your Android app with Firebase AI Logic.
| **Note:** Using the Gemini Live API with Firebase AI Logic is in developer preview. Non-backward compatible changes are possible, and it has the following [limitations](https://firebase.google.com/docs/ai-logic/live-api#limitations-and-requirements).

## Get started

Before you begin, make sure your app targets **API level 23 or higher**.

If you haven't already, set up a Firebase project and connect your app to
Firebase. For details, see the [Firebase AI Logic documentation](https://firebase.google.com/docs/ai-logic/get-started#set-up-firebase).

## Set up your Android project

Add the Firebase AI Logic library dependency to your app-level
`build.gradle.kts` or `build.gradle` file. Use the [Firebase Android
BoM](https://firebase.google.com/docs/android/learn-more#bom) to manage library versions.  

    dependencies {
      // Import the Firebase BoM
      implementation(platform("com.google.firebase:firebase-bom:34.3.0"))
      // Add the dependency for the Firebase AI Logic library
      // When using the BoM, you don't specify versions in Firebase library dependencies
      implementation("com.google.firebase:firebase-ai")
    }

After adding the dependency, sync your Android project with Gradle.

## Integrate Firebase AI Logic and initialize a generative model

Add the `RECORD_AUDIO` permission to the `AndroidManifest.xml` file of your
application:  

    <uses-permission android:name="android.permission.RECORD_AUDIO" />

Initialize the Gemini Developer API backend service and access the `LiveModel`.
Use a model that supports the Live API, like
`gemini-live-2.5-flash-preview`. See the Firebase documentation for
[available models](https://firebase.google.com/docs/ai-logic/live-api?api=dev#models-that-support-capability).

To specify a voice, set the [voice name](https://cloud.google.com/text-to-speech/docs/chirp3-hd) within the
`speechConfig` object as part of the [model configuration](https://firebase.google.com/docs/ai-logic/model-parameters#config-gemini-live-api). If
you don't specify a voice, the default is `Puck`.  

### Kotlin

    // Initialize the `LiveModel`
    val model = Firebase.ai(backend = GenerativeBackend.googleAI()).liveModel(
           modelName = "gemini-live-2.5-flash-preview",
           generationConfig = liveGenerationConfig {
              responseModality = ResponseModality.AUDIO
              speechConfig = SpeechConfig(voice = Voice("FENRIR"))
           })

### Java

    // Initialize the `LiveModel`
    LiveGenerativeModel model = FirebaseAI
           .getInstance(GenerativeBackend.googleAI())
           .liveModel(
                  "gemini-live-2.5-flash-preview",
                  new LiveGenerationConfig.Builder()
                         .setResponseModality(ResponseModality.AUDIO)
                         .setSpeechConfig(new SpeechConfig(new Voice("FENRIR"))
                  ).build(),
            null,
            null
    );

You can optionally define a persona or role the model plays by setting a
system instruction:  

### Kotlin

    val systemInstruction = content {
                text("You are a helpful assistant, you main role is [...]")}

    val model = Firebase.ai(backend = GenerativeBackend.googleAI()).liveModel(
           modelName = "gemini-live-2.5-flash-preview",
           generationConfig = liveGenerationConfig {
              responseModality = ResponseModality.AUDIO
              speechConfig = SpeechConfig(voice= Voice("FENRIR"))
           },
           systemInstruction = systemInstruction,
    )

### Java

    Content systemInstruction = new Content.Builder()
           .addText("You are a helpful assistant, you main role is [...]")
           .build();

    LiveGenerativeModel model = FirebaseAI
           .getInstance(GenerativeBackend.googleAI())
           .liveModel(
                  "gemini-live-2.5-flash-preview",
                  new LiveGenerationConfig.Builder()
                         .setResponseModality(ResponseModality.AUDIO)
                         .setSpeechConfig(new SpeechConfig(new Voice("FENRIR"))
                  ).build(),
            tools, // null if you don't want to use function calling
            systemInstruction
    );

You can further specialize the conversation with the model by using system
instructions to provide context specific to your app (for example, user
in-app activity history).

## Initialize a Live API session

Once you create the `LiveModel` instance, call `model.connect()` to
create a `LiveSession` object and establish a persistent connection with the
model with low-latency streaming. `LiveSession` lets you to interact with
the model by starting and stopping the voice session and also sending and
receiving text.

You can then call `startAudioConversation()` to start the conversation with the
model:  

### Kotlin

    val session = model.connect()
    session.startAudioConversation()

### Java

    LiveModelFutures model = LiveModelFutures.from(liveModel);
    ListenableFuture<LiveSession> sessionFuture = model.connect();

    Futures.addCallback(sessionFuture, new FutureCallback<LiveSession>() {
        @Override
        public void onSuccess(LiveSession ses) {
            LiveSessionFutures session = LiveSessionFutures.from(ses);
            session.startAudioConversation();
        }
        @Override
        public void onFailure(Throwable t) {
            // Handle exceptions
        }
    }, executor);

Also, in your conversations with the model, note that it doesn't
handle interruptions. We intend to add this in the future.

You can also use the Gemini Live API to [generate streamed audio from
text](https://firebase.google.com/docs/ai-logic/live-api?api=dev#text-in-audio-out) and [generate text from streamed audio](https://firebase.google.com/docs/ai-logic/live-api?api=dev#audio-in-text-out).
Note that the Live API is bidirectional so you use the same connection to send
and receive content. Eventually, you will also be able to send images and a live
video stream to the model.

## Function calling: connect the Gemini Live API to your app

To go one step further, you can also enable the model to interact directly with
the logic of your app using function calling.

Function calling (or tool calling) is a feature of generative AI implementations
that allows the model to call functions at its own initiative to perform
actions. If the function has an output, the model adds it to its context and
uses it for subsequent generations.
![Diagram illustrating how the Gemini Live API allows a user prompt
to be interpreted by a model, triggering a predefined function with
relevant arguments in an Android app, which then receives a confirmation
response from the model.](https://developer.android.com/static/ai/assets/images/gemini-live-api.svg) **Figure 1:** Diagram illustrating how the Gemini Live API allows a user prompt to be interpreted by a model, triggering a predefined function with relevant arguments in an Android app, which then receives a confirmation response from the model.

To implement function calling in your app, start by creating a
`FunctionDeclaration` object for each function you want to expose to the model.

For example, to expose an `addList` function that appends a string to a list
of strings to Gemini, start by creating a `FunctionDeclaration` variable with a
name and a short description in plain English of the function and its parameter:  

### Kotlin

    val itemList = mutableListOf<String>()

    fun addList(item: String){
       itemList.add(item)
    }

    val addListFunctionDeclaration = FunctionDeclaration(
            name = "addList",
            description = "Function adding an item the list",
            parameters = mapOf("item" to Schema.string("A short string
                describing the item to add to the list"))
            )

### Java

    HashMap<String, Schema> addListParams = new HashMap<String, Schema>(1);

    addListParams.put("item", Schema.str("A short string describing the item
        to add to the list"));

    FunctionDeclaration addListFunctionDeclaration = new FunctionDeclaration(
        "addList",
        "Function adding an item the list",
        addListParams,
        Collections.emptyList()
    );

Then, pass this `FunctionDeclaration` as a `Tool` to the model when you
instantiate it:  

### Kotlin

    val addListTool = Tool.functionDeclarations(listOf(addListFunctionDeclaration))

    val model = Firebase.ai(backend = GenerativeBackend.googleAI()).liveModel(
           modelName = "gemini-live-2.5-flash-preview",
           generationConfig = liveGenerationConfig {
              responseModality = ResponseModality.AUDIO
              speechConfig = SpeechConfig(voice= Voice("FENRIR"))
           },
           systemInstruction = systemInstruction,
           tools = listOf(addListTool)
    )

### Java

    LiveGenerativeModel model = FirebaseAI.getInstance(
        GenerativeBackend.googleAI()).liveModel(
            "gemini-live-2.5-flash-preview",
      new LiveGenerationConfig.Builder()
            .setResponseModalities(ResponseModality.AUDIO)
            .setSpeechConfig(new SpeechConfig(new Voice("FENRIR")))
            .build(),
      List.of(Tool.functionDeclarations(List.of(addListFunctionDeclaration))),
                   null,
                   systemInstruction
            );

Finally, implement a handler function to handle the tool call the model makes
and pass it back the response. This handler function provided to
the `LiveSession` when you call `startAudioConversation`, takes a
`FunctionCallPart` parameter and returns `FunctionResponsePart`:  

### Kotlin

    session.startAudioConversation(::functionCallHandler)

    // ...

    fun functionCallHandler(functionCall: FunctionCallPart): FunctionResponsePart {
        return when (functionCall.name) {
            "addList" -> {
                // Extract function parameter from functionCallPart
                val itemName = functionCall.args["item"]!!.jsonPrimitive.content
                // Call function with parameter
                addList(itemName)
                // Confirm the function call to the model
                val response = JsonObject(
                    mapOf(
                        "success" to JsonPrimitive(true),
                        "message" to JsonPrimitive("Item $itemName added to the todo list")
                    )
                )
                FunctionResponsePart(functionCall.name, response)
            }
            else -> {
                val response = JsonObject(
                    mapOf(
                        "error" to JsonPrimitive("Unknown function: ${functionCall.name}")
                    )
                )
                FunctionResponsePart(functionCall.name, response)
            }
        }
    }

### Java

    Futures.addCallback(sessionFuture, new FutureCallback<LiveSessionFutures>() {

        @RequiresPermission(Manifest.permission.RECORD_AUDIO)
        @Override
        @OptIn(markerClass = PublicPreviewAPI.class)
        public void onSuccess(LiveSessionFutures ses) {
            ses.startAudioConversation(::handleFunctionCallFuture);
        }

        @Override
        public void onFailure(Throwable t) {
            // Handle exceptions
        }
    }, executor);

    // ...

    ListenableFuture<JsonObject> handleFunctionCallFuture = Futures.transform(response, result -> {
        for (FunctionCallPart functionCall : result.getFunctionCalls()) {
            if (functionCall.getName().equals("addList")) {
                Map<String, JsonElement> args = functionCall.getArgs();
                String item =
                        JsonElementKt.getContentOrNull(
                                JsonElementKt.getJsonPrimitive(
                                        locationJsonObject.get("item")));
                return addList(item);
            }
        }
        return null;
    }, Executors.newSingleThreadExecutor());

## Next steps

- Play with the Gemini Live API in the [Android AI catalog sample
  app](https://github.com/android/ai-samples/tree/main/ai-catalog/samples/gemini-live-todo).
- Read more about the Gemini Live API in the [Firebase AI Logic
  documentation](https://firebase.google.com/docs/ai-logic/live-api?api=dev).
- Learn more about the available [Gemini models](https://firebase.google.com/docs/ai-logic/models).
- Learn more about [function calling](https://firebase.google.com/docs/ai-logic/function-calling).
- Explore [prompt design strategies](https://firebase.google.com/docs/ai-logic/prompt-design).