This repository contains a react-based starter app for using the Multimodal Live API over a websocket. It provides modules for streaming audio playback, recording user media such as from a microphone, webcam or screen capture as well as a unified log view to aid in development of your application.
Watch the demo of the Multimodal Live API here.
To get started, create a free Gemini API key and add it to the .env
file. Then:
$ npm install && npm start
We have provided several example applications on other branches of this repository:
Below is an example of an entire application that will use Google Search grounding and then render graphs using vega-embed:
import { type FunctionDeclaration, SchemaType } from "@google/generative-ai";
import { useEffect, useRef, useState, memo } from "react";
import vegaEmbed from "vega-embed";
import { useLiveAPIContext } from "../../contexts/LiveAPIContext";
export const declaration: FunctionDeclaration = {
name: "render_altair",
description: "Displays an altair graph in json format.",
parameters: {
type: SchemaType.OBJECT,
properties: {
json_graph: {
type: SchemaType.STRING,
description:
"JSON STRING representation of the graph to render. Must be a string, not a json object",
},
},
required: ["json_graph"],
},
};
export function Altair() {
const [jsonString, setJSONString] = useState<string>("");
const { client, setConfig } = useLiveAPIContext();
useEffect(() => {
setConfig({
model: "models/gemini-2.0-flash-exp",
systemInstruction: {
parts: [
{
text: 'You are my helpful assistant. Any time I ask you for a graph call the "render_altair" function I have provided you. Dont ask for additional information just make your best judgement.',
},
],
},
tools: [{ googleSearch: {} }, { functionDeclarations: [declaration] }],
});
}, [setConfig]);
useEffect(() => {
const onToolCall = (toolCall: ToolCall) => {
console.log(`got toolcall`, toolCall);
const fc = toolCall.functionCalls.find(
(fc) => fc.name === declaration.name
);
if (fc) {
const str = (fc.args as any).json_graph;
setJSONString(str);
}
};
client.on("toolcall", onToolCall);
return () => {
client.off("toolcall", onToolCall);
};
}, [client]);
const embedRef = useRef<HTMLDivElement>(null);
useEffect(() => {
if (embedRef.current && jsonString) {
vegaEmbed(embedRef.current, JSON.parse(jsonString));
}
}, [embedRef, jsonString]);
return <div className="vega-embed" ref={embedRef} />;
}
This project was bootstrapped with Create React App. Project consists of:
- an Event-emitting websocket-client to ease communication between the websocket and the front-end
- communication layer for processing audio in and out
- a boilerplate view for starting to build your apps and view logs
In the project directory, you can run:
Runs the app in the development mode.
Open http://localhost:3000 to view it in the browser.
The page will reload if you make edits.
You will also see any lint errors in the console.
Builds the app for production to the build
folder.
It correctly bundles React in production mode and optimizes the build for the best performance.
The build is minified and the filenames include the hashes.
Your app is ready to be deployed!
See the section about deployment for more information.
This is an experiment showcasing the Multimodal Live API, not an official Google product. We’ll do our best to support and maintain this experiment but your mileage may vary. We encourage open sourcing projects as a way of learning from each other. Please respect our and other creators' rights, including copyright and trademark rights when present, when sharing these works and creating derivative work. If you want more info on Google's policy, you can find that here.
Copy the .env.template file to .env and add your Gemini API key.
cp .env.template .env
Run the following commands to run the electron app:
npm install
npm run electron-dev
Note: In case you face issues with node-gyp on mac, follow the instructions below to create a virtual environment and install the dependencies, and then run
export PYTHON="$(pwd)/.venv/bin/python"
npm install
Note: I had to install xcode
to get this to work, since it required the unordered_map
cpp header file. The xcode-select cli tool was not enough.
python3 -m venv .venv && source .venv/bin/activate && python3 -m pip install setuptools
npm run electron-build
-
You need to have an Apple Developer account.
-
You need to install the following certificates:
a. From Apple's Certificate Authority, download the following - Apple Root CA - G2 - Apple Worldwide Developer Relations CA - G2 - Apple Worldwide Developer Relations Certificate Authority - Developer ID Certification Authority b. A developer ID Application certificate from here. You need to generate a Certificate Signing Request (CSR) from your mac to generate the certificate.
-
Create an App Specific Password from here
-
Set the following environment variables:
export APPLE_ID="[email protected]" # Your Apple email
export APPLE_APP_SPECIFIC_PASSWORD="YOUR_APP_SPECIFIC_PASSWORD" # Generate this at appleid.apple.com
export APPLE_ID_PASSWORD="YOUR_APP_SPECIFIC_PASSWORD" # same as above
export APPLE_TEAM_ID="KRY77A2RML" # Your Apple Team ID
-
Add the following to your package.json:
a. In your mac build"mac": { "hardenedRuntime": true, "gatekeeperAssess": false, "entitlements": "electron/entitlements.mac.plist", "entitlementsInherit": "electron/entitlements.mac.plist", "identity": "G-DEV FZ-LLC (KRY77A2RML)", "forceCodeSigning": true }
b. For notarisation,
"afterSign": "electron-builder-notarize"
And add this to your dev dependencies:
npm install electron-builder-notarize --save-dev
-
Run the following command to build the app, it will sign and notarize the app as well:
source .venv/bin/activate && npm run electron-build