Gemini Fish Voice Assistant

This project implements a voice-controlled robotic fish assistant using Google's Gemini Live API. The fish can listen to voice commands, process them with Gemini, and respond with both voice and physical movements.

Features

Real-time voice interaction using Gemini Live API
Live mouth movement synchronized with audio output
Physical movements (head and tail) through tool calling
Wake word detection based on sound energy
Ambient sound effects during processing

Requirements

Python 3.8+
Google API key with access to Gemini models
ElevenLabs API key (for the action processor)
Arduino-based fish hardware connected via serial

Setup

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```

Set up environment variables in .env:

GOOGLE_API_KEY=your_google_api_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key

Make sure the fish hardware is connected via USB

Usage

To start the fish assistant:

python gemini_fish.py

The fish will wait for a "wake word" (loud noise) and then:

Play a beep sound and start listening
Process your spoken command using Gemini
Respond with voice output and physical movements
Return to listening mode

Project Structure

gemini_fish.py - Main script with Gemini Live API integration
action_processor.py - Handles audio processing and fish movements
tooling.py - Controls the physical hardware movements
Sound files (beep.wav, microwave_ambient.wav) - Audio cues

How it Works

The system uses:

Gemini's Live API for real-time voice conversation
Function calling to trigger physical movements during speech
Energy-based audio analysis for mouth movement synchronization
Simple energy threshold for wake word detection

Extending the Project

You can:

Add more movement tools by extending the tool declarations
Improve wake word detection with a more sophisticated model
Customize the system instruction to change the fish's personality

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
fish-driver		fish-driver
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
.python-version		.python-version
Get_started_LiveAPI_tools.ipynb		Get_started_LiveAPI_tools.ipynb
Makefile		Makefile
README.md		README.md
action_processor.py		action_processor.py
add_wifi.sh		add_wifi.sh
arduino_controller.py		arduino_controller.py
beep.wav		beep.wav
brain.py		brain.py
command.wav		command.wav
entrypoint.py		entrypoint.py
live.py		live.py
microwave_ambient.wav		microwave_ambient.wav
microwave_beep.wav		microwave_beep.wav
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test.py		test.py
test_recording.wav		test_recording.wav
tooling.py		tooling.py
user_command.wav		user_command.wav
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemini Fish Voice Assistant

Features

Requirements

Setup

Usage

Project Structure

How it Works

Extending the Project

About

Releases

Packages

Contributors 2

Languages

fibleep/fish-ai

Folders and files

Latest commit

History

Repository files navigation

Gemini Fish Voice Assistant

Features

Requirements

Setup

Usage

Project Structure

How it Works

Extending the Project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages