AI Agent Text-to-Speech Framework

A flexible Text-to-Speech agent built with PocketFlow

Overview

This project implements a Text-to-Speech (TTS) agent using the PocketFlow framework. It allows users to convert text to speech with different voice options and save/play the generated audio.

Features

Convert text to speech with multiple voice options
Save generated audio to file
Play audio directly from the application
Extensible node-based architecture

Installation

# Clone the repository
git clone https://github.com/aixiasang/ai-agent-tts.git
cd ai-agent-tts

# Install dependencies
pip install -r requirements.txt

Usage

python main.py

Follow the interactive prompts to:

Enter the text you want to convert to speech
Select a voice option
Generate and save the audio
Play the generated audio

Extending the Framework

You can extend this framework by:

Adding new voice options in utils/tts_engine.py
Creating new nodes in nodes.py
Modifying the flow in flow.py

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Agent Text-to-Speech Framework

Overview

Features

Installation

Usage

Extending the Framework

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
utils		utils
README.md		README.md
flow.py		flow.py
main.py		main.py
nodes.py		nodes.py
requirements.txt		requirements.txt

aixiasang/ai-agent-tts

Folders and files

Latest commit

History

Repository files navigation

AI Agent Text-to-Speech Framework

Overview

Features

Installation

Usage

Extending the Framework

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages