Spanish Audio Transcriber

Configuration

No matter which environment you will need to setup your Open AI API Key.

Create a .env file in the project root directory with the following:

OPENAI_API_KEY=<paste your own api key here>
STREAM_KEY_NAME=hello

Or, export as environment variable. i.e. in your terminal:

export OPENAI_API_KEY=<paste your own api key here>
export STREAM_KEY_NAME=hello

Quick Proof Of Concept Setup

See the rtve live stream, translated, and displayed in a web browser.

Bring up the development stack by issuing the docker compose command:

docker compose up

This will allow you to view the translations live at: http://localhost:4567

Either point a local broadcast tool (for example OBS Studio at the endpoint: rtmp://localhost:1935/stream and set the stream key name.

Or, use ffmpeg to pull and direct an example stream (RTVe here) to livetranslation RTMP endpoint:

ffmpeg -analyzeduration 0 -i 'https://rtvelivesrc2.rtve.es/live-origin/24h-hls/bitrate_3.m3u8' -f flv rtmp://localhost:1935/stream/hello

Dependencies:

Docker (https://www.docker.com/get-started/)
Open AI API Key https://platform.openai.com/docs/guides/speech-to-text

How To Run In Codespaces

GitHub allows you to develop in a web-based IDE, that looks like VSCode. From github repo, select the code dropdown, and codespaces. You can then create a new codespace from there. The codespace allows you to develop in the cloud with other humans.

Run The Application Manually

Setup

bundle install

Usage

Start the sinatra web server with: bundle exec ruby app.rb
Start the demo stream with this script: bundle exec ruby start_rtve_translation.rb
Open a web browser at the following address: http://localhost:4567

Dependencies

Valid OPENAI_API_KEY set as an environment variable
ffmpeg installed (required for MP4 to MP3 conversion - not needed if only using MP3/ogg/wav files)

Transcribing Static Audio

Place audio files in the /audio directory
Receive Spanish transcription and English translation in /text directory

Running the static translator

ruby spanish_transcriber.rb

OBS Configuration

Stream Type: Custom Streaming Server
URL: rtmp://localhost:1935/stream
Stream Key: <stream key name>

Features

Processes MP3, OGG, and MP4 files
Converts MP4 to MP3 using ffmpeg (with error handling)
Transcribes audio and translates it to English (default)
Supports batch processing with configurable project name
Ensures already processed files are skipped
Includes logging for debugging

Troubleshooting

Check OpenAI API Key

curl https://api.openai.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
     "model": "gpt-4",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

Check Transcription

curl --request POST \
  --url https://api.openai.com/v1/audio/transcriptions \
  --header "Authorization: Bearer $OPENAI_API_KEY" \
  --header 'Content-Type: multipart/form-data' \
  --form file=@./audio/amelia.mp3 \
  --form model=whisper-1

Application Deployment

Heroku Deployment

Deploying A Simple Sinatra Helloworld to Heroku

Gemfile needs a ruby version:

ruby '3.2.2' # Other options: 3.2.8 (Heroku, 3.2.x latest), 3.3.7

Gemfile also needs:

gem 'rackup' # added for heroku error gem 'puma' # added for heroku error gem 'sinatra'

Heroku will need to have a Gemfile.lock, so ensure that is committed, after running:

bundle install

A Procfile should exist to run on heroku - it should run the sinatra app:

web: ruby app.rb -o 0.0.0.0 -p $PORT

A config.ru file should exist referencing the same app file:

require './app' run Sinatra::Application

Creating The Heroku Instance

heroku create boquercom

You now need to go into the Heroku Dashboard, and allow the server to run via the dashboard. This is PAID feature. So, you may want to also STOP the server via the dashboard later.

Pushing The Latest Code To Heroku

Do your changes. Commit with message, and push to your own (feature) branch. i.e. git commit -am "description of changes"

Automating The Deployment

It is possible to run the rspec tests either via Github or Heroku, and then deploy.

Trying Alternative Hosts.

Fly.io should be free, but was having issues before I tried Heroku. Probably worth retrying.
Obtaining a AWS mini free instance (Believe free for a year for new customers like Boquercom?) Think, with Docker finished, and a motivated maintainer this should be possible.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.github/workflows		.github/workflows
.vscode		.vscode
infra		infra
lib		lib
live_audio		live_audio
live_text		live_text
nginx		nginx
spec		spec
views		views
.dockerignore		.dockerignore
.gitignore		.gitignore
.rubocop.yml		.rubocop.yml
Dockerfile		Dockerfile
Dockerfile.alpine		Dockerfile.alpine
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
app.rb		app.rb
config.ru		config.ru
display_translation.rb		display_translation.rb
docker-compose.yml		docker-compose.yml
live_transcriber.rb		live_transcriber.rb
new_notes.txt		new_notes.txt
notes.txt		notes.txt
spanish_transcriber.rb		spanish_transcriber.rb
start_rtve_translation.rb		start_rtve_translation.rb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Spanish Audio Transcriber

Configuration

Quick Proof Of Concept Setup

How To Run In Codespaces

Run The Application Manually

Setup

Usage

Dependencies

Transcribing Static Audio

Running the static translator

OBS Configuration

Features

Troubleshooting

Check OpenAI API Key

Check Transcription

Application Deployment

Heroku Deployment

Trying Alternative Hosts.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

alterisian/livetranslator

Folders and files

Latest commit

History

Repository files navigation

Spanish Audio Transcriber

Configuration

Quick Proof Of Concept Setup

How To Run In Codespaces

Run The Application Manually

Setup

Usage

Dependencies

Transcribing Static Audio

Running the static translator

OBS Configuration

Features

Troubleshooting

Check OpenAI API Key

Check Transcription

Application Deployment

Heroku Deployment

Trying Alternative Hosts.

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages