Stars
Simple high-throughput inference library
Fine-tuning Moshi/J-Moshi on your own spoken dialogue data
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Googles NotebookLM but local
python bindings for symphonia/opus - read various audio formats from python and write opus files
Proof of concept for running moshi/hibiki using webrtc
J-Moshi: A Japanese Full-duplex Spoken Dialogue System
Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.
A Fish Speech implementation in Rust, with Candle.rs
A text embedding extension for the Polars Dataframe library.
A Keras like abstraction layer on top of the Rust ML framework candle
Write tensorboard compatible log files from ocaml