Stars
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
A collection of resources on controllable generation with text-to-image diffusion models.
Pinecone + Vercel AI SDK Starter
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.
Deriving Spark DataFrame schemas from case classes
A multi-voice TTS system trained with an emphasis on quality
A local-first personal finance app
Document scanner, features live border detection, perspective correction, image filters and more ! 📲📸
MIDI Library for Swift and Objective-C Mac and iOS apps.
Music Notation Library in Swift
OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
A simplified Jira clone built with React/Babel (Client), and Node/TypeScript (API). Auto formatted with Prettier, tested with Cypress.
Dataset of images of trash; Torch-based CNN for garbage image classification
Firefly III: a personal finances manager
Make social simulations on real maps! Agent-based modeling for the web.
DeepMind's Tacotron-2 Tensorflow implementation
High-precision indoor positioning framework, version 3.
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Generates cam profiles (.stl) for the mechanical laser show
Facebook AI Research's Automatic Speech Recognition Toolkit
Python programs, usually short, of considerable difficulty, to perfect particular skills.
A fast, customizable and compatible open source server for Minecraft: Java Edition