Stars
Search video frames with natural language and visualize their camera poses in a 3D point cloud
Letting Claude Code develop his own MCP tools :)
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
Send push notifications to your phone or desktop using PUT/POST
Parallel subtitle videos for opera performances
[arXiv 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"
Takes audio (mp3) and text input (string) and force aligns the text to the audio. Uses stable-ts and whisperx.
real time face swap and one-click video deepfake with only a single image
Generative models for conditional audio generation
This library for animating text. Developed with SwiftUI. This library supports iOS/macOS.
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
Low-Rank adapter extraction for fine-tuned transformers models
OMNI: Open-endedness via Models of human Notions of Interestingness
Real-time audio to chords, lyrics, beat, and melody.
A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive summarization and semantic search applications built on top of it.
Repository for training models for music source separation.
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors
multiscore / SMT
Forked from antoniorv6/SMTOfficial implementation of the Sheet Music Transformer
Swift community driven package for OpenAI public API
The official API server for Exllama. OAI compatible, lightweight, and fast.
EC-KitY: A scikit-learn-compatible Python tool kit for doing evolutionary computation.
[EG 2023] Sketch Video Synthesis
REST: Retrieval-Based Speculative Decoding, NAACL 2024
Early Alpha: Interact with OpenAI's Latest Assistant API through Natural Language.
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/