Stars
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
Create automatic playlists by using Deep Learning to *listen* to the music.
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
</> htmx - high power tools for HTML
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Port of OpenAI's Whisper model in C/C++
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🎓 Sharing machine learning course / lecture notes.
Latex template for Oxford integrated thesis
SPEAR Challenge scripts and tools.
Tools for handling multimodal data in machine learning projects.
Robust Speech Recognition via Large-Scale Weak Supervision
Code for the paper Hybrid Spectrogram and Waveform Source Separation
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
An educational resource to help anyone learn deep reinforcement learning.
Generative Adversarial Networks implemented in PyTorch and Tensorflow
Try out deep learning models online on Google Colab