Stars
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced i…
A package for parsing PDFs and analyzing their content using LLMs.
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on addi…
11 Lessons to Get Started Building AI Agents
Curated list of project-based tutorials
Convert PDF to markdown + JSON quickly with high accuracy
Multisampled Piano implementation using Salamander Grand Piano Sounds
A script that allows you to play .mid files through https://dotpiano.com
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
so-vits-svc fork with realtime support, improved interface and more features.
AI subtiltle tool. Translate your subtitle with GPT. 使用chatGPT来翻译你的字幕
A latent text-to-image diffusion model
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
A better tensorflow implementation of deepinsight, aiming at smoothly production ready for cross-platforms. Currently only with inference, training code later.
Productive, portable, and performant GPU programming in Python.
A new one shot face swap approach for image and video domains
State-of-the-art 2D and 3D Face Analysis Project
The official Python library for the OpenAI API
WebUI extension for ControlNet
Magenta: Music and Art Generation with Machine Intelligence