llm&ai
🦜🔗 Build context-aware reasoning applications
Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
A modular graph-based Retrieval-Augmented Generation (RAG) system
Official inference repo for FLUX.1 models
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
12 Weeks, 24 Lessons, AI for All!
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
10 Weeks, 20 Lessons, Data Science for All!
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
State-of-the-art 2D and 3D Face Analysis Project
Simple, unified interface to multiple Generative AI providers
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
AI Workshop Project of OceanBase 2024 Product Launch
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版