8000 liguiyuan / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View liguiyuan's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Shenzhen

Block or report liguiyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Robust Speech Recognition via Large-Scale Weak Supervision

Python 84,864 10,374 Updated Jun 26, 2025

Multilingual Voice Understanding Model

Python 6,126 545 Updated Jul 4, 2025

Spark-TTS Inference Code

Python 10,030 1,058 Updated Apr 9, 2025

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,725 101 Updated Jul 3, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 12,633 1,822 Updated Jul 14, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 102,540 13,692 Updated Jul 14, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 59,871 5,972 Updated Jul 14, 2025

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 20,392 4,826 Updated Jul 14, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 6,684 768 Updated Jul 14, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,651 876 Updated Apr 29, 2025

Formula recognition based on LaTeX-OCR and ONNXRuntime.

Python 357 35 Updated Nov 3, 2024

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 1,040 48 Updated May 24, 2025

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

Python 1,490 86 Updated Apr 24, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,544 848 Updated May 15, 2025

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 14,973 1,206 Updated Jan 18, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 5,994 669 Updated Jul 14, 2025

A series of math-specific large language models of our Qwen2 series.

Python 965 136 Updated Jan 11, 2025

微信小程序开发资源汇总 💯

48,171 8,812 Updated Feb 20, 2025

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 372 15 Updated Jan 19, 2025

A generative speech model for daily dialogue.

Python 37,135 4,021 Updated Jul 6, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,846 1,121 Updated Mar 14, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,231 6,642 Updated Jul 14, 2025

LLM UI with advanced features, easy setup, and multiple backend support.

Python 44,328 5,698 Updated Jul 11, 2025

LLM inference in C/C++

C++ 82,989 12,344 Updated Jul 14, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,868 2,616 Updated Apr 30, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,547 1,521 Updated Jun 26, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,592 1,143 Updated Nov 14, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,268 531 Updated Nov 20, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

57,369 6,184 Updated Jun 4, 2025
Next
0