8000 liguiyuan / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View liguiyuan's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Shenzhen

Block or report liguiyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Spark-TTS Inference Code

Python 9,467 988 Updated Apr 9, 2025

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,667 98 Updated Jul 3, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 11,957 1,689 Updated May 20, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,458 12,294 Updated May 20, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 52,958 5,074 Updated May 20, 2025

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 20,157 4,779 Updated May 12, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 6,035 688 Updated May 20, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,556 833 Updated Apr 29, 2025

Formula recognition based on LaTeX-OCR and ONNXRuntime.

Python 348 33 Updated Nov 3, 2024

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 1,013 46 Updated Aug 12, 2024

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

Python 1,395 77 Updated Apr 24, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,503 756 Updated May 15, 2025

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 14,344 1,138 Updated Jan 18, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 5,525 612 Updated May 20, 2025

A series of math-specific large language models of our Qwen2 series.

Python 929 134 Updated Jan 11, 2025

微信小程序开发资源汇总 💯

47,584 8,777 Updated Feb 20, 2025

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 368 14 Updated Jan 19, 2025

A generative speech model for daily dialogue.

Python 36,288 3,920 Updated May 6, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,721 1,095 Updated Mar 14, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,276 5,999 Updated May 20, 2025

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 43,652 5,622 Updated May 20, 2025

LLM inference in C/C++

C++ 80,598 11,842 Updated May 20, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,461 2,557 Updated Apr 30, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,423 1,413 Updated May 20, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,527 1,141 Updated Nov 14, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,178 528 Updated Nov 20, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 51,985 5,553 Updated May 12, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 2,875 224 Updated Apr 24, 2025

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 991 143 Updated May 18, 2025
Next
0