8000 mzh1993 (shouhengmzh) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View mzh1993's full-sized avatar

Block or report mzh1993

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 23,288 3,331 Updated Mar 5, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 52,146 4,489 Updated Jul 4, 2025

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python 3,571 387 Updated Dec 5, 2024

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 9,085 885 Updated Jul 3, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 5,905 663 Updated Jul 3, 2025

一个能讲广东话(粤语)的小程序

Vue 6 1 Updated Jul 18, 2024

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 14,635 1,135 Updated Jul 3, 2025

Model Context Protocol Servers

TypeScript 57,646 6,663 Updated Jul 4, 2025

Arduino library to play MOD, WAV, FLAC, MIDI, RTTTL, MP3, and AAC files on I2S DACs or with a software emulated delta-sigma DAC on the ESP8266 and ESP32 and Pico

C 2,219 457 Updated Apr 18, 2025

Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.

TypeScript 16,391 1,782 Updated Jul 4, 2025

Hi all, this a flight controller code for ESP32 written on ArduinoIDE, there are test files for each component, follow schematic.

C++ 211 56 Updated Mar 18, 2025

使用 ncmdump ,实现全自动网易云音乐ncm格式转mp3

Python 477 61 Updated Aug 12, 2020

转换网易云音乐 ncm 到 mp3 / flac. Convert Netease Cloud Music ncm files to mp3/flac files.

C++ 2,229 300 Updated Jun 8, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 17,315 2,022 Updated Jul 4, 2025

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…

Python 4,273 321 Updated May 28, 2025

百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断

Python 1,309 226 Updated Jun 18, 2025

本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.

Python 5,406 1,858 Updated Jul 4, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 58,904 5,845 Updated Jul 4, 2025

🏠 将小爱音箱接入 ChatGPT 和豆包,改造成你的专属语音助手。

TypeScript 11,288 1,465 Updated May 21, 2025

使用小爱音箱播放音乐,音乐使用 yt-dlp 下载。

Python 4,658 508 Updated Jul 4, 2025

Build your own AI friend

JavaScript 633 244 Updated Jun 7, 2025

An MCP-based chatbot | 一个基于MCP的聊天机器人

C++ 16,100 3,088 Updated Jul 1, 2025

利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.

Python 5,728 682 Updated Jul 2, 2025

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 4,717 549 Updated Mar 11, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 15,705 2,260 Updated Jul 4, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 51,437 8,494 Updated Jul 4, 2025

Official inference framework for 1-bit LLMs

Python 20,420 1,527 Updated Jun 3, 2025

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,888 522 Updated Apr 11, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,646 430 Updated Jul 3, 2025
Next
0