8000 ZhuoRoger / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ZhuoRoger's full-sized avatar

Block or report ZhuoRoger

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,508 52 Updated Jun 4, 2025

SoTA open-source TTS

Python 5,757 623 Updated Jun 4, 2025

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

392 22 Updated Mar 8, 2025

Split your restaurant bills easily

TypeScript 226 23 Updated May 21, 2025

Document to Markdown OCR library with Llama 3.2 vision

TypeScript 2,328 231 Updated Jan 20, 2025

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.

55,310 16,915 Updated May 31, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,055 3,169 Updated Jun 5, 2025

Research papers and blogs to transition to AI Engineering

1,128 144 Updated May 31, 2025
Python 1,302 55 Updated Jun 3, 2025

Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)

Python 125 8 Updated May 15, 2025
Python 183 18 Updated May 20, 2025
JavaScript 118 99 Updated Apr 29, 2025

Matrix-Game: Interactive World Foundation Model

Python 706 76 Updated May 13, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 10,115 897 Updated May 30, 2025

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,150 861 Updated Jul 6, 2024
A1FE

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,364 60 Updated May 28, 2025

Suna - Open Source Generalist AI Agent

TypeScript 13,958 2,040 Updated Jun 5, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,230 185 Updated Jun 4, 2025

The official Python library for Arklex framework

Python 242 88 Updated Jun 5, 2025

Enjoy the magic of Diffusion models!

Python 8,756 791 Updated May 19, 2025

Build resilient language agents as graphs.

Python 13,665 2,305 Updated Jun 5, 2025

🖥️ Run AI Agent in your browser.

Python 13,476 2,267 Updated Jun 1, 2025

Interface for OuteTTS models.

Python 1,296 106 Updated May 28, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 142,856 11,995 Updated Jun 5, 2025
Python 2,167 208 Updated Jun 4, 2025

Preprocess Audio for training

Python 340 60 Updated Mar 3, 2025

OCR Benchmark

TypeScript 498 38 Updated May 27, 2025

The fastest AI Chatbot for any LLM

933 21 Updated Jun 5, 2025

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 2,498 77 Updated Jun 5, 2025
Next
0