8000 yuanluw (matt) / Starred · GitHub

More Web Proxy on the site http://driver.im/

yuanluw

Follow

matt yuanluw

Follow

3 followers · 4 following

Achievements

Achievements

Starred repositories

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 3,755 374 Updated Apr 27, 2025

bilibili / Index-anisora

Python 1,325 58 Updated Jun 6, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 58,823 3,037 Updated Jun 4, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,594 185 Updated Jun 6, 2025

fzp0424 / MT-R1-Zero

Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"

Python 47 3 Updated Apr 15, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 46,628 8,153 Updated Jun 9, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,372 1,881 Updated Mar 26, 2025

ranpox / awesome-computer-use

This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.

374 14 Updated Jun 4, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 12,806 912 Updated Jun 6, 2025

modelscope / MemoryScope

Python 483 43 Updated Feb 17, 2025

LC1332 / Zero-Haruhi

The plan which extend ChatHaruhi into Zero-shot Roleplaying model

Jupyter Notebook 108 18 Updated Apr 12, 2024

wjx-git / IllegalTextDetection

Python 79 25 Updated Jan 2, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 21,570 1,748 Updated Jun 7, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 47,359 5,218 Updated Jun 9, 2025

datalab-to / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,589 1,169 Updated Jun 6, 2025

SlongLiu / query2labels

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Python 441 70 Updated Mar 18, 2022

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,559 1,414 Updated May 27, 2025

facebookresearch / moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 4,991 801 Updated Apr 30, 2025

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,619 668 Updated Feb 10, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 10,913 1,097 Updated May 28, 2025

tensorlakeai / indexify

A realtime serving engine for Data-Intensive Generative AI Applications

Rust 1,009 127 Updated Jun 9, 2025

yixuan730 / DetToolChain

Dettoolchain: A new prompting paradigm to unleash detection ability of MLLM

Python 37 2 Updated Oct 12, 2024

bloc97 / Anime4K

A High-Quality Real Time Upscaler for Anime Video

Jupyter Notebook 19,606 1,372 Updated Aug 17, 2024

KwaiVGI / LivePortrait

Bring portraits to life!

Python 16,083 1,678 Updated Jun 1, 2025

jiaxi-jiang / FBCNN

Official Code for ICCV 2021 paper "Towards Flexible Blind JPEG Artifacts Removal (FBCNN)"

Python 481 47 Updated Apr 19, 2024

hzwer / Practical-RIFE

More practical frame interpolation approach.

Python 759 88 Updated May 20, 2025

Kiteretsu77 / APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Python 1,003 67 Updated Jun 28, 2024

bilibili / ailab

Python 5,755 554 Updated Aug 2, 2023

josdejong / jsonrepair

Repair invalid JSON documents

TypeScript 775 46 Updated Feb 14, 2025

NanmiCoder / MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫、百度贴吧帖子｜百度贴吧评论回复爬虫 | 知乎问答文章｜评论爬虫

Python 23,300 6,470 Updated Jun 8, 2025

Starred topics

tts

3D

forgery-detection

0