8000 yuanluw (matt) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yuanluw's full-sized avatar

Block or report yuanluw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 3,755 374 Updated Apr 27, 2025
Python 1,325 58 Updated Jun 6, 2025

Python tool for converting files and office documents to Markdown.

Python 58,823 3,037 Updated Jun 4, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,594 185 Updated Jun 6, 2025

Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"

Python 47 3 Updated Apr 15, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 46,628 8,153 Updated Jun 9, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,372 1,881 Updated Mar 26, 2025

This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.

374 14 Updated Jun 4, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 12,806 912 Updated Jun 6, 2025
Python 483 43 Updated Feb 17, 2025

The plan which extend ChatHaruhi into Zero-shot Roleplaying model

Jupyter Notebook 108 18 Updated Apr 12, 2024

SOTA Open Source TTS

Python 21,570 1,748 Updated Jun 7, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 47,359 5,218 Updated Jun 9, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,589 1,169 Updated Jun 6, 2025

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Python 441 70 Updated Mar 18, 2022

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,559 1,414 Updated May 27, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 4,991 801 Updated Apr 30, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,619 668 Updated Feb 10, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 10,913 1,097 Updated May 28, 2025

A realtime serving engine for Data-Intensive Generative AI Applications

Rust 1,009 127 Updated Jun 9, 2025

Dettoolchain: A new prompting paradigm to unleash detection ability of MLLM

Python 37 2 Updated Oct 12, 2024

A High-Quality Real Time Upscaler for Anime Video

Jupyter Notebook 19,606 1,372 Updated Aug 17, 2024

Bring portraits to life!

Python 16,083 1,678 Updated Jun 1, 2025

Official Code for ICCV 2021 paper "Towards Flexible Blind JPEG Artifacts Removal (FBCNN)"

Python 481 47 Updated Apr 19, 2024

More practical frame interpolation approach.

Python 759 88 Updated May 20, 2025

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Python 1,003 67 Updated Jun 28, 2024
Python 5,755 554 Updated Aug 2, 2023

Repair invalid JSON documents

TypeScript 775 46 Updated Feb 14, 2025

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 23,300 6,470 Updated Jun 8, 2025
Next
0