Serge-weihao

Serge-weihao

SJTU

34 followers · 33 following

SJTU
Shanghai, China

Achievements

Stars

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,048 33 Updated May 21, 2025

Calamari-OCR / calamari_models

Pretrained mixed models to be used with Calamari.

63 18 Updated Oct 1, 2024

CodeGoat24 / UnifiedReward

Official implementation of UnifiedReward & UnifiedReward-Think

Python 382 9 Updated May 25, 2025

VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 283 28 Updated May 14, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 12,472 873 Updated May 23, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,872 223 Updated May 23, 2025

ElliottYan / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 181 18 Updated May 22, 2025

ByteDance-Seed / SAIL

Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"

Python 34 1 Updated May 20, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,883 191 Updated May 21, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,501 103 Updated Mar 7, 2025

opendatalab / magic-doc

Python 499 41 Updated Jul 26, 2024

qingzhenduyu / ICAL

Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expression Recognition"

Python 27 1 Updated Aug 16, 2024

WenmuZhou / TableGeneration

通过浏览器渲染生成表格图像

Python 217 42 Updated Apr 10, 2024

liunian-Jay / MU-GOT

GOT的vLLM加速实现并结合 MinerU 实现RAG中的pdf 解析

Python 57 5 Updated Nov 7, 2024

opendatalab / magic-html

Python 455 40 Updated Mar 13, 2025

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,447 612 Updated Feb 21, 2025

FormalGeo / FormalGeo

Formal representation and solving for Euclidean plane geometry problems.

Python 22 Updated Jan 11, 2025

ChenghaoMou / text-dedup

All-in-one text de-duplication

Python 677 75 Updated May 24, 2025

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,573 661 Updated Feb 10, 2025

cuppersd / table_recognition

表格线检测

Python 27 9 Updated Sep 3, 2019

gmarus777 / Printed-Latex-Data-Generation

Python and JS tools to generate Printed LaTex formulas and images

Jupyter Notebook 16 3 Updated Oct 26, 2023

yuxizhe / HTML-UI-datasets-generate

自动生成HTML常用表单元素的样本数据集。供机器学习目标检测训练使用

Jupyter Notebook 9 5 Updated Jan 6, 2023

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 945 43 Updated May 24, 2025

google-research-datasets / ToTTo

ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: given a Wikipedia table and a set of highlighted table cells, p…

448 37 Updated Sep 11, 2024

opendatalab / OmniDocBench

[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation

Python 455 44 Updated May 13, 2025

omron-sinicx / scipostlayout

Python 20 1 Updated Jul 31, 2024

jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 7,757 734 Updated May 16, 2025

2U1 / Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 751 92 Updated May 22, 2025

inclusionAI / AReaL

Distributed RL System for LLM Reasoning

Python 1,279 60 Updated May 25, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,287 304 Updated May 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serge-weihao

Achievements

Achievements

Block or report Serge-weihao

Stars

ByteDance-Seed / Seed1.5-VL

Calamari-OCR / calamari_models

CodeGoat24 / UnifiedReward

VITA-MLLM / Long-VITA

allenai / olmocr

huggingface / nanoVLM

ElliottYan / LUFFY

ByteDance-Seed / SAIL

huggingface / nanotron

huggingface / picotron

opendatalab / magic-doc

qingzhenduyu / ICAL

WenmuZhou / TableGeneration

liunian-Jay / MU-GOT

opendatalab / magic-html

facebookresearch / nougat

FormalGeo / FormalGeo

ChenghaoMou / text-dedup

Ucas-HaoranWei / GOT-OCR2.0

cuppersd / table_recognition

gmarus777 / Printed-Latex-Data-Generation

yuxizhe / HTML-UI-datasets-generate

sail-sg / understand-r1-zero

google-research-datasets / ToTTo

opendatalab / OmniDocBench

omron-sinicx / scipostlayout

jsvine / pdfplumber

2U1 / Qwen2-VL-Finetune

inclusionAI / AReaL

agentica-project / rllm