8000 Serge-weihao / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Serge-weihao's full-sized avatar
  • SJTU
  • Shanghai, China

Block or report Serge-weihao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,048 33 Updated May 21, 2025

Pretrained mixed models to be used with Calamari.

63 18 Updated Oct 1, 2024

Official implementation of UnifiedReward & UnifiedReward-Think

Python 382 9 Updated May 25, 2025

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 283 28 Updated May 14, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 12,472 873 Updated May 23, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,872 223 Updated May 23, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 181 18 Updated May 22, 2025

Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"

Python 34 1 Updated May 20, 2025

Minimalistic large language model 3D-parallelism training

Python 1,883 191 Updated May 21, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,501 103 Updated Mar 7, 2025
Python 499 41 Updated Jul 26, 2024

Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expression Recognition"

Python 27 1 Updated Aug 16, 2024

通过浏览器渲染生成表格图像

Python 217 42 Updated Apr 10, 2024

GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析

Python 57 5 Updated Nov 7, 2024
Python 455 40 Updated Mar 13, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,447 612 Updated Feb 21, 2025

Formal representation and solving for Euclidean plane geometry problems.

Python 22 Updated Jan 11, 2025

All-in-one text de-duplication

Python 677 75 Updated May 24, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,573 661 Updated Feb 10, 2025

表格线检测

Python 27 9 Updated Sep 3, 2019

Python and JS tools to generate Printed LaTex formulas and images

Jupyter Notebook 16 3 Updated Oct 26, 2023

自动生成HTML常用表单元素的样本数据集。供机器学习目标检测训练使用

Jupyter Notebook 9 5 Updated Jan 6, 2023

Understanding R1-Zero-Like Training: A Critical Perspective

Python 945 43 Updated May 24, 2025

ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: given a Wikipedia table and a set of highlighted table cells, p…

448 37 Updated Sep 11, 2024

[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation

Python 455 44 Updated May 13, 2025
Python 20 1 Updated Jul 31, 2024

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 7,757 734 Updated May 16, 2025

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 751 92 Updated May 22, 2025

Distributed RL System for LLM Reasoning

Python 1,279 60 Updated May 25, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,287 304 Updated May 13, 2025
Next
0