ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning

Python 21 4 Updated May 30, 2025

HongbangYuan / OmniReward

Python 7 Updated May 16, 2025

zhaochen0110 / OpenThinkIMG

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

Jupyter Notebook 206 4 Updated Jun 1, 2025

weiyifan1023 / senator

Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs

Python 58 3 Updated May 28, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework

Python 339 15 Updated May 12, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 12,436 1,315 Updated May 31, 2025

ByteDance-Seed / Seed-Coder

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

475 33 Updated May 15, 2025

HarryHsing / EchoInk

EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Visual Reasoning with Qwen2.5-Omni]

Python 31 1 Updated May 18, 2025

QwenLM / PolyMath

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python 21 Updated May 22, 2025

mukhal / ThinkPRM

Process Reward Models That Think

38 2 Updated May 29, 2025

AMAP-ML / GPG

GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning

Python 136 5 Updated May 21, 2025

NUS-TRAIL / NoisyRollout

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Python 64 2 Updated May 20, 2025

zwhe99 / DeepMath

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 194 9 Updated May 23, 2025

sail-sg / ActivePRM

Jupyter Notebook 15 Updated Apr 16, 2025

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 607 40 Updated May 31, 2025

liuqi6777 / llm4ranking

Large language models for document ranking.

Python 54 Updated May 13, 2025

ByteDance-Seed / Seed-Thinking-v1.5

772 14 Updated Apr 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziyang Huang hzy312

Achievements

Achievements

Highlights

Block or report hzy312

Stars

hbin0701 / Pred-Sent

Zhitao-He / MMBoundary

CharlesQ9 / Alita

TsinghuaC3I / MARTI

GaryStack / MMR-V

TencentARC / Video-Holmes

soyoung97 / AcuRank

Simple-Efficient / RL-Factory

eric-ai-lab / Soft-Thinking

SkyworkAI / DeepResearchAgent

iie-ycx / DEER

yix8 / VisualPlanning

QwenLM / ParScale

xuyige / SoftCoT