10000 hzy312 (Ziyang Huang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hzy312's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Chinese Academy of Sciences
  • Beijing, China
  • 15:15 (UTC +08:00)

Highlights

  • Pro

Block or report hzy312

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of the Paper "Let's Predict Step by Step"

Python 3 1 Updated May 27, 2025

[ACL 2025 Main] MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration

5 Updated May 30, 2025

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 69 1 Updated May 30, 2025

This is the code repository of the video reasoning benchmark MMR-V

Python 4 Updated May 19, 2025

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Python 46 Updated May 30, 2025
Python 7 Updated May 27, 2025

Train your Agent model via our easy and efficient framework

Python 817 77 Updated May 30, 2025

Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 127 8 Updated May 30, 2025

This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.

Python 114 1 Updated May 30, 2025

Visual Planning: Let's Think Only with Images

Python 182 5 Updated May 20, 2025

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 364 15 Updated May 17, 2025

ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning

Python 21 4 Updated May 30, 2025
Python 7 Updated May 16, 2025

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

Jupyter Notebook 206 4 Updated Jun 1, 2025

Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs

Python 58 3 Updated May 28, 2025

VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework

Python 339 15 Updated May 12, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 12,436 1,315 Updated May 31, 2025

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

475 33 Updated May 15, 2025

EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Visual Reasoning with Qwen2.5-Omni]

Python 31 1 Updated May 18, 2025

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python 21 Updated May 22, 2025

Process Reward Models That Think

38 2 Updated May 29, 2025

GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning

Python 136 5 Updated May 21, 2025

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Python 64 2 Updated May 20, 2025

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 194 9 Updated May 23, 2025
Jupyter Notebook 15 Updated Apr 16, 2025

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 607 40 Updated May 31, 2025

Large language models for document ranking.

Python 54 Updated May 13, 2025
Next
0