8000 SSshuishui (Xiang Zhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View SSshuishui's full-sized avatar
  • Beihang University
  • Beijing
  • 09:02 (UTC +08:00)

Highlights

  • Pro

Block or report SSshuishui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 29 5 Updated Jun 1, 2025

This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.

Python 126 3 Updated Jun 19, 2025

The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"

Python 17 1 Updated Jun 11, 2025
Python 81 1 Updated Jun 15, 2025

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 716 62 Updated Mar 17, 2025

[ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"

Python 20 Updated Jun 4, 2025

🚀 The fast, Pythonic way to build MCP servers and clients

Python 12,884 775 Updated Jun 19, 2025

🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasoning performance is an important topic!

49 3 Updated May 22, 2025

Simple RL training for reasoning

Python 3,634 271 Updated Apr 10, 2025

Code for the paper: "Learning to Reason without External Rewards"

Python 293 24 Updated Jun 17, 2025

Open-source Multi-agent Poster Generation from Papers

Python 2,106 114 Updated Jun 17, 2025

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Python 45 1 Updated May 29, 2025

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,010 496 Updated May 28, 2025

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

Python 177 9 Updated Jan 16, 2025

Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"

Python 18 3 Updated May 7, 2025
Python 7 1 Updated Jun 11, 2025

[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

Python 71 5 Updated Jun 2, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 36,889 5,265 Updated Jun 11, 2025

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Python 127 7 Updated Jun 17, 2025

Tina: Tiny Reasoning Models via LoRA

Python 259 33 Updated May 29, 2025

AI-powered multi-agent builder

TypeScript 3,225 276 Updated Jun 18, 2025

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)

Python 264 15 Updated Jun 18, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,419 63 Updated Apr 18, 2025

[NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey

24 Updated May 18, 2025

ICLR 2025

Python 26 1 Updated May 21, 2025
Python 5 Updated May 12, 2025

Distributed Compiler Based on Triton for Parallel Systems

Python 831 63 Updated Jun 18, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,747 196 Updated Jun 20, 2025
Next
0