8000 SSshuishui (Xiang Zhao) / Starred · GitHub

More Web Proxy on the site http://driver.im/

SSshuishui

Follow

Xiang Zhao SSshuishui

Follow

Keep learning

11 followers · 93 following

Beihang University
Beijing
09:02 (UTC +08:00)

Highlights

Pro

Lists (19)

Sort

agent

19 repositories

compress

12 repositories

cv

10 repositories

diff_compress

diffusion&flow

22 repositories

distill

knowledge_repre&edit

10 repositories

longcontext

12 repositories

lora

55 repositories

low_mem

14 repositories

merge

41 repositories

multimodality

34 repositories

optimization

some optimization library

12 repositories

prune

45 repositories

quantization

69 repositories

reasoning

11 repositories

RL

robot

10 repositories

XAI

Stars

DaShenZi721 / HRA

Python 29 5 Updated Jun 1, 2025

iie-ycx / DEER

This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.

Python 126 3 Updated Jun 19, 2025

F2-Song / Weak-to-Strong-Decoding

The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"

Python 17 1 Updated Jun 11, 2025

maple-research-lab / SLOT

Python 81 1 Updated Jun 15, 2025

magpie-align / magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 716 62 Updated Mar 17, 2025

S1s-Z / NOVA

[ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"

Python 20 Updated Jun 4, 2025

jlowin / fastmcp

🚀 The fast, Pythonic way to build MCP servers and clients

Python 12,884 775 Updated Jun 19, 2025

zwxandy / Awesome-Efficient-CoT-Reasoning-Summary

🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasoning performance is an important topic!

49 3 Updated May 22, 2025

QingyangZhang / Label-Free-RLVR

213 4 Updated Jun 15, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,634 271 Updated Apr 10, 2025

Dao-AILab / grouped-latent-attention

Python 114 2 Updated May 29, 2025

sunblaze-ucb / Intuitor

Code for the paper: "Learning to Reason without External Rewards"

Python 293 24 Updated Jun 17, 2025

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 2,106 114 Updated Jun 17, 2025

czg1225 / VeriThinker

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Python 45 1 Updated May 29, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,010 496 Updated May 28, 2025

PKU-Alignment / aligner

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

Python 177 9 Updated Jan 16, 2025

SophieZheng998 / ALI-Agent

Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"

Python 18 3 Updated May 7, 2025

gkevinyen5418 / LoRA-RITE

Python 7 1 Updated Jun 11, 2025

TsinghuaC3I / Fourier-Position-Embedding

[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

Python 71 5 Updated Jun 2, 2025

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 36,889 5,265 Updated Jun 11, 2025

juzhengz / LoRI

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Python 127 7 Updated Jun 17, 2025

shangshang-wang / Tina

Tina: Tiny Reasoning Models via LoRA

Python 259 33 Updated May 29, 2025

rowboatlabs / rowboat

AI-powered multi-agent builder

TypeScript 3,225 276 Updated Jun 18, 2025

jianghoucheng / AlphaEdit

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)

Python 264 15 Updated Jun 18, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,419 63 Updated Apr 18, 2025

ZongqianLi / Prompt-Compression-Survey

[NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey

24 Updated May 18, 2025

pixas / NoRM

ICLR 2025

Python 26 1 Updated May 21, 2025

hmarkc / FW-Merging

Python 5 Updated May 12, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler Based on Triton for Parallel Systems

Python 831 63 Updated Jun 18, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,747 196 Updated Jun 20, 2025

0