8000 lzu-ShuhaoLi (Li Shuhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lzu-ShuhaoLi's full-sized avatar
  • 兰州大学
  • Lanzhou, China
  • 16:51 (UTC +08:00)

Block or report lzu-ShuhaoLi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 11 1 Updated May 14, 2025

Fully open reproduction of DeepSeek-R1

Python 24,638 2,279 Updated May 28, 2025

Fully open data curation for reasoning models

Python 1,797 152 Updated May 20, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,308 306 Updated May 13, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,001 118 Updated Jun 1, 2023
Python 293 18 Updated May 31, 2025

The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)

Python 32 3 Updated May 15, 2024

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 247 36 Updated Feb 28, 2025

Repository for Self-Evaluative Decoding, SED.

Python 2 Updated Sep 17, 2024

In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)

Python 59 5 Updated Mar 30, 2024

Uncertainty-guided Likelihood Tree Search

Python 8 Updated Nov 15, 2024

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 490 58 Updated Jan 17, 2025
Python 3 Updated Dec 23, 2023
Python 10 5 Updated Feb 28, 2024

LLM-Check: Investigating Detection of Hallucinations in Large Language Models (NeurIPS 2024)

Jupyter Notebook 19 1 Updated Dec 8, 2024
Python 25 6 Updated Dec 9, 2024
Python 6 Updated Oct 21, 2023
Python 1,835 292 Updated Apr 19, 2024
Python 18 Updated Sep 18, 2024

Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)

Python 147 19 Updated Feb 21, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 13,883 1,695 Updated Jun 2, 2025
Python 69 14 Updated Jan 22, 2025
Python 30 3 Updated Oct 31, 2024

Official Code for Oᴘᴇɴ-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)

Python 119 12 Updated Feb 20, 2025

[ICML'2024] Can AI Assistants Know What They Don't Know?

Python 80 9 Updated Feb 5, 2024

Source code of our paper MIND, ACL 2024 Long Paper

Python 41 12 Updated May 28, 2024
JavaScript 126 6 Updated Sep 10, 2024

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 139 23 Updated Sep 21, 2024

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 159 15 Updated May 25, 2025

LLM inference in C/C++

C++ 81,214 11,980 Updated Jun 2, 2025
Next
0