lzu-ShuhaoLi

Li Shuhao lzu-ShuhaoLi

My name is Li Shuhao in Lanzhou University

2 followers · 2 following

兰州大学
Lanzhou, China
16:51 (UTC +08:00)

Lists (2)

Sort

cannot run

2 repositories

LLM+RL

4 repositories

Stars

MaHuanAAA / logtoku

Python 11 1 Updated May 14, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,638 2,279 Updated May 28, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 152 Updated May 20, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,308 306 Updated May 13, 2025

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,001 118 Updated Jun 1, 2023

eddycmu / demystify-long-cot

Python 293 18 Updated May 31, 2025

TIGER-AI-Lab / TheoremQA

The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)

Python 32 3 Updated May 15, 2024

TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 247 36 Updated Feb 28, 2025

lzqv5 / LLMDecoding

Repository for Self-Evaluative Decoding, SED.

Python 2 Updated Sep 17, 2024

hkust-nlp / Activation_Decoding

In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)

Python 59 5 Updated Mar 30, 2024

JuliaGrosse / ults

Uncertainty-guided Likelihood Tree Search

Python 8 Updated Nov 15, 2024

voidism / DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 490 58 Updated Jan 17, 2025

mk322 / ever_hallu

Python 3 Updated Dec 23, 2023

MBZUAI-CLeaR / IoE-Prompting

Python 10 5 Updated Feb 28, 2024

GaurangSriramanan / LLM_Check_Hallucination_Detection

LLM-Check: Investigating Detection of Hallucinations in Large Language Models (NeurIPS 2024)

Jupyter Notebook 19 1 Updated Dec 8, 2024

D2I-ai / eigenscore

Python 25 6 Updated Dec 9, 2024

Yuki-Asuuna / UMWP

Python 6 Updated Oct 21, 2023

tonyzhaozh / aloha

Python 1,835 292 Updated Apr 19, 2024

Bocchi7 / DRAGIN_simplified

Python 18 Updated Sep 18, 2024

oneal2000 / DRAGIN

Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)

Python 147 19 Updated Feb 21, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 13,883 1,695 Updated Jun 2, 2025

technion-cs-nlp / LLMsKnow

Python 69 14 Updated Jan 22, 2025

cmu-mind / RISE

Python 30 3 Updated Oct 31, 2024

ShayekhBinIslam / openrag

Official Code for Oᴘᴇɴ-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)

Python 119 12 Updated Feb 20, 2025

OpenMOSS / Say-I-Dont-Know

[ICML'2024] Can AI Assistants Know What They Don't Know?

Python 80 9 Updated Feb 5, 2024

oneal2000 / MIND

Source code of our paper MIND, ACL 2024 Long Paper

Python 41 12 Updated May 28, 2024

AmourWaltz / Reliable-LLM

JavaScript 126 6 Updated Sep 10, 2024

daje0601 / Google_SCoRe

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 139 23 Updated Sep 21, 2024

McGill-NLP / VinePPO

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 159 15 Updated May 25, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 81,214 11,980 Updated Jun 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly