8000 Li-Hyn / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Li-Hyn's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report Li-Hyn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My learning notes/codes for ML SYS.

Python 2,765 170 Updated Jul 6, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,270 706 Updated Jun 19, 2025

Distributed RL System for LLM Reasoning

Python 1,944 108 Updated Jul 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,650 6,570 Updated Jul 6, 2025

This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.

Python 483 33 Updated Feb 12, 2024
Python 44 2 Updated Jan 7, 2024

Numbers every LLM developer should know

4,238 139 Updated Jan 16, 2024

BeHonest: Benchmarking Honesty in Large Language Models

JavaScript 34 Updated Aug 15, 2024
Jupyter Notebook 17 2 Updated Dec 21, 2023

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

369 17 Updated Oct 4, 2023

A reading list on LLM based Synthetic Data Generation 🔥

1,325 76 Updated Jun 5, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

778 68 Updated Jun 29, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,205 50 Updated Nov 16, 2024

[2025-TMLR] A Survey on the Honesty of Large Language Models

58 2 Updated Dec 8, 2024

These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning

48 Updated Sep 3, 2023

The Paper List on Data Contamination for Large Language Models Evaluation.

95 3 Updated Mar 31, 2025

Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misinformation", accepted by AI Magazine 2024

102 9 Updated Nov 9, 2024

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

255 12 Updated Mar 20, 2025

A survey on harmful fine-tuning attack for large language model

189 7 Updated Jul 1, 2025
Python 3,782 379 Updated May 13, 2025

Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中

Python 96 15 Updated Apr 28, 2025

A paper & resource list of large language models, including course, paper, demo, figures

199 8 Updated Aug 8, 2023

NLP研究入门之道

2,025 255 Updated Nov 26, 2019

Curated list of datasets and tools for post-training.

3,236 271 Updated Jan 29, 2025

Attack to induce LLMs within hallucinations

Python 156 19 Updated May 17, 2024

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,426 221 Updated May 26, 2025

This is the repository of the Ape210K dataset and baseline models.

Python 194 59 Updated Dec 10, 2019

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 357 30 Updated Sep 6, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 8,430 909 Updated Apr 30, 2025

A collection for math word problem (MWP) works, including datasets, algorithms and so on.

Python 44 4 Updated Jun 18, 2024
Next
0