yqy2001

yqy2001

🎯 RL towards the ultimate.

164 followers · 362 following

Tsinghua, AIR
yqy2001.github.io

Achievements

Organizations

Stars

LLM

46 repositories

srush / LLM-Training-Puzzles

What would you do with 1000 H100s...

Jupyter Notebook 1,048 66 Updated Jan 10, 2024

xai-org / grok-1

Grok open release

Python 50,295 8,352 Updated Aug 30, 2024

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 812 48 Updated Dec 27, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 57,083 6,448 Updated Jun 2, 2025

SWE-bench / SWE-bench

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Python 3,009 521 Updated Jun 2, 2025

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,206 1,056 Updated May 31, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 9,119 2,432 Updated May 27, 2025

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,369 464 Updated Nov 6, 2024

pytorch / torchtune

PyTorch native post-training library

Python 5,233 619 Updated Jun 1, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 142,485 11,945 Updated May 31, 2025

apple / corenet

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,014 544 Updated May 9, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,203 445 Updated Apr 30, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 5,637 612 Updated May 28, 2025

allenai / OLMo-Eval

Evaluation suite for LLMs

Python 348 41 Updated Mar 31, 2025

princeton-nlp / QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 174 13 Updated Jun 20, 2024

locuslab / massive-activations

Code accompanying the paper "Massive Activations in Large Language Models"

Python 162 10 Updated Mar 4, 2024

google-deepmind / language_modeling_is_compression

Python 137 17 Updated Aug 28, 2024

anthropics / ConstitutionalHarmlessnessPaper

236 23 Updated Dec 21, 2022

zxytim / arithmetic-encoding-compression

Jupyter Notebook 11 Updated Apr 3, 2023

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,899 193 Updated May 31, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

33,537 1,829 Updated Aug 1, 2024

huggingface / cosmopedia

Python 518 45 Updated Nov 20, 2024

huggingface / text-clustering

Easily embed, cluster and semantically label text datasets

Python 541 41 Updated Mar 28, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,156 101 Updated May 8, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,778 1,450 Updated May 29, 2025

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,055 1,605 Updated Apr 26, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 33,764 3,079 Updated Jun 1, 2025

THUDM / LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,659 163 Updated Oct 29, 2024

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 805 100 Updated May 31, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 12,477 2,803 Updated May 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yqy2001

Achievements

Achievements

Organizations

Block or report yqy2001

LLM

srush / LLM-Training-Puzzles

xai-org / grok-1

volcengine / veScale

All-Hands-AI / OpenHands

SWE-bench / SWE-bench

EleutherAI / gpt-neox

EleutherAI / lm-evaluation-harness

OpenBMB / MiniCPM

pytorch / torchtune

ollama / ollama

apple / corenet

huggingface / alignment-handbook

allenai / OLMo

allenai / OLMo-Eval

princeton-nlp / QuRating

locuslab / massive-activations

google-deepmind / language_modeling_is_compression

anthropics / ConstitutionalHarmlessnessPaper

zxytim / arithmetic-encoding-compression

huggingface / nanotron

karpathy / LLM101n

huggingface / cosmopedia

huggingface / text-clustering

uclaml / SPIN

QwenLM / Qwen3

SakanaAI / AI-Scientist

Aider-AI / aider

THUDM / LongWriter

NVIDIA / NeMo-Aligner

NVIDIA / Megatron-LM