8000 yqy2001's list / LLM · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yqy2001's full-sized avatar

Organizations

@baaivision

Block or report yqy2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

46 repositories

What would you do with 1000 H100s...

Jupyter Notebook 1,048 66 Updated Jan 10, 2024

Grok open release

Python 50,295 8,352 Updated Aug 30, 2024

A PyTorch Native LLM Training Framework

Python 812 48 Updated Dec 27, 2024

🙌 OpenHands: Code Less, Make More

Python 57,083 6,448 Updated Jun 2, 2025

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Python 3,009 521 Updated Jun 2, 2025

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,206 1,056 Updated May 31, 2025

A framework for few-shot evaluation of language models.

Python 9,119 2,432 Updated May 27, 2025

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,369 464 Updated Nov 6, 2024

PyTorch native post-training library

Python 5,233 619 Updated Jun 1, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 142,485 11,945 Updated May 31, 2025

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,014 544 Updated May 9, 2025

Robust recipes to align language models with human and AI preferences

Python 5,203 445 Updated Apr 30, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,637 612 Updated May 28, 2025

Evaluation suite for LLMs

Python 348 41 Updated Mar 31, 2025

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 174 13 Updated Jun 20, 2024

Code accompanying the paper "Massive Activations in Large Language Models"

Python 162 10 Updated Mar 4, 2024
Jupyter Notebook 11 Updated Apr 3, 2023

Minimalistic large language model 3D-parallelism training

Python 1,899 193 Updated May 31, 2025

LLM101n: Let's build a Storyteller

33,537 1,829 Updated Aug 1, 2024
Python 518 45 Updated Nov 20, 2024

Easily embed, cluster and semantically label text datasets

Python 541 41 Updated Mar 28, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,156 101 Updated May 8, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,778 1,450 Updated May 29, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,055 1,605 Updated Apr 26, 2025

aider is AI pair programming in your terminal

Python 33,764 3,079 Updated Jun 1, 2025

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,659 163 Updated Oct 29, 2024

Scalable toolkit for efficient model alignment

Python 805 100 Updated May 31, 2025

Ongoing research training transformer models at scale

Python 12,477 2,803 Updated May 30, 2025
0