8000 yaof20 (Feng Yao) / Starred · GitHub

More Web Proxy on the site http://driver.im/

yaof20

Follow

😶

I may be slow to respond

Feng Yao yaof20

😶

I may be slow to respond

Follow

NLPer

23 followers · 27 following

University of California, San Diego
La Jolla, California
12:38 (UTC -07:00)

Achievements

Achievements

Highlights

Pro

Organizations

Stars

ISEEKYAN / verl_megatron_practice

(best/better) practices of megatron on veRL and tuning guide

Shell 22 1 Updated Jul 10, 2025

LLM360 / Reasoning360

A repo for open research on building large reasoning models

Python 71 5 Updated Jul 12, 2025

yaof20 / DenseMixer

Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient

Python 33 6 Updated Jul 12, 2025

PiotrNawrot / sparse-frontier

The evaluation framework for training-free sparse attention in LLMs

Python 82 4 Updated Jun 19, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support tool use

Python 291 25 Updated Jul 13, 2025

thinkwee / AgentsMeetRL

An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.

210 7 Updated Jul 11, 2025

tgale96 / grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 101 67 Updated May 29, 2025

yaof20 / ReaL

Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"

Python 20 Updated Jun 10, 2025

cmu-flame / FLAME-MoE

Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models

Jupyter Notebook 20 2 Updated Jul 8, 2025

alanjeffares / discreteVAE

Code for our tutorial on Discrete Variational Autoencoders

Python 3 Updated May 19, 2025

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,718 178 Updated Jun 25, 2025

KomeijiForce / Cuckoo

[ACL2025 Oral] Cuckoo: A Series of IE Free Riders Using LLM's Resources to Scale up Themselves.

Python 7 Updated Jul 5, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,205 50 Updated Nov 16, 2024

microsoft / GRIN-MoE

GRadient-INformed MoE

263 14 Updated Sep 25, 2024

Jack47 / hack-SysML

The road to hack SysML and become an system expert

Emacs Lisp 492 62 Updated Sep 25, 2024

bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Python 959 245 Updated Jul 1, 2025

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 6,016 579 Updated Jun 19, 2025

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,318 1,208 Updated Jul 8, 2025

gpu-mode / resource-stream

GPU programming related news and material links

1,616 90 Updated Jan 6, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,910 1,316 Updated Jul 6, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,608 2,550 Updated Jul 12, 2025

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 657 57 Updated Jan 2, 2024

deepspeedai / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,031 186 Updated Jun 30, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,332 4,467 Updated Jul 13, 2025

openai / chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,198 3,677 Updated Jul 4, 2024

luban-agi / Awesome-Tool-Learning

A curated list of papers and applications on tool learning.

120 4 Updated Dec 27, 2023

AGI-Edgerunners / LLM-Agents-Papers

A repo lists papers related to LLM based agent

Python 1,833 109 Updated Jul 12, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,582 2,036 Updated Jul 12, 2025

RZFan525 / Awesome-ScalingLaws

A curated list of awesome resources dedicated to Scaling Laws for LLMs

75 5 Updated Apr 10, 2023

FoundationAgents / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 57,163 6,872 Updated Jun 30, 2025

0