8000 yaof20 (Feng Yao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yaof20's full-sized avatar
😶
I may be slow to respond
😶
I may be slow to respond
  • University of California, San Diego
  • La Jolla, California
  • 12:38 (UTC -07:00)

Highlights

  • Pro

Organizations

@thunlp

Block or report yaof20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

(best/better) practices of megatron on veRL and tuning guide

Shell 22 1 Updated Jul 10, 2025

A repo for open research on building large reasoning models

Python 71 5 Updated Jul 12, 2025

Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient

Python 33 6 Updated Jul 12, 2025

The evaluation framework for training-free sparse attention in LLMs

Python 82 4 Updated Jun 19, 2025

A version of verl to support tool use

Python 291 25 Updated Jul 13, 2025

An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.

210 7 Updated Jul 11, 2025

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 101 67 Updated May 29, 2025

Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"

Python 20 Updated Jun 10, 2025

Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models

Jupyter Notebook 20 2 Updated Jul 8, 2025

Code for our tutorial on Discrete Variational Autoencoders

Python 3 Updated May 19, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,718 178 Updated Jun 25, 2025

[ACL2025 Oral] Cuckoo: A Series of IE Free Riders Using LLM's Resources to Scale up Themselves.

Python 7 Updated Jul 5, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,205 50 Updated Nov 16, 2024

GRadient-INformed MoE

263 14 Updated Sep 25, 2024

The road to hack SysML and become an system expert

Emacs Lisp 492 62 Updated Sep 25, 2024

A framework for the evaluation of autoregressive code generation language models.

Python 959 245 Updated Jul 1, 2025

Tools for merging pretrained large language models.

Python 6,016 579 Updated Jun 19, 2025

Large Language Model Text Generation Inference

Python 10,318 1,208 Updated Jul 8, 2025

GPU programming related news and material links

1,616 90 Updated Jan 6, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,910 1,316 Updated Jul 6, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,608 2,550 Updated Jul 12, 2025

Best practice for training LLaMA models in Megatron-LM

Python 657 57 Updated Jan 2, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,031 186 Updated Jun 30, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,332 4,467 Updated Jul 13, 2025

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,198 3,677 Updated Jul 4, 2024

A curated list of papers and applications on tool learning.

120 4 Updated Dec 27, 2023

A repo lists papers related to LLM based agent

Python 1,833 109 Updated Jul 12, 2025

Train transformer language models with reinforcement learning.

Python 14,582 2,036 Updated Jul 12, 2025

A curated list of awesome resources dedicated to Scaling Laws for LLMs

75 5 Updated Apr 10, 2023

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 57,163 6,872 Updated Jun 30, 2025
Next
0