8000 yflyzhang's list / LLM · GitHub

More Web Proxy on the site http://driver.im/

yflyzhang

Follow

yflyzhang

Follow

14 followers · 5 following

Achievements

Achievements

Stars

LLM

51 repositories

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,190 1,194 Updated Jun 6, 2025

OpenLMLab / MOSS-RLHF

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,368 101 Updated Mar 3, 2024

neo4j-labs / llm-graph-builder

Neo4j graph construction from unstructured data using LLMs

Jupyter Notebook 3,559 602 Updated Jun 5, 2025

anthropics / hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,750 139 Updated Sep 19, 2023

openai / summarize-from-feedback

Code for "Learning to summarize from human feedback"

Python 1,026 148 Updated Sep 5, 2023

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,336 170 Updated Jul 25, 2023

vwxyzjn / ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 786 111 Updated Mar 23, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 7,198 764 Updated Apr 8, 2025

Cinnamon / kotaemon

An open-source RAG-based tool for chatting with your documents.

Python 22,415 1,778 Updated Jun 6, 2025

jannerm / ddpo

Code for the paper "Training Diffusion Models with Reinforcement Learning"

Python 463 29 Updated Jul 5, 2023

openai / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,310 4,906 Updated Aug 1, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,744 376 Updated Jun 5, 2025

huggingface / text-embeddings-inference

A blazing fast inference solution for text embeddings models

Rust 3,657 269 Updated Jun 6, 2025

BlackPearl-Lab / KddCup-2024-OAG-Challenge-1st-Solutions

Python 175 36 Updated Jul 9, 2024

poloclub / transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 4,540 451 Updated Jun 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,083 7,836 Updated Jun 7, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,264 6,005 Updated Jun 7, 2025

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,810 1,121 Updated Jun 6, 2025

cybertronai / gradient-checkpointing

Make huge neural nets fit in memory

Python 2,796 275 Updated Apr 26, 2020

zyushun / Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 417 14 Updated May 13, 2025

deepspeedai / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,019 184 Updated Mar 26, 2025

cyzhh / MMOS

Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math reasoning.

Python 73 3 Updated Jul 27, 2024

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 143,027 12,013 Updated Jun 7, 2025

ysymyth / ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Jupyter Notebook 2,729 279 Updated Feb 6, 2024

streamlit / llm-examples

Streamlit LLM app examples for getting started

Python 812 1,644 Updated Jan 30, 2025

streamlit / streamlit

Streamlit — A faster way to build and share data apps.

Python 39,763 3,491 Updated Jun 7, 2025

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 7,055 1,173 Updated Jun 2, 2025

meta-llama / llama-stack-apps

Agentic components of the Llama Stack APIs

4,253 627 Updated Apr 30, 2025

mesop-dev / mesop

Rapidly build AI apps in Python

Python 6,300 316 Updated Jun 6, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 50,763 7,385 Updated Apr 20, 2025

0