8000 yflyzhang's list / LLM Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yflyzhang's full-sized avatar

Block or report yflyzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

51 repositories

Large Language Model Text Generation Inference

Python 10,190 1,194 Updated Jun 6, 2025

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,368 101 Updated Mar 3, 2024

Neo4j graph construction from unstructured data using LLMs

Jupyter Notebook 3,559 602 Updated Jun 5, 2025

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,750 139 Updated Sep 19, 2023

Code for "Learning to summarize from human feedback"

Python 1,026 148 Updated Sep 5, 2023

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,336 170 Updated Jul 25, 2023

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 786 111 Updated Mar 23, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 7,198 764 Updated Apr 8, 2025

An open-source RAG-based tool for chatting with your documents.

Python 22,415 1,778 Updated Jun 6, 2025

Code for the paper "Training Diffusion Models with Reinforcement Learning"

Python 463 29 Updated Jul 5, 2023

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,310 4,906 Updated Aug 1, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 πŸ“ and reasoning techniques.

6,744 376 Updated Jun 5, 2025

A blazing fast inference solution for text embeddings models

Rust 3,657 269 Updated Jun 6, 2025

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 4,540 451 Updated Jun 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,083 7,836 Updated Jun 7, 2025

πŸ€— Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,264 6,005 Updated Jun 7, 2025

πŸš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,810 1,121 Updated Jun 6, 2025

Make huge neural nets fit in memory

Python 2,796 275 Updated Apr 26, 2020

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 417 14 Updated May 13, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,019 184 Updated Mar 26, 2025

Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math reasoning.

Python 73 3 Updated Jul 27, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 143,027 12,013 Updated Jun 7, 2025

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Jupyter Notebook 2,729 279 Updated Feb 6, 2024

Streamlit LLM app examples for getting started

Python 812 1,644 Updated Jan 30, 2025

Streamlit β€” A faster way to build and share data apps.

Python 39,763 3,491 Updated Jun 7, 2025

Utilities intended for use with Llama models.

Python 7,055 1,173 Updated Jun 2, 2025

Agentic components of the Llama Stack APIs

4,253 627 Updated Apr 30, 2025

Rapidly build AI apps in Python

Python 6,300 316 Updated Jun 6, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 50,763 7,385 Updated Apr 20, 2025
0