- San Diego, CA
- https://snyhlxde1.github.io
- @Lanxiang_Hu
Highlights
- Pro
Stars
TradingAgents: Multi-Agents LLM Financial Trading Framework
🤖 RoboOS: A Universal Embodied Operating System for Cross-Embodied and Multi-Robot Collaboration
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Radial Attention Official Implementation
SkyRL: A Modular Full-stack RL Library for LLMs
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
A python interface for training Reinforcement Learning bots to battle on pokemon showdown
Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.
BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?
Benchmarking the Spectrum of Agent Capabilities
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
Examples of programs built using Modal
Train your Agent model via our easy and efficient framework
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
A non-saturating, open-ended environment for evaluating LLMs in Factorio
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
Reinforcement Learning environments based on the 1993 game Doom