8000 snyhlxde1 (Lanxiang Hu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View snyhlxde1's full-sized avatar
😎
hack like a chammmmpionzee
😎
hack like a chammmmpionzee

Highlights

  • Pro

Block or report snyhlxde1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 10,197 1,597 Updated Jul 3, 2025

🤖 RoboOS: A Universal Embodied Operating System for Cross-Embodied and Multi-Robot Collaboration

Python 100 11 Updated Jul 3, 2025

[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples

PDDL 99 7 Updated Jun 8, 2025

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python 3,008 440 Updated May 28, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 2,238 116 Updated Jul 1, 2025

Radial Attention Official Implementation

Python 255 10 Updated Jul 3, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 532 51 Updated Jul 4, 2025

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 150 6 Updated Jul 3, 2025

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 10,275 781 Updated Dec 4, 2024

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Python 374 113 Updated Jun 29, 2025

Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.

Python 65 7 Updated Mar 28, 2025

Code and Data for Tau-Bench

Python 645 93 Updated Jan 22, 2025

BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?

Python 680 32 Updated Jun 28, 2025

mGBA Game Boy Advance Emulator

C 6,301 856 Updated Jul 1, 2025

Benchmarking the Spectrum of Agent Capabilities

Python 454 71 Updated Jan 23, 2024
C++ 3 Updated Jun 9, 2025
Python 74 3 Updated Jun 17, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 269 14 Updated Jun 27, 2025

Dream 7B, a large diffusion language model

Python 799 39 Updated Jun 18, 2025

Examples of programs built using Modal

Python 890 225 Updated Jul 4, 2025

Train your Agent model via our easy and efficient framework

Python 1,237 109 Updated Jul 1, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,808 297 Updated May 27, 2025

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 226 29 Updated Jul 1, 2025

A non-saturating, open-ended environment for evaluating LLMs in Factorio

Python < 533B /span> 742 46 Updated Jul 3, 2025

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 221 19 Updated May 3, 2025

Reinforcement Learning environments based on the 1993 game Doom :godmode:

C++ 1,858 419 Updated Jun 29, 2025

A JAX-native LLM Post-Training Library

Python 60 11 Updated Jul 3, 2025
Next
0