8000 Gelrignard (Enlin Gu) / Starred · GitHub

More Web Proxy on the site http://driver.im/

Gelrignard

Follow

Enlin Gu Gelrignard

Follow

MS student in ROBO & CIS @ GRASP, UPenn

22 followers · 74 following

University of Pennsylvania
PA

Highlights

Pro

Lists (2)

Sort

NN/LLM

13 repositories

RL

20 repositories

Stars

priyasundaresan / blender-rope-sim

Python 19 2 Updated Feb 27, 2021

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,223 146 Updated Jun 2, 2025

Thinklab-SJTU / Bench2Drive-VL

Adapting VLMs to Bench2Drive.

Python 119 18 Updated Jun 6, 2025

PRIME-RL / SimpleVLA-RL

Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

Python 183 3 Updated May 30, 2025

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,952 1,619 Updated Feb 29, 2024

riddhiman13 / predictive-multi-agent-framework

Repository for predictive dual-arm reactive motion planning

C++ 56 9 Updated Jan 5, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,407 57 Updated Apr 18, 2025

biliticket / BHYG

B站 BW bilibiliworld 会员购抢票脚本

817 111 Updated Jun 1, 2025

unslothai / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,296 3,194 Updated Jun 9, 2025

CIS548 / example-code

This is the repository for example code from Prof. Boon Thau Loo's Operating System Course

C 37 28 Updated Jun 14, 2023

Kaixhin / imitation-learning

Imitation learning algorithms

Python 535 43 Updated Mar 22, 2025

xiahongchi / DRAWER

Jupyter Notebook 68 7 Updated Apr 22, 2025

ai-dawang / PlugNPlay-Modules

Python 3,954 310 Updated Apr 18, 2025

LeapLabTHU / Agent-Attention

Official repository of Agent Attention (ECCV2024)

Python 623 40 Updated Nov 17, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,949 1,464 Updated Jun 9, 2025

huangwl18 / ReKep

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 786 81 Updated Feb 20, 2025

arobey1 / robopair

Python 13 1 Updated Mar 6, 2025

eth-ait / 4d-dress

Official repository for CVPR 2024 highlight paper 4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic Annotations.

Python 114 3 Updated May 5, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,300 6,014 Updated Jun 10, 2025

ermongroup / ddim

Denoising Diffusion Implicit Models

Python 1,646 218 Updated Jul 26, 2024

zbzhu99 / madiff

Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"

Python 74 14 Updated Jan 23, 2025

Albusgive / mujoco_learning

C++ 223 21 Updated Jun 5, 2025

kuleshov-group / bd3lms

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 681 38 Updated Apr 16, 2025

kindredresearch / arp

Autoregressive policies for continuous control reinforcement learning

Python 32 4 Updated May 15, 2019

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 886 44 Updated Apr 1, 2025

mlii / mfrl

Mean Field Multi-Agent Reinforcement Learning

Python 395 102 Updated Mar 11, 2020

GasaiYU / PartRM

Python 128 1 Updated Mar 24, 2025

mlzxy / arp

Autoregressive Policy for Robot Learning (RA-L 2025)

Python 121 9 Updated Mar 25, 2025

rail-berkeley / softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1,309 244 3544 Updated Nov 29, 2023

irom-princeton / dppo

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 495 48 Updated Feb 4, 2025

0