8000 LeavesLei (Shiye Lei) / Starred · GitHub

More Web Proxy on the site http://driver.im/

LeavesLei

Follow

Shiye Lei LeavesLei

Follow

AI/ML PhD Student @ USYD

12 followers · 9 following

USYD
Sydney, Australia
shiyelei.com

Stars

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,726 374 Updated May 13, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

23,332 1,943 Updated May 9, 2025

TsinghuaC3I / Awesome-RL-Reasoning-Recipes

Awesome RL Reasoning Recipes ("Triple R")

540 31 Updated May 8, 2025

mbzuai-oryx / Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,616 116 Updated Apr 11, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,763 1,486 Updated Apr 24, 2025

LeavesLei / Caption_in_Prompt

[IJCV] Official repository for "Image Captions are Natural Prompts for Text-to-Image Models"

Python 1 Updated Mar 26, 2025

fshp971 / adv-ICL

[Preprint] Official repository for "Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence"

Python 5 Updated Feb 7, 2025

deepseek-ai / DeepSeek-V3

Python 96,842 15,744 Updated Apr 9, 2025

ycjing / Awesome-Model-Merging

👫 A curated list of Model Merging methods.

92 5 Updated Sep 16, 2024

kozakdenys / qr-code-styling

Automaticly generate your styled QR code in your web app.

TypeScript 1,871 553 Updated Apr 29, 2025

LeavesLei / OBD

[NeurIPS 2024] Official repository for "Offline Behavior Distillation"

Python 3 Updated Oct 31, 2024

kamenbliznashki / normalizing_flows

Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows

Python 620 104 Updated Jul 12, 2021

ikostrikov / pytorch-flows

PyTorch implementations of algorithms for density estimation

Python 583 74 Updated May 13, 2021

BMPixel / moffee

moffee: Make Markdown Ready to Present

Python 1,184 53 Updated Nov 22, 2024

LeavesLei / attentive_learning

[TNNLS] Official repository for "Attentive Learning Facilitates Generalization of Neural Networks"

Python 4 Updated Jan 18, 2024

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,941 239 Updated Apr 30, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,182 443 Updated Apr 30, 2025

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 569 71 Updated May 16, 2025

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,700 201 Updated May 12, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 13,542 1,595 Updated May 17, 2025

isaac-sim / IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Python 2,427 468 Updated Oct 26, 2024

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,078 4,922 Updated Aug 9, 2024

princetonvisualai / RememberThePast-DatasetDistillation

Python 39 8 Updated Nov 19, 2022

RchalYang / offlinerl

Repo for offline reinforcement learning methods

Python 9 3 Updated Jul 21, 2020

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 10,886 2,336 Updated Aug 5, 2024

hanjuku-kaso / awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

980 89 Updated May 23, 2024

CSCfi / singularity-recipes

Public recipe files for Apptainer containers used on CSC HPC environments

R 8 5 Updated May 16, 2025

rustdesk / rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 88,377 12,613 Updated May 17, 2025

timoklein / redo

ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)

Python 28 4 Updated Oct 22, 2024

hakuhodo-technologies / scope-rl

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Python 122 12 Updated Mar 18, 2024

0