8000 LeavesLei (Shiye Lei) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View LeavesLei's full-sized avatar

Block or report LeavesLei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,726 374 Updated May 13, 2025

Awesome-LLM: a curated list of Large Language Model

23,332 1,943 Updated May 9, 2025

Awesome RL Reasoning Recipes ("Triple R")

540 31 Updated May 8, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,616 116 Updated Apr 11, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,763 1,486 Updated Apr 24, 2025

[IJCV] Official repository for "Image Captions are Natural Prompts for Text-to-Image Models"

Python 1 Updated Mar 26, 2025

[Preprint] Official repository for "Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence"

Python 5 Updated Feb 7, 2025

👫 A curated list of Model Merging methods.

92 5 Updated Sep 16, 2024

Automaticly generate your styled QR code in your web app.

TypeScript 1,871 553 Updated Apr 29, 2025

[NeurIPS 2024] Official repository for "Offline Behavior Distillation"

Python 3 Updated Oct 31, 2024

Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows

Python 620 104 Updated Jul 12, 2021

PyTorch implementations of algorithms for density estimation

Python 583 74 Updated May 13, 2021

moffee: Make Markdown Ready to Present

Python 1,184 53 Updated Nov 22, 2024

[TNNLS] Official repository for "Attentive Learning Facilitates Generalization of Neural Networks"

Python 4 Updated Jan 18, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,941 239 Updated Apr 30, 2025

Robust recipes to align language models with human and AI preferences

Python 5,182 443 Updated Apr 30, 2025

RewardBench: the first evaluation tool for reward models.

Python 569 71 Updated May 16, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,700 201 Updated May 12, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 13,542 1,595 Updated May 17, 2025

Isaac Gym Reinforcement Learning Environments

Python 2,427 468 Updated Oct 26, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 14,078 4,922 Updated Aug 9, 2024

Repo for offline reinforcement learning methods

Python 9 3 Updated Jul 21, 2020

An educational resource to help anyone learn deep reinforcement learning.

Python 10,886 2,336 Updated Aug 5, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

980 89 Updated May 23, 2024

Public recipe files for Apptainer containers used on CSC HPC environments

R 8 5 Updated May 16, 2025

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 88,377 12,613 Updated May 17, 2025

ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)

Python 28 4 Updated Oct 22, 2024

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Python 122 12 Updated Mar 18, 2024
Next
0