XueruiSu

Happy XueruiSu

Achievements

Trust-Region-Preference-Approximation Trust-Region-Preference-Approximation Public

Forked from volcengine/verl

Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning

Python 10
Reproduce-DeepSeek-R1-Survey Reproduce-DeepSeek-R1-Survey Public

This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.

16
EnResLSTM EnResLSTM Public

基于OneFlow的高速公路车流量预测算法

Python 4 1
Diffusion-Demo Diffusion-Demo Public

Modified version from Google Colab(https://colab.research.google.com/drive/1sjy9odlSSy0RBVgMTgP7s99NXsqglsUL?usp=sharing#scrollTo=BIc33L9-uK4q)

Python
HRRR_NWP_baseline HRRR_NWP_baseline Public

Shell 1
jsikyoon/dreamer-torch jsikyoon/dreamer-torch Public

Pytorch version of Dreamer, which follows the original TF v2 codes.

Python 126 23