reward-modeling

Star

Here are 12 public repositories matching this topic...

YangLing0818 / IterComp

Star

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

text-to-image dpo rlhf reward-modeling

Updated Feb 19, 2025
Python

sileod / tasksource

Star

Datasets collection and preprocessings framework for NLP extreme multitask learning

Updated Jan 6, 2025
Python

VectorInstitute / vector-inference

Star

Efficient LLM inference on Slurm clusters using vLLM.

inference vlm text-embedding llm vllm llm-inference reward-modeling

Updated Jun 10, 2025
Python

holarissun / RewardModelingBeyondBradleyTerry

Star

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

reward inverse-reinforcement-learning large-language-models rlhf reward-models largelanguagemodels reward-modeling llm-aligment llmalignment

Updated Apr 2, 2025
Python

Jialuo-Li / Science-T2I

Star

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

science benchmark computer-vision dataset generative-model post-training reward-modeling

Updated Apr 27, 2025
Python

allenai / hybrid-preferences

Star

Learning to route instances for Human vs AI Feedback (ACL 2025 Main)

language-model dpo rlhf reward-modeling

Updated May 16, 2025
Python

quanshr / DMoERM

Star

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

rlhf large-language-model reward-modeling

Updated Jun 6, 2024
Python

MiuLab / DogeRM

Star

The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"

large-language-models rlhf model-merging reward-modeling

Updated Oct 8, 2024
Python

zhuohaoyu / RewardAnything

Star

RewardAnything: Generalizable Principle-Following Reward Models

evaluation alignment llm rlhf reward-models rlaif reward-modeling reasoning-language-models

Updated Jun 11, 2025
HTML

lca0503 / MergeToVLRM

Star

Source code of our paper "Transferring Textual Preferences to Vision-Language Understanding through Model Merging", ACL 2025

model-merging large-vision-language-model reward-modeling

Updated Apr 25, 2025
Python

homzer / Q-RM

Star

Code for SFT and RL

reinforcement-learning model-parallel reward-modeling supervised-fine-tuning

Updated Jun 6, 2025
Python

Building an LLM with RLHF involves fine-tuning using human-labeled preferences. Based on Learning to Summarize from Human Feedback, it uses supervised learning, reward modeling, and PPO to improve response quality and alignment.

transformer reinforcement-learning-algorithms fine-tuning ppo-agent policy-optimization-algorithms supervised-finetuning flan-t5 reward-modeling

Updated Mar 24, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the reward-modeling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reward-modeling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reward-modeling

Here are 12 public repositories matching this topic...

YangLing0818 / IterComp

sileod / tasksource

VectorInstitute / vector-inference

holarissun / RewardModelingBeyondBradleyTerry

Jialuo-Li / Science-T2I

allenai / hybrid-preferences

quanshr / DMoERM

MiuLab / DogeRM

zhuohaoyu / RewardAnything

lca0503 / MergeToVLRM

homzer / Q-RM

ranzeet013 / RLHF-CustomData

Improve this page

Add this topic to your repo