Question 1

What is RLHFlow/RLHF-Reward-Modeling?

Accepted Answer

A collection of training recipes for reward models used in RLHF-based LLM alignment, including Bradley-Terry, pairwise, multi-objective, and process-supervised approaches.

Question 2

Is RLHF-Reward-Modeling open source?

Accepted Answer

Yes — RLHFlow/RLHF-Reward-Modeling is open source, released under the Apache-2.0 license.

Question 3

What language is RLHF-Reward-Modeling written in?

Accepted Answer

RLHFlow/RLHF-Reward-Modeling is primarily written in Python.

Question 4

How popular is RLHF-Reward-Modeling?

Accepted Answer

RLHFlow/RLHF-Reward-Modeling has 1.5k stars on GitHub.

Question 5

Where can I find RLHF-Reward-Modeling?

Accepted Answer

RLHFlow/RLHF-Reward-Modeling is on GitHub at https://github.com/RLHFlow/RLHF-Reward-Modeling.

Frequently asked