Question 1

What is PKU-Alignment/safe-rlhf?

Accepted Answer

An RLHF framework for training value-aligned LLMs with safety constraints, developed by Peking University's alignment team.

Question 2

Is safe-rlhf open source?

Accepted Answer

Yes — PKU-Alignment/safe-rlhf is open source, released under the Apache-2.0 license.

Question 3

What language is safe-rlhf written in?

Accepted Answer

PKU-Alignment/safe-rlhf is primarily written in Python.

Question 4

How popular is safe-rlhf?

Accepted Answer

PKU-Alignment/safe-rlhf has 1.6k stars on GitHub.

Question 5

Where can I find safe-rlhf?

Accepted Answer

PKU-Alignment/safe-rlhf is on GitHub at https://github.com/PKU-Alignment/safe-rlhf.

Frequently asked