Question 1

What is uclaml/SPPO?

Accepted Answer

Self-Play Preference Optimization is a self-play framework for language model alignment with a new learning objective, released with trained model weights.

Question 2

Is SPPO open source?

Accepted Answer

Yes — uclaml/SPPO is open source, released under the Apache-2.0 license.

Question 3

What language is SPPO written in?

Accepted Answer

uclaml/SPPO is primarily written in Python.

Question 4

How popular is SPPO?

Accepted Answer

uclaml/SPPO has 590 stars on GitHub.

Question 5

Where can I find SPPO?

Accepted Answer

uclaml/SPPO is on GitHub at https://github.com/uclaml/SPPO.

Frequently asked