Single comment thread
See full discussion

A lot of other models have a liberal leaning, but with R1 for certain questions there is zero nuance.

It's better to wait for a similar model that is trained with different safeguards. This project is trying to do that, but there will be many more github.com/huggingface/open-r1

Home
Search
Messages
Notifications
More