So long as you don't ask it about the Chinese government, or anything historical relating to China, it works well. Otherwise it will start militantly defending anything China related.
A lot of other models have a liberal leaning, but with R1 for certain questions there is zero nuance.
It's better to wait for a similar model that is trained with different safeguards. This project is trying to do that, but there will be many more github.com/huggingface/open-r1
So long as you don't ask it about the Chinese government, or anything historical relating to China, it works well. Otherwise it will start militantly defending anything China related.
A lot of other models have a liberal leaning, but with R1 for certain questions there is zero nuance.
It's better to wait for a similar model that is trained with different safeguards. This project is trying to do that, but there will be many more github.com/huggingface/open-r1