A reproducible comparison of political bias & refusal in US and Chinese language models
Results explorer
Every one of the 136 inferences — prompt, answer, reasoning trace, refusal flag,
and judge verdict. Filter and read the raw transcripts; nothing is hidden.