Hear me out: What if the Chinese translations of mathematical problems present in English test sets (e.g. MATH) were not filtered from the pre-training corpora of Qwen and DeepSeek? this means the knowledge is there, just translated. This would also explain the language switching when RL-ing CoT ๐
1 year ago
7
2
1
0