[Paper] Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training
Long reasoning models often struggle in multilingual settings: they tend to reason in English for non-English questions; when constrained to reasoning in the qu...