[Paper] Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction
Developing robust world model reasoning is crucial for large language model (LLM) agents to plan and interact in complex environments. While multi-turn interact...