EUNO.NEWS EUNO.NEWS
  • All (2571) +248
  • AI (578) +19
  • DevOps (150) +2
  • Software (1091) +156
  • IT (746) +70
  • Education (6) +1
  • Notice
  • All (2571) +248
    • AI (578) +19
    • DevOps (150) +2
    • Software (1091) +156
    • IT (746) +70
    • Education (6) +1
  • Notice
  • All (2571) +248
  • AI (578) +19
  • DevOps (150) +2
  • Software (1091) +156
  • IT (746) +70
  • Education (6) +1
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 week ago · ai

    [Paper] Escaping the Verifier: Learning to Reason via Demonstrations

    Training Large Language Models (LLMs) to reason often relies on Reinforcement Learning (RL) with task-specific verifiers. However, many real-world reasoning-int...

    #LLM #reinforcement learning #reasoning #research paper
  • 1 week ago · ai

    [Paper] Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO

    Optimizing large language models (LLMs) for multi-turn conversational outcomes remains a significant challenge, especially in goal-oriented settings like AI mar...

    #LLM #reinforcement learning #PPO #RLHF #goal-oriented dialogue
  • 1 week ago · ai

    [Paper] BAMAS: Structuring Budget-Aware Multi-Agent Systems

    Large language model (LLM)-based multi-agent systems have emerged as a powerful paradigm for enabling autonomous agents to solve complex tasks. As these systems...

    #budget-aware AI #multi-agent systems #LLM cost optimization #integer linear programming #reinforcement learning
EUNO.NEWS
RSS GitHub © 2025