EUNO.NEWS EUNO.NEWS
  • All (2571) +248
  • AI (578) +19
  • DevOps (150) +2
  • Software (1091) +156
  • IT (746) +70
  • Education (6) +1
  • Notice
  • All (2571) +248
    • AI (578) +19
    • DevOps (150) +2
    • Software (1091) +156
    • IT (746) +70
    • Education (6) +1
  • Notice
  • All (2571) +248
  • AI (578) +19
  • DevOps (150) +2
  • Software (1091) +156
  • IT (746) +70
  • Education (6) +1
  • Notice
Sources Tags Search
한국어 English 中文
  • 4 days ago · ai

    Why AI Alignment Starts With Better Evaluation

    You can’t align what you don’t evaluate The post Why AI Alignment Starts With Better Evaluation appeared first on Towards Data Science....

    #AI alignment #evaluation #AI safety #machine learning #LLM
  • 1 week ago · ai

    [Paper] Escaping the Verifier: Learning to Reason via Demonstrations

    Training Large Language Models (LLMs) to reason often relies on Reinforcement Learning (RL) with task-specific verifiers. However, many real-world reasoning-int...

    #LLM #reinforcement learning #reasoning #research paper
  • 1 week ago · ai

    [Paper] Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO

    Optimizing large language models (LLMs) for multi-turn conversational outcomes remains a significant challenge, especially in goal-oriented settings like AI mar...

    #LLM #reinforcement learning #PPO #RLHF #goal-oriented dialogue
  • 1 week ago · ai

    [Paper] Can LLMs extract human-like fine-grained evidence for evidence-based fact-checking?

    Misinformation frequently spreads in user comments under online news articles, highlighting the need for effective methods to detect factually incorrect informa...

    #LLM #evidence extraction #fact-checking #multilingual dataset #benchmark
  • 1 week ago · ai

    [Paper] Even with AI, Bijection Discovery is Still Hard: The Opportunities and Challenges of OpenEvolve for Novel Bijection Construction

    Evolutionary program synthesis systems such as AlphaEvolve, OpenEvolve, and ShinkaEvolve offer a new approach to AI-assisted mathematical discovery. These syste...

    #LLM #evolutionary algorithms #bijection discovery #combinatorial mathematics #OpenEvolve
  • 1 week ago · ai

    [Paper] Beluga: A CXL-Based Memory Architecture for Scalable and Efficient LLM KVCache Management

    The rapid increase in LLM model sizes and the growing demand for long-context inference have made memory a critical bottleneck in GPU-accelerated serving system...

    #CXL #LLM #KVCache #memory architecture #inference acceleration
  • 1 week ago · ai

    [Paper] CodeFuse-CommitEval: Towards Benchmarking LLM's Power on Commit Message and Code Change Inconsistency Detection

    Version control relies on commit messages to convey the rationale for code changes, but these messages are often low quality and, more critically, inconsistent ...

    #LLM #benchmark #commit-message inconsistency #software engineering #code review
  • 1 week ago · ai

    [Paper] Can LLMs Recover Program Semantics? A Systematic Evaluation with Symbolic Execution

    Obfuscation poses a persistent challenge for software engineering tasks such as program comprehension, maintenance, testing, and vulnerability detection. While ...

    #LLM #symbolic execution #code deobfuscation #program semantics #research

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2025