EUNO.NEWS EUNO.NEWS
  • All (21023) +2
  • AI (3157)
  • DevOps (933) +1
  • Software (11078)
  • IT (5806)
  • Education (48)
  • Notice
  • All (21023) +2
    • AI (3157)
    • DevOps (933) +1
    • Software (11078)
    • IT (5806)
    • Education (48)
  • Notice
  • All (21023) +2
  • AI (3157)
  • DevOps (933) +1
  • Software (11078)
  • IT (5806)
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 week ago · ai

    [Paper] Reference Games as a Testbed for the Alignment of Model Uncertainty and Clarification Requests

    In human conversation, both interlocutors play an active role in maintaining mutual understanding. When addressees are uncertain about what speakers mean, for e...

    #research #paper #ai #nlp
  • 1 week ago · ai

    [Paper] The Confidence Trap: Gender Bias and Predictive Certainty in LLMs

    The increased use of Large Language Models (LLMs) in sensitive domains leads to growing interest in how their confidence scores correspond to fairness and bias....

    #research #paper #ai #machine-learning #nlp
  • 1 week ago · ai

    [Paper] Learning Through Dialogue: Unpacking the Dynamics of Human-LLM Conversations on Political Issues

    Large language models (LLMs) are increasingly used as conversational partners for learning, yet the interactional dynamics supporting users' learning and engage...

    #research #paper #ai #nlp
  • 1 week ago · ai

    [Paper] Kinship Data Benchmark for Multi-hop Reasoning

    Large language models (LLMs) are increasingly evaluated on their ability to perform multi-hop reasoning, i.e., to combine multiple pieces of information into a ...

    #research #paper #ai #machine-learning #nlp
  • 1 week ago · ai

    [Paper] Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning

    LLM agents operating over massive, dynamic tool libraries rely on effective retrieval, yet standard single-shot dense retrievers struggle with complex requests....

    #research #paper #ai #machine-learning #nlp
  • 1 week ago · ai

    [Paper] Enhancing Self-Correction in Large Language Models through Multi-Perspective Reflection

    While Chain-of-Thought (CoT) prompting advances LLM reasoning, challenges persist in consistency, accuracy, and self-correction, especially for complex or ethic...

    #research #paper #ai #nlp
  • 1 week ago · ai

    [Paper] OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

    While Vision-Language Models (VLMs) have significantly advanced Computer-Using Agents (CUAs), current frameworks struggle with robustness in long-horizon workfl...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 week ago · ai

    [Paper] Are LLM Decisions Faithful to Verbal Confidence?

    Large Language Models (LLMs) can produce surprisingly sophisticated estimates of their own uncertainty. However, it remains unclear to what extent this expresse...

    #research #paper #ai #machine-learning #nlp
  • 1 week ago · ai

    [Paper] Contrastive Learning with Narrative Twins for Modeling Story Salience

    Understanding narratives requires identifying which events are most salient for a story's progression. We present a contrastive learning framework for modeling ...

    #research #paper #ai #nlp
  • 1 week ago · ai

    [Paper] Structure First, Reason Next: Enhancing a Large Language Model using Knowledge Graph for Numerical Reasoning in Financial Documents

    Numerical reasoning is an important task in the analysis of financial documents. It helps in understanding and performing numerical predictions with logical con...

    #research #paper #ai #nlp
  • 1 week ago · ai

    How AI Can Become Your Personal Language Tutor

    How I used n8n to build AI study partners for learning Mandarin: vocabulary, listening, and pronunciation correction. The post How AI Can Become Your Personal L...

    #AI tutoring #language learning #Mandarin #n8n #vocabulary #pronunciation correction #NLP
  • 1 week ago · ai

    When Does Adding Fancy RAG Features Work?

    Looking at the performance of different pipelines The post When Does Adding Fancy RAG Features Work? appeared first on Towards Data Science....

    #retrieval-augmented-generation #RAG #LLM #prompt-engineering #pipeline-performance #NLP #AI-tools

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026