NLP | EUNO.NEWS

Sort:

2 days ago · ai · - · -

Beyond Vector Search: Why GraphRAG is the Next Frontier for LLMs

Beyond Vector Search: Why GraphRAG is the Next Frontier for LLMs For the past year, the industry standard for augmenting LLMs has been Retrieval-Augmented Gene...

#LLM #Retrieval-Augmented Generation #GraphRAG #vector search #knowledge graphs #AI research #NLP
3 days ago · ai · - · -

[Paper] LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Test-time scaling (TTS) has become an effective approach for improving large language model performance by allocating additional computation during inference. H...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Conformal Path Reasoning: Trustworthy Knowledge Graph Question Answering via Path-Level Calibration

Knowledge Graph Question Answering (KGQA) has shown promise for grounded and interpretable reasoning, yet existing approaches often fail to provide reliable cov...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. ...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation

While recent advancements in inference-time learning have improved LLM reasoning on Text-to-SQL tasks, current solutions still struggle to perform well on the m...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Accurate and Efficient Statistical Testing for Word Semantic Breadth

Measuring the breadth of a word's meaning, or its spread across contexts, has become feasible with contextualized token embeddings. A word type can be represent...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Uncertainty-Aware Structured Data Extraction from Full CMR Reports via Distilled LLMs

Converting free-text cardiac magnetic resonance (CMR) reports into auditable structured data remains a bottleneck for cohort assembly, longitudinal curation, an...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Fast Byte Latent Transformer

Recent byte-level language models (LMs) match the performance of token-level models without relying on subword vocabularies, yet their utility is limited by slo...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Position: Mechanistic Interpretability Must Disclose Identification Assumptions for Causal Claims

Mechanistic interpretability papers increasingly use causal vocabulary: circuits, mediators, causal abstraction, monosemanticity. Such claims require explicit i...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Tool Calling is Linearly Readable and Steerable in Language Models

When a tool-calling agent picks the wrong tool, the failure is invisible until execution: the email gets sent, the meeting gets missed. Probing 12 instruction-t...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] GLiGuard: Schema-Conditioned Classification for LLM Safeguard

Ensuring safe, policy-compliant outputs from large language models requires real-time content moderation that can scale across multiple safety dimensions. Howev...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] EMO: Pretraining Mixture of Experts for Emergent Modularity

Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Verifier-Backed Hard Problem Generation for Mathematical Reasoning

Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, ...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

Many deployments must compare candidate language models for safety before a labeled benchmark exists for the relevant language, sector, or regulatory regime. We...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients

Reinforcement learning with verifiable rewards (RLVR), due to the deterministic verification, becomes a dominant paradigm for enhancing the reasoning ability of...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Large language models (LLMs) are increasingly used as interactive agents, but optimizing them for long-horizon decision making remains difficult because current...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Recursive Agent Optimization

We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks t...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficul...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

Large language models (LLMs) power deep research agents that synthesize information from hundreds of web sources into cited reports, yet these citations cannot ...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Parser agreement and disagreement in L2 Korean UD: Implications for human-in-the-loop annotation

We propose a simplified human-in-the-loop workflow for second language (L2) Korean morphosyntactic annotation by leveraging agreement between two domain-adapted...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Large language model (LLM)-based Multi-agent systems (MAS) have shown promise in tackling complex collaborative tasks, where agents are typically orchestrated v...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Continuous Latent Diffusion Language Model

Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-r...

#research #paper #ai #machine-learning #nlp #computer-vision
5 days ago · ai · - · -

[Paper] Implicit Representations of Grammaticality in Language Models

Grammaticality and likelihood are distinct notions in human language. Pretrained language models (LMs), which are probabilistic models of language fitted to max...

#research #paper #ai #nlp
5 days ago · ai · - · -

[Paper] MRI-Eval: A Tiered Benchmark for Evaluating LLM Performance on MRI Physics and GE Scanner Operations Knowledge

Background: Existing MRI LLM benchmarks rely mainly on review-book multiple-choice questions, where top proprietary models already score highly, limiting discri...

#research #paper #ai #nlp
5 days ago · ai · - · -

[Paper] The First Token Knows: Single-Decode Confidence for Hallucination Detection

Self-consistency detects hallucinations by generating multiple sampled answers to a question and measuring agreement, but this requires repeated decoding and ca...

#research #paper #ai #machine-learning #nlp
5 days ago · ai · - · -

[Paper] PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

We present our system for SemEval-2026 Task 9: Multilingual Polarization Detection, a binary classification task spanning 22 languages. Our approach fine-tunes ...

#research #paper #ai #machine-learning #nlp
5 days ago · ai · - · -

[Paper] Beyond Semantics: An Evidential Reasoning-Aware Multi-View Learning Framework for Trustworthy Mental Health Prediction

Automated mental health prediction using textual data has shown promising results with deep learning and large language models. However, deploying these models ...

#research #paper #ai #nlp
5 days ago · ai · - · -

[Paper] Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement

We introduce the **Concept Field** of a text corpus: a local drift field with pointwise uncertainty, estimated in sentence-embedding space from the deltas betwe...

#research #paper #ai #machine-learning #nlp
5 days ago · ai · - · -

[Paper] Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics

LLMs are trained once, then deployed into a world that never stops changing. External memory compensates for this, but most systems manage it explicitly rather ...

#research #paper #ai #machine-learning #nlp
5 days ago · ai · - · -

[Paper] Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models

We present an automated, contrastive evaluation pipeline for auditing the behavioral impact of interventions on large language models. Given a base model M_1 an...

#research #paper #ai #machine-learning #nlp
5 days ago · ai · - · -

[Paper] The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences

We administer 45 validated psychometric questionnaires to 50 large language models (LLMs) to identify the dimensions along which LLMs differ psychometrically. U...

#research #paper #ai #nlp
5 days ago · ai · - · -

[Paper] The Impossibility Triangle of Long-Context Modeling

We identify and prove a fundamental trade-off governing long-sequence models: no model can simultaneously achieve (i) per-step computation independent of sequen...

#research #paper #ai #machine-learning #nlp
5 days ago · ai · - · -

[Paper] Coral: Cost-Efficient Multi-LLM Serving over Heterogeneous Cloud GPUs

The usage of large language models (LLMs) has grown increasingly fragmented, with no single model dominating. Meanwhile, cloud providers offer a wide range of m...

#research #paper #ai #machine-learning #nlp
6 days ago · ai · - · -

[Paper] Safety and accuracy follow different scaling laws in clinical large language models

Clinical LLMs are often scaled by increasing model size, context length, retrieval complexity, or inference-time compute, with the implicit expectation that hig...

#research #paper #ai #machine-learning #nlp
6 days ago · ai · - · -

[Paper] OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

Deep search capabilities have become an indispensable competency for frontier Large Language Model (LLM) agents, yet their development remains dominated by indu...

#research #paper #ai #machine-learning #nlp
6 days ago · ai · - · -

[Paper] Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

Reasoning-intensive retrieval aims to surface evidence that supports downstream reasoning rather than merely matching topical similarity. This capability is inc...

#research #paper #ai #nlp
6 days ago · ai · - · -

[Paper] EQUITRIAGE: A Fairness Audit of Gender Bias in LLM-Based Emergency Department Triage

Emergency department triage assigns patients an acuity score that determines treatment priority, and clinical evidence documents persistent gender disparities i...

#research #paper #ai #nlp
6 days ago · ai · - · -

[Paper] Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

Large Language Models (LLMs) are prone to factual hallucinations, risking their reliability in real-world applications. Existing hallucination detectors mainly ...

#research #paper #ai #nlp
6 days ago · ai · - · -

[Paper] Feature-Augmented Transformers for Robust AI-Text Detection Across Domains and Generators

AI-generated text is nowadays produced at scale across domains and heterogeneous generation pipelines, making robustness to distribution shift a central require...

#research #paper #ai #machine-learning #nlp
6 days ago · ai · - · -

[Paper] Transformers with Selective Access to Early Representations

Several recent Transformer architectures expose later layers to representations computed in the earliest layers, motivated by the observation that low-level fea...

#research #paper #ai #machine-learning #nlp
6 days ago · ai · - · -

[Paper] The Counterexample Game: Iterated Conceptual Analysis and Repair in Language Models

Conceptual analysis -- proposing definitions and refining them through counterexamples -- is central to philosophical methodology. We study whether language mod...

#research #paper #ai #machine-learning #nlp
6 days ago · ai · - · -

[Paper] Atomic Fact-Checking Increases Clinician Trust in Large Language Model Recommendations for Oncology Decision Support: A Randomized Controlled Trial

Question: Does atomic fact-checking, which decomposes AI treatment recommendations into individually verifiable claims linked to source guideline documents, inc...

#research #paper #ai #machine-learning #nlp
6 days ago · ai · - · -

[Paper] Steer Like the LLM: Activation Steering that Mimics Prompting

Large language models can be steered at inference time through prompting or activation interventions, but activation steering methods often underperform compare...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection

Speculative decoding accelerates large language model (LLM) inference by using a small draft model to propose candidate tokens that a larger target model verifi...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection

Speculative decoding accelerates large language model (LLM) inference by using a small draft model to propose candidate tokens that a larger target model verifi...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

Understanding Transformers Part 18: Completing the Decoding Process

Continuing the Decoding Process In the previous article we generated the first output word from the transformer. The translation was correct, but the decoder c...

#transformers #decoder #sequence-to-sequence #attention #machine-translation #deep-learning #NLP
1 week ago · ai · - · -

[Paper] FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents

Text-to-SQL over large analytical databases requires navigating complex schemas, resolving ambiguous queries, and grounding decisions in actual data. Most curre...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces

As large language model (LLM) agents evolve from isolated tool users into coordinated teams, reinforcement learning (RL) must optimize not only individual actio...

#research #paper #ai #nlp

Newer posts

Older posts