NLP — Page 2 | EUNO.NEWS

Sort:

1 week ago · ai · - · -

[Paper] FunFuzz: An LLM-Powered Evolutionary Fuzzing Framework

Modern fuzzers increasingly use Large Language Models (LLMs) to generate structured inputs, but LLM-driven fuzzing is sensitive to prompt initialization and sam...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] When Audio-Language Models Fail to Leverage Multimodal Context for Dysarthric Speech Recognition

Automatic speech recognition (ASR) systems remain brittle on dysarthric and other atypical speech. Recent audio-language models raise the possibility of improvi...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Mitigating Misalignment Contagion by Steering with Implicit Traits

Language models (LMs) are increasingly used in high-stakes, multi-agent settings, where following instructions and maintaining value alignment are critical. Mos...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Foundation Models to Unlock Real-World Evidence from Nationwide Medical Claims

Evidence derived from large-scale real-world data (RWD) is increasingly informing regulatory evaluation and healthcare decision-making. Administrative claims pr...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] PubMed-Ophtha: An open resource for training ophthalmology vision-language models on scientific literature

Vision-language models hold considerable promise for ophthalmology, but their development depends on large-scale, high-quality image-text datasets that remain s...

#research #paper #ai #nlp #computer-vision
1 week ago · ai · - · -

[Paper] mdok-style at SemEval-2026 Task 10: Finetuning LLMs for Conspiracy Detection

SemEval-2026 Task 10 is focused on conspiracy detection. Specifically, the goal is to detect whether a Reddit comment expresses a conspiracy belief. Our submitt...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] mdok-style at SemEval-2026 Task 9: Finetuning LLMs for Multilingual Polarization Detection

SemEval-2026 Task 9 is focused on multilingual polarization detection. Specifically, it covers the identification of multilingual, multicultural and multievent ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models

Large language models (LLMs) often achieve strong performance on reasoning benchmarks, but final-answer accuracy alone does not show whether they faithfully exe...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Can Coding Agents Reproduce Findings in Computational Materials Science?

Large language models are increasingly deployed as autonomous coding agents and have achieved remarkably strong performance on software engineering benchmarks. ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] RunAgent: Interpreting Natural-Language Plans with Constraint-Guided Execution

Humans solve problems by executing targeted plans, yet large language models (LLMs) remain unreliable for structured workflow execution. We propose RunAgent, a ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] When RAG Chatbots Expose Their Backend: An Anonymized Case Study of Privacy and Security Risks in Patient-Facing Medical AI

Background: Patient-facing medical chatbots based on retrieval-augmented generation (RAG) are increasingly promoted to deliver accessible, grounded health infor...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] LASE: Language-Adversarial Speaker Encoding for Indic Cross-Script Identity Preservation

A speaker encoder used in multilingual voice cloning should treat the same speaker identically regardless of which script the audio was uttered in. Off-the-shel...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Directed Social Regard: Surfacing Targeted Advocacy, Opposition, Aid, Harms, and Victimization in Online Media

The language in online platforms, influence operations, and political rhetoric frequently directs a mix of pro-social sentiment (e.g., advocacy, helpfulness, co...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Characterizing the Expressivity of Local Attention in Transformers

The transformer is the most popular neural architecture for language modeling. The cornerstone of the transformer is its global attention mechanism, which lets ...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] FinSafetyBench: Evaluating LLM Safety in Real-World Financial Scenarios

Large language models (LLMs) are increasingly applied in financial scenarios. However, they may produce harmful outputs, including facilitating illegal activiti...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory

Large language model (LLM) agents require long-term user memory for consistent personalization, but limited context windows hinder tracking evolving preferences...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Adaptive Querying with AI Persona Priors

We study adaptive querying for learning user-dependent quantities of interest, such as responses to held-out items and psychometric indicators, within tight que...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] AGoQ: Activation and Gradient Quantization for Memory-Efficient Distributed Training of LLMs

Quantization is a key method for reducing the GPU memory requirement of training large language models (LLMs). Yet, current approaches are ineffective for 4-bit...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Exploration Hacking: Can LLMs Learn to Resist RL Training?

Reinforcement learning (RL) has become essential to the post-training of large language models (LLMs) for reasoning, agentic capabilities and alignment. Success...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Synthetic Computers at Scale for Long-Horizon Productivity Simulation

Realistic long-horizon productivity work is strongly conditioned on user-specific computer environments, where much of the work context is stored and organized ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] On the Proper Treatment of Units in Surprisal Theory

Surprisal theory links human processing effort to the predictability of an upcoming linguistic unit, but empirical work often leaves the notion of a unit unders...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement le...

#research #paper #ai #machine-learning #nlp #computer-vision
1 week ago · ai · - · -

[Paper] Mapping the Methodological Space of Classroom Interaction Research: Scale, Duration, and Modality in an Age of AI

Research on classroom interaction has long been divided between large-scale observation and in-depth ethnographic work. We propose a framework mapping this meth...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering

Large Language Models (LLMs) have advanced Table Question Answering, where most queries can be answered by extracting information or simple aggregation. However...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Recent research has shown that filtering massive English web corpora into high-quality subsets significantly improves training efficiency. However, for high-res...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Measuring research data reuse in scholarly publications using generative artificial intelligence: Open Science Indicator development and preliminary results

Numerous metascience studies and other initiatives have begun to monitor the prevalence of open science practices when it is more important to understand the 'd...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception

Large Language Models (LLMs) are increasingly used as proxies for human perception in urban analysis, yet it remains unclear whether persona prompting produces ...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Ease of dependency distance minimization in star-like structures

The syntactic structure of a sentence can be represented as a tree where edges indicate syntactic dependencies between words. When that structure is a star, it ...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models

Diffusion large language models (dLLMs) offer parallel decoding and bidirectional context, but state-of-the-art dLLMs require billions of parameters for competi...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Select to Think: Unlocking SLM Potential with Local Sufficiency

Small language models (SLMs) offer computational efficiency for scalable deployment, yet they often fall short of the reasoning power exhibited by their larger ...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] ClassEval-Pro: A Cross-Domain Benchmark for Class-Level Code Generation

LLMs have achieved strong results on both function-level code synthesis and repository-level code modification, yet a capability that falls between these two ex...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] ClawGym: A Scalable Framework for Building Effective Claw Agents

Claw-style environments support multi-step workflows over local files, tools, and persistent workspace states. However, scalable development around these enviro...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] HealthNLP_Retrievers at ArchEHR-QA 2026: Cascaded LLM Pipeline for Grounded Clinical Question Answering

Patient portals now give individuals direct access to their electronic health records (EHRs), yet access alone does not ensure patients understand or act on the...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] MoRFI: Monotonic Sparse Autoencoder Feature Identification

Large language models (LLMs) acquire most of their factual knowledge during the pre-training stage, through next token prediction. Subsequent stages of post-tra...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] What Kind of Language is Easy to Language-Model Under Curriculum Learning?

Many of the thousands of attested languages share common configurations of features, creating a spectrum from typologically very rare (e.g., object-verb-subject...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Language Diffusion Models are Associative Memories Capable of Retrieving Unseen Data

When do language diffusion models memorize their training data, and how to quantitatively assess their true generative regime? We address these questions by sho...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] HalluCiteChecker: A Lightweight Toolkit for Hallucinated Citation Detection and Verification in the Era of AI Scientists

We introduce HalluCiteChecker, a toolkit for detecting and verifying hallucinated citations in scientific papers. While AI assistant technologies have transform...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding

RL post-training of frontier language models is increasingly bottlenecked by autoregressive rollout generation, making rollout acceleration a central systems ch...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Text-Utilization for Encoder-dominated Speech Recognition Models

This paper investigates efficient methods for utilizing text-only data to improve speech recognition, focusing on encoder-dominated models that facilitate faste...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Recursive Multi-Agent Systems

Recursive or looped language models have recently emerged as a new scaling axis by iteratively refining the same model computation over latent states to deepen ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

Real-world data visualization (DV) requires native environmental grounding, cross-platform evolution, and proactive intent alignment. Yet, existing benchmarks o...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] A paradox of AI fluency

How much does a user's skill with AI shape what AI actually delivers for them? This question is critical for users, AI product builders, and society at large, b...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Toward a Functional Geometric Algebra for Natural Language Semantics

Distributional and neural approaches to natural language semantics have been built almost exclusively on conventional linear algebra: vectors, matrices, tensors...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Three Models of RLHF Annotation: Extension, Evidence, and Authority

Preference-based alignment methods, most prominently Reinforcement Learning with Human Feedback (RLHF), use the judgments of human annotators to shape large lan...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] From Syntax to Emotion: A Mechanistic Analysis of Emotion Inference in LLMs

Large language models (LLMs) are increasingly used in emotionally sensitive human-AI applications, yet little is known about how emotion recognition is internal...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling

Machine-generated text (MGT) detection requires identifying structurally invariant signals across generation models, rather than relying on model-specific finge...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] G-Loss: Graph-Guided Fine-Tuning of Language Models

Traditional loss functions, including cross-entropy, contrastive, triplet, and su pervised contrastive losses, used for fine-tuning pre-trained language models ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses

Harnesses have become a central determinant of coding-agent performance, shaping how models interact with repositories, tools, and execution environments. Yet a...

#research #paper #ai #nlp

Newer posts

Older posts