nlp — Page 15 | EUNO.NEWS

Sort:

3 weeks ago · ai · - · -

[Paper] Olmix: A Framework for Data Mixing Throughout LM Development

Data mixing -- determining the ratios of data from different domains -- is a first-order concern for training language models (LMs). While existing mixing metho...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation

Efficient long-context processing remains a crucial challenge for contemporary large language models (LLMs), especially in resource-constrained environments. So...

#token compression #retrieval-augmented generation #overflow detection #LLM #NLP
3 weeks ago · ai · - · -

[Paper] Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education

AI models have achieved state-of-the-art results in textual reasoning; however, their ability to reason over spatial and relational structures remains a critica...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Weight Decay Improves Language Model Plasticity

The prevailing paradigm in large language model (LLM) development is to pretrain a base model, then perform further training to improve performance and model be...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Just on Time: Token-Level Early Stopping for Diffusion Language Models

Diffusion language models generate text through iterative refinement, a process that is often computationally inefficient because many tokens reach stability lo...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] TEGRA: Text Encoding With Graph and Retrieval Augmentation for Misinformation Detection

Misinformation detection is a critical task that can benefit significantly from the integration of external knowledge, much like manual fact-checking. In this w...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away

Reinforcement learning (RL) based post-training for explicit chain-of-thought (e.g., GRPO) improves the reasoning ability of multimodal large-scale reasoning mo...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Can Large Language Models Make Everyone Happy?

Misalignment in Large Language Models (LLMs) refers to the failure to simultaneously satisfy safety, value, and cultural dimensions, leading to behaviors that d...

#large language models #misalignment #benchmark #AI safety #NLP
3 weeks ago · ai · - · -

[Paper] SteuerLLM: Local specialized large language model for German tax law analysis

Large language models (LLMs) demonstrate strong general reasoning and language understanding, yet their performance degrades in domains governed by strict forma...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] ISD-Agent-Bench: A Comprehensive Benchmark for Evaluating LLM-based Instructional Design Agents

Large Language Model (LLM) agents have shown promising potential in automating Instructional Systems Design (ISD), a systematic approach to developing education...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Quantum-Audit: Evaluating the Reasoning Limits of LLMs on Quantum Computing

Language models have become practical tools for quantum computing education and research, from summarizing technical papers to explaining theoretical concepts a...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Overview of the TREC 2025 RAGTIME Track

The principal goal of the RAG TREC Instrument for Multilingual Evaluation (RAGTIME) track at TREC is to study report generation from multilingual source documen...

#research #paper #ai #nlp

Newer posts

Older posts