nlp — Page 18 | EUNO.NEWS

Sort:

0 month ago · ai · - · -

[Paper] Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

Long reasoning models often struggle in multilingual settings: they tend to reason in English for non-English questions; when constrained to reasoning in the qu...

#research #paper #ai #nlp
0 month ago · ai · - · -

[Paper] Polyglots or Multitudes? Multilingual LLM Answers to Value-laden Multiple-Choice Questions

Multiple-Choice Questions (MCQs) are often used to assess knowledge, reasoning abilities, and even values encoded in large language models (LLMs). While the eff...

#research #paper #ai #nlp
0 month ago · ai · - · -

[Paper] DARWIN: Dynamic Agentically Rewriting Self-Improving Network

DARWIN is an evolutionary GPT model, utilizing a genetic-algorithm like optimization structure with several independent GPT agents being trained individually us...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] ArkTS-CodeSearch: A Open-Source ArkTS Dataset for Code Retrieval

ArkTS is a core programming language in the OpenHarmony ecosystem, yet research on ArkTS code intelligence is hindered by the lack of public datasets and evalua...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] Reinforced Attention Learning

Post-training with Reinforcement Learning (RL) has substantially improved reasoning in Large Language Models (LLMs) via test-time scaling. However, extending th...

#research #paper #ai #machine-learning #nlp #computer-vision
1 month ago · ai · - · -

[Paper] Rethinking the Trust Region in LLM Reinforcement Learning

Reinforcement learning (RL) has become a cornerstone for fine-tuning Large Language Models (LLMs), with Proximal Policy Optimization (PPO) serving as the de fac...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Subliminal Effects in Your Data: A General Mechanism via Log-Linearity

Training modern large language models (LLMs) has become a veritable smorgasbord of algorithms and datasets designed to elicit particular behaviors, making it cr...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation

From generating headlines to fabricating news, the Large Language Models (LLMs) are typically assessed by their final outputs, under the safety assumption that ...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] Decomposed Prompting Does Not Fix Knowledge Gaps, But Helps Models Say 'I Don't Know'

Large language models often struggle to recognize their knowledge limits in closed-book question answering, leading to confident hallucinations. While decompose...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] Horizon-LM: A RAM-Centric Architecture for LLM Training

The rapid growth of large language models (LLMs) has outpaced the evolution of single-GPU hardware, making model scale increasingly constrained by memory capaci...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

True self-evolution requires agents to act as lifelong learners that internalize novel experiences to solve future problems. However, rigorously measuring this ...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Omni-modal Large Language Models (Omni-LLMs) have demonstrated strong capabilities in audio-video understanding tasks. However, their reliance on long multimoda...

#research #paper #ai #nlp

Newer posts

Older posts