Source

arXiv

1364 posts from this source

Sort:

3 days ago · ai · - · -

[Paper] EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Large language model (LLM) agents have achieved strong performance on a wide range of benchmarks, yet most evaluations assume static environments. In contrast, ...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

Retrieval-augmented generation (RAG) has become a standard mechanism for grounding language models in external knowledge, yet conventional retrieval based on le...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] InterleaveThinker: Reinforcing Agentic Interleaved Generation

Recent image generators have demonstrated impressive photorealism and instruction-following capabilities in single-image generation and editing. However, constr...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] Mana: Dexterous Manipulation of Articulated Tools

Articulated tool manipulation remains a major challenge in dexterous robotics due to the need to coordinate internal degrees of freedom and contact-rich interac...

#research #paper #ai #machine-learning #computer-vision
3 days ago · ai · - · -

[Paper] Modality Forcing for Scalable Spatial Generation

Text-to-image (T2I) models contain rich spatial priors. Synthesizing photorealistic, cluttered scenes requires an understanding of geometry, including perspecti...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] RepWAM: World Action Modeling with Representation Visual-Action Tokenizers

This work presents RepWAM, a representation-centric world action model (WAM) built on representation visual-action tokenizers. Existing WAMs typically inherit r...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Spatial reasoning, the ability to determine where objects are, how they relate, and how they move in 3D, remains a fundamental challenge for vision-language mod...

#research #paper #ai #machine-learning #computer-vision
3 days ago · ai · - · -

[Paper] Understanding Truncated Positional Encodings for Graph Neural Networks

Positional encodings (PEs) enhance the power of graph neural networks (GNNs), both theoretically and empirically. Two of the most popular families of PEs - spec...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Automated reproducibility assessments in the social and behavioral sciences using large language models

Reproducibility in the social and behavioral sciences is typically evaluated by independent researchers who reanalyze the original data to assess whether the pu...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Agents-K1: Towards Agent-native Knowledge Orchestration

Current LLM-based research agents have advanced through agent orchestration, yet largely overlook scientific knowledge orchestration. Existing works often reduc...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Influcoder: Distilling Decoders' Gradient Influence Rankings into an Encoder for Data Attribution

With the growth of LLMs' (Large Language Models) capabilities, there has been an increasing push to curate high quality datasets by filtering samples in the tra...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] HyperTool: Beyond Step-Wise Tool Calls for Tool-Augmented Agents

Tool-augmented LLM agents commonly rely on step-wise atomic tool calls, where each invocation, observation, and value transfer is exposed in the main reasoning ...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

LLM-based agents have shown increasing potential in automating scientific discovery. Given an optimizable metric and an execution environment, they can propose,...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Before You Think: System 0, AI-Mediated Cognition and Cognitive Colonization

This paper examines three recent frameworks for understanding the cognitive and epistemic consequences of artificial intelligence: Tri-System Theory, Thinkframe...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Dense Supervision, Sparse Updates: On the Sparsity and Geometry of On-Policy Distillation

On-policy distillation (textsc{OPD}) has recently become a prominent post-training recipe as it combines two desirable ingredients: on-policy student trajectori...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Flex4DHuman: Flexible Multi-view Video Diffusion for 4D Human Reconstruction

We present Flex4DHuman, a multi-view video diffusion model that transforms a monocular or sparse multi-view video of a dynamic subject into synchronized dense m...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible

Image-to-3D methods often trade off faithfulness and completeness: depth estimators are anchored to input pixels but stop at the visible surface, while image-to...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] Operadic consistency: a label-free signal for compositional reasoning failures in LLMs

Detecting LLM reasoning failures at inference time without ground-truth labels has motivated a wide range of confidence baselines, including self-consistency, s...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] SkMTEB: Slovak Massive Text Embedding Benchmark and Model Adaptation

We introduce SkMTEB, the first comprehensive MTEB-style text embedding benchmark for Slovak, a low-resource West Slavic language, comprising 31 datasets across ...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Surflo: Consistent 3D Surface Flow Model with Global State

Geometry is invariant to viewpoint, which makes any collection of images a redundant encoding of a single 3D state. Existing feed-forward reconstruction models ...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] Recursive Agent Harnesses

Recursive language models (RLMs) showed that recursion over model calls is an effective strategy for long-context reasoning, and production coding agents have b...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] The Stable Recovery Manifold: Geometric Principles Governing Recoverability in Continual Learning

Catastrophic forgetting is often viewed as the destruction of previously learned knowledge during sequential learning. Building on the Accessibility Collapse fr...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Operads for compositional reasoning in LLMs

Question decomposition, i.e. breaking a complex query into simpler sub-queries whose answers are composed to produce a final answer, is a widely used strategy f...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Aerial Wildfire Suppression Planning with a Hybrid CNN-Cellular Automata Fire Model

Aerial wildfire suppression requires not only predicting fire spread, but also designing effective intervention strategies under operational and environmental u...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] From Tokens to Faces: Investigating Discrete Speech Representations for 3D Facial Animation

The choice of speech representation is critical in speech-driven 3D facial animation. Representations differ in what they encode: SSL features emphasize segment...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Valid Inference with Synthetic Data via Task Exchangeability

There is a proliferation of work arguing for the use of synthetic data in scientific research. For example, social scientists are arguing for the use of LLM-gen...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Generative Modeling of Bach-Style Symbolic Music: A Comparative Study of Autoregressive, Latent-Variable, and Adversarial Approaches

We study generative modeling of Bach-style symbolic piano music using a shared MIDI corpus and three model families: autoregressive LSTMs with attention, latent...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Revisiting Vehicle Color Recognition in Long-Tailed Surveillance Scenarios

Vehicle color recognition is an important cue for vehicle identification in surveillance systems, especially when license plates are illegible due to low resolu...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] Beyond Uniform Tokens: Adaptive Compression for Time Series Language Models

Large language models (LLMs) have enabled time series (TS) analysis by jointly modeling numerical observations and textual context through a shared token interf...

#research #paper #ai #nlp
3 days ago · devops · - · -

[Paper] Finding Conservation Laws of Large Dynamical Systems with Tasks and Futures: A Case Study in Utilizing Dynamic Data Dependencies

As parallel workloads grow in complexity, managing fine-grained data dependencies becomes a critical challenge. Futures offer a promising model for handling the...

#research #paper #devops
3 days ago · ai · - · -

[Paper] Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks

Shielded reinforcement learning is typically presented as a runtime safety mechanism that compiles temporal-logic specifications into automata restricting an ag...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Majority-of-Three is Optimal

We give a short proof that the majority vote of three independent consistent classifiers is an optimal learner in the realizable PAC setting. This proves optima...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] One Polluted Page Is Enough: Evaluating Web Content Pollution in Generative Recommenders

Search-augmented LLMs increasingly mediate everyday consumer recommendations by retrieving live web content. This creates a new risk: generative recommenders ma...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility

Agent systems are advancing quickly across domains, but their evaluation remains fragmented. Most benchmarks rely on fixed, LLM-centric harnesses that require h...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Reasoning as Pattern Matching: Shared Mechanisms in Human and LLM Everyday Reasoning

When large language models (LLMs) fail to generalize or make haphazard errors in reasoning, it is often taken as evidence that LLMs are not truly reasoning, but...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Distribution-Agnostic Robust Trajectory Optimization via Chance-Constrained Reinforcement Learning

This paper presents a distribution-agnostic robust trajectory-optimization framework based on chance-constrained reinforcement learning. The uncertainty is repr...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Multi-Agent Reinforcement Learning from Delayed Marketplace Feedback for Objective-Weight Adaptation in Three-Sided Dispatch

Dispatch in three-sided marketplaces provides a natural setting for reinforcement learning from world feedback: decisions are evaluated by delayed operational o...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Beyond the Commitment Boundary: Probing Epiphenomenal Chain-of-Thought in Large Reasoning Models

Chain-of-thought (CoT) reasoning is the dominant paradigm for inference-time scaling in language models, yet the causal influence of individual steps on the fin...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] EpiBench: Verifiable Evaluation of AI Agents on Epigenomics Analysis

We introduce EpiBench, a verifiable benchmark for short-horizon epigenomics analysis. EpiBench evaluates whether agents can make well-defined analysis decisions...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Reward Modeling for Multi-Agent Orchestration

Multi-Agent Systems (MAS) built on Large Language Models (LLMs) require effective orchestration to coordinate specialized agents, yet training such orchestrator...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Multiagent Protocols with Aggregated Confidence Signals

Confidence is used for reliability, oversight, and a range of downstream decision tasks in Natural Language Processing (NLP), yet no existing method produces or...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Simplex-Constrained Sparse Bagging: Transitioning from Uniform Priors to Sparse Posteriors in Ensemble Learning

We present Simplex-Constrained Sparse Bagging (SCSB), a mathematically rigorous framework for post-training compression and probability calibration of bootstrap...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Towards Effective Waste Segmentation for Automated Waste Recycling in Cluttered Background

Rapid expansion of urban areas and population growth is causing an immense increase in waste production, which demands the need for efficient and automated wast...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] The Tone of Awareness: Topic, Sentiment, and Toxicity Maps During Mental Health Month on TikTok

Despite raising concerns about the mental health effects associated with the usage of TikTok, little is known about how related content is framed by creators an...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] EvTexture++: Event-Driven Texture Enhancement for Video Super-Resolution

Event-based vision has drawn increasing attention owing to its distinctive properties, including ultra-high temporal resolution and extreme dynamic range. Recen...

#research #paper #ai #machine-learning #computer-vision
3 days ago · ai · - · -

[Paper] LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories

Scientific laboratories increasingly rely on AI systems to reason about experiments, but the physical act of doing science remains largely outside their reach. ...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Learning with Simulators: No Regret in a Computationally Bounded World

Understanding the minimal assumptions necessary for generalization is the fundamental question in learning theory. Unfortunately, most results rely heavily on i...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] ArogyaSutra: A Multi-Agent Framework for Multimodal Medical Reasoning in Indic Languages

Multimodal Large Language Models (MLLMs) have shown promising reasoning capabilities in general domains, yet their performance remains limited in specialized se...

#research #paper #ai #machine-learning #nlp

Newer posts

Older posts