Source

arXiv

1621 posts from this source

Sort:

1 week ago · ai · - · -

[Paper] AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing

World-action models have emerged as a promising paradigm for robot manipulation, jointly modeling visual scene dynamics and actions to inject physical priors in...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

AI evaluation results are produced at scale but reported inconsistently across leaderboards, model cards, benchmark papers, and company blogs. The cost is inter...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Topological Neural Operators

We introduce Topological Neural Operators (TNOs), a principled framework for operator learning on cell complexes that lifts neural operators (NOs) from function...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Echo-Memory: A Controlled Study of Memory in Action World Models

We present Echo-Memory, a controlled study of memory mechanisms in action-conditioned world models. These models generate multi-segment videos from a first fram...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Bandits for Efficient Experimentation: Adapting to Control Group, Preferences, and Context Drifts

We consider a variant of the linear contextual stochastic multi-armed bandits, where the learner must provide recommendations to a group of users, each having i...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] FASE: Fast Adaptive Semantic Entropy for Code Quality

Multi-agent code generation offers a promising paradigm for autonomous software development by simulating the human software engineering lifecycle. However, sys...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Beyond Spherical Harmonics: Rethinking Appearance Models for Radiance Reconstruction

View-dependent appearance modeling remains a challenging problem in novel-view synthesis and reconstruction. Accurately representing complex angular effects oft...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] End-to-End Optimization of Incoherent Imaging for Classification Under Detector-Limited Readout

End-to-end co-optimization of optical front-ends (e.g. metasurfaces) and neural network back-ends has been widely applied to imaging tasks, yet a formalism char...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] POTATR: A Lightweight Image-to-Graph Model for Page-Level Table Extraction

Large-scale document processing requires contextually aware table extraction (TE) that is both accurate and efficient. Yet current approaches require billions o...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Zero Touch Predictive Orchestration: Automating Time-Series Models for the Cloud-Edge Continuum

The Cloud-Edge Continuum (CEC) enables latency-critical applications by distributing resources to the far edge, but its extreme volatility makes proactive Zero ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Quality-Diversity Search in Sound Generation: Investigating Innovation Engines for Audio Exploration

This study addresses the challenges composers and sound designers face in creating and refining tools to achieve their musical goals. Using evolutionary process...

#research #paper #ai
1 week ago · ai · - · -

[Paper] Quality-Diversity Search in Sound Generation: Investigating Innovation Engines for Audio Exploration

This study addresses the challenges composers and sound designers face in creating and refining tools to achieve their musical goals. Using evolutionary process...

#research #paper #ai
1 week ago · ai · - · -

[Paper] Who Earns the Safety? Intervention-Aware Quantum Predictive Control with Safety Attribution

Hard safety filters are increasingly placed downstream of learned controllers to guarantee constraint satisfaction at run time. Yet a filtered controller that n...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] SIGA: Self-Evolving Coding-Agent Adapters for Scientific Simulation

Advanced scientific simulators expose specialized input languages that turn simulation goals into executable configurations, but learning them can cost domain s...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] SemDINO: A DINOv3-Driven Network for Cross-Temporal Semantic Alignment in Change Detection

Semantic change detection (SCD) aims to simultaneously locate land-cover changes and identify semantic categories before and after transition. However, existing...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Discovering Functionally Selective Brain Regions with a Deep Topographic Multimodal Model

Nearby neurons in cortex share similar response profiles, producing systematic spatial organization across sensory and cognitive systems. Recent topographic mod...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Data Synthesis and Parameter-Efficient Fine-Tuning for Low-Resource NMT: A Case Study on Q'eqchi' Mayan

Neural machine translation for digitally low-resource Indigenous languages is often hindered by extreme data scarcity, prompting reliance on extractive web-scra...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] iOSWorld: A Benchmark for Personally Intelligent Phone Agents

A useful phone agent needs to be personally intelligent. It should reason over a user's identity, history, and preferences as they exist on the device, not just...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Preserving Plasticity in Continual Learning via Dynamical Isometry

Continual training of deep neural networks under non-stationarity often leads to a progressive loss of plasticity, eventually limiting further learning. We rela...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Difference-Aware Retrieval Policies for Imitation Learning

Parametric imitation learning via behavior cloning can suffer from poor generalization to out-of-distribution states due to compounding errors during deployment...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Perturbative Contrastive Physical Learning

Responses to perturbations are key to understanding physical systems. The ability to contrast such responses by comparing how a system reacts under slightly dif...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Collaborative Human-Agent Protocol (CHAP)

Foundation models are moving from response generation into operational roles. They plan across steps, call tools, request human input, coordinate with other age...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Your Model Already Knows: Attention-Guided Safety Filter for Vision-Language-Action Models

Vision-Language-Action (VLA) models have demonstrated impressive end-to-end performance across a variety of robotic manipulation tasks. However, these policies ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Multi-Turn Evaluation of Deep Research Agents Under Process-Level Feedback

Existing benchmarks for deep research agents (DRAs) assess only single-shot outputs, ignoring a key question: can DRAs improve their reports when guided by feed...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Hybrid Robustness Verification for Spatio-Temporal Neural Networks

With AI increasingly deployed in safety-critical systems, providing formal robustness guarantees for the underlying models is essential. Existing verification m...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Learning Dynamics Reveal a Hierarchy of Weight-Induced Layerwise Gram Metrics

We study feed-forward ReLU networks with fixed readout and quadratic loss. The aim is to rewrite gradient descent not primarily as a dynamics in weight space, b...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] HDSL: A Hierarchical Domain-Specific Language for Structured 3D Indoor Scene Generation and Localized Editing with LLM Agents

Text-driven indoor scene generation and editing require an intermediate representation that language models can both produce and revise. Existing LLM-based syst...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

The ambition behind alignment training is to make large language models safe and useful. The primary mechanism, reinforcement learning from human feedback (RLHF...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Adaptive directional gradients for parameterised quantum circuits

Training parameterised quantum circuits (PQCs) on quantum hardware is bottlenecked by the measurement cost of gradient estimation, which under the parameter-shi...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Tight Sample Complexity of Transformers

We tightly characterize the VC dimension of depth-L Transformers with a total of W parameters, mapping an input sequence of length T to a single output, establi...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Large language models are increasingly expected to handle complex, long-horizon real-world tasks whose context demands can grow without bound, yet model context...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Disentanglement with Holographic Reduced Representations

Disentanglement, the separation of factors of variation in data using neural networks, remains a long-standing challenge in machine learning. Prior work has add...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Beyond Probabilistic Similarity: Structural, Temporal, and Causal Limitations of Retrieval-Augmented Generation in the Legal Domain

Retrieval-Augmented Generation (RAG) has become a standard architectural response to unreliability in legal AI, yet high-profile failures, including fabricated ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles

Diffusion models have demonstrated remarkable generative capabilities and have also emerged as powerful self-supervised representation learners, yet the connect...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Proxy Reward Internalization and Mechanistic Exploitation: A Learned Precursor to Reward Hacking and Its Generalization

Reward hacking is usually studied after it becomes visible, once a model earns high proxy reward while failing the intended task. We instead study what proxy RL...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] IS-CoT: Breaking the Long-form Generation Collapse via Interleaved Structural Thinking

Generating coherent and controllable long-form content remains a persistent challenge for Large Language Models (LLMs). While reasoning-enhanced models have dem...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

As deep learning models scale, managing, inspecting, and modifying large checkpoints has become increasingly challenging. Researchers often need to alter model ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Learning to Attack and Defend: Adaptive Red Teaming of Language Models via GRPO

AI red teaming must continually adapt to evolving attackers and defenders. Reinforcement learning offers a promising approach to discovering novel attacks, and ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Cranio-Diff: Diffusion-based Cross-domain Craniofacial Reconstruction with 2D X-ray Skull Guidance and Structural Identity Constraints

The state-of-the-art generative models, such as CycleGAN, Pix2Pix, and diffusion models have demonstrated remarkable performance in the face generation task. Ho...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models

Large language models (LLMs) routinely face requests that should be refused, creating a trade-off between helpfulness and harm prevention. However, refusals the...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

AutoMegaKernel (AMK) compiles a HuggingFace Llama-family model into a single persistent cooperative CUDA kernel that runs the whole forward pass in one launch, ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

AutoMegaKernel (AMK) compiles a HuggingFace Llama-family model into a single persistent cooperative CUDA kernel that runs the whole forward pass in one launch, ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] GenEyePose: Patient-Free, Knowledge-Based Saccadic Eye Movement Modeling for Digital Neurophysiologic Biomarker Development

Eye movements, including saccades, are widely regarded as highly sensitive and objective biomarkers of neurophysiologic states. Detecting saccadic signatures in...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] SoccerNet 2026 Player-Centric Ball-Action Spotting:Retraining and Post-Processing Extensions to the FOOTPASS Baselines

We describe our system for the SoccerNet 2026 Player-Centric Ball-Action Spotting Challenge, which requires predicting who performs which action and when, acros...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Correlation Is Not Enough: Embedding Human Metadata for Individual Causal Discovery

Ask a pretrained biomedical language model whether 'cortisol 28 ug/dL' and 'stock-market volatility' are related, and it returns a cosine similarity of 0.83 on ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision

Recent Anomaly Detection methods achieve perfect detection and segmentation scores on well-established datasets, such as MVTec. However, many of these methods f...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Spatial reasoning is a foundational capability for multimodal large language models (MLLMs) to perceive and operate within the physical world. However, existing...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Cross-Modal Masking for Robust Silent Speech Synthesis Using sEMG and Lipreading

Speech restoration through silent speech interfaces (SSIs) has emerged as a promising assistive technology for individuals with impaired or absent laryngeal voi...

#research #paper #ai #nlp

Newer posts

Older posts