Source

arXiv

1603 posts from this source

Sort:

3 weeks ago · ai · - · -

[Paper] AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation

Vision-and-Language Navigation (VLN) requires an agent to ground language instructions to its own movement within a visual environment. While state-of-the-art m...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration

Exploration is a prerequisite for learning useful behaviors in sparse-reward, long-horizon tasks, particularly within 3D environments. Curiosity-driven reinforc...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] GesVLA: Gesture-Aware Vision-Language-Action Model Embedded Representations

Vision-Language-Action (VLA) models have shown strong potential for general-purpose robot manipulation by unifying perception and action. However, existing VLA ...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving

Robust training and validation of Autonomous Driving Systems (ADS) require massive, diverse datasets. Proprietary data collected by Autonomous Vehicle (AV) flee...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning

Robustness, domain adaptation, photometric and occlusion invariance, compositional generalisation, temporal robustness, alignment safety, and classical anisotro...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting Models

We propose and analyze a conservative drifting method for one-step generative modeling. The method replaces the original displacement-based drifting velocity by...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems

Autonomous agentic systems are largely static after deployment: they do not learn from user interactions, and recurring failures persist until the next human-dr...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Linear attention replaces the unbounded cache of softmax attention with a fixed-size recurrent state, reducing sequence mixing to linear time and decoding to co...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems

Large language model (LLM)-based multi-agent systems increasingly rely on intermediate communication to coordinate complex tasks. While most existing systems co...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Evaluating Commercial AI Chatbots as News Intermediaries

AI chatbots are rapidly shaping how people encounter the news, yet no prior study has systematically measured how accurately these systems, with their proprieta...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback

LLM-powered AI agents require high-frequency state exploration (e.g., test-time tree search and reinforcement learning), relying on rapid checkpoint and rollbac...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection

Production systems generate millions of log lines daily, yet most anomaly detectors operate at the session or window-level, flagging groups of lines rather than...

#research #paper #ai #machine-learning
3 weeks ago · devops · - · -

[Paper] AI-Driven Multi-Region Provisioning for Cloud Services Using Spot Fleets

Cloud service platforms increasingly rely on elastic infrastructures to support dynamic workloads. Spot instances provide discounted computing resources but int...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders

Representation Autoencoders (RAEs) leverage frozen vision foundation models (VFMs) as tokenizer encoders, providing robust high-level representations that facil...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival Analysis

Survival analysis aims to estimate a time-to-event distribution from data with censored observations. Many existing methods either impose structural assumptions...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data

Real-time cognitive load assessment from eye-tracking signals could potentially enable adaptive human-centered-AI such as safety-critical applications such as d...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation

Real-time cognitive load assessment is essential for adaptive human-computer interaction but remains challenging due to limited labeled data and poor cross-subj...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Reducing Political Manipulation with Consistency Training

Large language models (LLMs) exhibit systematic political bias across a variety of sensitive contexts. We find that LLMs handle counterpart topics from opposing...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Understanding Data Temporality Impact on Large Language Models Pre-training

Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding rem...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Synthetic Data Alone is Enough? Rethinking Data Scarcity in Pediatric Rare Disease Recognition

Children with rare genetic diseases often exhibit distinctive facial phenotypes, yet developing computer vision systems for early diagnosis remains challenging ...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Spectral Tail Auxiliary Learning for AI-Generated Image Detection

As generative image models evolve rapidly, the perceptual gap between generated and real images continues to narrow, making AI-generated image detection increas...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] ChronoMedKG: A Temporally-Grounded Biomedical Knowledge Graph and Benchmark for Clinical Reasoning

Biomedical knowledge graphs (KGs) treat disease associations as static facts, but temporal information is crucial for clinical reasoning, e.g., a symptom diagno...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools

Every Python function deployed as an LLM tool must today exist in two forms: an HTTP endpoint for human-facing clients and CI pipelines, and an MCP tool registr...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

We investigate whether acoustic emotion recognition models can serve as proxies for the Pathos dimension in political speech analysis, as operationalised by the...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] WorldKV: Efficient World Memory with World Retrieval and Compression

Autoregressive video diffusion models have enabled real-time, action-conditioned world generation. However, sustaining a persistent world, where revisiting a pr...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild

As wearable and mobile devices become increasingly embedded in daily life, they offer a practical way to continuously sense human motion in the wild. But inerti...

#research #paper #ai #machine-learning #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] AMEL: Accumulated Message Effects on LLM Judgments

Large language models are routinely used as automated evaluators: to review code, moderate content, or score outputs, often with many items passing through one ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Tokenization with Split Trees

We introduce Tokenization with Split Trees (ToaST), a subword tokenization method that directly optimizes compression under a new recursive inference procedure....

#research #paper #ai #nlp
3 weeks ago · devops · - · -

[Paper] A Generalized Nash Equilibrium-Seeking Scheme for Trauma Resuscitation

Trauma resuscitation is a clinical process for treating life-threatening physiological disorders in safety-critical environments, driven by the experience of he...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents

Skills are increasingly used to package agent instructions, workflows, scripts, and reference materials. In enterprise settings, however, skills often need to e...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Innovations in Cardless Artificial Intelligence Banking: A Comprehensive Framework for Cyber Secure and Fraud Mitigation using Machine Learning Algorithms

The advent of cardless artificial intelligence (AI) banking heralds a paradigm shift in the financial landscape, offering users unprecedented security and conve...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] SynAE: A Framework for Measuring the Quality of Synthetic Data for Tool-Calling Agent Evaluations

Today, tool-calling agents are commonly evaluated or tested on static datasets of execution traces, including input commands, agent responses, and associated to...

#research #paper #ai #machine-learning #nlp
3 weeks ago · software · - · -

[Paper] Why Are Agentic Pull Requests Merged or Rejected? An Empirical Study

AI coding agents increasingly submit pull requests (Agentic-PRs) to open-source repositories, yet their performance is commonly assessed using merge and rejecti...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] Quantum Genetic Optimization for Negative Selection Algorithms in Anomaly Detection

Negative Selection Algorithms (NSAs), inspired by the self/non-self discrimination mechanism of the human immune system, have been widely employed in anomaly de...

#research #paper #ai
3 weeks ago · software · - · -

[Paper] 'Refactoring Runaway': Understanding and Mitigating Tangled Refactorings in Coding Agents for Issue Resolution

Recent advances in coding agents have shown remarkable progress in software issue resolution. In practice, real-world issues are typically bug fixes or feature ...

#research #paper #software
3 weeks ago · devops · - · -

[Paper] Relay-Based Synchronization of Replicated Data Types in Opportunistic Networks

In Opportunistic Networks (OppNets), the dissemination of information can only rely on transient pairwise radio contacts between mobile devices (peers). Designi...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] Exploiting Multicast for Accelerating Collective Communication

Reducing collective communication latency is a critical goal for large model training and inference in both academia and industry. Many-to-many communications, ...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] Monotone Erasure Codes

Erasure codes are a critical component in reliable storage systems today, and many blockchain systems use consensus protocols that involve erasure codes to redu...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] The Neglected Baseline in Model Interpretation

We observe that existing model interpretation methods generally ignore the baseline, and such neglect often results in imprecise or even incorrect interpretatio...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Asymmetric Virtual Memory Paging for Hybrid Mamba-Transformer Inference

Hybrid language models like Jamba mix attention layers with State Space Models (SSMs), creating two memory cache types with opposite profiles: Key-Value (KV) ca...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology

Does the relationship between learning rules and brain alignment generalize across species? We extend our prior finding that untrained CNNs match backpropagatio...

#research #paper #ai #machine-learning
3 weeks ago · devops · - · -

[Paper] Nf-PEAK: Process-Based Energy Attribution for Nextflow Workflows on Kubernetes Clusters

Scientific workflows are pipelines of interdependent tasks. They are increasingly executed on shared Kubernetes clusters via workflow engines such as Nextflow. ...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Guiding Multi-Objective Genetic Programming with Description Length Improves Symbolic Regression Solutions

Symbolic regression with genetic programming (GPSR) may suffer from overfitting and structural bloat, especially when noise is present. In this paper we evaluat...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation

As large language models (LLMs) are increasingly deployed for software engineering, constructing high-quality benchmarks is crucial for evaluating not just the ...

#research #paper #ai #machine-learning
3 weeks ago · software · - · -

[Paper] At What Cost? Software Developers' Well-Being in the Age of GenAI

Generative Artificial Intelligence (GenAI) is rapidly reshaping software development, with growing emphasis on accelerating productivity and optimizing performa...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] SepsisAI Orchestrator: A Containerized and Scalable Platform for Deploying AI Models and Real-Time Monitoring in Early Sepsis Detection

Despite strong predictive results in the clinical machine learning literature, the translation of these models into bedside use remains limited by systems-level...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Temporal Coding as a Substrate for Sensorimotor Object Inference: A Spiking Reinterpretation of Thousand Brains Architecture

The Thousand Brains Theory (TBT) and its open-source Monty framework model object recognition through sensorimotor inference -- identifying objects by actively ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Secure and Parallel Determinant Computation for Large-Scale Matrices in Edge Environments

The advent of edge computing has enabled resource-constrained clients to delegate intensive computational tasks to distributed edge servers, especially within I...

#research #paper #ai #machine-learning

Newer posts

Older posts