Source

arXiv

5785 posts from this source

Sort:

3 months ago · ai · - · -

[Paper] MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Most Large Language Model (LLM) agent memory systems rely on a small set of static, hand-designed operations for extracting memory. These fixed procedures hard-...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

Enabling humanoid robots to perform agile and adaptive interactive tasks has long been a core challenge in robotics. Current approaches are bottlenecked by eith...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

Progressive Learning (PL) reduces pre-training computational overhead by gradually increasing model scale. While prior work has extensively explored depth expan...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Multi-head automated segmentation by incorporating detection head into the contextual layer neural network

Deep learning based auto segmentation is increasingly used in radiotherapy, but conventional models often produce anatomically implausible false positives, or h...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge

Autoregressive large language models (LLMs) have achieved remarkable success in many complex tasks, yet they can still fail in very simple logical reasoning suc...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Age-Aware Edge-Blind Federated Learning via Over-the-Air Aggregation

We study federated learning (FL) over wireless fading channels where multiple devices simultaneously send their model updates. We propose an efficient age-aware...

#federated learning #over-the-air aggregation #edge computing #wireless communications #AI
3 months ago · ai · - · -

[Paper] Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts

Despite advances in multimodal large language models, autonomous web agents still struggle to reliably execute long-horizon tasks on complex and dynamic web int...

#multimodal agents #web automation #grounding experts #experience imitation planning #open-source AI
3 months ago · ai · - · -

[Paper] Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models

Rapid advancements in large language models (LLMs) have sparked the question whether these models possess some form of consciousness. To tackle this challenge, ...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] MentisOculi: Revealing the Limits of Reasoning with Mental Imagery

Frontier models are transitioning from multimodal large language models (MLLMs) that merely ingest visual information to unified multimodal models (UMMs) capabl...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] From Directions to Regions: Decomposing Activations in Language Models via Local Geometry

Activation decomposition methods in language models are tightly coupled to geometric assumptions on how concepts are realized in activation space. Existing appr...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Abstract Activation Spaces for Content-Invariant Reasoning in Large Language Models

Large Language Models (LLMs) often struggle with deductive judgment in syllogistic reasoning, systematically conflating semantic plausibility with formal validi...

#large-language-models #logical-reasoning #activation-space #neural-network-intervention #research-paper
3 months ago · ai · - · -

[Paper] Conflict-Aware Client Selection for Multi-Server Federated Learning

Federated learning (FL) has emerged as a promising distributed machine learning (ML) that enables collaborative model training across clients without exposing r...

#federated learning #reinforcement learning #client selection #edge computing #communication efficiency
3 months ago · ai · - · -

[Paper] Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction

As Large Language Models transition to autonomous agents, user inputs frequently violate cooperative assumptions (e.g., implicit intent, missing parameters, fal...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval

Reranking is a critical component of modern retrieval systems, which typically pair an efficient first-stage retriever with a more expressive model to refine re...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Energy-Efficient Neuromorphic Computing for Edge AI: A Framework with Adaptive Spiking Neural Networks and Hardware-Aware Optimization

Edge AI applications increasingly require ultra-low-power, low-latency inference. Neuromorphic computing based on event-driven spiking neural networks (SNNs) of...

#spiking neural networks #neuromorphic computing #edge AI #energy efficiency #hardware-aware optimization
3 months ago · devops · - · -

[Paper] sVIRGO: A Scalable Virtual Tree Hierarchical Framework for Distributed Systems

We propose sVIRGO, a scalable virtual tree hierarchical framework for large-scale distributed systems. sVIRGO constructs virtual hierarchical trees directly on ...

#research #paper #devops
3 months ago · ai · - · -

[Paper] UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Unified multimodal models often struggle with complex synthesis tasks that demand deep reasoning, and typically treat text-to-image generation and image editing...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] SelvaMask: Segmenting Trees in Tropical Forests and Beyond

Tropical forests harbor most of the planet's tree biodiversity and are critical to global ecological balance. Canopy trees in particular play a disproportionate...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Graphical User Interface (GUI) grounding aims to translate natural language instructions into executable screen coordinates, enabling automated GUI interaction....

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Catalyst: Out-of-Distribution Detection via Elastic Scaling

Out-of-distribution (OOD) detection is critical for the safe deployment of deep neural networks. State-of-the-art post-hoc methods typically derive OOD scores f...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] ReasonEdit: Editing Vision-Language Models using Human Reasoning

Model editing aims to correct errors in large, pretrained models without altering unrelated behaviors. While some recent works have edited vision-language model...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation

Simulating deformable objects under rich interactions remains a fundamental challenge for real-to-sim robot manipulation, with dynamics jointly driven by enviro...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Superman: Unifying Skeleton and Vision for Human Motion Perception and Generation

Human motion analysis tasks, such as temporal 3D pose estimation, motion prediction, and motion in-betweening, play an essential role in computer vision. Howeve...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] SWE-Universe: Scale Real-World Verifiable Environments to Millions

We propose SWE-Universe, a scalable and efficient framework for automatically constructing real-world software engineering (SWE) verifiable environments from Gi...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Hierarchical Federated Learning with SignSGD: A Highly Communication-Efficient Approach

Hierarchical federated learning (HFL) has emerged as a key architecture for large-scale wireless and Internet of Things systems, where devices communicate with ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] A Task-Level Evaluation of AI Agents in Open-Source Projects

In this paper, we present a comparative study of five autonomous coding agents using AIDev-pop, which is a public dataset containing thousands of AI-generated p...

#AI coding assistants #pull‑request evaluation #AIDev‑pop dataset #software engineering AI #open‑source AI tools
3 months ago · devops · - · -

[Paper] LCLs Beyond Bounded Degrees

The study of Locally Checkable Labelings (LCLs) has led to a remarkably precise characterization of the distributed time complexities that can occur on bounded-...

#research #paper #devops
3 months ago · software · - · -

[Paper] Building a Correct-by-Design Lakehouse. Data Contracts, Versioning, and Transactional Pipelines for Humans and Agents

Lakehouses are the default cloud platform for analytics and AI, but they become unsafe when untrusted actors concurrently operate on production data: upstream-d...

#lakehouse #data contracts #versioning #transactional pipelines #data engineering
3 months ago · ai · - · -

[Paper] Introns and Templates Matter: Rethinking Linkage in GP-GOMEA

GP-GOMEA is among the state-of-the-art for symbolic regression, especially when it comes to finding small and potentially interpretable solutions. A key mechani...

#research #paper #ai
3 months ago · software · - · -

[Paper] Understanding and Detecting Flaky Builds in GitHub Actions

Continuous Integration (CI) is widely used to provide rapid feedback on code changes; however, CI build outcomes are not always reliable. Builds may fail interm...

#research #paper #software
3 months ago · ai · - · -

[Paper] Spark: Modular Spiking Neural Networks

Nowadays, neural networks act as a synonym for artificial intelligence. Present neural network models, although remarkably powerful, are inefficient both in ter...

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] Before Autonomy Takes Control: Software Testing in Robotics

Robotic systems are complex and safety-critical software systems. As such, they need to be tested thoroughly. Unfortunately, robot software is intrinsically har...

#software testing #robotics #research mapping #automation #test methodology
3 months ago · ai · - · -

[Paper] Backpropagation as Physical Relaxation: Exact Gradients in Finite Time

Backpropagation, the foundational algorithm for training neural networks, is typically understood as a symbolic computation that recursively applies the chain r...

#backpropagation #neuromorphic computing #continuous-time dynamics #gradient methods #research paper
3 months ago · ai · - · -

[Paper] RACA: Representation-Aware Coverage Criteria for LLM Safety Testing

Recent advancements in LLMs have led to significant breakthroughs in various AI applications. However, their sophisticated capabilities also introduce severe sa...

#research #paper #ai #machine-learning #nlp
3 months ago · software · - · -

[Paper] Bridging the Sim-to-Real Gap with multipanda ros2: A Real-Time ROS2 Framework for Multimanual Systems

We present multipanda_ros2, a novel open-source ROS2 architecture for multi-robot control of Franka Robotics robots. Leveraging ros2 control, this framework pro...

#ROS2 #robotics #simulation #real-time control #MuJoCo
3 months ago · ai · - · -

[Paper] OmniCode: A Benchmark for Evaluating Software Engineering Agents

LLM-powered coding agents are redefining how real-world software is developed. To drive the research towards better coding agents, we require challenging benchm...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL

Deploying pretrained policies in real-world applications presents substantial challenges that fundamentally limit the practical applicability of learning-based ...

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] Agent-Based Software Artifact Evaluation

Artifact evaluation has been adopted in the Software Engineering (SE) research community for 15 years, substantially improving research reproducibility across m...

#artifact evaluation #LLM agents #automation #software reproducibility #software engineering tools
3 months ago · ai · - · -

[Paper] Enabling AI Deep Potentials for Ab Initio-quality Molecular Dynamics Simulations in GROMACS

State-of-the-art AI deep potentials provide ab initio-quality results, but at a fraction of the computational cost of first-principles quantum mechanical calcul...

#deep learning potentials #GROMACS integration #DeePMD-kit #molecular dynamics #GPU acceleration
3 months ago · devops · - · -

[Paper] vLLM-Omni: Fully Disaggregated Serving for Any-to-Any Multimodal Models

Any-to-any multimodal models that jointly handle text, images, video, and audio represent a significant advance in multimodal AI. However, their complex archite...

#research #paper #devops
3 months ago · ai · - · -

[Paper] ECHO-2: A Large Scale Distributed Rollout Framework for Cost-efficient Reinforcement Learning

Reinforcement learning (RL) is a critical stage in post-training large language models (LLMs), involving repeated interaction between rollout generation, reward...

#reinforcement-learning #distributed-rollouts #large-language-models #cost-optimization #staleness-aware
3 months ago · devops · - · -

[Paper] TriCloudEdge: A multi-layer Cloud Continuum

TriCloudEdge is a scalable three-tier cloud continuum that integrates far-edge devices, intermediate edge nodes, and central cloud services, working in parallel...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Multi-View Stenosis Classification Leveraging Transformer-Based Multiple-Instance Learning Using Real-World Clinical Data

Coronary artery stenosis is a leading cause of cardiovascular disease, diagnosed by analyzing the coronary arteries from multiple angiography views. Although nu...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] See2Refine: Vision-Language Feedback Improves LLM-Based eHMI Action Designers

Automated vehicles lack natural communication channels with other road users, making external Human-Machine Interfaces (eHMIs) essential for conveying intent an...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Learning to Route and Schedule LLMs from User Retrials via Contextual Queueing Bandits

Explosive demands for LLMs often cause user queries to accumulate in server queues, requiring efficient routing (query-LLM matching) and scheduling (query prior...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] FiLoRA: Focus-and-Ignore LoRA for Controllable Feature Reliance

Multimodal foundation models integrate heterogeneous signals across modalities, yet it remains poorly understood how their predictions depend on specific intern...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Ultrafast On-chip Online Learning via Spline Locality in Kolmogorov-Arnold Networks

Ultrafast online learning is essential for high-frequency systems, such as controls for quantum computing and nuclear fusion, where adaptation must occur on sub...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] FORLER: Federated Offline Reinforcement Learning with Q-Ensemble and Actor Rectification

In Internet-of-Things systems, federated learning has advanced online reinforcement learning (RL) by enabling parallel policy training without sharing raw data....

#research #paper #ai #machine-learning

Newer posts

Older posts