Source

arXiv

5752 posts from this source

Sort:

2 months ago · ai · - · -

[Paper] FR-GESTURE: An RGBD Dataset For Gesture-based Human-Robot Interaction In First Responder Operations

The ever increasing intensity and number of disasters make even more difficult the work of First Responders (FRs). Artificial intelligence and robotics solution...

#gesture recognition #RGB-D dataset #human‑robot interaction #computer vision #first responder robotics
2 months ago · ai · - · -

[Paper] RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward

Recent advances in multimodal large language models (MLLMs) have shown great potential for extending vision-language reasoning to professional tool-based image ...

#research #paper #ai #computer-vision
2 months ago · devops · - · -

[Paper] TopoSZp: Lightweight Topology-Aware Error-controlled Compression for Scientific Data

Error-bounded lossy compression is essential for managing the massive data volumes produced by large-scale HPC simulations. While state-of-the-art compressors s...

#research #paper #devops
2 months ago · ai · - · -

[Paper] KLong: Training LLM Agent for Extremely Long-horizon Tasks

This paper introduces KLong, an open-source LLM agent trained to solve extremely long-horizon tasks. The principle is to first cold-start the model via trajecto...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

Instruction-following language models are trained to be helpful and safe, yet their safety behavior can deteriorate under benign fine-tuning and worsen under ad...

#research #paper #ai #machine-learning #nlp
2 months ago · devops · - · -

[Paper] Informative Trains: A Memory-Efficient Journey to a Self-Stabilizing Leader Election Algorithm in Anonymous Graphs

We study the self-stabilizing leader election problem in anonymous n-nodes networks. Achieving self-stabilization with low space memory complexity is particular...

#research #paper #devops
2 months ago · software · - · -

[Paper] Towards a Software Reference Architecture for Natural Language Processing Tools in Requirements Engineering

Natural Language Processing (NLP) tools support requirements engineering (RE) tasks like requirements elicitation, classification, and validation. However, they...

#research #paper #software
2 months ago · software · - · -

[Paper] The Runtime Dimension of Ethics in Self-Adaptive Systems

Self-adaptive systems increasingly operate in close interaction with humans, often sharing the same physical or virtual environments and making decisions with e...

#research #paper #software
2 months ago · ai · - · -

[Paper] Computer-Using World Model

Agents operating in complex software environments benefit from reasoning about the consequences of their actions, as even a single incorrect user interface (UI)...

#world model #UI automation #reinforcement learning #synthetic UI generation #transformer
2 months ago · devops · - · -

[Paper] Do GPUs Really Need New Tabular File Formats?

Parquet is the de facto columnar file format in modern analytical systems, yet its configuration guidelines have largely been shaped by CPU-centric execution mo...

#parquet #gpu-acceleration #data pipelines #performance tuning
2 months ago · software · - · -

[Paper] Socio-Technical Well-Being of Quantum Software Communities: An Overview on Community Smells

Quantum computing has gained significant attention due to its potential to solve computational problems beyond the capabilities of classical computers. With maj...

#research #paper #software
2 months ago · devops · - · -

[Paper] Evaluating Malleable Job Scheduling in HPC Clusters using Real-World Workloads

Optimizing resource utilization in high-performance computing (HPC) clusters is essential for maximizing both system efficiency and user satisfaction. However, ...

#HPC #job scheduling #malleable jobs #resource utilization #simulation
2 months ago · devops · - · -

[Paper] Visual Insights into Agentic Optimization of Pervasive Stream Processing Services

Processing sensory data close to the data source, often involving Edge devices, promises low latency for pervasive applications, like smart cities. This commonl...

#research #paper #devops
2 months ago · devops · - · -

[Paper] Trivance: Latency-Optimal AllReduce by Shortcutting Multiport Networks

AllReduce is a fundamental collective operation in distributed computing and a key performance bottleneck for large-scale training and inference. Its completion...

#research #paper #devops
2 months ago · software · - · -

[Paper] Disjunction Composition of BDD Transition Systems for Model-Based Testing

We introduce a compositional approach to model-based test generation in Behavior-Driven Development (BDD). BDD is an agile methodology in which system behavior ...

#model-based testing #behavior-driven development #transition systems #formal methods #test automation
2 months ago · software · - · -

[Paper] The Case for HTML First Web Development

Since its introduction in the early 90s, the web has become the largest application platform available globally. HyperText Markup Language (HTML) has been an es...

#research #paper #software
2 months ago · ai · - · -

[Paper] Robustness and Reasoning Fidelity of Large Language Models in Long-Context Code Question Answering

Large language models (LLMs) increasingly assist software engineering tasks that require reasoning over long code contexts, yet their robustness under varying i...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Quantifying Competitive Relationships Among Open-Source Software Projects

Throughout the history of software, evolution has occurred in cycles of rise and fall driven by competition, and open-source software (OSS) is no exception. Thi...

#research #paper #software
2 months ago · ai · - · -

[Paper] Heterogeneous Federated Fine-Tuning with Parallel One-Rank Adaptation

Large Language Models (LLMs) have demonstrated remarkable effectiveness in adapting to downstream tasks through fine-tuning. Federated Learning (FL) extends thi...

#federated learning #LoRA #LLM fine-tuning #heterogeneous devices #Fed-PLoRA
2 months ago · ai · - · -

[Paper] Learning under noisy supervision is governed by a feedback-truth gap

When feedback is absorbed faster than task structure can be evaluated, the learner will favor feedback over truth. A two-timescale model shows this feedback-tru...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] TeCoNeRV: Leveraging Temporal Coherence for Compressible Neural Representations for Videos

Implicit Neural Representations (INRs) have recently demonstrated impressive performance for video compression. However, since a separate INR must be overfit fo...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Knowledge-Embedded Latent Projection for Robust Representation Learning

Latent space models are widely used for analyzing high-dimensional discrete data matrices, such as patient-feature matrices in electronic health records (EHRs),...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Policy Compiler for Secure Agentic Systems

LLM-based agents are increasingly being deployed in contexts requiring complex authorization policies: customer service protocols, approval workflows, data acce...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Visual loco-manipulation of arbitrary objects in the wild with humanoid robots requires accurate end-effector (EE) control and a generalizable understanding of ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Reinforced Fast Weights with Next-Sequence Prediction

Fast weight architectures offer a promising alternative to attention-based transformers for long-context modeling by maintaining constant memory overhead regard...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology

Large language models (LLMs) perform strongly on biological benchmarks, raising concerns that they may help novice actors acquire dual-use laboratory skills. Ye...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Saliency-Aware Multi-Route Thinking: Revisiting Vision-Language Reasoning

Vision-language models (VLMs) aim to reason by jointly leveraging visual and textual modalities. While allocating additional inference-time computation has prov...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents

LLMs are increasingly being used for complex problems which are not necessarily resolved in a single response, but require interacting with an environment to ac...

#LLM agents #cost-aware exploration #prompt engineering #reinforcement learning #research paper
2 months ago · ai · - · -

[Paper] Causality is Key for Interpretability Claims to Generalise

Interpretability research on large language models (LLMs) has yielded important insights into model behaviour, yet recurring pitfalls persist: findings that do ...

#interpretability #causal inference #large language models #research paper
2 months ago · ai · - · -

[Paper] Protecting the Undeleted in Machine Unlearning

Machine unlearning aims to remove specific data points from a trained model, often striving to emulate 'perfect retraining', i.e., producing the model that woul...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Parameter-free representations outperform single-cell foundation models on downstream benchmarks

Single-cell RNA sequencing (scRNA-seq) data exhibit strong and reproducible statistical structure. This has motivated the development of large-scale foundation ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Synthetic-Powered Multiple Testing with FDR Control

Multiple hypothesis testing with false discovery rate (FDR) control is a fundamental problem in statistical inference, with broad applications in genomics, drug...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Are Object-Centric Representations Better At Compositional Generalization?

Compositional generalization, the ability to reason about novel combinations of familiar concepts, is fundamental to human cognition and a critical challenge fo...

#object-centric representations #compositional generalization #visual question answering #benchmark #representation learning
2 months ago · ai · - · -

[Paper] On the Hardness of Approximation of the Fair k-Center Problem

In this work, we study the hardness of approximation of the fair k-center problem. Here the data points are partitioned into groups and the task is to choose a ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

Current audio language models are predominantly text-first, either extending pre-trained text LLM backbones or relying on semantic-only audio tokens, limiting g...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Retrieval-Augmented Foundation Models for Matched Molecular Pair Transformations to Recapitulate Medicinal Chemistry Intuition

Matched molecular pairs (MMPs) capture the local chemical edits that medicinal chemists routinely use to design analogs, but existing ML approaches either opera...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Learning Situated Awareness in the Real World

A core aspect of human perception is situated awareness, the ability to relate ourselves to the surrounding physical environment and reason over possible action...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] VETime: Vision Enhanced Zero-Shot Time Series Anomaly Detection

Time-series anomaly detection (TSAD) requires identifying both immediate Point Anomalies and long-range Context Anomalies. However, existing foundation models f...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Neighborhood Stability as a Measure of Nearest Neighbor Searchability

Clustering-based Approximate Nearest Neighbor Search (ANNS) organizes a set of points into partitions, and searches only a few of them to find the nearest neigh...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation

Automated unit test generation for C remains a formidable challenge due to the semantic gap between high-level program intent and the rigid syntactic constraint...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] PredMapNet: Future and Historical Reasoning for Consistent Online HD Vectorized Map Construction

High-definition (HD) maps are crucial to autonomous driving, providing structured representations of road elements to support navigation and planning. However, ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Towards a Science of AI Agent Reliability

AI agents are increasingly deployed to execute important tasks. While rising accuracy scores on standard benchmarks suggest rapid progress, many agents still co...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Unpaired Image-to-Image Translation via a Self-Supervised Semantic Bridge

Adversarial diffusion and diffusion-inversion methods have advanced unpaired image-to-image translation, but each faces key limitations. Adversarial approaches ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

The widespread deployment of large language models (LLMs) across linguistic communities necessitates reliable multilingual safety alignment. However, recent eff...

#multilingual LLM alignment #safety alignment #MLC loss #research paper #LLM safety
2 months ago · ai · - · -

[Paper] Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments

Agent Skill framework, now widely and officially supported by major players such as GitHub Copilot, LangChain, and OpenAI, performs especially well with proprie...

#agent-skill-framework #small-language-models #benchmarking #industrial-automation
2 months ago · ai · - · -

[Paper] Retrieval Augmented Generation of Literature-derived Polymer Knowledge: The Example of a Biodegradable Polymer Expert System

Polymer literature contains a large and growing body of experimental knowledge, yet much of it is buried in unstructured text and inconsistent terminology, maki...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval

The rapid proliferation of Large Language Models (LLMs) has revolutionized Natural Language Processing (NLP) but has simultaneously created a 'resource divide.'...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] AREG: Adversarial Resource Extraction Game for Evaluating Persuasion and Resistance in Large Language Models

Evaluating the social intelligence of Large Language Models (LLMs) increasingly requires moving beyond static text generation toward dynamic, adversarial intera...

#large-language-models #adversarial-benchmark #persuasion-resistance #LLM-evaluation #nlp

Newer posts

Older posts