Source

arXiv

1659 posts from this source

Sort:

2 weeks ago · ai · - · -

[Paper] REPOT: Recoverable Program-of-Thought via Checkpoint Repair

One-shot Program-of-Thought (PoT) emits a Python program that prints a primitive-action plan; a single invalid action silently invalidates the trajectory. We in...

#research #paper #ai #machine-learning #nlp
2 weeks ago · software · - · -

[Paper] The Rise of the Software-Defined Vehicle: Architectures, Enabling Technologies, and Future Opportunities

The transition toward Software-Defined Vehicles (SDVs) represents a major paradigm shift in vehicle design, transforming traditional hardware-centric systems in...

#research #paper #software
2 weeks ago · devops · - · -

[Paper] Effective MPI: User-defined Datatypes and Cartesian Communicators for Zero-copy All-to-all Communication in Multidimensional Tori

We present and show how to implement a non-trivial all-to-all communication algorithm for arbitrary d-dimensional tori effectively in MPI. Given a factorization...

#research #paper #devops
2 weeks ago · ai · - · -

[Paper] Selection Hyper-heuristics Can Automatically Adjust the Learning Period to Optimally Solve Pseudo-Boolean Problems

The Random Gradient hyper-heuristic was recently shown to be able to learn the optimal neighbourhood size when optimizing the LeadingOnes benchmark via the Rand...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Agora: Toward Autonomous Bug Detection in Production-Level Consensus Protocols with LLM Agents

Consensus protocols form the backbone of distributed systems and blockchains, where implementation bugs can cause data corruption and financial losses. While LL...

#research #paper #ai #machine-learning
2 weeks ago · software · - · -

[Paper] Claim against Measurement: Statistical Artefacts in Quantum Error Mitigation Benchmarks

QEM is widely regarded as a plausible bridge from NISQ devices to FTQC. Yet the empirical studies used to assess the effectiveness of QEM techniques on concrete...

#research #paper #software
2 weeks ago · software · - · -

[Paper] TagDebt: A Bot to Support Technical Debt Management

Context: Technical debt (TD) is a widely studied metaphor that helps to explain how sub-optimal decisions that can harm software maintainability over time. Alth...

#research #paper #software
2 weeks ago · ai · - · -

[Paper] Ciphera: A Decentralised Biometric Identity Framework

Centralised biometric identity systems expose users to single points of failure, opaque verification processes, and irreversible biometric compromise. Decentral...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] Inferring Code Correctness from Specification

Large language models (LLMs) have become integral to modern software development, enabling automated code generation at scale. However, validating the correctne...

#research #paper #ai #machine-learning
2 weeks ago · devops · - · -

[Paper] CARM Tool: Cache-Aware Roofline Model Automatic Benchmarking and Application Analysis

In recent years, HPC systems and CPU architectures as their central components, have become increasingly complex, making application development and optimizatio...

#research #paper #devops
2 weeks ago · devops · - · -

[Paper] PRISM: Processing-In-Memory Sparse MTTKRP for Tensor Decomposition Acceleration

Sparse tensors are the most used representation of sparse multidimensional data. Operations that decompose them, selecting their most important features while r...

#research #paper #devops
2 weeks ago · ai · - · -

[Paper] AMDP: Asynchronous Multi-Directional Pipeline Parallelism for Large-Scale Models Training

Pipeline parallelism is essential for large-scale model training, but existing asynchronous approaches often degrade convergence due to parameter mismatch betwe...

#research #paper #ai #machine-learning
2 weeks ago · devops · - · -

[Paper] TC-MIS: Maximal Independent Set on Tensor-cores

Maximal Independent Set (MIS) in a graph is a fundamental problem with applications in resource allocation, scheduling, and network optimization. Although graph...

#research #paper #devops
2 weeks ago · devops · - · -

[Paper] Design and Implementation of a Serverless MapReduce Framework for Scalable Data Pipelines

Modern logistics systems tend to generate continuous streams of data from sources such as GPS, IoT sensors, and logistics management systems. The aggregation, p...

#research #paper #devops
2 weeks ago · devops · - · -

[Paper] Silent Data Corruption Protection through Efficient Task Replication

The trend of increasing cluster sizes of supercomputers leads to a growing susceptibility to Silent Data Corruption (SDC) that can invalidate program results. A...

#research #paper #devops
2 weeks ago · ai · - · -

[Paper] Evolutionary Rule Extraction from Corporate Default Prediction Models

Small and medium-sized enterprises (SMEs) represent the majority of firms in most economies and often face financial constraints and higher vulnerability to fin...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Runtime Analysis of a Compact Genetic Algorithm on a Truly Multi-valued OneMax Function

Recently, the runtime analysis of multi-valued estimation-of-distribution algorithms in the framework of Ben Jedidia et al. (TCS 2024) has made significant adva...

#research #paper #ai
2 weeks ago · ai · - · -

[Paper] EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization

Evolutionary model merging provides a powerful framework for the automated, training-free composition of LLMs through parameter-space search. However, existing ...

#research #paper #ai
2 weeks ago · ai · - · -

[Paper] Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits

LLM-guided evolutionary search (Evolve systems) has reached state-of-the-art results on mathematical and combinatorial tasks, yet most existing systems report o...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Real-rootedness of the Poincaré polynomials of $overline{mathcal M}_{0,n}$: an AI-assisted proof

We prove real-rootedness for the Poincaré polynomial [ P_n(t)=sum_{i=0}^{n-3} dim H^{2i}(overline{mathcal M}_{0,n};mathbb{Q})t^i ] of the Deligne--Mumford modul...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] From Pixels to Words -- Towards Native One-Vision Models at Scale

Current vision-language models (VLMs) typically stitch together separate image encoders and language decoders via multi-stage alignment, a modular framework tha...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

Parameter-efficient finetuning (PEFT) has become the standard approach for adapting large language models, yet evaluations largely emphasize downstream accuracy...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] VLMs May Not Globally Enhance Human Alignment over LLMs During Natural Reading

Large language models (LLMs) have become increasingly useful computational models of human language processing, but it remains unclear whether vision-language l...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

World models for interactive video generation have largely focused on single-agent settings, where future observations are generated from a single control signa...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Self-Improving Language Models with Bidirectional Evolutionary Search

Search has been proposed as an effective method for self-improving language models and agentic systems, both for post-training sample generation and for inferen...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Beyond Binary: Sim-to-Real Dexterous Manipulation with Physics-Grounded Contact Representation

A primary bottleneck in contact-rich manipulation is the difficulty of collecting real-world data. Sim-to-real reinforcement learning offers a scalable alternat...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] HarmoVid: Relightful Video Portrait Harmonization

We present a method for harmonizing the lighting of a foreground video to match a target background scene, adjusting shadows, color tone, and illumination inten...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Affective Music Recommendation: A Rollout-Based World Model for Offline Preference Optimization

Functional music applications, from consumer focus and sleep aids to clinical interventions, share a distinctive recommendation problem: success is defined by t...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] AREA: Attribute Extraction and Aggregation for CLIP-Based Class-Incremental Learning

Class-Incremental Learning (CIL) is important in building real-world learning systems. In CLIP-based CIL, the model performs classification by comparing similar...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Calibrating Conservatism for Scalable Oversight

Agentic AI systems capable of autonomous planning and extended environmental interaction pose a fundamental control problem: how can humans maintain meaningful ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Personal Visual Memory from Explicit and Implicit Evidence

Long-term memory is increasingly important for personalized AI agents, yet existing benchmarks and methods remain largely text-centric. Even when images are inc...

#research #paper #ai #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration

Visual outcomes are increasingly central to multimodal large language models, making reliable and fine-grained verification essential for scaling generalist fou...

#research #paper #ai #machine-learning #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] Ω-QVLA: Robust Quantization for Vision-Language-Action Models via Composite Rotation and Per-step Scaling

Vision-Language-Action (VLA) models unify perception, reasoning, and control within a single policy, yet their multi-billion-parameter backbones and diffusion-b...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Human Label Variation as Stable Signal: Learning Annotator-Specific Explanation Behavior via Cross-Annotator Preference Optimization

Free-text explanations extend human label variation (HLV) beyond label disagreement by revealing the reasoning and preferences behind annotators' decisions. We ...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models

Electroencephalography (EEG) is a critical, non-invasive method to monitor electrical brain activity. EEGs can span anywhere from a couple seconds to multiple h...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Skill-Conditioned Gated Self-Distillation for LLM Reasoning

On-policy self-distillation (SD) improves LLM reasoning by using teacher-side privileged information (PI) to turn sparse verifier outcomes into dense token-leve...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Do Agents Need Semantic Metadata? A Comparative Study in Agentic Data Retrieval

In the era of autonomous agents, machine-actionable data is critical for data-driven workflows. For more than a decade, semantic metadata like schema.org has an...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Can Large Language Models Handle Discourse Particles? A Case Study of Colloquial Malay

Discourse particles, such as well and kind of, are crucial components that enable LLMs to ``speak'' more like humans. They are used to convey emotions, intentio...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Bias Leaves a Gradient Trail: Label-Free Bias Identification via Gradient Probes on Concept Decompositions

Vision classifiers can exploit spurious correlations, achieving high in-distribution accuracy yet failing under distribution shift. Existing approaches to bias ...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] The Abstraction Gap in Vision-Language Causal Reasoning

Vision-language models (VLMs) generate fluent causal explanations, but current evaluations cannot distinguish linguistic plausibility from faithful causal reaso...

#research #paper #ai #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] Can LLMs Use Linguistic Uncertainty Markers to Reliably Reflect Intrinsic Confidence?

LLMs' linguistically expressed confidence should faithfully reflect their intrinsic uncertainty. While recent work shows LLMs struggle to use epistemic markers ...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Computer-use agents (CUAs) have recently made substantial progress, but deploying a separate large expert for each software domain remains expensive. Small open...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Rethinking Memory as Continuously Evolving Connectivity

Existing memory-augmented LLM agents often treat memory as a static repository with pre-defined representations and fixed retrieval pipelines, which is brittle ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] SwarmHarness: Skill-Based Task Routing via Decentralized Incentive-Aligned AI Agent Networks

Vast quantities of compute (GPU cycles on personal workstations, idle inference servers, and edge devices between jobs) go unused because no incentive-aligned p...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] CubePart: An Open-Vocabulary Part-Controllable 3D Generator

Interactive 3D assets used in games and simulation are typically decomposed into specific semantic parts to support animation, physics, and scripted behaviors, ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Preference-Shaped Expected Hypervolume and R2 Improvement: Exact Computation and Monotonicity

This paper studies preference-shaped expected improvement criteria for Bayesian multiobjective optimization. We consider two indicator families which are often ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Preference-Shaped Expected Hypervolume and R2 Improvement: Exact Computation and Monotonicity

This paper studies preference-shaped expected improvement criteria for Bayesian multiobjective optimization. We consider two indicator families which are often ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Self-Prophetic Decoding to Unlock Visual Search in LVLMs

Large Vision-Language Models (LVLMs) are rapidly evolving toward true multimodal reasoning, with visual search representing a concrete instantiation of the thin...

#research #paper #ai #computer-vision

Newer posts

Older posts