Source

arXiv

1659 posts from this source

Sort:

3 weeks ago · ai · - · -

[Paper] Probabilistic Smoothing with Ratio-Monotone Transforms for Global Optimization

Probabilistic smoothing is a standard tool for global optimization, but existing methods rely on Gaussian kernels and specific transforms, often resulting in st...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Real Images, Worse Judgments: Evaluating Vision-Language Models on Concreteness and Imagery

Visual inputs are often assumed to improve language understanding in multimodal models. We examine this assumption by asking whether vision-language models (VLM...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] When Does Demographic Information Help? Data and Modeling Regimes for Perspective-Aware Hate Speech Detection

Demographic information is often used to model annotator perspectives in subjective tasks such as hate speech detection, but its benefit is inconsistent: it imp...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models

Chart question-answering (QA) benchmarks aim to pose questions that require visual reasoning to correctly answer, but models can often reach solutions through s...

#research #paper #ai #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] Greening AI Inference with Accuracy and Latency-aware User Incentives

The widespread use of AI services has raised concerns for its environmental sustainability, towards which recent studies have identified carbon emissions of AI ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Normal Guidance is what Attention Needs

We consider training classifiers for 3D medical images using only one binary label for the entire volume rather than a label for each 2D slice. In such weakly s...

#research #paper #ai #machine-learning
3 weeks ago · software · - · -

[Paper] EviACT: An Evidence-to-Action Framework for Agentic Program Repair

LLM-based agents have moved automated program repair (APR) from fixed-context patch generation to interactive repository-level repair. However, existing agentic...

#research #paper #software
3 weeks ago · software · - · -

[Paper] ProDebug: An Automated Debugging System for Prolog

Prolog is a well-known declarative programming language commonly used in introductory courses on logic and reasoning. However, many students find Prolog challen...

#research #paper #software
3 weeks ago · devops · - · -

[Paper] Autonomic Federated-Market Orchestration for the Edge-Cloud Continuum

The edge-cloud computing continuum demands self-management mechanisms that scale across autonomous administrative domains while honouring tenant- and operator-s...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] ReMoE: Boosting Expert Reuse through Router Fine-Tuning in Memory-Constrained MoE LLM Inference

Fine-grained Mixture-of-Experts (MoE) models sparsely activate only a subset of experts per token, reducing activated computation while maintaining high model c...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] ConVer: Using Contracts and Loop Invariant Synthesis for Scalable Formal Software Verification

Formal verification of large C programs is impeded by state-space explosion: Bounded Model Checking (BMC) tools must encode the entire state space up to the pre...

#research #paper #ai #machine-learning
3 weeks ago · devops · - · -

[Paper] Nonlinear spectral clustering with C++ GraphBLAS

Nonlinear reformulations of the spectral clustering method have gained a lot of recent attention due to their increased numerical benefits and their solid mathe...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Signal-to-Noise Ratio and Sample Size Govern Representational Alignment in Neural Networks

Neural networks are known to develop latent representations that are aligned, namely structurally similar across networks trained with different architectures, ...

#research #paper #ai #machine-learning
3 weeks ago · devops · - · -

[Paper] Extreme-Scale Interconnection Networks

Extreme-scale data centers are the backbone of next-generation computing, enabling breakthroughs in science, artificial intelligence, and global innovation thro...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Neuro-Symbolic Verification of LLM Outputs for Data-Sensitive Domains (extended preprint)

LLMs deployed in high-stakes domains face fundamental reliability challenges: hallucinations, inconsistencies, and privacy vulnerabilities introduce unacceptabl...

#research #paper #ai #machine-learning
3 weeks ago · devops · - · -

[Paper] Revisiting Bruck: Phase-Efficient All-to-All Communication in Reconfigurable Networks

All-to-All communication is a key performance bottleneck for distributed machine learning (ML) and high-performance computing (HPC) workloads, where dense traff...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Strategies for Guiding LLMs to Use Software Design Patterns: A Case of Singleton

Large Language Models (LLMs) can generate functional source code from natural-language prompts, but often fail to consistently follow higher-level architectural...

#research #paper #ai #machine-learning
3 weeks ago · software · - · -

[Paper] LLM-based Mockless Unit Test Generation for Java

Large language models (LLMs) have shown strong potential for automated test generation, yet most approaches to generating Java unit tests still rely on mocking ...

#research #paper #software
3 weeks ago · software · - · -

[Paper] On the GitHub Actions Language: Usage, Evolution, and Workflow Reliability

Developers often struggle with maintaining GitHub Actions workflow configurations in GitHub-hosted repositories, with recent studies showing frequent execution ...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] HTMLCure: Turning Browser Experience into State Guided Repair for Interactive HTML

LLMs can now produce full HTML pages, but many of those pages are only superficially correct: they render once, then fail under scroll, hover, click, resize, or...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Evolutionary Data Theory: On the Similarities between Data Problems and Evolutionary Games

Applying the concepts and formalisms from Evolutionary Game Theory to the data regime, the fundamental paradigms of Evolutionary Data Theory are introduced. Int...

#research #paper #ai
3 weeks ago · devops · - · -

[Paper] RT-RkNN: Reverse k Nearest Neighbor Queries as a Graphics Ray Casting Problem

Reverse k nearest neighbor (RkNN) queries are fundamental in spatial databases, location-based analytics, and recommendation systems. Existing state-of-the-art ...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Why Prompt Optimization Works, and Why It Sometimes Doesn't: A Causal-Inspired Edit-Level Analysis

Automated prompt optimization methods (e.g., DSpy, TextGrad) can substantially improve the performance of large language model (LLM), however, their generalizat...

#research #paper #ai #machine-learning #nlp
3 weeks ago · devops · - · -

[Paper] Credibility Trilemma in Polymatroidal Service Markets

Mechanism-mediated service markets with polymatroidal feasibility admit efficient, dominant-strategy incentive-compatible (DSIC) allocation, but these guarantee...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] Reducing Internal State in Eigenvalue-Only Divide-and-Conquer Tridiagonal Eigensolvers

Divide and Conquer (D&C) is a widely used algorithmic strategy for symmetric eigenvalue decomposition. Its natural parallelism makes D&C attractive on m...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] A Formal Semantics of C with OpenMP Parallelism

OpenMP is a popular parallelization framework that lets users transform sequential code into parallel code with a few simple annotations. Unfortunately, it is a...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] StreamSplit: Continuous Audio Representation Learning via Uncertainty-Guided Adaptive Splitting

Large-batch Contrastive Learning (CL), the foundation of modern representation learning, is fundamentally incompatible with the volatile resource constraints of...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Constitutional Arms Races in the Public Goods Game: Co-Evolving LLM Constitutions Under Cooperation-Defection Pressure

Frontier LLM agents engage in blackmail, sabotage, and document leaks under goal conflicts in agentic settings, exposing limitations of alignment methods built ...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Unified Neural Scaling Laws

We present a functional form (that we refer to as a Unified Neural Scaling Law (UNSL)) that accurately models and extrapolates the scaling behaviors of deep neu...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

Sparse-view 3D reconstruction is increasingly addressed with feed-forward splatting networks that predict explicit primitives directly from images. Yet most exi...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

We present MobileGym, a browser-hosted, lightweight, fully controllable environment for everyday mobile use, targeting interaction fidelity without replicating ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

Generating high-fidelity and controllable synthetic data is critical for advancing end-to-end autonomous driving, particularly for addressing the long tail of r...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] From Model Scaling to System Scaling: Scaling the Harness in Agentic AI

This paper studies the next major bottleneck in agentic AI as system scaling, not only model scaling: the design of auditable, persistent, modular, and verifiab...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Squeezing Capacity from Multimodal Large Language Models for Subject-driven Generation

Subject-driven image generation aims to synthesize new images that preserve the identity of the given subject while following textual instructions. Existing app...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction Tuning

Multimodal Large Language Models (MLLMs) achieve versatility by reformulating diverse tasks into a unified instruction-following framework via instruction tunin...

#research #paper #ai #machine-learning #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] Helix4D: Complex 4D Mesh Generation

Current video-to-4D methods struggle with complex topology changes, transparent materials, thin structures, and inner surfaces. We present Helix4D, a dynamic me...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Reinforcing Few-step Generators via Reward-Tilted Distribution Matching

Recent advances in few-step diffusion distillation have enabled efficient image generation, yet aligning these models with human preferences remains challenging...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Looped Diffusion Language Models

Masked diffusion models (MDMs) have emerged as a promising alternative to autoregressive models for language modeling, yet the effective design of transformer a...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

Autoregressive video generators are attractive for streaming, long-horizon, and interactive applications, but distilling strong black-box teachers into causal s...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] EVIDENT: Routing MLLM Adaptation through Entity-Grounded Visual Evidence for Cross-Domain Video Temporal Grounding

Fine-tuning MLLMs for Video Temporal Grounding (VTG) often improves in-domain performance but degrades sharply under domain shift. In this work, we find that th...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Global Structure-from-Motion Meets Feedforward Reconstruction

Structure-from-Motion -- the process of simultaneously estimating camera poses and 3D scene structure from a collection of images -- remains a central challenge...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] InstructSAM: Segment Any Instance with Any Instructions

In this paper, we introduce InstructSAM, a unified and streamlined framework designed for multi-instance segmentation under arbitrary instructions. We formulate...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Beyond Summaries: Structure-Aware Labeling of Code Changes with Large Language Models

Code review is a critical practice in software engineering, yet the growing scale and frequency of code patches in modern projects, together with the widespread...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Language Models Need Sleep

Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To h...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Forgetting in Language Models: Capacity, Optimization, and Self-Generated Replay

Models trained on a new task typically degrade on prior tasks, a phenomenon known as forgetting. Traditionally, mitigating forgetting has required replaying sto...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Goal-driven Bayesian Optimal Experimental Design for Robust Decision-Making Under Model Uncertainty

Bayesian optimal experimental design (BOED) selects experiments to maximize information gain about model parameters. However, in decision-critical settings, red...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization

The deployment of Large Language Models (LLMs) and Vision Transformers (ViTs) on edge devices is significantly constrained by memory limitations and the critica...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Channel-wise Vector Quantization

We present Channel-wise Vector Quantization (CVQ), a novel image tokenization paradigm that replaces patch-wise tokens with channel-wise tokens. Unlike conventi...

#research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts