Source

arXiv

1659 posts from this source

Sort:

2 weeks ago · ai · - · -

[Paper] Linear Ordering Problem: Time for a Change

The Linear Ordering Problem (LOP) is a fundamental combinatorial optimization problem with important applications in areas such as economics, social choice, and...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] GP-GOMEA with GPU-Based Fitness Evaluations: Design and Performance Analysis

GP-GOMEA is a state-of-the-art evolutionary algorithm for symbolic regression, known for discovering small and interpretable models. However, its computational ...

#research #paper #ai
2 weeks ago · ai · - · -

[Paper] Automating Formal Verification with Reinforcement Learning and Recursive Inference

Automated formal verification remains challenging for large language models because data for proof assistants and verification-aware languages is scarce, and co...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] BlueFin: Benchmarking LLM Agents on Financial Spreadsheets

We present BlueFin, a benchmark that tasks large language model (LLM) agents with synthesis, manipulation, and comprehension tasks over spreadsheet workbooks in...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences

Federated Learning (FL) offers a privacy-preserving pathway for aligning Large Language Models (LLMs); however, existing frameworks typically enforce a monolith...

#research #paper #ai #machine-learning
2 weeks ago · software · - · -

[Paper] What Breaks When LLMs Code? Characterizing Operational Safety Failures of Agentic Code Assistants

Autonomous coding agents built on large language models (LLMs) are rapidly being integrated into development workflows, yet their operational safety properties ...

#research #paper #software
2 weeks ago · ai · - · -

[Paper] Reducing the GPU Memory Bottleneck with Lossless Compression for ML -- Extended

Machine learning (ML) training and inference often process data sets far exceeding GPU memory capacity, forcing them to rely on PCIe for on-demand tensor transf...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Agnosiophobia in a virtual agent: behavioral and dynamical architecture in Lenia

All embodied agents are fundamentally patterns in physiological or other excitable media, blurring the distinction between objects and processes. Emergent patte...

#research #paper #ai
2 weeks ago · software · - · -

[Paper] FASR: Automated Identification of Unsafe Control Actions in STPA

The System-Theoretic Process Analysis (STPA) is a well-established hazard analysis technique that has been applied to a wide range of safety-critical systems. D...

#research #paper #software
2 weeks ago · ai · - · -

[Paper] Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode

Physical AI systems, including robots, autonomous vehicles, embodied agents and edge copilots, often run a different inference workload from cloud LLM serving: ...

#research #paper #ai #machine-learning
2 weeks ago · devops · - · -

[Paper] Energy-Efficient Aggregation and Minimum-Degree Spanning Trees in Radio Networks

We study the aggregation problem in synchronous multi-hop radio networks with O(log n)-bit messages and no collision detection. Each node initially holds a valu...

#research #paper #devops
2 weeks ago · devops · - · -

[Paper] Scheduling Mechanisms in Wireless Sensor-Actuator Networks for Multi-rate Periodic Control in Industry 4.0

This paper investigates scheduling strategies for wireless sensor-actuator networks (WSANs) in Industry 4.0 scenarios. In particular, we address the problem of ...

#research #paper #devops
2 weeks ago · devops · - · -

[Paper] A Virtual Processor brings back the Free Lunch

This work introduces a self-optimizing virtual processor (VP) for numerical array programs that shifts parallelization from a manual developer task to a coopera...

#research #paper #devops
2 weeks ago · ai · - · -

[Paper] Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software

Are AI agents tools, co-authors, or researchers? We present a quantified case study (N=1): a physicist supervising an AI coding agent (Claude Code, Sonnet and O...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] GMOS: Grounding Moving Object Segmentation in 3D Space and Time

Moving Object Segmentation (MOS) aims to discover, segment, and track objects that move independently of the camera. Current MOS methods, however, exhibit two f...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Long-rollout causal video diffusion has converged on a fixed-size sliding-window KV cache, with recent progress innovating within this layout by changing which ...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] AdaState: Self-Evolving Anchors for Streaming Video Generation

Autoregressive video diffusion models generate streaming video by producing frames sequentially, conditioning each chunk on previously generated content. These ...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation

Robot manipulation critically depends on perception that preserves the action-relevant aspects of a scene. Yet most robot learning pipelines are built upon visu...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] LLMSurgeon: Diagnosing Data Mixture of Large Language Models

The pretraining data mixture of Large Language Models (LLMs) constitutes their 'digital DNA', shaping model behaviors, capabilities, and failure modes. Yet this...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] NeuROK: Generative 4D Neural Object Kinematics

Data-driven approaches have revolutionized 3D vision, enabling transformers to effectively reconstruct and generate static 3D objects. However, generating simul...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] YoCausal: How Far is Video Generation from World Model? A Causality Perspective

As video diffusion models (VDMs) advance toward world models, a key question arises: do they truly understand causality, or merely overfit to statistical tempor...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations

Printed circuit board (PCB) schematic design defines nearly all electronic hardware, but it remains manual and expertise-intensive. While generative AI has adva...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection

Recent advances in Vision-Language Models (VLMs) have achieved impressive performance across many tasks, yet prior studies report unsatisfactory performance whe...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Unlocking the Working Memory of Large Language Models for Latent Reasoning

To improve the reasoning capabilities of large language models, test-time compute is typically scaled by generating intermediate tokens before the final answer....

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Uncertainty-driven 3D Gaussian Splatting Active Mapping via Anisotropic Visibility Field

We present Gaussian Splatting Anisotropic Visibility Field (GAVIS), a novel framework for uncertainty quantification and active mapping in 3DGS. Our key insight...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] GPIC: A Giant Permissive Image Corpus for Visual Generation

Studying scalable methods for visual generative modeling requires large, accessible, and stable datasets. We introduce GPIC, a Giant Permissive Image Corpus of ...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Benchmarking Single-Factor Physical Video-to-Audio Generation

Generative video-to-audio (V2A) models produce highly plausible soundtracks, but it remains unclear whether they capture the underlying physical processes. Exis...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] Efficient Test-Time Finetuning of LLMs via Convex Reconstruction and Gradient Caching

Test-time finetuning (TTFT) is a rapidly evolving paradigm that adapts a language model to each prompt by retrieving related sequences, updating the model on th...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] REST3D: Reconstructing Physically Stable 3D Scenes from a Single Image

Reconstructing physically stable 3D scenes from a single RGB image enables casual images to be converted into simulation-ready digital assets for applications s...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] Fairness-Aware Federated Learning with Trajectory Shapley Value

Federated learning is an emerging distributed paradigm that addresses the challenges posed by heterogeneous, privacy-sensitive data. It enables multiple clients...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents

Multi-component LLM agents assemble probabilistic claims from components that each see only part of a joint problem; the composition can violate basic probabili...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Demystifying Data Organization for Enhanced LLM Training

Large Language Models (LLMs) have revolutionized various fields, yet their training efficiency is heavily reliant on effective data curation. While data selecti...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] COMPOSE: Composing Future Theorems from Citations and Formal Structure

A plausible future mathematical claim must satisfy two constraints: it should follow the direction of prior work and respect the formal dependencies that constr...

#research #paper #ai #nlp
2 weeks ago · ai · - · -

[Paper] Colored Noise Diffusion Sampling

Diffusion models achieve state-of-the-art image synthesis, with their generative trajectories fundamentally exhibiting a spectral bias, resolving low-frequency ...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] When, why, and how do diffusion posterior samplers fail? A finite-sample lens

Diffusion models have excellent capacity to model complex distributions of natural data, which has made them a popular and effective choice for posterior sampli...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?

Autonomous AI research agents aim to accelerate scientific discovery by automating the research pipeline, from hypothesis generation to peer review. However, ex...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Reasoning with Sampling: Cutting at Decision Points

Frontier reasoning models are produced by posttraining base language models with reinforcement learning. Recent work has challenged this by showing that samplin...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] On Language Generation in the Limit with Bounded Memory

We study language generation in the limit under bounded memory. In this task, a learner observes examples from an unknown target language one at a time and must...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] In-Context Reward Adaptation for Robust Preference Modeling

Reinforcement Learning from Human Feedback (RLHF) typically relies on static reward models to align Large Language Models with human preferences. However, human...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Resolution Diagnostics for Paired LLM Evaluation

Across two public LLM leaderboards, many displayed pairwise rankings do not meet a conventional paired-test resolution target under the actual paired evaluation...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] MedCase-Structured: A Text-to-FHIR Dataset for Benchmarking Diagnostic Reasoning in Clinically Realistic EHR Settings

Large language models (LLMs) show promise for clinical reasoning and decision support, but evaluation in realistic, electronic health record-congruent settings ...

#research #paper #ai #machine-learning #nlp
2 weeks ago · devops · - · -

[Paper] RAFI -- A Ray/Work Forwarding Infrastructure for Data Parallel Multi-Node/Multi-GPU Computing

We present RaFI, a CUDA and MPI based software framework that simplifies the task of building GPU-enabled data-parallel software where rays or similar work item...

#research #paper #devops
2 weeks ago · ai · - · -

[Paper] Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency

AI-assisted coding tools have altered software production. At Meta, significant lines of code per human-landed diff grew by 105.9% year over year and per-develo...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Deep Binarized Photonic Reservoir Computing for Ultrafast Multimedia Signal Processing

We present a deep photonic neural network architecture based on ultrafast binary optical modulation from a digital micro-mirror device (DMD), optical scattering...

#research #paper #ai
2 weeks ago · ai · - · -

[Paper] Evolving Features vs Evolving Entire Trees with GP for Interpretable Survival Analysis

Survival analysis concerns the task of predicting the time until an event occurs. Often used in the medical field, survival analysis deals with incomplete (i.e....

#research #paper #ai #machine-learning
2 weeks ago · software · - · -

[Paper] EvoRepair: Enhancing Vulnerability Repair Agents Through Experience-Based Self-Evolution

Large Language Models (LLMs) have shown promise for automated vulnerability repair (AVR), but they still face several limitations, including the lack of intra-v...

#research #paper #software
2 weeks ago · ai · - · -

[Paper] Q-ANCHOR: Federated Quantum Learning with ZNE-guided Correction

Quantum Federated Learning (QFL) offers a promising framework to train quantum models across distributed clients while keeping data strictly local. Due to its s...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Projectional Decoding: Towards Semantic-Aware LLM Generation

Large language models (LLMs) are increasingly used to generate software artifacts across many software engineering (SE) tasks, yet ensuring the semantic validit...

#research #paper #ai #machine-learning

Newer posts

Older posts