Source

arXiv

5752 posts from this source

Sort:

3 months ago · devops · - · -

[Paper] Pending Conflicts Make Progress Impossible

In this work, we study progress conditions for commutativity-aware, linearizable implementations of shared objects. Motivated by the observation that commuting ...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Statistical Guarantees for Reasoning Probes on Looped Boolean Circuits

We study the statistical behaviour of reasoning probes in a stylized model of looped reasoning, given by Boolean circuits whose computational graph is a perfect...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Non-linear PCA via Evolution Strategies: a Novel Objective Function

Principal Component Analysis (PCA) is a powerful and popular dimensionality reduction technique. However, due to its linear nature, it often fails to capture th...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] EventNeuS: 3D Mesh Reconstruction from a Single Event Camera

Event cameras offer a considerable alternative to RGB cameras in many scenarios. While there are recent works on event-based novel-view synthesis, dense 3D mesh...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] PLATE: Plasticity-Tunable Efficient Adapters for Geometry-Aware Continual Learning

We develop a continual learning method for pretrained models that requires no access to old-task data, addressing a practical barrier in foundation model adapta...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Parallel thinking has emerged as a promising paradigm for reasoning, yet it imposes significant computational burdens. Existing efficiency methods primarily rel...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Investigating Quantum Circuit Designs Using Neuro-Evolution

Designing effective quantum circuits remains a central challenge in quantum computing, as circuit structure strongly influences expressivity, trainability, and ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL

Reinforcement learning (RL) is a critical component for post-training large language models (LLMs). However, in bandwidth-constrained distributed RL, scalabilit...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] PrevizWhiz: Combining Rough 3D Scenes and 2D Video to Guide Generative Video Previsualization

In pre-production, filmmakers and 3D animation experts must rapidly prototype ideas to explore a film's possibilities before fullscale production, yet conventio...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

Recent advances in large language models (LLMs) have opened new avenues for accelerating scientific research. While models are increasingly capable of assisting...

#large-language-models #Gemini #prompt-engineering #neuro-symbolic #scientific-research
3 months ago · ai · - · -

[Paper] AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

High-quality scientific illustrations are crucial for effectively communicating complex scientific and technical concepts, yet their manual creation remains a w...

#research #paper #ai #machine-learning #nlp #computer-vision
3 months ago · ai · - · -

[Paper] Continuous Control of Editing Models via Adaptive-Origin Guidance

Diffusion-based editing models have emerged as a powerful tool for semantic image and video manipulation. However, existing models lack a mechanism for smoothly...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Robust Intervention Learning from Emergency Stop Interventions

Human interventions are a common source of data in autonomous systems during testing. These interventions provide an important signal about where the current po...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Deep-learning-based pan-phenomic data reveals the explosive evolution of avian visual disparity

The evolution of biological morphology is critical for understanding the diversity of the natural world, yet traditional analyses often involve subjective biase...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Preference-based Conditional Treatment Effects and Policy Learning

We introduce a new preference-based framework for conditional treatment effect estimation and policy learning, built on the Conditional Preference-based Treatme...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References

Meme-based social abuse detection is challenging because harmful intent often relies on implicit cultural symbolism and subtle cross-modal incongruence. Prior a...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Adaptive Evidence Weighting for Audio-Spatiotemporal Fusion

Many machine learning systems have access to multiple sources of evidence for the same prediction target, yet these sources often differ in reliability and info...

#audio classification #multimodal fusion #gating network #bioacoustics #adaptive weighting
3 months ago · ai · - · -

[Paper] SymPlex: A Structure-Aware Transformer for Symbolic PDE Solving

We propose SymPlex, a reinforcement learning framework for discovering analytical symbolic solutions to partial differential equations (PDEs) without access to ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning

Multimodal Large Language Models (MLLMs) suffer from severe training inefficiency issue, which is associated with their massive model sizes and visual token num...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Conformal Thinking: Risk Control for Reasoning on a Compute Budget

Reasoning Large Language Models (LLMs) enable test-time scaling, with dataset-level accuracy improving as the token budget increases, motivating adaptive reason...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Antidistillation Fingerprinting

Model distillation enables efficient emulation of frontier large language models (LLMs), creating a need for robust mechanisms to detect when a third-party stud...

#LLM fingerprinting #model watermarking #distillation security #machine learning research
3 months ago · ai · - · -

[Paper] Progressive Checkerboards for Autoregressive Multiscale Image Generation

A key challenge in autoregressive image generation is to efficiently sample independent locations in parallel, while still modeling mutual dependencies with ser...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Enhancing Imbalanced Node Classification via Curriculum-Guided Feature Learning and Three-Stage Attention Network

Imbalanced node classification in graph neural networks (GNNs) happens when some labels are much more common than others, which causes the model to learn unfair...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation

Recently, there have been significant research interests in training large language models (LLMs) with reinforcement learning (RL) on real-world tasks, such as ...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Do We Need Asynchronous SGD? On the Near-Optimality of Synchronous Methods

Modern distributed optimization methods mostly rely on traditional synchronous approaches, despite substantial recent progress in asynchronous optimization. We ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation

Assisting non-expert users to develop complex interactive websites has become a popular task for LLM-powered code agents. However, existing code agents tend to ...

#research #paper #ai #nlp #computer-vision
3 months ago · ai · - · -

[Paper] 3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Existing methods for human motion control in video generation typically rely on either 2D poses or explicit 3D parametric models (e.g., SMPL) as control signals...

#video generation #motion representation #computer vision #3D modeling #implicit neural representations
3 months ago · ai · - · -

[Paper] BridgeV2W: Bridging Video Generation Models to Embodied World Models via Embodiment Masks

Embodied world models have emerged as a promising paradigm in robotics, most of which leverage large-scale Internet videos or pretrained video generation models...

#video generation #embodied AI #robotics #ControlNet #diffusion models
3 months ago · ai · - · -

[Paper] WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents

Prompt injection attacks manipulate webpage content to cause web agents to execute attacker-specified tasks instead of the user's intended ones. Existing method...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Language agents have shown strong promise for task automation. Realizing this promise for increasingly complex, long-horizon tasks has driven the rise of a sub-...

#agentic orchestration #sub-agent automation #LLM multi-agent systems #AI research #dynamic tool selection
3 months ago · ai · - · -

[Paper] Context Compression via Explicit Information Transmission

Long-context inference with Large Language Models (LLMs) is costly due to quadratic attention and growing key-value caches, motivating context compression. In t...

#research #paper #ai #nlp
3 months ago · software · - · -

[Paper] From Separate Compilation to Sound Language Composition

The development of programming languages involves complex theoretical and practical challenges, particularly when addressing modularity and reusability through ...

#research #paper #software
3 months ago · ai · - · -

[Paper] FOVI: A biologically-inspired foveated interface for deep vision models

Human vision is foveated, with variable resolution peaking at the center of a large field of view; this reflects an efficient trade-off for active sensing, allo...

#research #paper #ai #computer-vision
3 months ago · software · - · -

[Paper] Improving Deep Learning Library Testing with Machine Learning

Deep Learning (DL) libraries like TensorFlow and Pytorch simplify machine learning (ML) model development but are prone to bugs due to their complex design. Bug...

#research #paper #software
3 months ago · software · - · -

[Paper] SWE-Refactor: A Repository-Level Benchmark for Real-World LLM-Based Code Refactoring

Large Language Models (LLMs) have recently attracted wide interest for tackling software engineering tasks. In contrast to code generation, refactoring demands ...

#research #paper #software
3 months ago · ai · - · -

[Paper] Improved Analysis of the Accelerated Noisy Power Method with Applications to Decentralized PCA

We analyze the Accelerated Noisy Power Method, an algorithm for Principal Component Analysis in the setting where only inexact matrix-vector products are availa...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Equilibrium Propagation for Non-Conservative Systems

Equilibrium Propagation (EP) is a physics-inspired learning algorithm that uses stationary states of a dynamical system both for inference and learning. In its ...

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] CALM: A Self-Adaptive Orchestration Approach for QoS-Aware Routing in Small Language Model based Systems

AI-enabled systems are subjected to various types of runtime uncertainties, ranging from dynamic workloads, resource requirements, model drift, etc. These uncer...

#research #paper #software
3 months ago · ai · - · -

[Paper] Beyond the Commit: Developer Perspectives on Productivity with AI Coding Assistants

Measuring developer productivity is a topic that has attracted attention from both academic research and industrial practice. In the age of AI coding assistants...

#AI coding assistants #developer productivity #empirical study #GitHub Copilot #software engineering
3 months ago · software · - · -

[Paper] Causal Inference for the Effect of Code Coverage on Bug Introduction

Context: Code coverage is widely used as a software quality assurance measure. However, its effect, and specifically the advisable dose, are disputed in both th...

#research #paper #software
3 months ago · software · - · -

[Paper] Scaling Test-Driven Code Generation from Functions to Classes: An Empirical Study

Test-driven development (TDD) has been adopted to improve Large Language Model (LLM)-based code generation by using tests as executable specifications. However,...

#research #paper #software
3 months ago · software · - · -

[Paper] Flaky Tests in a Large Industrial Database Management System: An Empirical Study of Fixed Issue Reports for SAP HANA

Flaky tests yield different results when executed multiple times for the same version of the source code. Thus, they provide an ambiguous signal about the quali...

#research #paper #software
3 months ago · ai · - · -

[Paper] Mitigating Staleness in Asynchronous Pipeline Parallelism via Basis Rotation

Asynchronous pipeline parallelism maximizes hardware utilization by eliminating the pipeline bubbles inherent in synchronous execution, offering a path toward e...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs

Mixture of Experts (MoE) architectures significantly enhance the capacity of LLMs without proportional increases in computation, but at the cost of a vast param...

#research #paper #ai #machine-learning
3 months ago · devops · - · -

[Paper] Recursive Energy Efficient Agreement

Agreement is a foundational problem in distributed computing that have been studied extensively for over four decades. Recently, Meir, Mirault, Peleg and Robins...

#research #paper #devops
3 months ago · devops · - · -

[Paper] Exploiting Multi-Core Parallelism in Blockchain Validation and Construction

Blockchain validators can reduce block processing time by exploiting multi-core CPUs, but deterministic execution must preserve a given total order while respec...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Dynamic Topology Optimization for Non-IID Data in Decentralized Learning

Decentralized learning (DL) enables a set of nodes to train a model collaboratively without central coordination, offering benefits for privacy and scalability....

#research #paper #ai #machine-learning
3 months ago · devops · - · -

[Paper] Joint Network-and-Server Congestion in Multi-Source Traffic Allocation: A Convex Formulation and Price-Based Decentralization

This paper studies an important rate allocation problem that arises in many networked and distributed systems: steady-state traffic rate allocation from multipl...

#research #paper #devops

Newer posts

Older posts