Source

arXiv

5804 posts from this source

Sort:

3 months ago · ai · - · -

[Paper] UEval: A Benchmark for Unified Multimodal Generation

We introduce UEval, a benchmark to evaluate unified models, i.e., models capable of generating both images and text. UEval comprises 1,000 expert-curated questi...

#research #paper #ai #nlp #computer-vision
3 months ago · ai · - · -

[Paper] DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Manipulating dynamic objects remains an open challenge for Vision-Language-Action (VLA) models, which, despite strong generalization in static manipulation, str...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Late Breaking Results: Conversion of Neural Networks into Logic Flows for Edge Computing

Neural networks have been successfully applied in various resource-constrained edge devices, where usually central processing units (CPUs) instead of graphics p...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions

Large Vision-Language Models (VLMs) often answer classic visual illusions 'correctly' on original images, yet persist with the same responses when illusion fact...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] DynaWeb: Model-Based Reinforcement Learning of Web Agents

The development of autonomous web agents, powered by Large Language Models (LLMs) and reinforcement learning (RL), represents a significant step towards general...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale

Due to limited supervised training data, large language models (LLMs) are typically pre-trained via a self-supervised 'predict the next word' objective on a vas...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion

Audio-Visual Foundation Models, which are pretrained to jointly generate sound and visual content, have recently shown an unprecedented ability to model multi-m...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data

In pruning, the Lottery Ticket Hypothesis posits that large networks contain sparse subnetworks, or winning tickets, that can be trained in isolation to match t...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

Reasoning-oriented Large Language Models (LLMs) have achieved remarkable progress with Chain-of-Thought (CoT) prompting, yet they remain fundamentally limited b...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] PRISM: Distribution-free Adaptive Computation of Matrix Functions for Accelerating Neural Network Training

Matrix functions such as square root, inverse roots, and orthogonalization play a central role in preconditioned gradient methods for neural network training. T...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] StepShield: When, Not Whether to Intervene on Rogue Agents

Existing agent safety benchmarks report binary accuracy, conflating early intervention with post-mortem analysis. A detector that flags a violation at step 8 en...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] PI-Light: Physics-Inspired Diffusion for Full-Image Relighting

Full-image relighting remains a challenging problem due to the difficulty of collecting large-scale structured paired data, the difficulty of maintaining physic...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Early and Prediagnostic Detection of Pancreatic Cancer from Computed Tomography

Pancreatic ductal adenocarcinoma (PDAC), one of the deadliest solid malignancies, is often detected at a late and inoperable stage. Retrospective reviews of pre...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference

Large Language Models (LLMs) deliver state-of-the-art performance on complex reasoning tasks, but their inference costs limit deployment at scale. Small Languag...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] SMOG: Scalable Meta-Learning for Multi-Objective Bayesian Optimization

Multi-objective optimization aims to solve problems with competing objectives, often with only black-box access to a problem and a limited budget of measurement...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems

Frontier large language models (LLMs) excel as autonomous agents in many domains, yet they remain untested in complex enterprise systems where hidden workflows ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents

Test-time scaling has been widely adopted to enhance the capabilities of Large Language Model (LLM) agents in software engineering (SWE) tasks. However, the sta...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers

Current generative video models excel at producing novel content from text and image prompts, but leave a critical gap in editing existing pre-recorded videos, ...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Creative Image Generation with Diffusion Model

Creative image generation has emerged as a compelling area of research, driven by the need to produce novel and high-quality images that expand the boundaries o...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clin...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] ECO: Quantized Training without Full-Precision Master Weights

Quantization has significantly improved the compute and memory efficiency of Large Language Model (LLM) training. However, existing approaches still rely on acc...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Where Do the Joules Go? Diagnosing Inference Energy Consumption

Energy is now a critical ML computing resource. While measuring energy consumption and observing trends is a valuable first step, accurately understanding and d...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Lens-descriptor guided evolutionary algorithm for optimization of complex optical systems with glass choice

Designing high-performance optical lenses entails exploring a high-dimensional, tightly constrained space of surface curvatures, glass choices, element thicknes...

#research #paper #ai
3 months ago · ai · - · -

[Paper] When 'Better' Prompts Hurt: Evaluation-Driven Iteration for LLM Applications

Evaluating Large Language Model (LLM) applications differs from traditional software testing because outputs are stochastic, high-dimensional, and sensitive to ...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference

AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and hig...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Liquid Interfaces: A Dynamic Ontology for the Interoperability of Autonomous Systems

Contemporary software architectures struggle to support autonomous agents whose reasoning is adaptive, probabilistic, and context-dependent, while system integr...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic

Recent work has explored optimizing LLM collaboration through Multi-Agent Reinforcement Learning (MARL). However, most MARL fine-tuning approaches rely on prede...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] The Energy Impact of Domain Model Design in Classical Planning

AI research has traditionally prioritised algorithmic performance, such as optimising accuracy in machine learning or runtime in automated planning. The emergin...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Dependence of Equilibrium Propagation Training Success on Network Architecture

The rapid rise of artificial intelligence has led to an unsustainable growth in energy consumption. This has motivated progress in neuromorphic computing and ph...

#research #paper #ai #machine-learning
3 months ago · devops · - · -

[Paper] Belief Propagation Converges to Gaussian Distributions in Sparsely-Connected Factor Graphs

Belief Propagation (BP) is a powerful algorithm for distributed inference in probabilistic graphical models, however it quickly becomes infeasible for practical...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Adaptive Surrogate-Based Strategy for Accelerating Convergence Speed when Solving Expensive Unconstrained Multi-Objective Optimisation Problems

Multi-Objective Evolutionary Algorithms (MOEAs) have proven effective at solving Multi-Objective Optimisation Problems (MOOPs). However, their performance can b...

#research #paper #ai
3 months ago · ai · - · -

[Paper] Evolution of Benchmark: Black-Box Optimization Benchmark Design through Large Language Model

Benchmark Design in Black-Box Optimization (BBO) is a fundamental yet open-ended topic. Early BBO benchmarks are predominantly human-crafted, introducing expert...

#research #paper #ai
3 months ago · devops · - · -

[Paper] Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement Learning

In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Quer...

#research #paper #devops
3 months ago · ai · - · -

[Paper] READY: Reward Discovery for Meta-Black-Box Optimization

Meta-Black-Box Optimization (MetaBBO) is an emerging avenue within Optimization community, where algorithm design policy could be meta-learned by reinforcement ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Bridging Forecast Accuracy and Inventory KPIs: A Simulation-Based Software Framework

Efficient management of spare parts inventory is crucial in the automotive aftermarket, where demand is highly intermittent and uncertainty drives substantial c...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] DASH: Deterministic Attention Scheduling for High-throughput Reproducible LLM Training

Determinism is indispensable for reproducibility in large language model (LLM) training, yet it often exacts a steep performance cost. In widely used attention ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] General Self-Prediction Enhancement for Spiking Neurons

Spiking Neural Networks (SNNs) are highly energy-efficient due to event-driven, sparse computation, but their training is challenged by spike non-differentiabil...

#research #paper #ai
3 months ago · software · - · -

[Paper] Folklore in Software Engineering: A Definition and Conceptual Foundations

We explore the concept of folklore within software engineering, drawing from folklore studies to define and characterize narratives, myths, rituals, humor, and ...

#research #paper #software
3 months ago · ai · - · -

[Paper] Assessing the Business Process Modeling Competences of Large Language Models

The creation of Business Process Model and Notation (BPMN) models is a complex and time-consuming task requiring both domain knowledge and proficiency in modeli...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Error Amplification Limits ANN-to-SNN Conversion in Continuous Control

Spiking Neural Networks (SNNs) can achieve competitive performance by converting already existing well-trained Artificial Neural Networks (ANNs), avoiding furth...

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] Towards A Sustainable Future for Peer Review in Software Engineering

Peer review is the main mechanism by which the software engineering community assesses the quality of scientific results. However, the rapid growth of paper sub...

#research #paper #software
3 months ago · ai · - · -

[Paper] EWSJF: An Adaptive Scheduler with Hybrid Partitioning for Mixed-Workload LLM Inference

Serving Large Language Models (LLMs) under mixed workloads--short, latency-sensitive interactive queries alongside long, throughput-oriented batch requests--pos...

#research #paper #ai #machine-learning
3 months ago · devops · - · -

[Paper] bigMICE: Multiple Imputation of Big Data

Missing data is a prevalent issue in many applications, including large medical registries such as the Swedish Healthcare Quality Registries, potentially leadin...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Meta Context Engineering via Agentic Skill Evolution

The operational efficacy of large language models relies heavily on their inference-time context. This has established Context Engineering (CE) as a formal disc...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] LLaMEA-SAGE: Guiding Automated Algorithm Design with Structural Feedback from Explainable AI

Large language models have enabled automated algorithm design (AAD) by generating optimization algorithms directly from natural-language prompts. While evolutio...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] MAR: Efficient Large Language Models via Module-aware Architecture Refinement

Large Language Models (LLMs) excel across diverse domains but suffer from high energy costs due to quadratic attention and dense Feed-Forward Network (FFN) oper...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory Management

LLM-based multi-agent simulations are increasingly adopted across application domains, but remain difficult to scale due to GPU memory pressure. Each agent main...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion

Repository-level code completion remains a challenging task for existing code large language models (code LLMs) due to their limited understanding of repository...

#code completion #retrieval-augmented generation #reinforcement learning #code LLM #repository context

Newer posts

Older posts