Source

arXiv

5752 posts from this source

Sort:

2 months ago · ai · - · -

[Paper] From Core to Detail: Unsupervised Disentanglement with Entropy-Ordered Flows

Learning unsupervised representations that are both semantically meaningful and stable across runs remains a central challenge in modern representation learning...

#normalizing flows #unsupervised disentanglement #entropy-ordered latent space #representation learning #generative models
2 months ago · ai · - · -

[Paper] Cochain Perspectives on Temporal-Difference Signals for Learning Beyond Markov Dynamics

Non-Markovian dynamics are commonly found in real-world environments due to long-range dependencies, partial observability, and memory effects. The Bellman equa...

#reinforcement learning #temporal-difference #non-Markovian dynamics #topological methods
2 months ago · ai · - · -

[Paper] Reliable Mislabel Detection for Video Capsule Endoscopy Data

The classification performance of deep neural networks relies strongly on access to large, accurately annotated datasets. In medical imaging, however, obtaining...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Reciprocal Latent Fields for Precomputed Sound Propagation

Realistic sound propagation is essential for immersion in a virtual scene, yet physically accurate wave-based simulations remain computationally prohibitive for...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Implementing Grassroots Logic Programs with Multiagent Transition Systems and AI

Grassroots Logic Programs (GLP) is a concurrent logic programming language with variables partitioned into paired readers and writers, conjuring both linear log...

#concurrent programming #logic programming #multi-agent systems #AI code generation #formal semantics
2 months ago · ai · - · -

[Paper] From Kepler to Newton: Inductive Biases Guide Learned World Models in Transformers

Can general-purpose AI architectures go beyond prediction to discover the physical laws governing the universe? True intelligence relies on 'world models' -- ca...

#transformers #inductive bias #physics discovery #machine learning
2 months ago · ai · - · -

[Paper] Halluverse-M^3: A multitask multilingual benchmark for hallucination in LLMs

Hallucinations in large language models remain a persistent challenge, particularly in multilingual and generative settings where factual consistency is difficu...

#LLM hallucination #multilingual benchmark #NLP evaluation #dataset release #AI research
2 months ago · ai · - · -

[Paper] Seeing Beyond Redundancy: Task Complexity's Role in Vision Token Specialization in VLLMs

Vision capabilities in vision large language models (VLLMs) have consistently lagged behind their linguistic capabilities. In particular, numerous benchmark stu...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] PANC: Prior-Aware Normalized Cut for Object Segmentation

Fully unsupervised segmentation pipelines naively seek the most salient object, should this be present. As a result, most of the methods reported in the literat...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Supercharging Simulation-Based Inference for Bayesian Optimal Experimental Design

Bayesian optimal experimental design (BOED) seeks to maximize the expected information gain (EIG) of experiments. This requires a likelihood estimate, which in ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Prompt Reinjection: Alleviating Prompt Forgetting in Multimodal Diffusion Transformers

Multimodal Diffusion Transformers (MMDiTs) for text-to-image generation maintain separate text and image branches, with bidirectional information flow between t...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Vision Transformer Finetuning Benefits from Non-Smooth Components

The smoothness of the transformer architecture has been extensively studied in the context of generalization, training stability, and adversarial robustness. Ho...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] NanoFLUX: Distillation-Driven Compression of Large Text-to-Image Generation Models for Mobile Devices

While large-scale text-to-image diffusion models continue to improve in visual quality, their increasing scale has widened the gap between state-of-the-art mode...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code

Large Language Models (LLMs) often generate code with subtle but critical bugs, especially for complex tasks. Existing automated repair methods typically rely o...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] RFDM: Residual Flow Diffusion Model for Efficient Causal Video Editing

Instructional video editing applies edits to an input video using only text prompts, enabling intuitive natural-language control. Despite rapid progress, most m...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Uncovering Cross-Objective Interference in Multi-Objective Alignment

We study a persistent failure mode in multi-objective alignment for large language models (LLMs): training improves performance on only a subset of objectives w...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

Multi-turn jailbreaks capture the real threat model for safety-aligned chatbots, where single-turn attacks are merely a special case. Yet existing approaches br...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] The Representational Geometry of Number

A central question in cognitive science is whether conceptual representations converge onto a shared manifold to support generalization, or diverge into orthogo...

#research #paper #ai #machine-learning #nlp
2 months ago · software · - · -

[Paper] Statistical-Based Metric Threshold Setting Method for Software Fault Prediction in Firmware Projects: An Industrial Experience

Ensuring software quality in embedded firmware is critical, especially in safety-critical domains where compliance with functional safety standards (ISO 26262) ...

#research #paper #software
2 months ago · ai · - · -

[Paper] Visual Word Sense Disambiguation with CLIP through Dual-Channel Text Prompting and Image Augmentations

Ambiguity poses persistent challenges in natural language understanding for large language models (LLMs). To better understand how lexical ambiguity can be reso...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Sparse Spike Encoding of Channel Responses for Energy Efficient Human Activity Recognition

ISAC enables pervasive monitoring, but modern sensing algorithms are often too complex for energy-constrained edge devices. This motivates the development of le...

#research #paper #ai
2 months ago · software · - · -

[Paper] Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

Recent progress in ML and LLMs has improved vulnerability detection, and recent datasets have reduced label noise and unrelated code changes. However, most exis...

#vulnerability detection #inter-procedural analysis #large language models #code property graph #software security
2 months ago · ai · - · -

[Paper] Structural bias in multi-objective optimisation

Structural bias (SB) refers to systematic preferences of an optimisation algorithm for particular regions of the search space that arise independently of the ob...

#research #paper #ai
2 months ago · software · - · -

[Paper] Using Large Language Models to Support Automation of Failure Management in CI/CD Pipelines: A Case Study in SAP HANA

CI/CD pipeline failure management is time-consuming when performed manually. Automating this process is non-trivial because the information required for effecti...

#research #paper #software
2 months ago · devops · - · -

[Paper] Same Engine, Multiple Gears: Parallelizing Fixpoint Iteration at Different Granularities (Extended Version)

Fixpoint iteration constitutes the algorithmic core of static analyzers. Parallelizing the fixpoint engine can significantly reduce analysis times. Previous app...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Code vs Serialized AST Inputs for LLM-Based Code Summarization: An Empirical Study

Summarizing source code into natural language descriptions (code summarization) helps developers better understand program functionality and reduce the burden o...

#code summarization #AST serialization #LLM fine‑tuning #CodeXGLUE #Python
2 months ago · devops · - · -

[Paper] Wonderboom -- Efficient, and Censorship-Resilient Signature Aggregation for Million Scale Consensus

Over the last years, Ethereum has evolved into a public platform that safeguards the savings of hundreds of millions of people and secures more than $650 billio...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Green Optimization: Energy-aware Design of Metaheuristics by Using Machine Learning Surrogates to Cope with Real Problems

Addressing real-world optimization challenges requires not only advanced metaheuristics but also continuous refinement of their internal mechanisms. This paper ...

#energy-efficient computing #metaheuristics #machine learning surrogates #optimization #green AI
2 months ago · ai · - · -

[Paper] Energy-Aware Metaheuristics

This paper presents a principled framework for designing energy-aware metaheuristics that operate under fixed energy budgets. We introduce a unified operator-le...

#metaheuristics #energy-aware computing #optimization algorithms #edge AI #expected improvement per joule
2 months ago · ai · - · -

[Paper] AgentStepper: Interactive Debugging of Software Development Agents

Software development agents powered by large language models (LLMs) have shown great promise in automating tasks like environment setup, issue solving, and prog...

#LLM #interactive debugging #software development agents #research paper #agent tooling
2 months ago · ai · - · -

[Paper] Degradation of Feature Space in Continual Learning

Centralized training is the standard paradigm in deep learning, enabling models to learn from a unified dataset in a single location. In such setup, isotropic f...

#continual learning #feature space isotropy #contrastive regularization #representation geometry
2 months ago · ai · - · -

[Paper] Reinforcement Learning-Based Dynamic Management of Structured Parallel Farm Skeletons on Serverless Platforms

We present a framework for dynamic management of structured parallel processing skeletons on serverless platforms. Our goal is to bring HPC-like performance and...

#research #paper #ai #machine-learning
2 months ago · devops · - · -

[Paper] DualMap: Enabling Both Cache Affinity and Load Balancing for Distributed LLM Serving

In LLM serving, reusing the KV cache of prompts across requests is critical for reducing TTFT and serving costs. Cache-affinity scheduling, which co-locates req...

#LLM serving #cache affinity #load balancing #distributed scheduling #cluster management
2 months ago · ai · - · -

[Paper] FCDP: Fully Cached Data Parallel for Communication-Avoiding Large-Scale Training

Training billion-parameter models requires distributing model states across GPUs using fully sharded data parallel (i.e., ZeRO-3). While ZeRO-3 succeeds on clus...

#distributed training #ZeRO-3 #communication-avoiding #host-memory caching #large-scale model training
2 months ago · ai · - · -

[Paper] BouquetFL: Emulating diverse participant hardware in Federated Learning

In Federated Learning (FL), multiple parties collaboratively train a shared Machine Learning model to encapsulate all private knowledge without exchange of info...

#federated learning #hardware emulation #heterogeneous devices #resource constraints #open-source framework
2 months ago · ai · - · -

[Paper] A neuromorphic model of the insect visual system for natural image processing

Insect vision supports complex behaviors including associative learning, navigation, and object detection, and has long motivated computational models for under...

#neuromorphic computing #spiking neural networks #self-supervised learning #computer vision #bio-inspired AI
2 months ago · ai · - · -

[Paper] AdFL: In-Browser Federated Learning for Online Advertisement

Since most countries are coming up with online privacy regulations, such as GDPR in the EU, online publishers need to find a balance between revenue from target...

#federated-learning #browser-ml #privacy #advertising #tensorflow.js
2 months ago · ai · - · -

[Paper] Identifying Adversary Tactics and Techniques in Malware Binaries with an LLM Agent

Understanding TTPs (Tactics, Techniques, and Procedures) in malware binaries is essential for security analysis and threat intelligence, yet remains challenging...

#LLM #malware analysis #ATT&CK #threat intelligence #code retrieval
2 months ago · software · - · -

[Paper] Trustworthy AI Software Engineers

With the rapid rise of AI coding agents, the fundamental premise of what it means to be a software engineer is in question. In this vision paper, we re-examine ...

#research #paper #software
2 months ago · software · - · -

[Paper] Scaling Mobile Chaos Testing with AI-Driven Test Execution

Mobile applications in large-scale distributed systems are susceptible to backend service failures, yet traditional chaos engineering approaches cannot scale mo...

#research #paper #software
2 months ago · ai · - · -

[Paper] Pseudo-Invertible Neural Networks

The Moore-Penrose Pseudo-inverse (PInv) serves as the fundamental solution for linear systems. In this paper, we propose a natural generalization of PInv to the...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Shared LoRA Subspaces for almost Strict Continual Learning

Adapting large pretrained models to new tasks efficiently and continually is crucial for real-world deployment but remains challenging due to catastrophic forge...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning

Multi-image spatial reasoning remains challenging for current multimodal large language models (MLLMs). While single-view perception is inherently 2D, reasoning...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching

Multi-agent systems built from prompted large language models can improve multi-round reasoning, yet most existing pipelines rely on fixed, trajectory-wide comm...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs

Multimodal Large Language Models (MLLMs) have made remarkable progress in multimodal perception and reasoning by bridging vision and language. However, most exi...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] CommCP: Efficient Multi-Agent Coordination via LLM-Based Communication with Conformal Prediction

To complete assignments provided by humans in natural language, robots must interpret commands, generate and answer relevant questions for scene understanding, ...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Thinking with Geometry: Active Geometry Integration for Spatial Reasoning

Recent progress in spatial reasoning with Multimodal Large Language Models (MLLMs) increasingly leverages geometric priors from 3D encoders. However, most exist...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] DFlash: Block Diffusion for Flash Speculative Decoding

Autoregressive large language models (LLMs) deliver strong performance but require inherently sequential decoding, leading to high inference latency and poor GP...

#research #paper #ai #nlp

Newer posts

Older posts