Source

arXiv

1364 posts from this source

Sort:

4 days ago · ai · - · -

[Paper] The Standard Interpretable Model: A general theory of interpretable machine learning to deductively design interpretable methods using Lagrangian mechanics

As Artificial Intelligence models grow in complexity, interpretability has become an indispensable tool for understanding, debugging, and controlling their comp...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

The Transformer architecture is widely regarded as the most powerful tool for natural language processing, but due to a high number of complex operations, it in...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] CellNet -- Localizing Cells using Sparse and Noisy Point Annotations

Counting living cells is an important step in many biological research workflows. Our collaborators at the Wellcome Sanger Institute study vital genes in humans...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] PianoKontext: Expressive Performance Rendering from Deadpan Context

Expressive performance rendering (EPR) aims to generate realistic performances constrained on sequences of notes. However, flow matching audio editing models ma...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Mathematical perspective on genetic algorithms with optimization guided operators

Recent work in ML applies genetic algorithms at inference time to iteratively improve solutions to optimization problems. The basic mutation and recombination o...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Finding Sparse Subnetworks in One Training Cycle via Progressive Magnitude-Based Pruning

Neural network pruning reduces model size by removing less important parameters while aiming to preserve predictive performance. Although the Lottery Ticket Hyp...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] Beyond Fully Random Masking: Attention-Guided Denoising and Optimization for Diffusion Language Models

Diffusion large language models (dLLMs) offer an efficient alternative to autoregressive models through parallel decoding, yet existing post-training methods la...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] VOID: Defeating Unauthorized Mimicry in Latent Diffusion Models

While Latent Diffusion Models (LDMs) have revolutionized visual synthesis, they are increasingly exploited for unauthorized mimicry of individuals. Existing def...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] Bridging Day and Night: Unsupervised Cross-Domain Re-Identification with Synergistic Prompt and Prototype Learning

Cross-domain day-night re-identification (ReID) is fundamentally challenged by the substantial visual appearance discrepancies between daytime and nighttime sce...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

Large language models (LLMs) in medicine are mainly evaluated using multiple-choice question answering (MCQA), which can overestimate real clinical ability due ...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Beyond Third-Person Audits: Situated Interaction Auditing for User-Centered LLM Bias Research

Research on bias in large language models (LLMs) has predominantly focused on third-person audits, which study how models represent or evaluate demographic grou...

#research #paper #ai #nlp
4 days ago · devops · - · -

[Paper] Efficient and Robust Online Learning to Rank in Decentralized Systems

In Online Learning to Rank (OLTR), ranking models are trained directly from live user interactions, but existing systems rely on a trusted central server to col...

#research #paper #devops
4 days ago · ai · - · -

[Paper] VIA-SD: Verification via Intra-Model Routing for Speculative Decoding

Speculative decoding (SD) addresses the high inference costs of LLMs by having lightweight drafters generate candidates for large verifiers to validate in paral...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] On The Effectiveness-Fluency Trade-Off In LLM Conditioning: A Systematic Study

Controlling the output of Large Language Models (LLMs) is a central challenge for their reliable deployment, yet a clear understanding of the involved trade-off...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Rule Taxonomy and Evolution in AI IDEs: A Mining and Survey Study

The adoption of AI-powered Integrated Development Environments (AI IDEs) has introduced 'Rules' as a novel software artifact, allowing developers to persistentl...

#research #paper #ai #machine-learning
4 days ago · software · - · -

[Paper] Mind your key: An Empirical Study of LLM API Credential Leakage in iOS Apps

The rapid integration of large language models (LLMs) into mobile applications has introduced a new class of credential security risk: leaked credentials that g...

#research #paper #software
4 days ago · ai · - · -

[Paper] Can News Predict the Market? Limits of Zero-Shot Financial NLP and the Role of Explainable AI

Can financial news reliably predict short-term stock movements? Despite advances in large language models, this question remains unresolved. We revisit this pro...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Adaptive Multi-Resolution Procedural Knowledge Compression for Large Language Models

Large language models (LLMs) are widely used to tackle complex tasks with autonomous workflows. Recently, reusable natural language skills have emerged as a pop...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Which Speech Representation Better Matches Text-Native Reasoning? A Study of Speech-Text Alignment on Frame Rate and Representation

Spoken dialogue models typically start from text LLM backbones, yet reasoning often degrades when conditioning on speech instead of text. We attribute part of t...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding

Rotary Position Embedding (RoPE) is widely adopted in Transformer models, yet its extension to high-dimensional domains lacks a unified theoretical formulation....

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] PCA-Enhanced Adaptive NVAR Framework for High-Resolution Sea Surface Temperature Forecasting in the East Sea

Accurate forecasting of sea surface temperature (SST) in regional seas such as the East Sea is crucial for monitoring marine ecosystems, assessing climate risks...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

Sparse autoencoders (SAEs) are widely used to interpret neural network representations, but their utility depends on whether the learned features are reproducib...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] A Riemannian Approach to Low-Rank Optimal Transport

Low-rank optimal transport (OT) mitigates the quadratic scaling of classical solvers, yet existing approaches rely heavily on first-order mirror-descent updates...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] DAM-VLA: Decoupled Asynchronous Multimodal Vision Language Action model

Vision-language-action (VLA) models inherit a shared synchronous clock from vision-language pretraining, processing every input at one rate. This is misaligned ...

#research #paper #ai #machine-learning #computer-vision
4 days ago · devops · - · -

[Paper] The PM-EdgeMap: Towards Real-Time Process Mining on the Edge-Cloud Continuum

Smart factories are evolving into Cyber-Physical Systems (CPS), demanding increased autonomy. This necessitates real-time decision making, facilitated by insigh...

#research #paper #devops
4 days ago · ai · - · -

[Paper] IntElicit: Eliciting and Assessing Contextualized Creativity via Dialogue Policy Optimization

Contextualized assessment offers high ecological validity for evaluating creativity but introduces a critical challenge: observed performance may be confounded ...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Efficient Time Series Clustering from Multiscale Reservoir Dynamics with Granular-Ball Anchoring Graph Optimization

Time-series clustering remains challenging due to the inherent trade-off between clustering effectiveness and computational efficiency. Similarity-based methods...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Categorical Robustness Assessment for Machine Learning based Network Intrusion Detection Systems

Network Intrusion Detection Systems (NIDS) heavily utlize Machine Learning (ML) but ML models can be manipulated via adversarial attacks. These attacks add care...

#research #paper #ai #machine-learning
4 days ago · software · - · -

[Paper] Undefined Behavior in C and C++: An Experiment With Desktop Use Cases

Undefined behavior is idiomatic to C and C++ programming; such behavior is a use of an erroneous program construct for which the languages impose no requirement...

#research #paper #software
4 days ago · ai · - · -

[Paper] Attention by Synchronization in Coupled Oscillator Networks

We address transformer attention on energy-constrained physical substrates. Softmax attention requires exponentiation and global reduction, operations with high...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Phase Transitions in Attention: A Bayesian Theory of Copy Head Emergence

Attention is the key mechanism underlying in-context learning in transformers, and attention patterns have been observed empirically to emerge abruptly during t...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Exploration Structure in LLM Agents for Multi-File Change Localization

Software engineering tools increasingly rely on LLM based agents to localize files to change to resolve a software issue. Most AI agents explore repositories li...

#research #paper #ai #machine-learning
4 days ago · devops · - · -

[Paper] Near-Optimal Distributed 2-Ruling Sets on Graphs with Low Arboricity

Given a graph G=(V,E), a β-ruling set is a subset of nodes Ssubseteq V that is independent, and each node in V is at distance at most β from some node in S. In ...

#research #paper #devops
4 days ago · devops · - · -

[Paper] From Fork-Join to Asynchronous Tasks: Parallelizing Tiled Cholesky Decomposition with OpenMP and HPX

Fork-join parallelism, popularized by OpenMP, remains the dominant model for shared-memory parallel programming, but its implicit synchronization barriers can p...

#research #paper #devops
4 days ago · ai · - · -

[Paper] Characterizing Software Aging in GPU-Based LLM Serving Systems

This paper proposes an empirical methodology to study software aging in GPU-based LLM serving systems. Traditional aging studies focus on CPU-centric software w...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Agents All the Way Down; A Methodology for Building Custom AI Agents from Substrate to Production

Custom AI agents areagents that live inside their own application, talk to their own data and tools, enforce their own security boundaries, and carry their own ...

#research #paper #ai #machine-learning
4 days ago · devops · - · -

[Paper] Harnessing Routing Foresight for Micro-step-level MoE load balancing in RL Post-training

Mixture-of-Experts (MoE) and reinforcement learning (RL) post-training now dominate large language model (LLM) development, yet expert load imbalance remains a ...

#research #paper #devops
4 days ago · software · - · -

[Paper] Enhancing LLM-Based Code Translation with Verified Multi-Semantic Representations

Large language models (LLMs) have shown great promise for automated code translation, yet existing approaches often rely on token-level statistical patterns rat...

#research #paper #software
4 days ago · software · - · -

[Paper] How Requirements Quality Makes (or Breaks) Traceability Link Recovery

Traceability information between requirements and source code greatly benefits the maintenance of a software system. Since manually establishing trace links is ...

#research #paper #software
4 days ago · devops · - · -

[Paper] Optimizing Cloud Deployment: Blending of IaaS and FaaS for Microservice Architecture

The rapid evolution of cloud computing has resulted in the adoption of hybrid deployments that blend Infrastructure-as-a-Service (IaaS) and Function-as-a-Servic...

#research #paper #devops
4 days ago · ai · - · -

[Paper] Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code

Large Language Models (LLMs) are increasingly used for code generation, raising concerns that they may be misused to produce malicious code. Meanwhile, Grammar-...

#research #paper #ai #machine-learning #nlp
4 days ago · software · - · -

[Paper] Understanding and Detecting Scalability Faults in Large-Scale Distributed Systems

Scalable distributed systems form the backbone of modern computing infrastructure. However, as scale grows, system complexity may lead to scalability faults. Sc...

#research #paper #software
4 days ago · devops · - · -

[Paper] Consensus Time in 3-Majority and 2-Choices Is Determined by the Maximum Initial Opinion Density

We establish the correct parameter governing the convergence time of the 3-Majority and 2-Choices dynamics on the complete graph in the synchronous model. Recen...

#research #paper #devops
4 days ago · software · - · -

[Paper] Acoda: Adversarial Code Obfuscation for Defending against LLM-based Analysis

With the widespread adoption of Large Language Models (LLMs) in software engineering (SE) tasks such as code understanding, debugging, and vulnerability detecti...

#research #paper #software
4 days ago · devops · - · -

[Paper] MHOT: Height-Optimized Authenticated Data Structure for Blockchain State Commitment

State root computation dominates (78%) blockchain block processing time. Ethereum's canonical authenticated data structure, i.e., Merkle Patricia Trie (MPT), su...

#research #paper #devops
4 days ago · devops · - · -

[Paper] Beyond Per-Token Pricing: A Concurrency-Aware Methodology for LLM Infrastructure Cost Estimation

Every public LLM cost calculator we surveyed treats GPU utilization as a fixed input -- entered by the user, baked in as a preset, or silently assumed at 100% -...

#research #paper #devops
4 days ago · ai · - · -

[Paper] Sovereign Assurance Boundary: Certificate-Bound Admission for Agentic Infrastructure

Agentic infrastructure introduces a critical control-plane authorization problem: non-deterministic reasoning systems can propose high-stakes mutations to produ...

#research #paper #ai #machine-learning
4 days ago · devops · - · -

[Paper] Tensor-Network-Based Distributed Quantum Dynamics on Independent Quantum Computers

We present an approach based on tensor networks for distributed quantum computing simulation of chemical wavepacket dynamics in a continuous variable representa...

#research #paper #devops

Newer posts

Older posts