Source

arXiv

5644 posts from this source

Sort:

2 months ago · devops · - · -

[Paper] Subcubic Coin Tossing in Asynchrony without Setup

We consider an asynchronous network of n parties connected to each other via secure channels, up to t of which are byzantine. We study common coin tossing, a ta...

#research #paper #devops
2 months ago · devops · - · -

[Paper] Beyond Microservices: Testing Web-Scale RCA Methods on GPU-Driven LLM Workloads

Large language model (LLM) services have become an integral part of search, assistance, and decision-making applications. However, unlike traditional web or mic...

#root-cause analysis #LLM inference #GPU observability #fault injection #RCA tools
2 months ago · software · - · -

[Paper] MetaRCA: A Generalizable Root Cause Analysis Framework for Cloud-Native Systems Powered by Meta Causal Knowledge

The dynamics and complexity of cloud-native systems present significant challenges for Root Cause Analysis (RCA). While causality-based RCA methods have shown s...

#research #paper #software
2 months ago · ai · - · -

[Paper] Real Money, Fake Models: Deceptive Model Claims in Shadow APIs

Access to frontier large language models (LLMs), such as GPT-5 and Gemini-2.5, is often hindered by high pricing, payment barriers, and regional restrictions. T...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Agentic Code Reasoning

Can LLM agents explore codebases and reason about code semantics without executing the code? We study this capability, which we call agentic code reasoning, and...

#large-language-models #code reasoning #prompt engineering #software engineering AI
2 months ago · ai · - · -

[Paper] Uniform-in-time concentration in two-layer neural networks via transportation inequalities

We quantify, uniformly over time and with high probability, the discrepancy between the predictions of a two-layer neural network trained by stochastic gradient...

#research #paper #ai
2 months ago · ai · - · -

[Paper] Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition

Implementing new features across an entire codebase presents a formidable challenge for Large Language Models (LLMs). This proactive task requires a deep unders...

#large-language-models #code-generation #software-architecture #automated-testing #research-paper
2 months ago · ai · - · -

[Paper] CA-AFP: Cluster-Aware Adaptive Federated Pruning

Federated Learning (FL) faces major challenges in real-world deployments due to statistical heterogeneity across clients and system heterogeneity arising from r...

#federated learning #model pruning #client clustering #communication efficiency #heterogeneous devices
2 months ago · ai · - · -

[Paper] Bootstrapping Embeddings for Low Resource Languages

Embedding models are crucial to modern NLP. However, the creation of the most effective models relies on carefully constructed supervised finetuning data. For h...

#multilingual embeddings #low-resource languages #synthetic data generation #adapter composition #LoRA
2 months ago · ai · - · -

[Paper] TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

Training tool-use agents typically relies on outcome-based filtering: Supervised Fine-Tuning (SFT) on successful trajectories and Reinforcement Learning (RL) on...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Legal RAG Bench: an end-to-end benchmark for legal RAG

We introduce Legal RAG Bench, a benchmark and evaluation methodology for assessing the end-to-end performance of legal RAG systems. As a benchmark, Legal RAG Be...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Building a Strong Instruction Language Model for a Less-Resourced Language

Large language models (LLMs) have become an essential tool for natural language processing and artificial intelligence in general. Current open-source models ar...

#language models #multilingual NLP #instruction tuning #open-source AI #low-resource languages
2 months ago · ai · - · -

[Paper] QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions

While dense biomedical embeddings achieve strong performance, their black-box nature limits their utility in clinical decision-making. Recent question-based int...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] HeRo: Adaptive Orchestration of Agentic RAG on Heterogeneous Mobile SoC

With the increasing computational capability of mobile devices, deploying agentic retrieval-augmented generation (RAG) locally on heterogeneous System-on-Chips ...

#RAG #on-device LLM #heterogeneous scheduling #mobile AI #performance optimization
2 months ago · devops · - · -

[Paper] TeraPool: A Physical Design Aware, 1024 RISC-V Cores Shared-L1-Memory Scaled-up Cluster Design with High Bandwidth Main Memory Link

Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel c...

#research #paper #devops
2 months ago · software · - · -

[Paper] MigMate: A VS Code Extension for LLM-based Library Migration of Python Projects

Modern software relies heavily on third-party software libraries to streamline the development process. The act of switching one library for a similar counterpa...

#research #paper #software
2 months ago · ai · - · -

[Paper] Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents

Tool-using LLM agents face a reliability-cost tradeoff: routing every decision through the LLM improves correctness but incurs high latency and inference cost, ...

#LLM agents #self-healing routing #graph-based orchestration #cost optimization #tool reliability
2 months ago · software · - · -

[Paper] ICSE 2022 Sustainability Report

The carbon footprint of academic conferences becomes a topic of increasing debate. It is important to consider whether the benefits derived from attending confe...

#research #paper #software
2 months ago · devops · - · -

[Paper] Compliance as Code: A Study of Linux Distributions and Beyond

Compliance as code is an emerging idea about automating compliance through programmed compliance controls and checks. Given scant existing research thus far, th...

#compliance-as-code #linux-distributions #cybersecurity-regulations #infrastructure-automation
2 months ago · ai · - · -

[Paper] A Cascaded Graph Neural Network for Joint Root Cause Localization and Analysis in Edge Computing Environments

Edge computing environments host increasingly complex microservice-based IoT applications that are prone to performance anomalies propagating across dependent s...

#graph neural networks #root cause localization #edge computing #AIOps #microservice diagnostics
2 months ago · devops · - · -

[Paper] The Semantic Arrow of Time, Part I: From Eddington to Ethernet

This is the first of five papers comprising The Semantic Arrow of Time. The argument begins with a claim: computing's arrow of time is semantic, not thermodynam...

#research #paper #devops
2 months ago · devops · - · -

[Paper] Message Passing Without Temporal Direction: Constraint Semantics and the FITO Category Mistake

Message passing is widely assumed to be a fundamental primitive of distributed systems. This paper argues that conventional message systems embed a category mis...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification

Speculative Decoding (SD) has emerged as a premier technique for accelerating Large Language Model (LLM) inference by decoupling token generation into rapid dra...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] PARWiS: Winner determination under shoestring budgets using active pairwise comparisons

Determining a winner among a set of items using active pairwise comparisons under a limited budget is a challenging problem in preference-based learning. The go...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] A Gauge Theory of Superposition: Toward a Sheaf-Theoretic Atlas of Neural Representations

We develop a discrete gauge-theoretic framework for superposition in large language models (LLMs) that replaces the single-global-dictionary premise with a shea...

#large-language-models #gauge-theory #sheaf-theory #LLM-interpretability #machine-learning-research
2 months ago · ai · - · -

[Paper] Reward-Modulated Local Learning in Spiking Encoders: Controlled Benchmarks with STDP and Hybrid Rate Readouts

This paper presents a controlled empirical study of biologically motivated local learning for handwritten digit recognition. We evaluate an STDP-inspired compet...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] UFO-4D: Unposed Feedforward 4D Reconstruction from Two Images

Dense 4D reconstruction from unposed images remains a critical challenge, with current methods relying on slow test-time optimization or fragmented, task-specif...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Mode Seeking meets Mean Seeking for Fast Long Video Generation

Scaling video generation from seconds to minutes faces a critical bottleneck: while short-video data is abundant and high-fidelity, coherent long-form data is s...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

The fast-growing demands in using Large Language Models (LLMs) to tackle complex multi-step data science tasks create an emergent need for accurate benchmarking...

#LLM benchmark #data science #model evaluation #machine learning #research
2 months ago · ai · - · -

[Paper] Do LLMs Benefit From Their Own Words?

Multi-turn interactions with large language models typically retain the assistant's own past responses in the conversation history. In this work, we revisit thi...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

GPU kernel optimization is fundamental to modern deep learning but remains a highly specialized task requiring deep hardware expertise. Despite strong performan...

#cuda #reinforcement-learning #large-language-model #kernel-optimization #synthetic-data
2 months ago · ai · - · -

[Paper] Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation

Modern optimizers like Adam and Muon are central to training large language models, but their reliance on first- and second-order momenta introduces significant...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Memory Caching: RNNs with Growing Memory

Transformers have been established as the de-facto backbones for most recent advances in sequence modeling, mainly due to their growing memory capacity that sca...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Who Guards the Guardians? The Challenges of Evaluating Identifiability of Learned Representations

Identifiability in representation learning is commonly evaluated using standard metrics (e.g., MCC, DCI, R^2) on synthetic benchmarks with known ground-truth fa...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Resources for Automated Evaluation of Assistive RAG Systems that Help Readers with News Trustworthiness Assessment

Many readers today struggle to assess the trustworthiness of online news because reliable reporting coexists with misinformation. The TREC 2025 DRAGUN (Detectio...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Hierarchical Action Learning for Weakly-Supervised Action Segmentation

Humans perceive actions through key transitions that structure actions across multiple abstraction levels, whereas machines, relying on visual features, tend to...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] A Minimal Agent for Automated Theorem Proving

We propose a minimal agentic baseline that enables systematic comparison across different AI-based theorem prover architectures. This design implements the core...

#automated theorem proving #minimal AI agent #neural theorem provers #benchmarking #large language models
2 months ago · ai · - · -

[Paper] Efficient Discovery of Approximate Causal Abstractions via Neural Mechanism Sparsification

Neural networks are hypothesized to implement interpretable causal mechanisms, yet verifying this requires finding a causal abstraction -- a simpler, high-level...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Compositional generalization, the ability to recognize familiar parts in novel contexts, is a defining property of intelligent systems. Although modern models a...

#compositional generalization #vision embeddings #representation learning #CLIP #machine learning theory
2 months ago · ai · - · -

[Paper] Active Bipartite Ranking with Smooth Posterior Distributions

In this article, bipartite ranking, a statistical learning problem involved in many applications and widely studied in the passive context, is approached in a m...

#active learning #bipartite ranking #smooth posterior distributions #PAC guarantees #ROC optimization
2 months ago · ai · - · -

[Paper] Coverage-Aware Web Crawling for Domain-Specific Supplier Discovery via a Web--Knowledge--Web Pipeline

Identifying the full landscape of small and medium-sized enterprises (SMEs) in specialized industry sectors is critical for supply-chain resilience, yet existin...

#web crawling #knowledge graph #supplier discovery #coverage estimation #machine learning
2 months ago · ai · - · -

[Paper] FaultXformer: A Transformer-Encoder Based Fault Classification and Location Identification model in PMU-Integrated Active Electrical Distribution System

Accurate fault detection and localization in electrical distribution systems is crucial, especially with the increasing integration of distributed energy resour...

#transformer #fault detection #power systems #PMU #machine learning
2 months ago · ai · - · -

[Paper] Histopathology Image Normalization via Latent Manifold Compaction

Batch effects arising from technical variations in histopathology staining protocols, scanners, and acquisition pipelines pose a persistent challenge for comput...

#histopathology #image-normalization #latent-manifold-compaction #domain-adaptation #computer-vision
2 months ago · ai · - · -

[Paper] Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution

Diffusion-based Real-World Image Super-Resolution (Real-ISR) achieves impressive perceptual quality but suffers from high computational costs due to iterative s...

#research #paper #ai #computer-vision
2 months ago · devops · - · -

[Paper] nvidia-pcm: A D-Bus-Driven Platform Configuration Manager for OpenBMC Environments

GPU-accelerated server platforms that share most of their hardware architecture often require separate firmware images due to minor hardware differences--differ...

#research #paper #devops
2 months ago · ai · - · -

[Paper] SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems

Safety-critical task planning in robotic systems remains challenging: classical planners suffer from poor scalability, Reinforcement Learning (RL)-based methods...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Enhancing Spatial Understanding in Image Generation via Reward Modeling

Recent progress in text-to-image generation has greatly advanced visual fidelity and creativity, but it has also imposed higher demands on prompt complexity-par...

#image generation #diffusion models #reward modeling #spatial understanding #reinforcement learning
2 months ago · ai · - · -

[Paper] MuViT: Multi-Resolution Vision Transformers for Learning Across Scales in Microscopy

Modern microscopy routinely produces gigapixel images that contain structures across multiple spatial scales, from fine cellular morphology to broader tissue or...

#research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts