[Paper] Subcubic Coin Tossing in Asynchrony without Setup
We consider an asynchronous network of n parties connected to each other via secure channels, up to t of which are byzantine. We study common coin tossing, a ta...
5644 posts from this source
We consider an asynchronous network of n parties connected to each other via secure channels, up to t of which are byzantine. We study common coin tossing, a ta...
Large language model (LLM) services have become an integral part of search, assistance, and decision-making applications. However, unlike traditional web or mic...
The dynamics and complexity of cloud-native systems present significant challenges for Root Cause Analysis (RCA). While causality-based RCA methods have shown s...
Access to frontier large language models (LLMs), such as GPT-5 and Gemini-2.5, is often hindered by high pricing, payment barriers, and regional restrictions. T...
Can LLM agents explore codebases and reason about code semantics without executing the code? We study this capability, which we call agentic code reasoning, and...
We quantify, uniformly over time and with high probability, the discrepancy between the predictions of a two-layer neural network trained by stochastic gradient...
Implementing new features across an entire codebase presents a formidable challenge for Large Language Models (LLMs). This proactive task requires a deep unders...
Federated Learning (FL) faces major challenges in real-world deployments due to statistical heterogeneity across clients and system heterogeneity arising from r...
Embedding models are crucial to modern NLP. However, the creation of the most effective models relies on carefully constructed supervised finetuning data. For h...
Training tool-use agents typically relies on outcome-based filtering: Supervised Fine-Tuning (SFT) on successful trajectories and Reinforcement Learning (RL) on...
We introduce Legal RAG Bench, a benchmark and evaluation methodology for assessing the end-to-end performance of legal RAG systems. As a benchmark, Legal RAG Be...
Large language models (LLMs) have become an essential tool for natural language processing and artificial intelligence in general. Current open-source models ar...
While dense biomedical embeddings achieve strong performance, their black-box nature limits their utility in clinical decision-making. Recent question-based int...
With the increasing computational capability of mobile devices, deploying agentic retrieval-augmented generation (RAG) locally on heterogeneous System-on-Chips ...
Shared L1-memory clusters of streamlined instruction processors (processing elements - PEs) are commonly used as building blocks in modern, massively parallel c...
Modern software relies heavily on third-party software libraries to streamline the development process. The act of switching one library for a similar counterpa...
Tool-using LLM agents face a reliability-cost tradeoff: routing every decision through the LLM improves correctness but incurs high latency and inference cost, ...
The carbon footprint of academic conferences becomes a topic of increasing debate. It is important to consider whether the benefits derived from attending confe...
Compliance as code is an emerging idea about automating compliance through programmed compliance controls and checks. Given scant existing research thus far, th...
Edge computing environments host increasingly complex microservice-based IoT applications that are prone to performance anomalies propagating across dependent s...
This is the first of five papers comprising The Semantic Arrow of Time. The argument begins with a claim: computing's arrow of time is semantic, not thermodynam...
Message passing is widely assumed to be a fundamental primitive of distributed systems. This paper argues that conventional message systems embed a category mis...
Speculative Decoding (SD) has emerged as a premier technique for accelerating Large Language Model (LLM) inference by decoupling token generation into rapid dra...
Determining a winner among a set of items using active pairwise comparisons under a limited budget is a challenging problem in preference-based learning. The go...
We develop a discrete gauge-theoretic framework for superposition in large language models (LLMs) that replaces the single-global-dictionary premise with a shea...
This paper presents a controlled empirical study of biologically motivated local learning for handwritten digit recognition. We evaluate an STDP-inspired compet...
Dense 4D reconstruction from unposed images remains a critical challenge, with current methods relying on slow test-time optimization or fragmented, task-specif...
Scaling video generation from seconds to minutes faces a critical bottleneck: while short-video data is abundant and high-fidelity, coherent long-form data is s...
The fast-growing demands in using Large Language Models (LLMs) to tackle complex multi-step data science tasks create an emergent need for accurate benchmarking...
Multi-turn interactions with large language models typically retain the assistant's own past responses in the conversation history. In this work, we revisit thi...
GPU kernel optimization is fundamental to modern deep learning but remains a highly specialized task requiring deep hardware expertise. Despite strong performan...
Modern optimizers like Adam and Muon are central to training large language models, but their reliance on first- and second-order momenta introduces significant...
Transformers have been established as the de-facto backbones for most recent advances in sequence modeling, mainly due to their growing memory capacity that sca...
Identifiability in representation learning is commonly evaluated using standard metrics (e.g., MCC, DCI, R^2) on synthetic benchmarks with known ground-truth fa...
Many readers today struggle to assess the trustworthiness of online news because reliable reporting coexists with misinformation. The TREC 2025 DRAGUN (Detectio...
Humans perceive actions through key transitions that structure actions across multiple abstraction levels, whereas machines, relying on visual features, tend to...
We propose a minimal agentic baseline that enables systematic comparison across different AI-based theorem prover architectures. This design implements the core...
Neural networks are hypothesized to implement interpretable causal mechanisms, yet verifying this requires finding a causal abstraction -- a simpler, high-level...
Compositional generalization, the ability to recognize familiar parts in novel contexts, is a defining property of intelligent systems. Although modern models a...
In this article, bipartite ranking, a statistical learning problem involved in many applications and widely studied in the passive context, is approached in a m...
Identifying the full landscape of small and medium-sized enterprises (SMEs) in specialized industry sectors is critical for supply-chain resilience, yet existin...
Accurate fault detection and localization in electrical distribution systems is crucial, especially with the increasing integration of distributed energy resour...
Batch effects arising from technical variations in histopathology staining protocols, scanners, and acquisition pipelines pose a persistent challenge for comput...
Diffusion-based Real-World Image Super-Resolution (Real-ISR) achieves impressive perceptual quality but suffers from high computational costs due to iterative s...
GPU-accelerated server platforms that share most of their hardware architecture often require separate firmware images due to minor hardware differences--differ...
Safety-critical task planning in robotic systems remains challenging: classical planners suffer from poor scalability, Reinforcement Learning (RL)-based methods...
Recent progress in text-to-image generation has greatly advanced visual fidelity and creativity, but it has also imposed higher demands on prompt complexity-par...
Modern microscopy routinely produces gigapixel images that contain structures across multiple spatial scales, from fine cellular morphology to broader tissue or...