paper

Sort:

3 days ago · ai · - · -

[Paper] 123D: Unifying Multi-Modal Autonomous Driving Data at Scale

The pursuit of autonomous driving has produced one of the richest sensor data collections in all of robotics. However, its scale and diversity remain largely un...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Test-time scaling (TTS) has become an effective approach for improving large language model performance by allocating additional computation during inference. H...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Normalizing Trajectory Models

Diffusion-based models decompose sampling into many small Gaussian denoising steps -- an assumption that breaks down when generation is compressed to a few coar...

#research #paper #ai #machine-learning #computer-vision
3 days ago · ai · - · -

[Paper] Conformal Path Reasoning: Trustworthy Knowledge Graph Question Answering via Path-Level Calibration

Knowledge Graph Question Answering (KGQA) has shown promise for grounded and interpretable reasoning, yet existing approaches often fail to provide reliable cov...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Zero-Shot Imagined Speech Decoding via Imagined-to-Listened MEG Mapping

Decoding imagined speech from non-invasive brain recordings is challenging because imagined datasets are scarce and difficult to align temporally across subject...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] GRAPHLCP: Structure-Aware Localized Conformal Prediction on Graphs

Conformal prediction (CP) provides a distribution-free approach to uncertainty quantification with finite-sample guarantees. However, applying CP to graph neura...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction

Recent event-based image reconstruction methods predominantly rely on Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to process complementa...

#research #paper #ai #machine-learning #computer-vision
3 days ago · ai · - · -

[Paper] A Note on Non-Negative $L_1$-Approximating Polynomials

L_1-Approximating polynomials, i.e., polynomials that approximate indicator functions in L_1-norm under certain distributions, are widely used in computational ...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection

A standard technique for scaling inference-time reasoning is Self-Consistency, whereby multiple candidate answers are sampled from an LLM and the most common an...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Proxy3D: Efficient 3D Representations for Vision-Language Models via Semantic Clustering and Alignment

Spatial intelligence in vision-language models (VLMs) attracts research interest with the practical demand to reason in the 3D world.Despite promising results, ...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] Flow-OPD: On-Policy Distillation for Flow Matching Models

Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under multi-task alignment: the reward sparsity induced by scalar-valued r...

#research #paper #ai #machine-learning #computer-vision
3 days ago · ai · - · -

[Paper] Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning

We argue that decomposing reward into weighted, verifiable criteria and using an LLM judge to score them provides a partial-credit optimization signal: instead ...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. ...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] 6D Pose Estimation via Keypoint Heatmap Regression with RGB-D Residual Neural Networks

In this paper, we propose a modular framework for 6D pose estimation based on keypoint heatmap regression. Our approach combines YOLOv10m for object detection w...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation

While recent advancements in inference-time learning have improved LLM reasoning on Text-to-SQL tasks, current solutions still struggle to perform well on the m...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Towards Highly-Constrained Human Motion Generation with Retrieval-Guided Diffusion Noise Optimization

Generating human motion that satisfies customized zero-shot goal functions, enabling applications such as controllable character animation and behavior synthesi...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] Reinforcement Learning for Exponential Utility: Algorithms and Convergence in Discounted MDPs

Reinforcement learning (RL) for exponential-utility optimization in discounted Markov decision processes (MDPs) lacks principled value-based algorithms. We addr...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] MoCoTalk: Multi-Conditional Diffusion with Adaptive Router for Controllable Talking Head Generation

Talking-head generation requires joint modeling of identity, head pose, facial expression, and mouth dynamics. Existing methods typically address only a subset ...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] Accurate and Efficient Statistical Testing for Word Semantic Breadth

Measuring the breadth of a word's meaning, or its spread across contexts, has become feasible with contextualized token embeddings. A word type can be represent...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Uncertainty-Aware Structured Data Extraction from Full CMR Reports via Distilled LLMs

Converting free-text cardiac magnetic resonance (CMR) reports into auditable structured data remains a bottleneck for cohort assembly, longitudinal curation, an...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] Fast Byte Latent Transformer

Recent byte-level language models (LMs) match the performance of token-level models without relying on subword vocabularies, yet their utility is limited by slo...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

While text-to-image models have made strong progress in visual fidelity, faithfully realizing complex visual intents remains challenging because many requiremen...

#research #paper #ai #machine-learning #computer-vision
3 days ago · ai · - · -

[Paper] Beyond Pairs: Your Language Model is Secretly Optimizing a Preference Graph

Direct Preference Optimization (DPO) aligns language models using pairwise preference comparisons, offering a simple and effective alternative to Reinforcement ...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Don't Get Your Kroneckers in a Twist: Gaussian Processes on High-Dimensional Incomplete Grids

We introduce CUTS-GPR, a new method for performing numerically exact Gaussian process regression (GPR) in high-dimensional settings. The key component of CUTS-G...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] PropSplat: Map-Free RF Field Reconstruction via 3D Gaussian Propagation Splatting

Building a site-specific propagation model typically requires either ray-tracing over detailed 3D maps or dense measurement campaigns. Both approaches are expen...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Semiparametric Efficient Test for Interpretable Distributional Treatment Effects

Distributional treatment effects can be invisible to means: a treatment may preserve average outcomes while changing tails, modes, dispersion, or rare-event pro...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models

Vision-language models (VLMs) raise growing concerns about privacy, copyright, and bias, motivating machine unlearning to remove sensitive knowledge. However, e...

#research #paper #ai #computer-vision
3 days ago · ai · - · -

[Paper] MPD$^2$-Router: Mask-aware Multi-expert Prior-regularized Dual-head Deferral Router in Glaucoma Screening and Diagnosis

Learning-to-defer (L2D) can make glaucoma screening safer by routing difficult/uncertain cases to humans, yet standard formulations overlook expert availability...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Globally Optimal Training of Spiking Neural Networks via Parameter Reconstruction

Spiking Neural Networks (SNNs) have been proposed as biologically plausible and energy-efficient alternatives to conventional Artificial Neural Networks (ANNs)....

#research #paper #ai #machine-learning
3 days ago · software · - · -

[Paper] Collaborator or Assistnat? How AI Coding Agents Partition Work Across Pull Request Lifecycles

When AI coding agents open branches and submit pull requests (PRs), two questions co-determine oversight design: who starts the work (operational agency) and wh...

#research #paper #software
3 days ago · ai · - · -

[Paper] Position: Mechanistic Interpretability Must Disclose Identification Assumptions for Causal Claims

Mechanistic interpretability papers increasingly use causal vocabulary: circuits, mediators, causal abstraction, monosemanticity. Such claims require explicit i...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Tool Calling is Linearly Readable and Steerable in Language Models

When a tool-calling agent picks the wrong tool, the failure is invisible until execution: the email gets sent, the meeting gets missed. Probing 12 instruction-t...

#research #paper #ai #machine-learning #nlp
3 days ago · ai · - · -

[Paper] Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

Selecting the optimal LLM inference configuration requires evaluation across hardware, serving engines, attention backends, and model architectures, since no si...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] GLiGuard: Schema-Conditioned Classification for LLM Safeguard

Ensuring safe, policy-compliant outputs from large language models requires real-time content moderation that can scale across multiple safety dimensions. Howev...

#research #paper #ai #nlp
3 days ago · ai · - · -

[Paper] FLAM: Evaluating Model Performance with Aggregatable Measures in Federated Learning

Performance evaluation is essential for assessing the quality of machine learning (ML) models and guiding deployment decisions. In federated learning (FL), asse...

#research #paper #ai #machine-learning
3 days ago · software · - · -

[Paper] Similar Pattern Annotation via Retrieval Knowledge for LLM-Based Test Code Fault Localization

Software failures remain a major challenge in modern software development, and identifying the code elements responsible for failures is a time-consuming debugg...

#research #paper #software
3 days ago · devops · - · -

[Paper] Stencil Computations on Cerebras Wafer-Scale Engine

Stencil computations are a fundamental kernel in scientific computing, critical for simulations in domains such as fluid dynamics and climate modeling. However,...

#research #paper #devops
3 days ago · software · - · -

[Paper] Evaluating Design Conformance Through Trace Comparison

The design of a system and its implementation are two tasks often carried out by different individuals on a development team, and can occur weeks or months apar...

#research #paper #software
3 days ago · ai · - · -

[Paper] mathsf{VISTA}: Decentralized Machine Learning in Adversary Dominated Environments

Decentralized machine learning often relies on outsourcing computations, such as gradient evaluations, to untrusted worker nodes. Existing robust aggregation me...

#research #paper #ai #machine-learning
3 days ago · software · - · -

[Paper] Unsafe by Flow: Uncovering Bidirectional Data-Flow Risks in MCP Ecosystem

Model Context Protocol (MCP) have quickly become the interface layer between LLM agents and external tools, yet they also introduce unsafe data flows that exist...

#research #paper #software
3 days ago · software · - · -

[Paper] Can I Check What I Designed? Mapping Security Design DSLs to Code Analyzers

When assessing the potential impact of code-level vulnerabilities, e.g., discovered by automated analyzers, it is essential to consider them in the context of t...

#research #paper #software
3 days ago · software · - · -

[Paper] Bridging the Programming Language Gap: Constructing a Multilingual Shared Semantic Space through AST Unification and Graph Matching

The lexical and syntactic disparities among different programming languages (e.g., Java and Python) pose significant challenges for multi-language software engi...

#research #paper #software
3 days ago · software · - · -

[Paper] Coding Agents Don't Know When to Act

Coding agents are increasingly deployed to autonomously maintain software, including to resolve user-reported issues: a bug report comes in and the agent create...

#research #paper #software
3 days ago · devops · - · -

[Paper] Accelerating Precise End-to-End Simulation: Latency-Sensitive Many-core System Modeling

Modern large language model workloads put increasing demands on parallel compute capability and on-chip memory capacity, while also stressing fine-grained data ...

#research #paper #devops
3 days ago · software · - · -

[Paper] Securing the Dark Matter: A Semantic-Enhanced Neuro-Symbolic Framework for Supply Chain Analysis of Opaque Industrial Software

Automated vulnerability detection in critical-infrastructure software confronts a fundamental barrier: industrial software is routinely deployed as stripped, sy...

#research #paper #software
3 days ago · software · - · -

[Paper] SARC: A Governance-by-Architecture Framework for Agentic AI Systems

Agentic AI systems increasingly act through tools, sub-agents, and external services, but governance controls are still commonly attached to prompts, dashboards...

#research #paper #software
3 days ago · devops · - · -

[Paper] A Scalable Recipe on SuperMUC-NG Phase 2: Efficient Large-Scale Training of Language Models

Large Language Models (LLMs) continue to demonstrate superior performance with increasing scale, yet training models with billions to trillions of parameters re...

#research #paper #devops
3 days ago · devops · - · -

[Paper] Stencil Computations on Tenstorrent Wormhole

As investment in AI-focused accelerators grows and their deployment in supercomputing facilities expands, understanding whether these architectures can efficien...

#research #paper #devops

Newer posts

Older posts