paper — Page 19 | EUNO.NEWS

Sort:

3 weeks ago · ai · - · -

[Paper] Equivariant Reinforcement Learning for Clifford Quantum Circuit Synthesis

We consider the problem of synthesizing Clifford quantum circuits for devices with all-to-all qubit connectivity. We approach this task as a reinforcement learn...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$-step Policy Gradients

This work revisits standard policy gradient methods used on restricted policy classes, which are known to get stuck in suboptimal critical points. We identify a...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Engineering Robustness into Personal Agents with the AI Workflow Store

The dominant paradigm for AI agents is an 'on-the-fly' loop in which agents synthesize plans and execute actions within seconds or minutes in response to user p...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] DataMaster: Towards Autonomous Data Engineering for Machine Learning

As model families, training recipes, and compute budgets become increasingly standardized, further gains in machine learning systems depend increasingly on data...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models

This paper proposes a novel approach to address the challenge that pretrained VLA models often fail to effectively improve performance and reduce adaptation cos...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Beyond Red-Teaming: Formal Guarantees of LLM Guardrail Classifiers

Guardrail Classifiers defend production language models against harmful behavior, but although results seem promising in testing, they provide no formal guarant...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Training deep research agents, namely systems that plan, search, evaluate evidence, and synthesize long-form reports, pushes reinforcement learning beyond the r...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Counterfactual Stress Testing for Image Classification Models

Deep learning models in medical imaging often fail when deployed in new clinical environments due to distribution shifts in demographics, scanner hardware, or a...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Grounded or Guessing? LVLM Confidence Estimation via Blind-Image Contrastive Ranking

Large vision-language models suffer from visual ungroundedness: they can produce a fluent, confident, and even correct response driven entirely by language prio...

#research #paper #ai #nlp
3 weeks ago · software · - · -

[Paper] CppPerf: An Automated Pipeline and Dataset for Performance-Improving C++ Commits

Recent progress in automated repair of performance bugs demands realistic, executable benchmarks. However, existing C++ performance benchmarks are largely built...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

On-policy distillation offers dense, per-token supervision for training reasoning models; however, it remains unclear under which conditions this signal is bene...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Shields to Guarantee Probabilistic Safety in MDPs

Shielding is a prominent model-based technique to ensure safety of autonomous agents. Classical shielding aims to ensure that nothing bad ever happens and comes...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Count Anything at Any Granularity

Open-world object counting remains brittle: despite rapid advances in vision-language models (VLMs), reliably counting the objects a user intends is far from so...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] LoKA: Low-precision Kernel Applications for Recommendation Models At Scale

Recent GPU generations deliver significantly higher FLOPs using lower-precision arithmetic, such as FP8. While successfully applied to large language models (LL...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Geometry-aware Prototype Learning for Cross-domain Few-shot Medical Image Segmentation

Cross-domain few-shot medical image segmentation (CD-FSMIS) requires a model to generalise simultaneously to novel anatomical categories and unseen imaging doma...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Neural at ArchEHR-QA 2026: One Method Fits All: Unified Prompt Optimization for Clinical QA over EHRs

Automated question answering (QA) over electronic health records (EHRs) demands precise evidence retrieval, faithful answer generation, and explicit grounding o...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents

Recent advances in machine learning and large-scale biological data collections have revived the prospect of building a virtual cell, a computational model of c...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Compute Where it Counts: Self Optimizing Language Models

Efficient LLM inference research has largely focused on reducing the cost of each decoding step (e.g., using quantization, pruning, or sparse attention), typica...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] CADBench: A Multimodal Benchmark for AI-Assisted CAD Program Generation

Recovering editable CAD programs from images or 3D observations is central to AI-assisted design, but progress is difficult to measure because existing evaluati...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD

Industrial Computer-Aided Design (CAD) code generation requires models to produce executable parametric programs from visual or textual inputs. Beyond recognizi...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] DGPO: Beyond Pairwise Preferences with Directional Consistent Groupwise Optimization

Although Large Language Models (LLMs) have made remarkable progress, current preference optimization methods still struggle to align directional consistency whi...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] RUBEN: Rule-Based Explanations for Retrieval-Augmented LLM Systems

This paper demonstrates RUBEN, an interactive tool for discovering minimal rules to explain the outputs of retrieval-augmented large language models (LLMs) in d...

#research #paper #ai #nlp
3 weeks ago · devops · - · -

[Paper] Closer in the Gap: Towards Portable Performance on RISC-V Vector Processors

The RISC-V Vector Extension~(RVV) is a cornerstone for supporting compute throughout in scientific and machine learning workloads. Yet compiler support and perf...

#research #paper #devops
3 weeks ago · software · - · -

[Paper] StartFlow: From Method Conception to Multi-Perspective Evaluation in UX Prototyping for Software Startups

Context. Software startups face significant challenges in building minimum viable products, particularly in the early stages, when resources are limited and exp...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox

Current LLM agents are proficient at calling isolated APIs but struggle with the 'last mile' of commercial software automation. In real-world scenarios, tools a...

#research #paper #ai #machine-learning
3 weeks ago · software · - · -

[Paper] Unitaria: Quantum Linear Algebra via Block Encodings

We introduce Unitaria, a Python library that brings the simplicity of classical linear algebra toolkits such as NumPy and SciPy to the implementation of quantum...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum

Grey failures in the computing continuum produce ambiguous overlapping symptoms that existing approaches fail to diagnose reliably, either due to a lack of caus...

#research #paper #ai #machine-learning
3 weeks ago · software · - · -

[Paper] AutoSOUP: Safety-Oriented Unit Proof Generation for Component-level Memory-Safety Verification

Memory-safety errors remain a persistent source of zero-day vulnerabilities in low-level software. The problem is especially acute in embedded systems, where ha...

#research #paper #software
3 weeks ago · software · - · -

[Paper] ChatGPT: Friend or Foe When Comprehending and Changing Unfamiliar Code

A rapidly growing body of research is examining how LLMs influence developers when they code. To date, this research has tended to focus on productivity and cod...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] Energy-Efficient Implementation of Spiking Recurrent Cells on FPGA

Spiking Neural Networks (SNNs) can reduce energy consumption compared to conventional Artificial Neural Networks (ANNs) when spiking activity is sparse and the ...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Step Rejection Fine-Tuning: A Practical Distillation Recipe

Rejection Fine-Tuning (RFT) is a standard method for training LLM agents, where unsuccessful trajectories are discarded from the training set. In the context of...

#research #paper #ai #machine-learning #nlp
3 weeks ago · devops · - · -

[Paper] Surviving Partial Rank Failures in Wide Expert-Parallel MoE Inference

Mixture-of-Experts (MoE) serving relies on wide expert parallelism (EP) to aggregate the memory capacity and bandwidth of many GPUs within one inference instanc...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] SoK: A Systematic Bidirectional Literature Review of AI & DLT Convergence

The integration of Artificial Intelligence (AI) with Distributed Ledger Technology (DLT) has become a growing research area, yet contributions tend to cluster a...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] A Theory of Multilevel Interactive Equilibrium in NeuroAI

We propose a game-theoretic framework for adaptive multi-agent intelligent systems. Unlike classical game theory, which often treats strategies as primitive obj...

#research #paper #ai
3 weeks ago · devops · - · -

[Paper] Accelerating Compound LLM Training Workloads with Maestro

Compound LLM training workloads-such as knowledge distillation and multimodal LLM (MLLM) training-are gaining prominence. These typically comprise heterogeneous...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] Privacy-preserving Chunk Scheduling in a BitTorrent Implementation of Federated Learning

Traditional federated learning (FL) relies on a central aggregator server, which can create performance bottlenecks and privacy risks. Decentralized mix-and-for...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] HiRL: Hierarchical Reinforcement Learning for Coordinated Resource Management in Heterogeneous Edge Computing

Edge computing faces unprecedented resource orchestration challenges from multi-dimensional heterogeneity across device architectures, diverse task requirements...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Causal Explanations from the Geometric Properties of ReLU Neural Networks

Neural networks have proved an effective means of learning control policies for autonomous systems, but these learned policies are difficult to understand due t...

#research #paper #ai #machine-learning
3 weeks ago · devops · - · -

[Paper] FractalSortCPU: Bandwidth-Efficient Compressed Radix Sort on CPU

Cloud database systems, particularly their middleware and query execution layers, use sorting as a core operation in query processing, indexing and join executi...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Agentic Performance at the Edge: Insights from Benchmarking

Agentic artificial intelligence (AI) is a natural fit for Internet of Things (IoT) and edge systems, but edge deployments are often constrained to models around...

#research #paper #ai #machine-learning
3 weeks ago · devops · - · -

[Paper] Amortized Asynchronous Byzantine Reliable Broadcast with Optimal Resilience

Byzantine Reliable Broadcast (BRB) is a fundamental primitive in distributed computing and cryptographic systems. Reducing the communication complexity of BRB p...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Meta-Black-Box Optimization Can Do Search Guidance for Expensive Constrained Multi-Objective Optimization

Existing Meta-Black-Box Optimization (MetaBBO) methods focus on how to search when controlling optimizers, but largely overlook where to search. We propose Meta...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Joint sparse coding and temporal dynamics support context reconfiguration

Adaptive behavior requires the brain to transition between distinct contexts while maintaining representations of prior experience. The ability to reconfigure n...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Prospective Compression in Human Abstraction Learning

A core challenge in program synthesis is online library learning: the incremental acquisition of reusable abstractions under uncertainty about future task deman...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Frequency Matching in Spiking Neural Networks for mmWave Sensing

Millimeter-wave (mmWave) sensing enables privacy-preserving, always-on edge perception, but its measurements are often sparse, temporally irregular, and corrupt...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution

Large Language Models exhibit mode collapse, producing homogeneous outputs that fail to explore valid solution spaces. We present QD-LLM, a framework for parame...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent

Gradient-based preference optimization methods for large language model (LLM) alignment suffer from preference collapse, converging to narrow behavioral modes w...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Encoding and Decoding Temporal Signals with Spiking Bandpass Wavelets

Spike-based encodings are sparse and energy-efficient, but have largely been formulated probabilistically, disconnected from most signal processing literature. ...

#research #paper #ai

Newer posts

Older posts