paper — Page 23 | EUNO.NEWS

Sort:

3 weeks ago · ai · - · -

[Paper] GLiGuard: Schema-Conditioned Classification for LLM Safeguard

Ensuring safe, policy-compliant outputs from large language models requires real-time content moderation that can scale across multiple safety dimensions. Howev...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] FLAM: Evaluating Model Performance with Aggregatable Measures in Federated Learning

Performance evaluation is essential for assessing the quality of machine learning (ML) models and guiding deployment decisions. In federated learning (FL), asse...

#research #paper #ai #machine-learning
3 weeks ago · software · - · -

[Paper] Similar Pattern Annotation via Retrieval Knowledge for LLM-Based Test Code Fault Localization

Software failures remain a major challenge in modern software development, and identifying the code elements responsible for failures is a time-consuming debugg...

#research #paper #software
3 weeks ago · devops · - · -

[Paper] Stencil Computations on Cerebras Wafer-Scale Engine

Stencil computations are a fundamental kernel in scientific computing, critical for simulations in domains such as fluid dynamics and climate modeling. However,...

#research #paper #devops
3 weeks ago · software · - · -

[Paper] Evaluating Design Conformance Through Trace Comparison

The design of a system and its implementation are two tasks often carried out by different individuals on a development team, and can occur weeks or months apar...

#research #paper #software
3 weeks ago · ai · - · -

[Paper] mathsf{VISTA}: Decentralized Machine Learning in Adversary Dominated Environments

Decentralized machine learning often relies on outsourcing computations, such as gradient evaluations, to untrusted worker nodes. Existing robust aggregation me...

#research #paper #ai #machine-learning
3 weeks ago · software · - · -

[Paper] Unsafe by Flow: Uncovering Bidirectional Data-Flow Risks in MCP Ecosystem

Model Context Protocol (MCP) have quickly become the interface layer between LLM agents and external tools, yet they also introduce unsafe data flows that exist...

#research #paper #software
3 weeks ago · software · - · -

[Paper] Can I Check What I Designed? Mapping Security Design DSLs to Code Analyzers

When assessing the potential impact of code-level vulnerabilities, e.g., discovered by automated analyzers, it is essential to consider them in the context of t...

#research #paper #software
3 weeks ago · software · - · -

[Paper] Bridging the Programming Language Gap: Constructing a Multilingual Shared Semantic Space through AST Unification and Graph Matching

The lexical and syntactic disparities among different programming languages (e.g., Java and Python) pose significant challenges for multi-language software engi...

#research #paper #software
3 weeks ago · software · - · -

[Paper] Coding Agents Don't Know When to Act

Coding agents are increasingly deployed to autonomously maintain software, including to resolve user-reported issues: a bug report comes in and the agent create...

#research #paper #software
3 weeks ago · devops · - · -

[Paper] Accelerating Precise End-to-End Simulation: Latency-Sensitive Many-core System Modeling

Modern large language model workloads put increasing demands on parallel compute capability and on-chip memory capacity, while also stressing fine-grained data ...

#research #paper #devops
3 weeks ago · software · - · -

[Paper] Securing the Dark Matter: A Semantic-Enhanced Neuro-Symbolic Framework for Supply Chain Analysis of Opaque Industrial Software

Automated vulnerability detection in critical-infrastructure software confronts a fundamental barrier: industrial software is routinely deployed as stripped, sy...

#research #paper #software
3 weeks ago · software · - · -

[Paper] SARC: A Governance-by-Architecture Framework for Agentic AI Systems

Agentic AI systems increasingly act through tools, sub-agents, and external services, but governance controls are still commonly attached to prompts, dashboards...

#research #paper #software
3 weeks ago · devops · - · -

[Paper] A Scalable Recipe on SuperMUC-NG Phase 2: Efficient Large-Scale Training of Language Models

Large Language Models (LLMs) continue to demonstrate superior performance with increasing scale, yet training models with billions to trillions of parameters re...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] Stencil Computations on Tenstorrent Wormhole

As investment in AI-focused accelerators grows and their deployment in supercomputing facilities expands, understanding whether these architectures can efficien...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] HexiSeq: Accommodating Long Context Training of LLMs over Heterogeneous Hardware

Long-context training of large language models (LLMs) is commonly distributed with Context Parallelism (CP) and Head Parallelism (HP), but existing training sys...

#research #paper #devops
3 weeks ago · devops · - · -

[Paper] Deadline-Driven Hierarchical Agentic Resource Sharing for AI Services and RAN Functions in AI-RAN

AI-RAN consolidates AI services and Radio Access Network (RAN) functions onto a unified, GPU-accelerated infrastructure at the network edge. However, compute sh...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Broken-symmetry shape discrimination on a driven Duffing ring

Distributed computational substrates rely on two elementary operations: bundling, the act of populating a shared physical medium with independently retrievable ...

#research #paper #ai
3 weeks ago · devops · - · -

[Paper] RcLLM: Accelerating Generative Recommendation via Beyond-Prefix KV Caching

Large Language Models (LLMs) are transforming recommendation from ranking into a generative task, but industrial deployment remains limited by the high latency ...

#research #paper #devops
3 weeks ago · ai · - · -

[Paper] Discovering Ordinary Differential Equations with LLM-Based Qualitative and Quantitative Evaluation

Discovering governing differential equations from observational data is a fundamental challenge in scientific machine learning. Existing symbolic regression app...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Electroencephalography (EEG) is a cornerstone of brain-computer interfaces and clinical neuroscience, yet deep learning models are typically trained and evaluat...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Direct-to-Event Spiking Neural Network Transfer

Spiking Neural Networks (SNNs) have gained increasing attention due to their potential for low-power computation on neuromorphic hardware. A widely adopted trai...

#research #paper #ai
0 month ago · ai · - · -

[Paper] Every Feedforward Neural Network Definable in an o-Minimal Structure Has Finite Sample Complexity

We show that, in a precise sense, a broad class of feedforward neural networks learn (have finite sample complexity) in the PAC model: every fixed finite feedfo...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] A Unified Measure-Theoretic View of Diffusion, Score-Based, and Flow Matching Generative Models

We survey continuous-time generative modeling methods based on transporting a simple reference distribution to a data distribution via stochastic or determinist...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation

For artistic applications, video generation requires fine-grained control over both performance and cinematography, i.e., the actor's motion and the camera traj...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

Modern Mixture-of-Experts (MoE) architectures allocate expert capacity through a rigid per-layer rule: each transformer layer owns a separate expert set. This c...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] BAMI: Training-Free Bias Mitigation in GUI Grounding

GUI grounding is a critical capability for enabling GUI agents to execute tasks such as clicking and dragging. However, in complex scenarios like the ScreenSpot...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] EMO: Pretraining Mixture of Experts for Emergent Modularity

Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e...

#research #paper #ai #nlp
0 month ago · ai · - · -

[Paper] Verifier-Backed Hard Problem Generation for Mathematical Reasoning

Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, ...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Relit-LiVE: Relight Video by Jointly Learning Environment Video

Recent advances have shown that large-scale video diffusion models can be repurposed as neural renderers by first decomposing videos into intrinsic scene repres...

#research #paper #ai #computer-vision
0 month ago · ai · - · -

[Paper] Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

Ranking LLMs via pairwise human feedback underpins current leaderboards for open-ended tasks, such as creative writing and problem-solving. We analyze ~89K comp...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less

Optimizers play an important role in both pretraining and finetuning stages when training large language models (LLMs). In this paper, we present an observation...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

Many deployments must compare candidate language models for safety before a labeled benchmark exists for the relevant language, sector, or regulatory regime. We...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician ...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients

Reinforcement learning with verifiable rewards (RLVR), due to the deterministic verification, becomes a dominant paradigm for enhancing the reasoning ability of...

#research #paper #ai #nlp
0 month ago · ai · - · -

[Paper] Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

Retrieval-augmented agents are increasingly the interface to large organizational knowledge bases, yet most still treat retrieval as a black box: they issue exp...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Inductive Venn-Abers and related regressors

Venn-Abers predictors are probabilistic predictors that enjoy appealing properties of validity, but their major limitation is that they are applicable only to t...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Edge-specific signal propagation on mature chromophore-region 3D mechanism graphs for fluorescent protein quantum-yield prediction

Fluorescent protein quantum yield (QY) is governed by the mature chromophore and its three-dimensional microenvironment rather than sequence identity alone. Pro...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study

Despite the growing popularity of Multimodal Domain Generalization (MMDG) for enhancing model robustness, it remains unclear whether reported performance gains ...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Large language models (LLMs) are increasingly used as interactive agents, but optimizing them for long-horizon decision making remains difficult because current...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] GlazyBench: A Benchmark for Ceramic Glaze Property Prediction and Image Generation

Developing ceramic glazes is a costly, time-consuming process of trial and error due to complex chemistry, placing a significant burden on independent artists. ...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] Recursive Agent Optimization

We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks t...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficul...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] DPM++: Dynamic Masked Metric Learning for Occluded Person Re-identification

Although person re-identification has made impressive progress, occlusion caused by obstacles remains an unsettled issue in real applications. The difficulty li...

#research #paper #ai #computer-vision
0 month ago · ai · - · -

[Paper] Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

Large language models (LLMs) power deep research agents that synthesize information from hundreds of web sources into cited reports, yet these citations cannot ...

#research #paper #ai #nlp
0 month ago · ai · - · -

[Paper] Parser agreement and disagreement in L2 Korean UD: Implications for human-in-the-loop annotation

We propose a simplified human-in-the-loop workflow for second language (L2) Korean morphosyntactic annotation by leveraging agreement between two domain-adapted...

#research #paper #ai #nlp
0 month ago · ai · - · -

[Paper] MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Large language model (LLM)-based Multi-agent systems (MAS) have shown promise in tackling complex collaborative tasks, where agents are typically orchestrated v...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] SoftSAE: Dynamic Top-K Selection for Adaptive Sparse Autoencoders

Sparse Autoencoders (SAEs) have become an important tool in mechanistic interpretability, helping to analyze internal representations in both Large Language Mod...

#research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts