paper — Page 7 | EUNO.NEWS

Sort:

1 week ago · ai · - · -

[Paper] From Model Scaling to System Scaling: Scaling the Harness in Agentic AI

This paper studies the next major bottleneck in agentic AI as system scaling, not only model scaling: the design of auditable, persistent, modular, and verifiab...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Squeezing Capacity from Multimodal Large Language Models for Subject-driven Generation

Subject-driven image generation aims to synthesize new images that preserve the identity of the given subject while following textual instructions. Existing app...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction Tuning

Multimodal Large Language Models (MLLMs) achieve versatility by reformulating diverse tasks into a unified instruction-following framework via instruction tunin...

#research #paper #ai #machine-learning #nlp #computer-vision
1 week ago · ai · - · -

[Paper] Helix4D: Complex 4D Mesh Generation

Current video-to-4D methods struggle with complex topology changes, transparent materials, thin structures, and inner surfaces. We present Helix4D, a dynamic me...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Reinforcing Few-step Generators via Reward-Tilted Distribution Matching

Recent advances in few-step diffusion distillation have enabled efficient image generation, yet aligning these models with human preferences remains challenging...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Looped Diffusion Language Models

Masked diffusion models (MDMs) have emerged as a promising alternative to autoregressive models for language modeling, yet the effective design of transformer a...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

Autoregressive video generators are attractive for streaming, long-horizon, and interactive applications, but distilling strong black-box teachers into causal s...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] EVIDENT: Routing MLLM Adaptation through Entity-Grounded Visual Evidence for Cross-Domain Video Temporal Grounding

Fine-tuning MLLMs for Video Temporal Grounding (VTG) often improves in-domain performance but degrades sharply under domain shift. In this work, we find that th...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Global Structure-from-Motion Meets Feedforward Reconstruction

Structure-from-Motion -- the process of simultaneously estimating camera poses and 3D scene structure from a collection of images -- remains a central challenge...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] InstructSAM: Segment Any Instance with Any Instructions

In this paper, we introduce InstructSAM, a unified and streamlined framework designed for multi-instance segmentation under arbitrary instructions. We formulate...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Beyond Summaries: Structure-Aware Labeling of Code Changes with Large Language Models

Code review is a critical practice in software engineering, yet the growing scale and frequency of code patches in modern projects, together with the widespread...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Language Models Need Sleep

Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To h...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Forgetting in Language Models: Capacity, Optimization, and Self-Generated Replay

Models trained on a new task typically degrade on prior tasks, a phenomenon known as forgetting. Traditionally, mitigating forgetting has required replaying sto...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Goal-driven Bayesian Optimal Experimental Design for Robust Decision-Making Under Model Uncertainty

Bayesian optimal experimental design (BOED) selects experiments to maximize information gain about model parameters. However, in decision-critical settings, red...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization

The deployment of Large Language Models (LLMs) and Vision Transformers (ViTs) on edge devices is significantly constrained by memory limitations and the critica...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Channel-wise Vector Quantization

We present Channel-wise Vector Quantization (CVQ), a novel image tokenization paradigm that replaces patch-wise tokens with channel-wise tokens. Unlike conventi...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] DiscoverPhysics: Benchmarking LLMs for Out-of-the-Box Scientific Thinking

Frontier LLMs now perform strongly across a wide range of physics evaluations, but it is hard to disentangle genuine reasoning from recall of established scienc...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World

Large language model agents are increasingly envisioned as always-on personal assistants with access to anything relevant in the user's digital world. Yet curre...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] VeriTrace: Evolving Mental Models for Deep Research Agents

Deep research agents face vast, interdependent, and pervasively uncertain information. Existing systems explore what evolving intermediate representations shoul...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Automated Benchmark Auditing for AI Agents and Large Language Models

Modern AI benchmarks operate at a complexity that outpaces traditional verification methods. Tasks authored by domain experts often contain implicit assumptions...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Global Convergence of Wasserstein Policy Gradient for Entropy-Regularized Reinforcement Learning

Wasserstein policy gradient (WPG) is a policy optimization method for reinforcement learning (RL) that exploits the optimal-transport geometry of action distrib...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] StakeBench: Evaluating Language Understanding Grounded in Market Commitment

Existing financial NLP benchmarks often rely on labels supplied by outside observers, measuring how language is perceived rather than what speakers have committ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Active Query Synthesis for Preference Learning

Efficient learning of user preferences is crucial for many modern decision making systems but typically requires costly labeled data. Active learning reduces th...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] WhoSaidIt: Human-LLM Collaborative Annotation for Text-Based Multilingual Speaker-Attribute Classification

Annotating speaker attributes from text is inherently ambiguous, particularly in multilingual settings where demographic and social cues are implicit and cultur...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges

Customizing an LLM judge to a specific task or domain often involves optimizing its prompt across multiple evaluation criteria simultaneously. Textual gradient ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals

Activation oracles aim to make the activations of other models legible to humans and yield promising results compared to white-box interpretability techniques. ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Peak-Then-Collapse and the Four Interface Channels of Knowledge-Graph Tool Use

We test the standard RLVR tool-use recipe -- GRPO on Qwen2.5-7B-Instruct -- on a deliberately minimal knowledge-graph tool API: four Freebase navigation verbs o...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists

We introduce CausaLab, a scalable environment for evaluating interactive causal discovery by LLM agents. Unlike prior evaluations, CausaLab evaluates both wheth...

#research #paper #ai #machine-learning #nlp
1 week ago · software · - · -

[Paper] Trustworthy Software Project Generation : a Case Study with an Interactive Theorem Prover

Generating code from natural-language requirements has become a primary route for LLM-assisted software development. Although LLMs can successfully complete sma...

#research #paper #software
1 week ago · software · - · -

[Paper] Uncovering multi-channel magnetic hopfion annihilation via a single-node, billion-spin-scale atomistic framework

Modern atomistic spin simulations combine long stochastic trajectories, thermodynamic sampling, static optimization and multi-image transition-path workflows, a...

#research #paper #software
1 week ago · software · - · -

[Paper] CelerLog: Fast Log Parsing via Dynamic Routing

Log parsing is a fundamental step for automated log analysis, which transforms raw log messages into structured formats. Existing syntax-based parsers struggle ...

#research #paper #software
1 week ago · software · - · -

[Paper] From Early Adoption to Sustained Use: Understanding GenAI Usage Among Software Developers in Italian SMEs

Generative AI tools are rapidly transforming software development practice, prompting unprecedented research interest. However, existing studies have predominan...

#research #paper #software
1 week ago · ai · - · -

[Paper] Joint Optimization of Training and Inference in Federated Edge Learning via Constrained Multi-Objective Deep Reinforcement Learning

Federated edge learning (FEEL) has recently emerged as a promising paradigm for achieving edge intelligence (EI) via enabling collaborative model training acros...

#research #paper #ai #machine-learning
1 week ago · software · - · -

[Paper] How Agentic AI Coding Assistants Become the Attacker's Shell

Agentic AI coding assistants can edit files, run commands, and access the internet on behalf of developers. However, their reliance on unvetted external artifac...

#research #paper #software
1 week ago · devops · - · -

[Paper] Proof of Useful Attestation: A Consensus Primitive for Attestation-Native Chains

Validators on generic Proof of Stake chains earn the same fees whether they handle attestation work correctly or selectively censor it. For chains whose main ac...

#research #paper #devops
1 week ago · ai · - · -

[Paper] A Scalable Benchmark Test Suite for Dynamic Multi-Objective Optimization with a Changing Number of Objectives

Dynamic multi-objective optimization with a changing number of objectives has recently attracted increasing attention due to its relevance to real-world problem...

#research #paper #ai
1 week ago · devops · - · -

[Paper] An Efficient and Privacy-Preserving Architecture for Cross-Institutional Collaborative RAG

Retrieval-Augmented Generation (RAG) empowers LLMs with external knowledge, making cross-institutional domain-specific knowledge base integration a highly promi...

#research #paper #devops
1 week ago · ai · - · -

[Paper] Neural Router: Semantic Content Matching for Agentic AI

Large language models (LLMs) can serve as the semantic-matching engine of a content-based publish/subscribe broker for agentic AI across the edge-cloud computin...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Profiling-Driven Adaptive Distributed Transformer Inference on Embedded Edge Deployment

Distributing Transformer inference across embedded edge devices can alleviate individual memory and compute constraints, yet practical benefits on real hardware...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Meta-Engineering Harnesses for AI-Native Software Production: A Contract-Driven Adversarial Verification Architecture with Early Deployment Report

AI-native software development is often evaluated at the level of individual models, prompts, or generated artifacts. This framing is insufficient for productio...

#research #paper #ai #machine-learning
1 week ago · devops · - · -

[Paper] Bandwidth-Aware LLM Inference on Heterogeneous Many-Core Supercomputers

Large language model (LLM) inference is limited by high computational cost and memory bandwidth demands, making deployment on heterogeneous many-core processors...

#research #paper #devops
1 week ago · devops · - · -

[Paper] When Agents Control Robots: A Zero Trust Policy Model for Agentic Cyber-Physical Systems

Multi-agent systems powered by large foundation models (LFMs) are increasingly deployed to control industrial robots through natural language, creating deployme...

#research #paper #devops
1 week ago · ai · - · -

[Paper] Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines

We present the first end-to-end demonstration of fine-tuning and serving Google's Gemma 4 31B model on TPU hardware, providing an empirical comparison of TPU an...

#research #paper #ai #machine-learning
1 week ago · devops · - · -

[Paper] DisagFusion: Asynchronous Pipeline Parallelism and Elastic Scheduling for Disaggregated Diffusion Serving

Diffusion-based generation is increasingly powering production content pipelines; however, deploying these models at scale remains a significant challenge. Mode...

#research #paper #devops
1 week ago · ai · - · -

[Paper] A Tertiary Review of Large Language Model-Based Code Generating Tasks: Trends, Challenges, and Future Directions

Context. Large language models (LLMs) are increasingly applied to code-generating tasks (CGTs) in software engineering. While reported results are promising, th...

#research #paper #ai #machine-learning
1 week ago · software · - · -

[Paper] A Heuristic Approach to Localize CSS Properties for Responsive Layout Failures

Responsive Layout Failures (RLFs) typically arise from CSS properties that hinder proper layout behavior in different screen sizes. To find an accurate and effe...

#research #paper #software
1 week ago · devops · - · -

[Paper] Bandwidth-Aware and Cost-Efficient Pipeline Parallel Scheduling in Geo-Distributed LLM Training

The rapid evolution of large language models (LLMs) has made geographically distributed training necessary due to GPU scarcity within a single cloud region. In ...

#research #paper #devops
1 week ago · ai · - · -

[Paper] Positivity in classical enumerative geometry: a case study in synchronized AI-assisted mathematics

We study the symmetric polynomial prod_{αin A_{n,d}}bigl(1+α_1 x_1+cdots+α_n x_nbigr) where A_{n,d}:={αinmathbb{Z}_{ge 0}^n:|α|=d}, which is the total Chern cla...

#research #paper #ai #machine-learning

Newer posts

Older posts