research — Page 2

Sort:

3 days ago · devops · - · -

[Paper] HexiSeq: Accommodating Long Context Training of LLMs over Heterogeneous Hardware

Long-context training of large language models (LLMs) is commonly distributed with Context Parallelism (CP) and Head Parallelism (HP), but existing training sys...

#research #paper #devops
3 days ago · devops · - · -

[Paper] Deadline-Driven Hierarchical Agentic Resource Sharing for AI Services and RAN Functions in AI-RAN

AI-RAN consolidates AI services and Radio Access Network (RAN) functions onto a unified, GPU-accelerated infrastructure at the network edge. However, compute sh...

#research #paper #devops
3 days ago · ai · - · -

[Paper] Broken-symmetry shape discrimination on a driven Duffing ring

Distributed computational substrates rely on two elementary operations: bundling, the act of populating a shared physical medium with independently retrievable ...

#research #paper #ai
3 days ago · devops · - · -

[Paper] RcLLM: Accelerating Generative Recommendation via Beyond-Prefix KV Caching

Large Language Models (LLMs) are transforming recommendation from ranking into a generative task, but industrial deployment remains limited by the high latency ...

#research #paper #devops
3 days ago · ai · - · -

[Paper] Discovering Ordinary Differential Equations with LLM-Based Qualitative and Quantitative Evaluation

Discovering governing differential equations from observational data is a fundamental challenge in scientific machine learning. Existing symbolic regression app...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Electroencephalography (EEG) is a cornerstone of brain-computer interfaces and clinical neuroscience, yet deep learning models are typically trained and evaluat...

#research #paper #ai #machine-learning
3 days ago · ai · - · -

[Paper] Direct-to-Event Spiking Neural Network Transfer

Spiking Neural Networks (SNNs) have gained increasing attention due to their potential for low-power computation on neuromorphic hardware. A widely adopted trai...

#research #paper #ai
3 days ago · ai · - · -

[Paper] Every Feedforward Neural Network Definable in an o-Minimal Structure Has Finite Sample Complexity

We show that, in a precise sense, a broad class of feedforward neural networks learn (have finite sample complexity) in the PAC model: every fixed finite feedfo...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] A Unified Measure-Theoretic View of Diffusion, Score-Based, and Flow Matching Generative Models

We survey continuous-time generative modeling methods based on transporting a simple reference distribution to a data distribution via stochastic or determinist...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation

For artistic applications, video generation requires fine-grained control over both performance and cinematography, i.e., the actor's motion and the camera traj...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

Modern Mixture-of-Experts (MoE) architectures allocate expert capacity through a rigid per-layer rule: each transformer layer owns a separate expert set. This c...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] BAMI: Training-Free Bias Mitigation in GUI Grounding

GUI grounding is a critical capability for enabling GUI agents to execute tasks such as clicking and dragging. However, in complex scenarios like the ScreenSpot...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] EMO: Pretraining Mixture of Experts for Emergent Modularity

Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Verifier-Backed Hard Problem Generation for Mathematical Reasoning

Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, ...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Relit-LiVE: Relight Video by Jointly Learning Environment Video

Recent advances have shown that large-scale video diffusion models can be repurposed as neural renderers by first decomposing videos into intrinsic scene repres...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

Ranking LLMs via pairwise human feedback underpins current leaderboards for open-ended tasks, such as creative writing and problem-solving. We analyze ~89K comp...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less

Optimizers play an important role in both pretraining and finetuning stages when training large language models (LLMs). In this paper, we present an observation...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

Many deployments must compare candidate language models for safety before a labeled benchmark exists for the relevant language, sector, or regulatory regime. We...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician ...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients

Reinforcement learning with verifiable rewards (RLVR), due to the deterministic verification, becomes a dominant paradigm for enhancing the reasoning ability of...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

Retrieval-augmented agents are increasingly the interface to large organizational knowledge bases, yet most still treat retrieval as a black box: they issue exp...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Inductive Venn-Abers and related regressors

Venn-Abers predictors are probabilistic predictors that enjoy appealing properties of validity, but their major limitation is that they are applicable only to t...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Edge-specific signal propagation on mature chromophore-region 3D mechanism graphs for fluorescent protein quantum-yield prediction

Fluorescent protein quantum yield (QY) is governed by the mature chromophore and its three-dimensional microenvironment rather than sequence identity alone. Pro...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study

Despite the growing popularity of Multimodal Domain Generalization (MMDG) for enhancing model robustness, it remains unclear whether reported performance gains ...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Large language models (LLMs) are increasingly used as interactive agents, but optimizing them for long-horizon decision making remains difficult because current...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] GlazyBench: A Benchmark for Ceramic Glaze Property Prediction and Image Generation

Developing ceramic glazes is a costly, time-consuming process of trial and error due to complex chemistry, placing a significant burden on independent artists. ...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] Recursive Agent Optimization

We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks t...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficul...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] DPM++: Dynamic Masked Metric Learning for Occluded Person Re-identification

Although person re-identification has made impressive progress, occlusion caused by obstacles remains an unsettled issue in real applications. The difficulty li...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

Large language models (LLMs) power deep research agents that synthesize information from hundreds of web sources into cited reports, yet these citations cannot ...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Parser agreement and disagreement in L2 Korean UD: Implications for human-in-the-loop annotation

We propose a simplified human-in-the-loop workflow for second language (L2) Korean morphosyntactic annotation by leveraging agreement between two domain-adapted...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Large language model (LLM)-based Multi-agent systems (MAS) have shown promise in tackling complex collaborative tasks, where agents are typically orchestrated v...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] SoftSAE: Dynamic Top-K Selection for Adaptive Sparse Autoencoders

Sparse Autoencoders (SAEs) have become an important tool in mechanistic interpretability, helping to analyze internal representations in both Large Language Mod...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] DINORANKCLIP: DINOv3 Distillation and Injection for Vision-Language Pretraining with High-Order Ranking Consistency

Contrastive language-image pretraining (CLIP) suffers from two structural weaknesses: the symmetric InfoNCE loss discards the relative ordering among unmatched ...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] Solving Minimal Problems Without Matrix Inversion Using FFT-Based Interpolation

Estimating camera geometry typically involves solving minimal problems formulated as systems of multivariate polynomial equations, which often pose computationa...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] CLAD: A Clustered Label-Agnostic Federated Learning Framework for Joint Anomaly Detection and Attack Classification

The rapid expansion of the Internet of Things (IoT) and Industrial IoT (IIoT) has created a massive, heterogeneous attack surface that challenges traditional ne...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Continuous Latent Diffusion Language Model

Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-r...

#research #paper #ai #machine-learning #nlp #computer-vision
4 days ago · devops · - · -

[Paper] CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure

Evaluative claims about LLM infrastructure -- ``workload X is fastest on hardware Y with software Z'' -- depend on a complex configuration space spanning hardwa...

#research #paper #devops
4 days ago · devops · - · -

[Paper] ROSE: Rollout On Serving GPUs via Cooperative Elasticity for Agentic RL

Agentic reinforcement learning (RL) has emerged as a key driver for improving the multi-step reasoning and tool-use capabilities of LLMs. However, its efficienc...

#research #paper #devops
4 days ago · software · - · -

[Paper] To What Extent Does Agent-generated Code Require Maintenance? An Empirical Study

LLM-based autonomous coding agents have reshaped software development. While these agents excel at code generation, open questions persist about the long-term m...

#research #paper #software
4 days ago · ai · - · -

[Paper] Constraint Decay: The Fragility of LLM Agents in Backend Code Generation

Large Language Model (LLM) agents demonstrate strong performance in autonomous code generation under loose specifications. However, production-grade software re...

#research #paper #ai #machine-learning
4 days ago · devops · - · -

[Paper] ADELIA: Automatic Differentiation for Efficient Laplace Inference Approximations

Spatio-temporal Bayesian inference drives environmental and health sciences using latent Gaussian models. Integrated Nested Laplace Approximations (INLA) enable...

#research #paper #devops
4 days ago · ai · - · -

[Paper] The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents

A hallmark of life on Earth is the ability of agents to exert causal power and be drivers of subsequent events. This is key to cognition at all scales. Causal e...

#research #paper #ai
4 days ago · devops · - · -

[Paper] ResiHP: Taming LLM Training Failures with Dynamic Hybrid

Hybrid parallelism underpins large-scale LLM training across tens of thousands of GPUs. At such scale, hardware failures on individual devices lead to performan...

#research #paper #devops
4 days ago · ai · - · -

[Paper] From Agent Loops to Deterministic Graphs: Execution Lineage for Reproducible AI-Native Work

Large language model systems are increasingly deployed as agentic workflows that interleave reasoning, tool use, memory, and iterative refinement. These systems...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] CoupleEvo: Evolving Heuristics for Coupled Optimization Problems Using Large Language Models

Many real-world optimization problems consist of multiple tightly coupled subproblems whose solutions must be coordinated to achieve high overall performance. H...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions

Large language models (LLMs) are now largely involved in software development workflows, and the code they generate routinely includes third-party library (TPL)...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Safactory: A Scalable Agent Factory for Trustworthy Autonomous Intelligence

As large models evolve from conversational assistants into autonomous agents, challenges increasingly arise from long-horizon decision making, tool use, and rea...

#research #paper #ai #machine-learning

Newer posts

Older posts