research — Page 54

Sort:

1 month ago · ai · - · -

[Paper] StepShield: When, Not Whether to Intervene on Rogue Agents

Existing agent safety benchmarks report binary accuracy, conflating early intervention with post-mortem analysis. A detector that flags a violation at step 8 en...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] PI-Light: Physics-Inspired Diffusion for Full-Image Relighting

Full-image relighting remains a challenging problem due to the difficulty of collecting large-scale structured paired data, the difficulty of maintaining physic...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Early and Prediagnostic Detection of Pancreatic Cancer from Computed Tomography

Pancreatic ductal adenocarcinoma (PDAC), one of the deadliest solid malignancies, is often detected at a late and inoperable stage. Retrospective reviews of pre...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference

Large Language Models (LLMs) deliver state-of-the-art performance on complex reasoning tasks, but their inference costs limit deployment at scale. Small Languag...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] SMOG: Scalable Meta-Learning for Multi-Objective Bayesian Optimization

Multi-objective optimization aims to solve problems with competing objectives, often with only black-box access to a problem and a limited budget of measurement...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems

Frontier large language models (LLMs) excel as autonomous agents in many domains, yet they remain untested in complex enterprise systems where hidden workflows ...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents

Test-time scaling has been widely adopted to enhance the capabilities of Large Language Model (LLM) agents in software engineering (SWE) tasks. However, the sta...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers

Current generative video models excel at producing novel content from text and image prompts, but leave a critical gap in editing existing pre-recorded videos, ...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] Creative Image Generation with Diffusion Model

Creative image generation has emerged as a compelling area of research, driven by the need to produce novel and high-quality images that expand the boundaries o...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clin...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] ECO: Quantized Training without Full-Precision Master Weights

Quantization has significantly improved the compute and memory efficiency of Large Language Model (LLM) training. However, existing approaches still rely on acc...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Where Do the Joules Go? Diagnosing Inference Energy Consumption

Energy is now a critical ML computing resource. While measuring energy consumption and observing trends is a valuable first step, accurately understanding and d...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Lens-descriptor guided evolutionary algorithm for optimization of complex optical systems with glass choice

Designing high-performance optical lenses entails exploring a high-dimensional, tightly constrained space of surface curvatures, glass choices, element thicknes...

#research #paper #ai
1 month ago · ai · - · -

[Paper] When 'Better' Prompts Hurt: Evaluation-Driven Iteration for LLM Applications

Evaluating Large Language Model (LLM) applications differs from traditional software testing because outputs are stochastic, high-dimensional, and sensitive to ...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference

AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and hig...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Liquid Interfaces: A Dynamic Ontology for the Interoperability of Autonomous Systems

Contemporary software architectures struggle to support autonomous agents whose reasoning is adaptive, probabilistic, and context-dependent, while system integr...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic

Recent work has explored optimizing LLM collaboration through Multi-Agent Reinforcement Learning (MARL). However, most MARL fine-tuning approaches rely on prede...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] The Energy Impact of Domain Model Design in Classical Planning

AI research has traditionally prioritised algorithmic performance, such as optimising accuracy in machine learning or runtime in automated planning. The emergin...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Dependence of Equilibrium Propagation Training Success on Network Architecture

The rapid rise of artificial intelligence has led to an unsustainable growth in energy consumption. This has motivated progress in neuromorphic computing and ph...

#research #paper #ai #machine-learning
1 month ago · devops · - · -

[Paper] Belief Propagation Converges to Gaussian Distributions in Sparsely-Connected Factor Graphs

Belief Propagation (BP) is a powerful algorithm for distributed inference in probabilistic graphical models, however it quickly becomes infeasible for practical...

#research #paper #devops
1 month ago · ai · - · -

[Paper] Adaptive Surrogate-Based Strategy for Accelerating Convergence Speed when Solving Expensive Unconstrained Multi-Objective Optimisation Problems

Multi-Objective Evolutionary Algorithms (MOEAs) have proven effective at solving Multi-Objective Optimisation Problems (MOOPs). However, their performance can b...

#research #paper #ai
1 month ago · ai · - · -

[Paper] Evolution of Benchmark: Black-Box Optimization Benchmark Design through Large Language Model

Benchmark Design in Black-Box Optimization (BBO) is a fundamental yet open-ended topic. Early BBO benchmarks are predominantly human-crafted, introducing expert...

#research #paper #ai
1 month ago · devops · - · -

[Paper] Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement Learning

In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Quer...

#research #paper #devops
1 month ago · ai · - · -

[Paper] READY: Reward Discovery for Meta-Black-Box Optimization

Meta-Black-Box Optimization (MetaBBO) is an emerging avenue within Optimization community, where algorithm design policy could be meta-learned by reinforcement ...

#research #paper #ai #machine-learning

Newer posts

Older posts