[Paper] Mem-$π$: Adaptive Memory through Learning When and What to Generate
We present Mem-π, a framework for adaptive memory in large language model (LLM) agents, where useful guidance is generated on demand rather than retrieved from ...
We present Mem-π, a framework for adaptive memory in large language model (LLM) agents, where useful guidance is generated on demand rather than retrieved from ...
Autonomous manipulation systems have achieved remarkable capabilities, yet the integration of human expertise with diffusion-based policies in shared control re...
Suppose a planner has a pre-trained simulator of a sequential decision problem and the option to run real experiments in the field. The simulator is cheap to qu...
We introduce ProtoPathway, an interpretable-by-design multimodal framework for cancer survival prediction that unifies whole slide imaging and transcriptomics t...
As AI agents increasingly contribute to code development and maintenance, there is still limited empirical evidence on the quality and risk characteristics of t...
Universal approximation theorems provide a mathematical explanation for the expressive power of neural networks. They assert that, under mild conditions on the ...
Vision-language models (VLMs) are increasingly being explored for video game quality assurance, especially gameplay glitch detection. Most existing evaluations,...
Atmospheric turbulence severely degrades video quality by introducing distortions such as geometric warping, blur, and temporal flickering, posing significant c...
Overview AI agents can quickly become expensive without a clear strategy for planning, skill coverage, and budgets. This article shows how to use operations re...
Large language model (LLM) inference has become a dominant workload in modern data centers, driving significant GPU utilization and energy consumption. While pr...
Third-party Python libraries introduce dependency management overhead, supply chain risk, and deployment friction in constrained environments. A natural questio...
Agreement attraction errors, in which a verb erroneously agrees with an intervening noun rather than its grammatical head, are amplified by morphological syncre...
Metaphor requires a language model to resolve a token whose contextual meaning diverges from its basic literal sense. Understanding how transformer models organ...
As long-horizon coding agents produce more code than any developer can review, oversight collapses onto a single surface: the automated test suite. Reward hacki...
In The Algebraic Mind, Gary Marcus identified three components essential for any adequate cognitive architecture: operations over variables, recursively structu...
In The Algebraic Mind, Gary Marcus identified three components essential for any adequate cognitive architecture: operations over variables, recursively structu...
This paper describes the fifth edition of the Shared Task on Multilingual Coreference Resolution, held in conjunction with the CODI-CRAC 2026 workshop. Building...
As large language models (LLMs) increasingly shape how users form, refine, and extend their goals, attributing contributions in human-AI collaboration becomes c...
Diagnosing failures in LLM agents remains largely manual. Practitioners inspect a small subset of execution traces, form ad-hoc hypotheses, and iterate. This pr...
Image: Google Some ads will have chatbots built in. Google's AI‑powered Search era now extends to its ads. When you search for a product, Google's Gemini AI cha...
Modern LLM serving is no longer homogeneous or monolithic. Production systems now combine disaggregated execution, complex parallelism, runtime optimizations, a...
!https://9to5mac.com/wp-content/uploads/sites/6/2026/05/iphone-17-pro-three-up-iOS-26.jpg?quality=82&strip=all& We’re less than three weeks away from WWDChttps:...
NanoClaw & NanoCo AI – From Open‑Source Project to Enterprise‑Ready Platform The Vision The creators of NanoClaw – the hit open‑source, enterprise‑friendly var...
Federated learning (FL) has emerged as a promising paradigm for managing electric vehicle (EV) battery data in intelligent transportation systems (ITS), enablin...
A recent trend is to leverage machine learning models to improve the evolutionary design and optimization process. We propose a novel transformer-based mutation...
Mixed-integer extensions of evolution strategies (ES) that discretize selected coordinates of sampled continuous vectors often impose a lower bound on the stand...
AlltoAll dispatch is the dominant bottleneck of MoE expert parallelism, and the interconnect community has responded with four families of mitigations: predicti...
Corti Launches Symphony for Speech‑to‑Text Copenhagen‑based healthcare AI Corti is launching Symphony for Speech‑to‑Text, a new generation of clinical‑grade sp...
In distributed optimization, multiple parties collaborate to find an optimal solution to a problem. Privacy-preserving distributed optimization uses techniques,...
Communication is a major bottleneck in distributed learning, especially in large-scale settings and in federated learning environments with slow links. Three st...
This work presents E-ReCON, a 16 Kb energy and resource-efficient digital compute-in-memory (DCIM) macro based on a compact 3T1R ReRAM bitcell for edge-AI infer...
Among all of the possible chemical compounds, it’s estimated that between 10²⁰ and 10⁶⁰ may hold potential as small‑molecule drugs. Evaluating each of those com...
Generative AI’s Rapid Transition Generative AI’s rapid transition from text‑based chatbots to high‑fidelity media—spanning images, video, spatial 3D, and audio...
At Ramp, engineers are using Codex with GPT‑5.5 to accelerate code review and develop internal agentic tooling, helping teams get substantive pull‑request feedb...
Appointment Justin Solomon, associate professor in the MIT Department of Electrical Engineering and Computer Science EECS, has been appointed associate dean of...
Transformers trained on modular arithmetic exhibit sharp transitions between memorization, generalization, and collapse. We show that weight decay acts as a sca...
Enterprises have been slow to connect AI agents to internal APIs and databases—not because of the models, but because of credential handling. In most production...
Existing Gaussian avatar methods typically parameterize geometry on a body-template surface, which entangles the avatar's representation space with the template...
Video generation is rapidly evolving from single-shot synthesis to complex multi-shot audio-video (MSAV) narratives to meet real-world demands. However, evaluat...
Learning universal representations from electroencephalogram (EEG) signals is a cutting-edge approach in the field of neuroinformatics and brain-computer interf...
Diffusion Large Language Models (dLLMs) have emerged as a competitive alternative to autoregressive (AR) models, offering better hardware utilization and bidire...
Recent advances in vision-language models (VLMs) emphasize long chain-of-thought reasoning; yet, we find that their performance on visual tasks is primarily lim...
Large language models (LLMs) and agentic systems have shown promise for clinical decision support, but existing works largely assume that evidence has already b...
Advanced image editing software enables easy creation of highly convincing image manipulations, which has been made even more accessible in recent years due to ...
Production LLM agents combine stochastic model outputs with deterministic software systems, yet the boundary between the two is rarely treated as a first-class ...
The Power grid is a critical infrastructure underpinning all aspects of modern society and its services. Maintaining its effectiveness requires continuous adapt...
Modern Large Language Models (LLMs) have shown impressive performances in user-facing tasks such as question answering, as well as consistent improvements in re...
Flash floods in Bangladesh's haor wetlands show up with almost no warning. They wreck the annual boro rice harvest. Current setups, built for riverine floods, m...