machine learning — Page 5

Sort:

1 week ago · ai · - · -

[Paper] Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals

Activation oracles aim to make the activations of other models legible to humans and yield promising results compared to white-box interpretability techniques. ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists

We introduce CausaLab, a scalable environment for evaluating interactive causal discovery by LLM agents. Unlike prior evaluations, CausaLab evaluates both wheth...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Joint Optimization of Training and Inference in Federated Edge Learning via Constrained Multi-Objective Deep Reinforcement Learning

Federated edge learning (FEEL) has recently emerged as a promising paradigm for achieving edge intelligence (EI) via enabling collaborative model training acros...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Profiling-Driven Adaptive Distributed Transformer Inference on Embedded Edge Deployment

Distributing Transformer inference across embedded edge devices can alleviate individual memory and compute constraints, yet practical benefits on real hardware...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Meta-Engineering Harnesses for AI-Native Software Production: A Contract-Driven Adversarial Verification Architecture with Early Deployment Report

AI-native software development is often evaluated at the level of individual models, prompts, or generated artifacts. This framing is insufficient for productio...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines

We present the first end-to-end demonstration of fine-tuning and serving Google's Gemma 4 31B model on TPU hardware, providing an empirical comparison of TPU an...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] A Tertiary Review of Large Language Model-Based Code Generating Tasks: Trends, Challenges, and Future Directions

Context. Large language models (LLMs) are increasingly applied to code-generating tasks (CGTs) in software engineering. While reported results are promising, th...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Positivity in classical enumerative geometry: a case study in synchronized AI-assisted mathematics

We study the symmetric polynomial prod_{αin A_{n,d}}bigl(1+α_1 x_1+cdots+α_n x_nbigr) where A_{n,d}:={αinmathbb{Z}_{ge 0}^n:|α|=d}, which is the total Chern cla...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Growing a Neural Network in Breadth, Depth, and Time

Spatial and temporal resource constraints are critical for both biological and artificial intelligent systems. Here we define differentiable cost terms for brea...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Cultivating Machine Intelligence: The OMEGA Shift from Top-Down Optimization to Autopoietic Cognitive Ecologies

The dominant artificial intelligence paradigm trains neural architectures via gradient descent against proxy objectives and reinforcement learning from human fe...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Convex-Neural RRT*: Fast and Reliable Learning-Guided Sampling for High-Quality Robot Path Planning

Sampling-based algorithms for robot path planning offer probabilistic completeness and strong empirical convergence properties across environments with diverse ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Agent skills today are hand-crafted, generated one-shot, or evolved through loosely controlled self-revision, none of which behaves like a deep-learning optimiz...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Existing scaling laws for Large Language Models (LLMs), predominantly monotonic power laws, fail to explain emerging non-monotonic phenomena such as catastrophi...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Language agents increasingly improve by reusing skills -- structured procedural artifacts distilled from past experience. In particular, domain-level and model-...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] SPACENUM: Revisiting Spatial Numerical Understanding in VLMs

Vision-Language Models (VLMs) are increasingly deployed in embodied environments, where they need produce numerical outputs such as action magnitudes and spatia...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] ETCHR: Editing To Clarify and Harness Reasoning

Multimodal Large Language Models have advanced visual reasoning, yet a purely textual chain of thought remains a bottleneck for questions that require fine-grai...

#research #paper #ai #machine-learning #nlp #computer-vision
1 week ago · ai · - · -

[Paper] Complete-muE: Optimal Hyperparameter Transfer and Scaling for MoE Models

We propose Complete-muE, a framework which targets hyperparameter transfer across dense FFN and any Mixture-of-Experts (MoE) setups in transformer blocks. Exist...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Good Token Hunting: A Hitchhiker's Guide to Token Selection for Visual Geometry Transformers

Visual geometry transformers have become powerful architectures for multi-view 3D reconstruction, enabling joint prediction of multiple 3D attributes in a feed-...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] CHRONOS: Temporally-Aware Multi-Agent Coordination for Evolving Data Marketplaces

Temporal knowledge-graph data marketplaces face three coupled failures in static designs: stale hybrid index shortcuts reduce recall as edges evolve, stationary...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] PGT: Procedurally Generated Tasks for improving visual grounding in MLLMs

Despite remarkable progress in Multimodal Large Language Models (MLLMs), these models still struggle with fine-grained understanding tasks. In this work, we pro...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] On the Stability of Spherical Hellinger-Kantorovich Flows and Their Implications for Differential Privacy

Gradient-flow sampling interprets a Gibbs distribution as the minimizer of an energy functional over probability measures and generates dynamics converging to t...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Training-Free Looped Transformers

We introduce training-free looped transformers, in which a lightweight inference-time wrapper loops a contiguous mid-stack block of layers of a frozen checkpoin...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Move on Muon : A Hamiltonian probability gradient flow perspective of Muon optimizer

We develop a gradient flow on the space of probability measures defined on matrix-valued parameters induced by regularized Muon, an analytically smoothed versio...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Human Decision-Making with Persuasive and Narrative LLM Explanations

Large language models (LLMs) have the potential to aid and improve human decision-making in classification tasks, not only by providing fairly accurate predicti...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Leveraging Foundation Models for Causal Generative Modeling

Causal generative modeling is essential for developing reliable and transparent AI systems capable of counterfactual reasoning. While existing approaches focus ...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Strong Teacher Not Needed? On Distillation in LLM Pretraining

Knowledge distillation generally assumes a strong-to-weak relationship where stronger teachers yield better students. In this work, we examine this assumption a...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Entrywise Error Bounds for Spectral Ranking with Semi-Random Adversaries

Bradley-Terry-Luce (BTL) model estimation is a well-established strategy to rank a collection of items given a dataset of pairwise comparisons. Although the the...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Hierarchical Concept Geometry in Language Models Emerges from Word Co-occurrence

We propose a distributional theory of how hypernymy -- the ``is-a'' relation between general and specific concepts -- is encoded geometrically in language repre...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Agentic Proving for Program Verification

Agentic systems have recently emerged as state-of-the-art approaches for automated theorem proving in formal mathematics. To assess how far these capabilities e...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Preisach Attention: A Hysteretic Model of Sequential Memory

We introduce the Preisach Attention Layer (PAL), a novel sequence modelling architecture grounded in the classical Preisach hysteresis operator from mathematica...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Push Your Agent: Measuring and Enforcing Quantitative Goal Persistence in Long-Horizon LLM Agents

Long-horizon language agents can make many plausible local tool calls yet fail to persist until a requested count is actually complete. We study this gap as Qua...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems

Enterprise AI systems, built on large language models, retrieval pipelines and autonomous agents, introduce a class of risks that traditional software quality a...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Philosophical Dispositions as Behavioral Constraints for AI-Assisted Code Review: An Empirical Study

AI-assisted code review tools typically operate as generic 'expert reviewer' agents, producing homogeneous findings regardless of the analysis type needed. We p...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Security of LLM-generated Code: A Comparative Analysis

The majority of software developers use or are planning to use Artificial Intelligence (AI) tools in their development processes. Their top reasons include impr...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Tokenisation via Convex Relaxations

Tokenisation is an integral part of the current NLP pipeline. Current tokenisation algorithms such as BPE and Unigram are greedy algorithms -- they make locally...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Integrable Elasticity via Neural Demand Potentials

We propose the Integrable Context-Dependent Demand Network (ICDN), a demand-first neural model for multiproduct retail demand. The model learns log-demand as a ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Vector Policy Optimization: Training for Diversity Improves Test-Time Search

Language models must now generalize out of the box to novel environments and work inside inference-scaling search procedures, such as AlphaEvolve, that select r...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration

Exploration is a prerequisite for learning useful behaviors in sparse-reward, long-horizon tasks, particularly within 3D environments. Curiosity-driven reinforc...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning

Robustness, domain adaptation, photometric and occlusion invariance, compositional generalisation, temporal robustness, alignment safety, and classical anisotro...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting Models

We propose and analyze a conservative drifting method for one-step generative modeling. The method replaces the original displacement-based drifting velocity by...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems

Autonomous agentic systems are largely static after deployment: they do not learn from user interactions, and recurring failures persist until the next human-dr...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Linear attention replaces the unbounded cache of softmax attention with a fixed-size recurrent state, reducing sequence mixing to linear time and decoding to co...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems

Large language model (LLM)-based multi-agent systems increasingly rely on intermediate communication to coordinate complex tasks. While most existing systems co...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback

LLM-powered AI agents require high-frequency state exploration (e.g., test-time tree search and reinforcement learning), relying on rapid checkpoint and rollbac...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection

Production systems generate millions of log lines daily, yet most anomaly detectors operate at the session or window-level, flagging groups of lines rather than...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival Analysis

Survival analysis aims to estimate a time-to-event distribution from data with censored observations. Many existing methods either impose structural assumptions...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data

Real-time cognitive load assessment from eye-tracking signals could potentially enable adaptive human-centered-AI such as safety-critical applications such as d...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation

Real-time cognitive load assessment is essential for adaptive human-computer interaction but remains challenging due to limited labeled data and poor cross-subj...

#research #paper #ai #machine-learning

Newer posts

Older posts