machine learning — Page 13

Sort:

3 weeks ago · ai · - · -

RAG Is Blind to Time — I Built a Temporal Layer to Fix It in Production

Overview Three weeks into testing, a learner told me my AI tutor gave her the wrong answer. Not obviously wrong — just outdated enough to mislead. That was the...

#retrieval-augmented generation #RAG #temporal layer #time-aware AI #knowledge base freshness #AI tutoring #LLM #machine learning #production systems
3 weeks ago · ai · - · -

[Paper] Normalizing Trajectory Models

Diffusion-based models decompose sampling into many small Gaussian denoising steps -- an assumption that breaks down when generation is compressed to a few coar...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Zero-Shot Imagined Speech Decoding via Imagined-to-Listened MEG Mapping

Decoding imagined speech from non-invasive brain recordings is challenging because imagined datasets are scarce and difficult to align temporally across subject...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] GRAPHLCP: Structure-Aware Localized Conformal Prediction on Graphs

Conformal prediction (CP) provides a distribution-free approach to uncertainty quantification with finite-sample guarantees. However, applying CP to graph neura...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction

Recent event-based image reconstruction methods predominantly rely on Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to process complementa...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] A Note on Non-Negative $L_1$-Approximating Polynomials

L_1-Approximating polynomials, i.e., polynomials that approximate indicator functions in L_1-norm under certain distributions, are widely used in computational ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection

A standard technique for scaling inference-time reasoning is Self-Consistency, whereby multiple candidate answers are sampled from an LLM and the most common an...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Flow-OPD: On-Policy Distillation for Flow Matching Models

Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under multi-task alignment: the reward sparsity induced by scalar-valued r...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning

We argue that decomposing reward into weighted, verifiable criteria and using an LLM judge to score them provides a partial-credit optimization signal: instead ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation

While recent advancements in inference-time learning have improved LLM reasoning on Text-to-SQL tasks, current solutions still struggle to perform well on the m...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Reinforcement Learning for Exponential Utility: Algorithms and Convergence in Discounted MDPs

Reinforcement learning (RL) for exponential-utility optimization in discounted Markov decision processes (MDPs) lacks principled value-based algorithms. We addr...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Fast Byte Latent Transformer

Recent byte-level language models (LMs) match the performance of token-level models without relying on subword vocabularies, yet their utility is limited by slo...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

While text-to-image models have made strong progress in visual fidelity, faithfully realizing complex visual intents remains challenging because many requiremen...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Beyond Pairs: Your Language Model is Secretly Optimizing a Preference Graph

Direct Preference Optimization (DPO) aligns language models using pairwise preference comparisons, offering a simple and effective alternative to Reinforcement ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Don't Get Your Kroneckers in a Twist: Gaussian Processes on High-Dimensional Incomplete Grids

We introduce CUTS-GPR, a new method for performing numerically exact Gaussian process regression (GPR) in high-dimensional settings. The key component of CUTS-G...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] PropSplat: Map-Free RF Field Reconstruction via 3D Gaussian Propagation Splatting

Building a site-specific propagation model typically requires either ray-tracing over detailed 3D maps or dense measurement campaigns. Both approaches are expen...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Semiparametric Efficient Test for Interpretable Distributional Treatment Effects

Distributional treatment effects can be invisible to means: a treatment may preserve average outcomes while changing tails, modes, dispersion, or rare-event pro...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] MPD$^2$-Router: Mask-aware Multi-expert Prior-regularized Dual-head Deferral Router in Glaucoma Screening and Diagnosis

Learning-to-defer (L2D) can make glaucoma screening safer by routing difficult/uncertain cases to humans, yet standard formulations overlook expert availability...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Globally Optimal Training of Spiking Neural Networks via Parameter Reconstruction

Spiking Neural Networks (SNNs) have been proposed as biologically plausible and energy-efficient alternatives to conventional Artificial Neural Networks (ANNs)....

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Position: Mechanistic Interpretability Must Disclose Identification Assumptions for Causal Claims

Mechanistic interpretability papers increasingly use causal vocabulary: circuits, mediators, causal abstraction, monosemanticity. Such claims require explicit i...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

Learning AI Out Loud — Full Series Index

!Cover image for Learning AI Out Loud — Full Series Indexhttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=auto/https%3A%2...

#AI tutorials #machine learning #learning series #dev.to #educational resources
3 weeks ago · ai · - · -

AI Ascent 2026

AI Ascent 2026 AI Ascent IV was our biggest and best year yet. By Team Sequoia Published May 8, 2026 On April 20, we hosted our fourth annual AI Ascent in San...

#AI conference #AI agents #founders #Sequoia #2026 #machine learning #generative AI #venture capital
3 weeks ago · ai · - · -

[Paper] Tool Calling is Linearly Readable and Steerable in Language Models

When a tool-calling agent picks the wrong tool, the failure is invisible until execution: the email gets sent, the meeting gets missed. Probing 12 instruction-t...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

Selecting the optimal LLM inference configuration requires evaluation across hardware, serving engines, attention backends, and model architectures, since no si...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] FLAM: Evaluating Model Performance with Aggregatable Measures in Federated Learning

Performance evaluation is essential for assessing the quality of machine learning (ML) models and guiding deployment decisions. In federated learning (FL), asse...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] mathsf{VISTA}: Decentralized Machine Learning in Adversary Dominated Environments

Decentralized machine learning often relies on outsourcing computations, such as gradient evaluations, to untrusted worker nodes. Existing robust aggregation me...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Discovering Ordinary Differential Equations with LLM-Based Qualitative and Quantitative Evaluation

Discovering governing differential equations from observational data is a fundamental challenge in scientific machine learning. Existing symbolic regression app...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Electroencephalography (EEG) is a cornerstone of brain-computer interfaces and clinical neuroscience, yet deep learning models are typically trained and evaluat...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Every Feedforward Neural Network Definable in an o-Minimal Structure Has Finite Sample Complexity

We show that, in a precise sense, a broad class of feedforward neural networks learn (have finite sample complexity) in the PAC model: every fixed finite feedfo...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] A Unified Measure-Theoretic View of Diffusion, Score-Based, and Flow Matching Generative Models

We survey continuous-time generative modeling methods based on transporting a simple reference distribution to a data distribution via stochastic or determinist...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation

For artistic applications, video generation requires fine-grained control over both performance and cinematography, i.e., the actor's motion and the camera traj...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

Modern Mixture-of-Experts (MoE) architectures allocate expert capacity through a rigid per-layer rule: each transformer layer owns a separate expert set. This c...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] BAMI: Training-Free Bias Mitigation in GUI Grounding

GUI grounding is a critical capability for enabling GUI agents to execute tasks such as clicking and dragging. However, in complex scenarios like the ScreenSpot...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] Verifier-Backed Hard Problem Generation for Mathematical Reasoning

Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, ...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

Ranking LLMs via pairwise human feedback underpins current leaderboards for open-ended tasks, such as creative writing and problem-solving. We analyze ~89K comp...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less

Optimizers play an important role in both pretraining and finetuning stages when training large language models (LLMs). In this paper, we present an observation...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

Many deployments must compare candidate language models for safety before a labeled benchmark exists for the relevant language, sector, or regulatory regime. We...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician ...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

Retrieval-augmented agents are increasingly the interface to large organizational knowledge bases, yet most still treat retrieval as a black box: they issue exp...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Inductive Venn-Abers and related regressors

Venn-Abers predictors are probabilistic predictors that enjoy appealing properties of validity, but their major limitation is that they are applicable only to t...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Edge-specific signal propagation on mature chromophore-region 3D mechanism graphs for fluorescent protein quantum-yield prediction

Fluorescent protein quantum yield (QY) is governed by the mature chromophore and its three-dimensional microenvironment rather than sequence identity alone. Pro...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study

Despite the growing popularity of Multimodal Domain Generalization (MMDG) for enhancing model robustness, it remains unclear whether reported performance gains ...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Large language models (LLMs) are increasingly used as interactive agents, but optimizing them for long-horizon decision making remains difficult because current...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] GlazyBench: A Benchmark for Ceramic Glaze Property Prediction and Image Generation

Developing ceramic glazes is a costly, time-consuming process of trial and error due to complex chemistry, placing a significant burden on independent artists. ...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] Recursive Agent Optimization

We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks t...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficul...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Large language model (LLM)-based Multi-agent systems (MAS) have shown promise in tackling complex collaborative tasks, where agents are typically orchestrated v...

#research #paper #ai #machine-learning #nlp

Newer posts

Older posts