machine learning — Page 14

Sort:

0 month ago · ai · - · -

[Paper] SoftSAE: Dynamic Top-K Selection for Adaptive Sparse Autoencoders

Sparse Autoencoders (SAEs) have become an important tool in mechanistic interpretability, helping to analyze internal representations in both Large Language Mod...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] DINORANKCLIP: DINOv3 Distillation and Injection for Vision-Language Pretraining with High-Order Ranking Consistency

Contrastive language-image pretraining (CLIP) suffers from two structural weaknesses: the symmetric InfoNCE loss discards the relative ordering among unmatched ...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] CLAD: A Clustered Label-Agnostic Federated Learning Framework for Joint Anomaly Detection and Attack Classification

The rapid expansion of the Internet of Things (IoT) and Industrial IoT (IIoT) has created a massive, heterogeneous attack surface that challenges traditional ne...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Continuous Latent Diffusion Language Model

Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-r...

#research #paper #ai #machine-learning #nlp #computer-vision
0 month ago · ai · - · -

[Paper] Constraint Decay: The Fragility of LLM Agents in Backend Code Generation

Large Language Model (LLM) agents demonstrate strong performance in autonomous code generation under loose specifications. However, production-grade software re...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] From Agent Loops to Deterministic Graphs: Execution Lineage for Reproducible AI-Native Work

Large language model systems are increasingly deployed as agentic workflows that interleave reasoning, tool use, memory, and iterative refinement. These systems...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] CoupleEvo: Evolving Heuristics for Coupled Optimization Problems Using Large Language Models

Many real-world optimization problems consist of multiple tightly coupled subproblems whose solutions must be coordinated to achieve high overall performance. H...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions

Large language models (LLMs) are now largely involved in software development workflows, and the code they generate routinely includes third-party library (TPL)...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Safactory: A Scalable Agent Factory for Trustworthy Autonomous Intelligence

As large models evolve from conversational assistants into autonomous agents, challenges increasingly arise from long-horizon decision making, tool use, and rea...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Teaching LLMs Program Semantics via Symbolic Execution Traces

We introduce an evaluation framework of 500 C verification tasks across five property types (memory safety, overflow, termination, reachability, data races) bui...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Beyond Accuracy: Policy Invariance as a Reliability Test for LLM Safety Judges

LLM-as-a-Judge pipelines have become the de facto evaluator for agent safety, yet existing benchmarks treat their verdicts as ground-truth proxies without check...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] BUILD-AND-FIND: An Effort-Aware Protocol for Evaluating Agent-Managed Codebases

Most coding-agent benchmarks ask whether generated code behaves correctly. That remains essential, but repository-level engineering is increasingly agent-manage...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?

For years, we have built LLM serving systems like any other critical infrastructure: a single general-purpose stack, hand-tuned over many engineer-years, meant ...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] MDN: Parallelizing Stepwise Momentum for Delta Linear Attention

Linear Attention (LA) offers a promising paradigm for scaling large language models (LLMs) to long sequences by avoiding the quadratic complexity of self-attent...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

How Bayesian Networks Work — Graphs, Probability, and Inference

Core Idea A Bayesian Network represents relationships between variables using a directed graph. - Each node is a variable. - Each edge shows a dependency. - Ea...

#bayesian networks #probabilistic graphical models #directed acyclic graph #conditional probability table #inference #machine learning
0 month ago · ai · - · -

[Paper] Graph Normalization: Fast Binarizing Dynamics for Differentiable MWIS

We introduce Graph Normalization (GN), a principled dynamical system on graphs that serves as a differentiable approximation engine for the NP-hard Maximum Weig...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Taming Outlier Tokens in Diffusion Transformers

We study outlier tokens in Diffusion Transformers (DiTs) for image generation. Prior work has shown that Vision Transformers (ViTs) can produce a small number o...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] Grokability in five inequalities

In this note, we report five mathematical discoveries made in collaboration with Grok, all of which have been subsequently verified by the authors. These includ...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Almost-Orthogonality in Lp Spaces: A Case Study with Grok

Carbery proposed the following sharpened form of triangle inequality for many functions: for any pge 2 and any finite sequence (f_j)_jsubset L^p we have [ Big|s...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

Long-horizon search agents must manage a rapidly growing working context as they reason, call tools, and observe information. Naively accumulating all intermedi...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Sharp Capacity Thresholds in Linear Associative Memory: From Winner-Take-All to Listwise Retrieval

How many key-value associations can a dtimes d linear memory store? We show that the answer depends not only on the d^2 degrees of freedom in the memory matrix,...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Estimating the expected output of wide random MLPs more efficiently than sampling

By far the most common way to estimate an expected loss in machine learning is to draw samples, compute the loss on each one, and take the empirical average. Ho...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer

Pre-trained transformers are able to learn from examples provided as part of the prompt without any weight updates, a remarkable ability known as in-context lea...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning

Behavior Cloning (BC) has emerged as a highly effective paradigm for robot learning. However, BC lacks a self-guided mechanism for online improvement after demo...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours

Driven by a rapid co-evolution of both harness and underlying models, LLM agents are improving at a dizzying pace. In our prior work (performed in Dec. 2025), w...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] The First Token Knows: Single-Decode Confidence for Hallucination Detection

Self-consistency detects hallucinations by generating multiple sampled answers to a question and measuring agreement, but this requires repeated decoding and ca...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Direct From Darwin: Deriving Advanced Optimizers From Evolutionary First Principles

Evolutionary computation has long promised to deliver both high-performance optimization tools as well as rigorous scientific simulations of Darwinian evolution...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation

Accurate analysis of histopathological images is critical for disease diagnosis and treatment planning. Whole-slide images (WSIs), which digitize tissue specime...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

We present our system for SemEval-2026 Task 9: Multilingual Polarization Detection, a binary classification task spanning 22 languages. Our approach fine-tunes ...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Aes3D: Aesthetic Assessment in 3D Gaussian Splatting

As 3D Gaussian Splatting (3DGS) gains attention in immersive media and digital content creation, assessing the aesthetics of 3D scenes becomes important in help...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] Superposition Is Not Necessary: A Mechanistic Interpretability Analysis of Transformer Representations for Time Series Forecasting

Transformer architectures have been widely adopted for time series forecasting, yet whether the representational mechanisms that make them powerful in NLP actua...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] What Matters in Practical Learned Image Compression

One of the major differentiators unlocked by learned codecs relative to their hard-coded traditional counterparts is their ability to be optimized directly to a...

#research #paper #ai #machine-learning #computer-vision
0 month ago · ai · - · -

[Paper] Human-AI Co-Mentorship in Project-Based Learning: A Case Study in Financial Forecasting

This paper reflects on a AI research project carried out by a team of high-school and early-undergraduate students under the mentorship of graduate researchers ...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction

Large Language Models (LLMs) frequently generate plausible but non-factual content, a phenomenon known as hallucination. While existing detection methods typica...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Transformed Latent Variable Multi-Output Gaussian Processes

Multi-Output Gaussian Processes (MOGPs) provide a principled probabilistic framework for modelling correlated outputs but face scalability bottlenecks when appl...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement

We introduce the **Concept Field** of a text corpus: a local drift field with pointwise uncertainty, estimated in sentence-embedding space from the deltas betwe...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics

LLMs are trained once, then deployed into a world that never stops changing. External memory compensates for this, but most systems manage it explicitly rather ...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models

We present an automated, contrastive evaluation pipeline for auditing the behavioral impact of interventions on large language models. Given a base model M_1 an...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] The Impossibility Triangle of Long-Context Modeling

We identify and prove a fundamental trade-off governing long-sequence models: no model can simultaneously achieve (i) per-step computation independent of sequen...

#research #paper #ai #machine-learning #nlp
0 month ago · ai · - · -

[Paper] Piper: Efficient Large-Scale MoE Training via Resource Modeling and Pipelined Hybrid Parallelism

Frontier models increasingly adopt Mixture-of-Experts (MoE) architectures to achieve large-model performance at reduced cost. However, training MoE models on HP...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] Architectural Constraints Alignment in AI-assisted, Platform-based Service Development

AI-assisted development tools enable rapid prototyping of services but often lack awareness of architectural constraints, infrastructure dependencies, and organ...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] On the Influence of the Feature Computation Budget on Per-Instance Algorithm Selection for Black-Box Optimization

Per-instance algorithm selection (PIAS) takes advantage of complementarity between a set of algorithms by deciding which algorithm to run on a given instance. T...

#research #paper #ai #machine-learning
0 month ago · ai · - · -

[Paper] DALight-3D: A Lightweight 3D U-Net for Brain Tumor Segmentation from Multi-Modal MRI

Automatic brain tumor segmentation from multi-modal MRI remains challenging because volumetric models often incur substantial computational cost. This paper pre...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training

As training scales grow, collective communication libraries (CCL) increasingly face anomalies arising from complex interactions among hardware, software, and en...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] One Pool, Two Caches: Adaptive HBM Partitioning for Accelerating Generative Recommender Serving

Generative Recommender (GR) inference places embedding hot caches (EMB) and KV caches in direct competition for limited GPU HBM: allocating more memory to one i...

#research #paper #ai #machine-learning

Newer posts

Older posts