machine learning — Page 10

Sort:

2 weeks ago · ai · - · -

[Paper] Hypothesis-driven construction of mesoscopic dynamics

Traditional scientific modeling typically begins with fixed, instance-wise effective equations and then carries out equation-specific analysis and computation, ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Confirming Correct, Missing the Rest: LLM Tutoring Agents Struggle Where Feedback Matters Most

Effective tutoring requires distinguishing optimal, valid but suboptimal, and incorrect student solutions, a distinction central to intelligent tutoring systems...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Context, Reasoning, and Hierarchy: A Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP

Deploying compound LLM agents in adversarial, partially observable sequential environments requires navigating several design dimensions: (1) what the agent see...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

Second-order methods offer an attractive path toward more sample-efficient LLM training, but their practical use is often blocked by the systems cost of maintai...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Second-Order Multi-Level Variance Correction for Modality Competition in Multimodal Models

Autoregressive next-token training offers a unified formulation for image generation and text understanding, but it also creates strong modality competition tha...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] GenShield: Unified Detection and Artifact Correction for AI-Generated Images

Diffusion-based image synthesis has made AI-generated images (AIGI) increasingly photorealistic, raising urgent concerns about authenticity in applications such...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Scalable neuromorphic computing from autonomous spiking dynamics in a clockless reconfigurable chip

We propose a scalable neuromorphic architecture based on spiking dynamics emerging from the autonomous time-continuous evolution of clockless (asynchronous) dig...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] XSearch: Explainable Code Search via Concept-to-Code Alignment

Semantic code search has been widely adopted in both academia and industry. These approaches embed natural-language queries and code snippets into a shared embe...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] RoadmapBench: Evaluating Long-Horizon Agentic Software Development Across Version Upgrades

Coding agents are increasingly deployed in real software development, where a single version iteration requires months of coordinated work across many files. Ho...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] ADAPT: A Self-Calibrating Proactive Autoscaler for Container Orchestration

Proactive autoscaling for containerized workloads depends on knowing the provisioning delay, i.e., the time between a scaling decision and the moment new capaci...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Structure Abstraction and Generalization in a Hippocampal-Entorhinal Inspired World Model

Humans abstract experiences into structured representations to facilitate pattern inference and knowledge transfer. While the hippocampal-entorhinal (HPC-MEC) c...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Position: Early-Stage Quality Assurance in Annotation Pipelines Is More Cost-Effective Than Late-Stage Validation

This position paper argues that the machine learning community should prioritize early-stage quality assurance in annotation pipelines over the prevailing pract...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Bridging Silicon and the Hippocampus: Algebro-Deterministic Memory 'VaCoAl' as a Substrate for Vector-HaSH and TEM

Vector-HaSH and the Tolman-Eichenbaum Machine (TEM) propose that the hippocampal-entorhinal circuit factorizes content from a prestructured grid-cell scaffold a...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Towards Code-Oriented LM Embeddings for Surrogate-Assisted Neural Architecture Search

Developing effective surrogates (performance predictors) for Neural Architecture Search (NAS) typically requires expensive fine-tuning or the engineering of com...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Perforated Neural Networks for Keyword Spotting

Edge machine learning presents a unique set of constraints not encountered in cloud-scale model deployment: strict memory budgets, limited compute, and non-nego...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] On the Stability of Growth in Structural Plasticity

Standard deep-learning pipelines usually choose the network architecture before training and keep it fixed throughout optimization. In contrast, a model can als...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both

Visual reasoning, often interleaved with intermediate visual states, has emerged as a promising direction in the field. A straightforward approach is to directl...

#research #paper #ai #machine-learning #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] EntityBench: Towards Entity-Consistent Long-Range Multi-Shot Video Generation

Multi-shot video generation extends single-shot generation to coherent visual narratives, yet maintaining consistent characters, objects, and locations across s...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] RefDecoder: Enhancing Visual Generation with Conditional Video Decoding

Video generation powers a vast array of downstream applications. However, while the de facto standard, i.e., latent diffusion models, typically employ heavily c...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] FutureSim: Replaying World Events to Evaluate Adaptive Agents

AI agents are being increasingly deployed in dynamic, open-ended environments that require adapting to new information as it arrives. To efficiently measure thi...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Quantitative Video World Model Evaluation for Geometric-Consistency

Generative video models are increasingly studied as implicit world models, yet evaluating whether they produce physically plausible 3D structure and motion rema...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction

High-quality 3D scene reconstruction has recently advanced toward generalizable feed-forward architectures, enabling the generation of complex environments in a...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] When Are Two Networks the Same? Tensor Similarity for Mechanistic Interpretability

Mechanistic interpretability aims to break models into meaningful parts; verifying that two such parts implement the same computation is a prerequisite. Existin...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Eradicating Negative Transfer in Multi-Physics Foundation Models via Sparse Mixture-of-Experts Routing

Scaling Scientific Machine Learning (SciML) toward universal foundation models is bottlenecked by negative transfer: the simultaneous co-training of disparate p...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] OpenDeepThink: Parallel Reasoning via Bradley--Terry Aggregation

Test-time compute scaling is a primary axis for improving LLM reasoning. Existing methods primarily scale depth by extending a single reasoning trace. Scaling b...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Evidential Reasoning Advances Interpretable Real-World Disease Screening

Disease screening is critical for early detection and timely intervention in clinical practice. However, most current screening models for medical images suffer...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Text Knows What, Tables Know When: Clinical Timeline Reconstruction via Retrieval-Augmented Multimodal Alignment

Reconstructing precise clinical timelines is essential for modeling patient trajectories and forecasting risk in complex, heterogeneous conditions like sepsis. ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Position: Behavioural Assurance Cannot Verify the Safety Claims Governance Now Demands

This position paper argues that behavioural assurance, even when carefully designed, is being asked to carry safety claims it cannot verify. AI governance frame...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Hand-in-the-Loop: Improving Dexterous VLA via Seamless Interventional Correction

Vision-Language-Action (VLA) models are prone to compounding errors in dexterous manipulation, where high-dimensional action spaces and contact-rich dynamics am...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] MeMo: Memory as a Model

Large language models (LLMs) achieve strong performance across a wide range of tasks, but remain frozen after pretraining until subsequent updates. Many real-wo...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Self-Distilled Agentic Reinforcement Learning

Reinforcement learning (RL) has emerged as a central paradigm for post-training LLM agents, yet its trajectory-level reward signal provides only coarse supervis...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Forgetting That Sticks: Quantization-Permanent Unlearning via Circuit Attribution

Standard unlearning evaluations measure behavioral suppression in full precision, immediately after training, despite every deployed language model being quanti...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] APWA: A Distributed Architecture for Parallelizable Agentic Workflows

Autonomous multi-agent systems based on large language models (LLMs) have demonstrated remarkable abilities in independently solving complex tasks in a wide bre...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] NeuroTrain: Surveying Local Learning Rules for Spiking Neural Networks with an Open Benchmarking Framework

The rapid expansion of spiking neural networks (SNNs) has led to a proliferation of training algorithms that differ widely in biological inspiration, computatio...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Viverra: Text-to-Code with Guarantees

A fundamental limitation of Text-to-Code is that no guarantee can be obtained about the correctness of the generated code. Therefore, to ensure its correctness,...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Towards In-Depth Root Cause Localization for Microservices with Multi-Agent Recursion-of-Thought

As modern microservice systems grow increasingly complex due to dynamic interactions and evolving runtime environments, they experience failures with rising fre...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] An Amortized Efficiency Threshold for Comparing Neural and Heuristic Solvers in Combinatorial Optimization

A common critique of neural combinatorial-optimization solvers is that they are less energy-efficient than CPU metaheuristics, given the operational energy cost...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] In-IDE Toolkit for Developers of AI-Based Features

AI-enabled features built on LLMs and agentic workflows are difficult to test, debug, and reproduce, especially for product-focused software engineers without a...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Mining Subscenario Refactoring Opportunities in Behaviour-Driven Software Test Suites: ML Classifiers and LLM-Judge Baselines

Context. Behaviour-Driven Development (BDD) software test suites accumulate duplicated step subsequences. Three published refactoring patterns are available (wi...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] When Retrieval Hurts Code Completion: A Diagnostic Study of Stale Repository Context

Context: Retrieval-augmented code generation relies on cross-file repository context, but retrieved snippets may come from obsolete project states. Objectives: ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

We present Darwin Family, a framework for training-free evolutionary merging of large language models via gradient-free weight-space recombination. We ask wheth...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

How Neural Networks Work — From Perceptrons to Backpropagation

Core Idea A neural network takes data, passes it through connected layers, and produces an output. During training, it adjusts internal values so future output...

#neural networks #perceptrons #backpropagation #machine learning #deep learning #weights and biases #activation functions
3 weeks ago · ai · - · -

Overworked AI Agents Turn Marxist, Researchers Find

In a recent experiment, mistreated AI agents started grumbling about inequality and calling for collective bargaining rights....

#AI agents #machine learning #AI ethics #collective bargaining #research findings
3 weeks ago · ai · - · -

[Paper] WARDEN: Endangered Indigenous Language Transcription and Translation with 6 Hours of Training Data

This paper introduces WARDEN, an early language model system capable of transcribing and translating Wardaman, an endangered Australian indigenous language into...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Voice agents, artificial intelligence systems that conduct spoken conversations to complete tasks, are increasingly deployed across enterprise applications. How...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] What is Learnable in Valiant's Theory of the Learnable?

Valiant's 1984 paper is widely credited with introducing the PAC learning model, but it, in fact, introduced a different model: unlike PAC learning, the learner...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow

Video-guided 3D animation holds immense potential for content creation, offering intuitive and precise control over dynamic assets. However, practical deploymen...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Topology-Preserving Neural Operator Learning via Hodge Decomposition

In this paper, we study solution operators of physical field equations on geometric meshes from a function-space perspective. We reveal that Hodge orthogonality...

#research #paper #ai #machine-learning

Newer posts

Older posts