machine learning — Page 12

Sort:

3 weeks ago · ai · - · -

[Paper] ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models

Large language models (LLMs) often produce answers with high certainty even when they are incorrect, making reliable confidence estimation essential for deploym...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] A Family of Quaternion-Valued Differential Evolution Algorithms for Numerical Function Optimization

The numerical optimization of continuous functions is a fundamental task in many scientific and engineering domains, ranging from mechanical design to training ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Iterative Audit Convergence in LLM-Managed Multi-Agent Systems: A Case Study in Prompt Engineering Quality Assurance

Prompt specifications for multi-agent large language model (LLM) systems carry data contracts and integration logic across many interdependent files but are rar...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Uncertainty Quantification for LLM-based Code Generation

Prediction sets provide a theoretically grounded framework for quantifying uncertainty in machine learning models. Adapting them to structured generation tasks,...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] CIDR: A Large-Scale Industrial Source Code Dataset for Software Engineering Research

We present Curated Industrial Developer Repository (CIDR), a large-scale dataset of real-world software repositories collected through direct collaboration with...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] It's Not the Size: Harness Design Determines Operational Stability in Small Language Models

This paper experimentally analyzes how the level of harness engineering affects the operational performance of small language models (SLMs, 2-3B parameters). Th...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Property-Level Reconstructability of Agent Decisions: An Anchor-Level Pilot Across Vendor SDK Adapter Regimes

Agentic AI failures need post-hoc reconstruction: what the agent did, on whose authority, against which policy, and from what reasoning. Cross-regime feasibilit...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Scaling Laws and Tradeoffs in Recurrent Networks of Expressive Neurons

Cortical neurons are complex, multi-timescale processors wired into recurrent circuits, shaped by long evolutionary pressure under stringent biological constrai...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] The Illusion of Power Capping in LLM Decode: A Phase-Aware Energy Characterisation Across Attention Architectures

Power capping is the standard GPU energy lever in LLM serving, and it appears to work: throughput drops, power readings fall, and energy budgets are met. We sho...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Trade-offs in Decentralized Agentic AI Discovery Across the Compute Continuum

Agentic systems deployed across the compute continuum need discovery mechanisms that remain effective across cloud, edge, and intermittently connected domains. ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Multi-Timescale Conductance Spiking Networks: A Sparse, Gradient-Trainable Framework with Rich Firing Dynamics for Enhanced Temporal Processing

Spiking neural networks (SNNs) promise low-power event-driven computation for temporally rich tasks, but commonly used neuron models often trade off gradient-ba...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Self-organized MT Direction Maps Emerge from Spatiotemporal Contrastive Optimization

The spatial and functional organization of the primate visual cortex is a fundamental problem in neuroscience. While recent computational frameworks like the To...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

AutoScout24 scales engineering with AI-powered workflows

Rebuilding engineering for speed, scale, and complexity AutoScout24 Group⁠opens in a new windowhttps://www.autoscout24.com/ is the largest pan‑European and Can...

#AutoScout24 #AI-powered workflows #engineering scaling #machine learning #automotive marketplace #software automation
3 weeks ago · ai · - · -

[Paper] Decomposing Evolutionary Mixture-of-LoRA Architectures: The Routing Lever, the Lifecycle Penalty, and a Substrate-Conditional Boundary

We decompose an evolutionary mixture-of-LoRA system on a from-scratch ~150M-parameter widened-D substrate (D=1536, V=32000; D/V approx 0.048; the 'widened-1536'...

#research #paper #ai #machine-learning #nlp
3 weeks ago · it · - · -

Google announces its first-ever discovery of a zero-day exploit made with AI

!Google's logo in front of its headquarters.https://www.engadget.com/img/gallery/google-announces-its-first-ever-discovery-of-a-zero-day-exploit-made-with-ai/in...

#AI #zero-day #exploit #cybersecurity #Google #threat intelligence #vulnerability #machine learning
3 weeks ago · ai · - · -

[Paper] ELF: Embedded Language Flows

Diffusion and flow-based models have become the de facto approaches for generating continuous data, e.g., in domains such as images and videos. Their success ha...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Variational Inference for Lévy Process-Driven SDEs via Neural Tilting

Modelling extreme events and heavy-tailed phenomena is central to building reliable predictive systems in domains such as finance, climate science, and safety-c...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices

While Mixture-of-Experts (MoE) scales model capacity without proportionally increasing computation, its massive total parameter footprint creates significant st...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Quantifying Concentration Phenomena of Mean-Field Transformers in the Low-Temperature Regime

Transformers with self-attention modules as their core components have become an integral architecture in modern large language and foundation models. In this p...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

Large language model agents increasingly rely on external skills to solve complex tasks, where skills act as modular units that extend their capabilities beyond...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Optimal and Scalable MAPF via Multi-Marginal Optimal Transport and Schrödinger Bridges

We consider anonymous multi-agent path finding (MAPF) where a set of robots is tasked to travel to a set of targets on a finite, connected graph. We show that M...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Confidence-Guided Diffusion Augmentation for Enhanced Bangla Compound Character Recognition

Recognition of handwritten Bangla compound characters remains a challenging problem due to complex character structures, large intra-class variation, and limite...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Shepherd: A Runtime Substrate Empowering Meta-Agents with a Formalized Execution Trace

We introduce Shepherd, a functional programming model that formalizes meta-agent operations on target agents as functions, with core operations mechanized in Le...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Equivariant Reinforcement Learning for Clifford Quantum Circuit Synthesis

We consider the problem of synthesizing Clifford quantum circuits for devices with all-to-all qubit connectivity. We approach this task as a reinforcement learn...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$-step Policy Gradients

This work revisits standard policy gradient methods used on restricted policy classes, which are known to get stuck in suboptimal critical points. We identify a...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Engineering Robustness into Personal Agents with the AI Workflow Store

The dominant paradigm for AI agents is an 'on-the-fly' loop in which agents synthesize plans and execute actions within seconds or minutes in response to user p...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] DataMaster: Towards Autonomous Data Engineering for Machine Learning

As model families, training recipes, and compute budgets become increasingly standardized, further gains in machine learning systems depend increasingly on data...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Beyond Red-Teaming: Formal Guarantees of LLM Guardrail Classifiers

Guardrail Classifiers defend production language models against harmful behavior, but although results seem promising in testing, they provide no formal guarant...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Training deep research agents, namely systems that plan, search, evaluate evidence, and synthesize long-form reports, pushes reinforcement learning beyond the r...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

On-policy distillation offers dense, per-token supervision for training reasoning models; however, it remains unclear under which conditions this signal is bene...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Shields to Guarantee Probabilistic Safety in MDPs

Shielding is a prominent model-based technique to ensure safety of autonomous agents. Classical shielding aims to ensure that nothing bad ever happens and comes...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] LoKA: Low-precision Kernel Applications for Recommendation Models At Scale

Recent GPU generations deliver significantly higher FLOPs using lower-precision arithmetic, such as FP8. While successfully applied to large language models (LL...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents

Recent advances in machine learning and large-scale biological data collections have revived the prospect of building a virtual cell, a computational model of c...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Compute Where it Counts: Self Optimizing Language Models

Efficient LLM inference research has largely focused on reducing the cost of each decoding step (e.g., using quantization, pruning, or sparse attention), typica...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] CADBench: A Multimodal Benchmark for AI-Assisted CAD Program Generation

Recovering editable CAD programs from images or 3D observations is central to AI-assisted design, but progress is difficult to measure because existing evaluati...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD

Industrial Computer-Aided Design (CAD) code generation requires models to produce executable parametric programs from visual or textual inputs. Beyond recognizi...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox

Current LLM agents are proficient at calling isolated APIs but struggle with the 'last mile' of commercial software automation. In real-world scenarios, tools a...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum

Grey failures in the computing continuum produce ambiguous overlapping symptoms that existing approaches fail to diagnose reliably, either due to a lack of caus...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Step Rejection Fine-Tuning: A Practical Distillation Recipe

Rejection Fine-Tuning (RFT) is a standard method for training LLM agents, where unsuccessful trajectories are discarded from the training set. In the context of...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] SoK: A Systematic Bidirectional Literature Review of AI & DLT Convergence

The integration of Artificial Intelligence (AI) with Distributed Ledger Technology (DLT) has become a growing research area, yet contributions tend to cluster a...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Causal Explanations from the Geometric Properties of ReLU Neural Networks

Neural networks have proved an effective means of learning control policies for autonomous systems, but these learned policies are difficult to understand due t...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Agentic Performance at the Edge: Insights from Benchmarking

Agentic artificial intelligence (AI) is a natural fit for Internet of Things (IoT) and edge systems, but edge deployments are often constrained to models around...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

LLMs and Text-in-Text Steganography

Comments Privacy – May 11, 2026 8:07 AM To hide text, try white text on a white background. The human eye won’t see it but the computer will. If you want to te...

#LLM #steganography #text hiding #adversarial NLP #security #machine learning
3 weeks ago · ai · - · -

[Paper] Joint sparse coding and temporal dynamics support context reconfiguration

Adaptive behavior requires the brain to transition between distinct contexts while maintaining representations of prior experience. The ability to reconfigure n...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Prospective Compression in Human Abstraction Learning

A core challenge in program synthesis is online library learning: the incremental acquisition of reusable abstractions under uncertainty about future task deman...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution

Large Language Models exhibit mode collapse, producing homogeneous outputs that fail to explore valid solution spaces. We present QD-LLM, a framework for parame...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent

Gradient-based preference optimization methods for large language model (LLM) alignment suffer from preference collapse, converging to narrow behavioral modes w...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

Exploring AI on AWS Made Simpler 🚀

!pichttps://media2.dev.to/dynamic/image/width=256,height=,fit=scale-down,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farti...

#AWS #Artificial Intelligence #Machine Learning #Cloud Services #AI on AWS

Newer posts

Older posts