machine learning — Page 6

Sort:

2 weeks ago · ai · - · -

[Paper] Reducing Political Manipulation with Consistency Training

Large language models (LLMs) exhibit systematic political bias across a variety of sensitive contexts. We find that LLMs handle counterpart topics from opposing...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Understanding Data Temporality Impact on Large Language Models Pre-training

Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding rem...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools

Every Python function deployed as an LLM tool must today exist in two forms: an HTTP endpoint for human-facing clients and CI pipelines, and an MCP tool registr...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

We investigate whether acoustic emotion recognition models can serve as proxies for the Pathos dimension in political speech analysis, as operationalised by the...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild

As wearable and mobile devices become increasingly embedded in daily life, they offer a practical way to continuously sense human motion in the wild. But inerti...

#research #paper #ai #machine-learning #nlp #computer-vision
2 weeks ago · ai · - · -

[Paper] AMEL: Accumulated Message Effects on LLM Judgments

Large language models are routinely used as automated evaluators: to review code, moderate content, or score outputs, often with many items passing through one ...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents

Skills are increasingly used to package agent instructions, workflows, scripts, and reference materials. In enterprise settings, however, skills often need to e...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Innovations in Cardless Artificial Intelligence Banking: A Comprehensive Framework for Cyber Secure and Fraud Mitigation using Machine Learning Algorithms

The advent of cardless artificial intelligence (AI) banking heralds a paradigm shift in the financial landscape, offering users unprecedented security and conve...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] SynAE: A Framework for Measuring the Quality of Synthetic Data for Tool-Calling Agent Evaluations

Today, tool-calling agents are commonly evaluated or tested on static datasets of execution traces, including input commands, agent responses, and associated to...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Asymmetric Virtual Memory Paging for Hybrid Mamba-Transformer Inference

Hybrid language models like Jamba mix attention layers with State Space Models (SSMs), creating two memory cache types with opposite profiles: Key-Value (KV) ca...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology

Does the relationship between learning rules and brain alignment generalize across species? We extend our prior finding that untrained CNNs match backpropagatio...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation

As large language models (LLMs) are increasingly deployed for software engineering, constructing high-quality benchmarks is crucial for evaluating not just the ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] SepsisAI Orchestrator: A Containerized and Scalable Platform for Deploying AI Models and Real-Time Monitoring in Early Sepsis Detection

Despite strong predictive results in the clinical machine learning literature, the translation of these models into bedside use remains limited by systems-level...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Temporal Coding as a Substrate for Sensorimotor Object Inference: A Spiking Reinterpretation of Thousand Brains Architecture

The Thousand Brains Theory (TBT) and its open-source Monty framework model object recognition through sensorimotor inference -- identifying objects by actively ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Secure and Parallel Determinant Computation for Large-Scale Matrices in Edge Environments

The advent of edge computing has enabled resource-constrained clients to delegate intensive computational tasks to distributed edge servers, especially within I...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Engineering Hybrid Physics-Informed Neural Networks for Next-Generation Electricity Systems: A State-of-the-Art Review

The integration of machine learning with domain-specific physics is transforming the design, monitoring, and control of electricity systems, where data scarcity...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos

We develop a mean-field theory of dropout as a perturbation of critical signal propagation at the edge of chaos. Dropout shifts the perfect-alignment fixed poin...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Variance Reduction for Expectations with Diffusion Teachers

Pretrained diffusion models serve as frozen teachers feeding downstream pipelines such as text-to-3D, single-step distillation, and data attribution. The teache...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning

Scaling test-time compute by iteratively updating a latent state has emerged as a powerful paradigm for reasoning. Yet the internal mechanisms that enable these...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Quantifying Hyperparameter Transfer and the Importance of Embedding Layer Learning Rate

Hyperparameter transfer allows extrapolating optimal optimization hyperparameters from small to large scales, making it critical for training large language mod...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] EvoStruct: Bridging Evolutionary and Structural Priors for Antibody CDR Design via Protein Language Model Adaptation

Equivariant graph neural network (GNN) methods for antibody complementarity-determining region (CDR) design achieve the highest sequence recovery but suffer fro...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Velocityformer: Broken-Symmetry-Matched Equivariant Graph Transformers for Cosmological Velocity Reconstruction

Precise measurement of the kinematic Sunyaev-Zel'dovich (kSZ) effect - a probe of the large-scale distribution of baryonic matter, a key observable for cosmolog...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] AiraXiv: An AI-Driven Open-Access Platform for Human and AI Scientists

Recent advances in artificial intelligence (AI) have accelerated the growth of both human-authored and AI-generated research outputs, placing increasing strain ...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation

Deep research, in which an agent searches the open web, collects evidence, and derives an answer through extended reasoning, is a prominent use case for frontie...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] WikiVQABench: A Knowledge-Grounded Visual Question Answering Benchmark from Wikipedia and Wikidata

Visual Question Answering (VQA) benchmarks have largely emphasized perception-based tasks that can be solved from visual content alone. In contrast, many real-w...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Is Fixing Schema Graphs Necessary? Full-Resolution Graph Structure Learning for Relational Deep Learning

Relational prediction tasks are fundamental in many real-world applications, where data are naturally stored in relational databases (RDBs). Relational Deep Lea...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling

Computer-use agents (CUA) automate tasks specified with natural language such as 'order the cheapest item from Taco Bell' by generating sequences of calls to to...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Reinforcement learning with verifiable rewards (RLVR) has become a dominant paradigm for improving reasoning in large language models (LLMs), yet the underlying...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Reinforcement learning from verifiable rewards (RLVR) has emerged as a central technique for improving the reasoning capabilities of large language models. Desp...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Mem-$π$: Adaptive Memory through Learning When and What to Generate

We present Mem-π, a framework for adaptive memory in large language model (LLM) agents, where useful guidance is generated on demand rather than retrieved from ...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] HITL-D: Human In The Loop Diffusion Assisted Shared Control

Autonomous manipulation systems have achieved remarkable capabilities, yet the integration of human expertise with diffusion-based policies in shared control re...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Mind the Sim-to-Real Gap & Think Like a Scientist

Suppose a planner has a pre-trained simulator of a sequential decision problem and the option to run real experiments in the field. The simulator is cheap to qu...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Quality and Security Signals in AI-Generated Python Refactoring Pull Requests

As AI agents increasingly contribute to code development and maintenance, there is still limited empirical evidence on the quality and risk characteristics of t...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Approximation Theory for Neural Networks: Old and New

Universal approximation theorems provide a mathematical explanation for the expressive power of neural networks. They assert that, under mild conditions on the ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos

Vision-language models (VLMs) are increasingly being explored for video game quality assurance, especially gameplay glitch detection. Most existing evaluations,...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] PALS: Power-Aware LLM Serving for Mixture-of-Experts Models

Large language model (LLM) inference has become a dominant workload in modern data centers, driving significant GPU utilization and energy consumption. While pr...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Stdlib or Third-Party? Empirical Performance and Correctness of LLM-Assisted Zero-Dependency Python Libraries

Third-party Python libraries introduce dependency management overhead, supply chain risk, and deployment friction in constrained environments. A natural questio...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents

As long-horizon coding agents produce more code than any developer can review, oversight collapses onto a single surface: the automated test suite. Reward hacki...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] How to Build Marcus's Algebraic Mind: Algebro-Deterministic Substrate over Galois Fields

In The Algebraic Mind, Gary Marcus identified three components essential for any adequate cognitive architecture: operations over variables, recursively structu...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] How to Build Marcus's Algebraic Mind: Algebro-Deterministic Substrate over Galois Fields

In The Algebraic Mind, Gary Marcus identified three components essential for any adequate cognitive architecture: operations over variables, recursively structu...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Insights Generator: Systematic Corpus-Level Trace Diagnostics for LLM Agents

Diagnosing failures in LLM agents remains largely manual. Practitioners inspect a small subset of execution traces, form ad-hoc hypotheses, and iterate. This pr...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Frontier: Towards Comprehensive and Accurate LLM Inference Simulation

Modern LLM serving is no longer homogeneous or monolithic. Production systems now combine disaggregated execution, complex parallelism, runtime optimizations, a...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Automated Byzantine-Resilient Clustered Decentralized Federated Learning for Battery Intelligence in Connected EVs

Federated learning (FL) has emerged as a promising paradigm for managing electric vehicle (EV) battery data in intelligent transportation systems (ITS), enablin...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Genetic Programming with Transformer-Based Mutation for Approximate Circuit Design

A recent trend is to leverage machine learning models to improve the evolutionary design and optimization process. We propose a novel transformer-based mutation...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Diagnosing Overhead in Dispatch Operations: Cross-architecture Observatory

AlltoAll dispatch is the dominant bottleneck of MoE expert parallelism, and the interconnect community has responded with four families of mitigations: predicti...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] LOSCAR-SGD: Local SGD with Communication-Computation Overlap and Delay-Corrected Sparse Model Averaging

Communication is a major bottleneck in distributed learning, especially in large-scale settings and in federated learning environments with slow links. Three st...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

Building AI models that understand chemical principles

Among all of the possible chemical compounds, it’s estimated that between 10²⁰ and 10⁶⁰ may hold potential as small‑molecule drugs. Evaluating each of those com...

#AI #drug discovery #computational chemistry #machine learning #molecular design #MIT #chemical engineering
2 weeks ago · ai · - · -

[Paper] Weight Decay Regimes in Grokking Transformers: Cheap Online Diagnostics

Transformers trained on modular arithmetic exhibit sharp transitions between memorization, generalization, and collapse. We show that weight decay acts as a sca...

#research #paper #ai #machine-learning

Newer posts

Older posts