machine learning — Page 3

Sort:

1 week ago · ai · - · -

[Paper] On Language Generation in the Limit with Bounded Memory

We study language generation in the limit under bounded memory. In this task, a learner observes examples from an unknown target language one at a time and must...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] In-Context Reward Adaptation for Robust Preference Modeling

Reinforcement Learning from Human Feedback (RLHF) typically relies on static reward models to align Large Language Models with human preferences. However, human...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Resolution Diagnostics for Paired LLM Evaluation

Across two public LLM leaderboards, many displayed pairwise rankings do not meet a conventional paired-test resolution target under the actual paired evaluation...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] MedCase-Structured: A Text-to-FHIR Dataset for Benchmarking Diagnostic Reasoning in Clinically Realistic EHR Settings

Large language models (LLMs) show promise for clinical reasoning and decision support, but evaluation in realistic, electronic health record-congruent settings ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency

AI-assisted coding tools have altered software production. At Meta, significant lines of code per human-landed diff grew by 105.9% year over year and per-develo...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Evolving Features vs Evolving Entire Trees with GP for Interpretable Survival Analysis

Survival analysis concerns the task of predicting the time until an event occurs. Often used in the medical field, survival analysis deals with incomplete (i.e....

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Q-ANCHOR: Federated Quantum Learning with ZNE-guided Correction

Quantum Federated Learning (QFL) offers a promising framework to train quantum models across distributed clients while keeping data strictly local. Due to its s...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Projectional Decoding: Towards Semantic-Aware LLM Generation

Large language models (LLMs) are increasingly used to generate software artifacts across many software engineering (SE) tasks, yet ensuring the semantic validit...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] REPOT: Recoverable Program-of-Thought via Checkpoint Repair

One-shot Program-of-Thought (PoT) emits a Python program that prints a primitive-action plan; a single invalid action silently invalidates the trajectory. We in...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Selection Hyper-heuristics Can Automatically Adjust the Learning Period to Optimally Solve Pseudo-Boolean Problems

The Random Gradient hyper-heuristic was recently shown to be able to learn the optimal neighbourhood size when optimizing the LeadingOnes benchmark via the Rand...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Agora: Toward Autonomous Bug Detection in Production-Level Consensus Protocols with LLM Agents

Consensus protocols form the backbone of distributed systems and blockchains, where implementation bugs can cause data corruption and financial losses. While LL...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Inferring Code Correctness from Specification

Large language models (LLMs) have become integral to modern software development, enabling automated code generation at scale. However, validating the correctne...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] AMDP: Asynchronous Multi-Directional Pipeline Parallelism for Large-Scale Models Training

Pipeline parallelism is essential for large-scale model training, but existing asynchronous approaches often degrade convergence due to parameter mismatch betwe...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Evolutionary Rule Extraction from Corporate Default Prediction Models

Small and medium-sized enterprises (SMEs) represent the majority of firms in most economies and often face financial constraints and higher vulnerability to fin...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

Day 9 - Sparse embedding continued - RAG

Inverse Document Frequency IDF It determines how infrequently a word occurs across the input documents. A rare word receives a high IDF score, while a common w...

#sparse embeddings #TF-IDF #inverse document frequency #retrieval-augmented generation #embeddings #natural language processing #machine learning
1 week ago · ai · - · -

[Paper] Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits

LLM-guided evolutionary search (Evolve systems) has reached state-of-the-art results on mathematical and combinatorial tasks, yet most existing systems report o...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

RamAIn (YC W26) Is Hiring

Founders RamAIn was founded by Shourya Vir Jain CEO and Vansh Ramani CTO, who met at IIT Delhi and dropped out to build AI‑native automation for enterprise wor...

#AI agents #enterprise automation #workflow automation #YC #machine learning #legacy systems #vector search #AI-native automation
1 week ago · ai · - · -

[Paper] Real-rootedness of the Poincaré polynomials of $overline{mathcal M}_{0,n}$: an AI-assisted proof

We prove real-rootedness for the Poincaré polynomial [ P_n(t)=sum_{i=0}^{n-3} dim H^{2i}(overline{mathcal M}_{0,n};mathbb{Q})t^i ] of the Deligne--Mumford modul...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

Tensors Explained Part 1: How AI Systems Represent Data

Introduction In this article we explore the concept of tensors in the context of machine learning. From the perspective of someone building a neural network, t...

#tensors #machine learning #neural networks #data representation #deep learning
1 week ago · ai · - · -

[Paper] PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

Parameter-efficient finetuning (PEFT) has become the standard approach for adapting large language models, yet evaluations largely emphasize downstream accuracy...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Beyond Binary: Sim-to-Real Dexterous Manipulation with Physics-Grounded Contact Representation

A primary bottleneck in contact-rich manipulation is the difficulty of collecting real-world data. Sim-to-real reinforcement learning offers a scalable alternat...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Affective Music Recommendation: A Rollout-Based World Model for Offline Preference Optimization

Functional music applications, from consumer focus and sleep aids to clinical interventions, share a distinctive recommendation problem: success is defined by t...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] AREA: Attribute Extraction and Aggregation for CLIP-Based Class-Incremental Learning

Class-Incremental Learning (CIL) is important in building real-world learning systems. In CLIP-based CIL, the model performs classification by comparing similar...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Calibrating Conservatism for Scalable Oversight

Agentic AI systems capable of autonomous planning and extended environmental interaction pose a fundamental control problem: how can humans maintain meaningful ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration

Visual outcomes are increasingly central to multimodal large language models, making reliable and fine-grained verification essential for scaling generalist fou...

#research #paper #ai #machine-learning #nlp #computer-vision
1 week ago · ai · - · -

[Paper] Ω-QVLA: Robust Quantization for Vision-Language-Action Models via Composite Rotation and Per-step Scaling

Vision-Language-Action (VLA) models unify perception, reasoning, and control within a single policy, yet their multi-billion-parameter backbones and diffusion-b...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models

Electroencephalography (EEG) is a critical, non-invasive method to monitor electrical brain activity. EEGs can span anywhere from a couple seconds to multiple h...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Skill-Conditioned Gated Self-Distillation for LLM Reasoning

On-policy self-distillation (SD) improves LLM reasoning by using teacher-side privileged information (PI) to turn sparse verifier outcomes into dense token-leve...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Do Agents Need Semantic Metadata? A Comparative Study in Agentic Data Retrieval

In the era of autonomous agents, machine-actionable data is critical for data-driven workflows. For more than a decade, semantic metadata like schema.org has an...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Bias Leaves a Gradient Trail: Label-Free Bias Identification via Gradient Probes on Concept Decompositions

Vision classifiers can exploit spurious correlations, achieving high in-distribution accuracy yet failing under distribution shift. Existing approaches to bias ...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Computer-use agents (CUAs) have recently made substantial progress, but deploying a separate large expert for each software domain remains expensive. Small open...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Rethinking Memory as Continuously Evolving Connectivity

Existing memory-augmented LLM agents often treat memory as a static repository with pre-defined representations and fixed retrieval pipelines, which is brittle ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] SwarmHarness: Skill-Based Task Routing via Decentralized Incentive-Aligned AI Agent Networks

Vast quantities of compute (GPU cycles on personal workstations, idle inference servers, and edge devices between jobs) go unused because no incentive-aligned p...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] CubePart: An Open-Vocabulary Part-Controllable 3D Generator

Interactive 3D assets used in games and simulation are typically decomposed into specific semantic parts to support animation, physics, and scripted behaviors, ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Preference-Shaped Expected Hypervolume and R2 Improvement: Exact Computation and Monotonicity

This paper studies preference-shaped expected improvement criteria for Bayesian multiobjective optimization. We consider two indicator families which are often ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Preference-Shaped Expected Hypervolume and R2 Improvement: Exact Computation and Monotonicity

This paper studies preference-shaped expected improvement criteria for Bayesian multiobjective optimization. We consider two indicator families which are often ...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] BIRDNet: Mining and Encoding Boolean Implication Knowledge Graphs as Interpretable Deep Neural Networks

Tabular data in knowledge-rich domains often carries a latent prior in the form of Boolean implication relationships (BIRs) between pairs of features. We mine s...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] A Fresh Look at Lamarckian Evolution and the Baldwin Effect

Baldwinian and Lamarckian evolution have existed for a long time in evolutionary algorithms (EAs) without ever dominating the academic literature or practical a...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Do LLMs Favor Their Providers? Measuring Vertical Integration Bias in Code Generation

Large Language Models (LLMs) have become an integral part of software development, especially with the advent of agentic capabilities. Yet, many frontier LLMs a...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Efficient and Scalable Provenance Tracking for LLM-Generated Code Snippets

Large language models (LLMs) for code completion and generation are increasingly used in software development, yet they may reproduce training examples verbatim...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] CLANE: Continual Learning of Actions on Neuromorphic Hardware from Event Cameras

Recognizing and continuously learning novel human actions without forgetting prior classes is a requirement for emerging AR/VR and robotics applications. For th...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] From paper to benchmark: agentic, framework-based reproduction of under-specified methods in machine health intelligence

Industrial Prognostics and Health Management (PHM) provides a representative case study for a broader challenge in applied machine learning: translating publish...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Improving Evaluation of Recombination-based Cartesian Genetic Programming

Cartesian Genetic Programming has traditionally been using mutation as its main and often sole genetic operator to drive evolutionary search. Despite advancemen...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Multi-Agent LLM-based Metamorphic Testing for REST APIs

As REST APIs become an increasingly significant part of software systems, their validation is becoming more critical. Hence, testing and uncovering underlying i...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Learning to Assess the Reliability of Number-of-Runs Estimation in Stochastic Optimization

In large-scale benchmarking of stochastic optimization algorithms, the key challenge is no longer whether repeated runs are needed for reliability, but how to d...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] How Far Can Disaggregation Go? A Design-Space Exploration of Attention-FFN Disaggregation for Efficient MoE LLM Serving

Modern large language model (LLM) inference has progressively disaggregated to keep pace with growing model sizes and tight TTFT and TPOT service-level objectiv...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] GUI Agents for Continual Game Generation

Generating a game is not the same as making one that can be played. Despite advances in code generation, existing approaches treat game generation as one-shot t...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

Karpathy Joined Anthropic to Train Claude Using Claude

!Cover image for Karpathy Joined Anthropic to Train Claude Using Claudehttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=a...

#andrej karpathy #anthropic #claude #large language models #LLM pretraining #AI research #self‑training #machine learning

Newer posts

Older posts