machine learning — Page 9

Sort:

2 weeks ago · ai · - · -

[Paper] Actionable World Representation

Inspired by the emergent behaviors in large language models that generalized human intelligence, the research community is pursuing similar emergent capabilitie...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

Multimodal Large Language Models (MLLMs) still struggle with fine-grained visual understanding, where answers often depend on small but decisive evidence in the...

#research #paper #ai #machine-learning #nlp #computer-vision
2 weeks ago · ai · - · -

[Paper] What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models

Medicine is inherently pluralistic. Principles such as autonomy, beneficence, nonmaleficence, and justice routinely conflict, and such ethical dilemmas often sh...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] PIXLRelight: Controllable Relighting via Intrinsic Conditioning

We present PIXLRelight, a feed-forward approach for physically controllable single-image relighting. Existing methods either provide limited lighting control (e...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Predictable Confabulations: Factual Recall by LLMs Scales with Model Size and Topic Frequency

While scaling laws govern aggregate large language model performance, no scaling law has linked factual recall to both model size and training-data composition....

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] DexHoldem: Playing Texas Hold'em with Dexterous Embodied System

Evaluating embodied systems on real dexterous hardware requires more than isolated primitive skills: an agent must perceive a changing tabletop scene, choose a ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] General Preference Reinforcement Learning

Post-training has split large language model (LLM) alignment into two largely disconnected tracks. Online reinforcement learning (RL) with verifiable rewards dr...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Semantic Generative Tuning for Unified Multimodal Models

Unified multimodal models (UMMs) strive to consolidate visual understanding and visual generation within a single architecture. However, prevailing training par...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Learned Memory Attenuation in Sage-Husa Kalman Filters for Robust UAV State Estimation

Unmanned Aerial Vehicles in dynamic environments face telemetry outages, structural vibrations, and regime-dependent noise that invalidate the stationary covari...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Equipping LLMs with tool-use capabilities via Agentic Reinforcement Learning (Agentic RL) is bottlenecked by two challenges: the lack of scalable, robust execut...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Distilling Tabular Foundation Models for Structured Health Data

Tabular foundation models (TFMs) achieve strong performance on health datasets, but their inference cost and infrastructure requirements limit practical use. We...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications

Compound AI applications, which compose calls to ML models using a general-purpose programming language like Python, are widely used for a variety of user-facin...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents

Legacy systems concentrate business rules, architectural decisions, and operational exceptions that often remain implicit in code, data, configuration, and main...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] GIM: Evaluating models via tasks that integrate multiple cognitive domains

As LLM benchmarks saturate, the evaluation community has pursued two strategies to increase difficulty: escalating knowledge demands (GPQA, HLE) or removing kno...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] An Assessment of Human vs. Model Uncertainty in Soft-Label Learning and Calibration

Central to human-aligned AI is understanding the benefits of human-elicited labels over synthetic alternatives. While human soft-labels improve calibration by c...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks

Coding agents now run autonomously with shell, file, and network privileges. When a user issues a benign request, the agent sometimes does more than asked: it d...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Improving BM25 Code Retrieval Under Fixed Generic Tokenization: Adaptive q-Log Odds as a Drop-In BM25 Fix

In retrieval-augmented coding, failures often begin when the relevant file is absent from the retrieved context. Under frozen generic tokenization, where a BM25...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Self-supervised local learning rules learn the hidden hierarchical structure of high-dimensional data

The brain learns abstract representations of high-dimensional sensory input, but the plasticity rules that enable such learning are unknown. We study biological...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] When Fireflies Cluster; Enhancing Automatic Clustering via Centroid-Guided Firefly Optimization

This work presents a novel variant of the Firefly Algorithm (FA) for data clustering, addressing limitations of traditional methods like K-Means that struggle w...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Heterogeneous Tasks Offloading in Vehicular Edge Computing: A Federated Meta Deep Reinforcement Learning Approach

Vehicular edge computing (VEC) enables latency-sensitive vehicular applications by offloading computation-intensive tasks to nearby edge servers. However, real-...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Same Signal, Different Semantics: A Cross-Framework Behavioral Analysis of Software Engineering Agents

Behavioral studies of LLM-based software engineering agents extract operational rules about which trajectory shapes correlate with higher resolution rates: that...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] CommitDistill: A Lightweight Knowledge-Centric Memory Layer for Software Repositories

Software repositories accumulate large amounts of unstructured knowledge in commit messages, pull-request discussions, and issue threads, but developers and AI ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] SIREM: Speech-Informed MRI Reconstruction with Learned Sampling

Real-time magnetic resonance imaging (rtMRI) of speech production enables non-invasive visualization of dynamic vocal-tract motion and is valuable for speech sc...

#research #paper #ai #machine-learning #nlp #computer-vision
2 weeks ago · ai · - · -

[Paper] SPATIOROUTE: Dynamic Prompt Routing for Zero-Shot Spatial Reasoning

Spatial question answering over egocentric video is a challenging task that requires Vision-Language Models (VLMs) to reason about 3D object positions, scene af...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] RGB-only Active 3D Scene Graph Generation for Indoor Mobile Robots

Current approaches to 3D scene graph generation rely on dedicated depth sensors, such as LiDAR or RGB-D cameras, for metric 3D reconstruction. This limits deplo...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Beyond the Cartesian Illusion: Testing Two-Stage Multi-Modal Theory of Mind under Perceptual Bottlenecks

While Multi-Modal Large Language Models (MLLMs) demonstrate impressive capabilities in general reasoning, their embodied spatial intelligence remains hampered b...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] A-ProS: Towards Reliable Autonomous Programming Through Multi-Model Feedback

Large Language Models (LLMs) demonstrate strong potential for automated code generation, yet their ability to iteratively refine solutions using execution feedb...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows

Multi-agent LLM workflows -- systems composed of multiple role-specific LLM calls -- often outperform single-prompt baselines, but they remain difficult to debu...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] LogRouter: Adaptive Two-Level LLM Routing for Log Question Answering in Big Data Systems

Production log analytics in self-hosted, resource-constrained environments requires natural-language access to massive log streams without the cost of routing e...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Spiker-LL: An Energy-Efficient FPGA Accelerator Enabling Adaptive Local Learning in Spiking Neural Networks

Deploying adaptive intelligence at the edge remains challenging due to the high computational and energy cost of training neural models. Spiking Neural Networks...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Deep Reinforcement Learning Framework for Diversified Portfolio Management Across Global Equity Markets

This study develops and evaluates a deep reinforcement learning framework for dynamic portfolio allocation across global equity markets. The Soft Actor-Critic a...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Evolutionary Extreme Learning Machine of ab-initio Energy Landscapes for Crystal Structure Prediction using Manta Ray Optimization with Levy Flight

The Manta Ray Foraging Optimization algorithm (MRFO) has proven to be a powerful heuristic strategy in the optimal solution of a large number of engineering pro...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation

Reconstructing coherent 3D geometry and appearance from unposed multi-view images is a fundamental yet challenging problem in computer vision. Most existing vis...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Designing Datacenter Power Delivery Hierarchies for the AI Era

Demand for AI accelerators is rapidly increasing rack power density, with projections approaching 1MW per deployment by 2027. This poses a major challenge for d...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] A Generative AI Framework for Intelligent Utility Billing CO 2 Analytics and Sustainable Resource Optimisation

Distribution utilities are now expected to deliver bills that customers can actually read attach a defensible carbon number to every kWh sold and schedule load ...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] AI-Mediated Communication Can Steer Collective Opinion

Generative artificial intelligence (AI) is increasingly integrated into the online platforms where humans exchange opinions; large language models (LLMs) now po...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Offline Semantic Guidance for Efficient Vision-Language-Action Policy Distillation

Billion-parameter Vision-Language-Action (VLA) policies have recently shown impressive performance in robotic manipulation, yet their size and inference cost re...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Dynamics-Level Watermarking of Flow Matching Models with Random Codes

We introduce a dynamics-level approach to watermarking generative models. Rather than embedding signals into model weights or outputs, we embed the watermark di...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Prospective multi-pathogen disease forecasting using autonomous LLM-guided tree search

Probabilistic forecasting of infectious diseases is crucial for public health but relies on labor-intensive manual model curation by expert modeling teams. This...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Layer Equivalence Is Not a Property of Layers Alone: How You Test Redundancy Changes What You Find

When researchers ask whether two transformer layers are 'equivalent' for compression, they often conflate distinct tests. Replacement asks whether one layer's m...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast

Can LLM agents improve decision-making through self-generated memory without gradient updates? We propose FORGE (Failure-Optimized Reflective Graduation and Evo...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] A Unified Generative-AI Framework for Smart Energy Infrastructure: Intelligent Gas Distribution, Utility Billing, Carbon Analytics, and Quantum-Inspired Optimisation

The accelerating convergence of smart metering, generative artificial intelligence, and quantum-inspired combinatorial optimisation is reshaping how energy util...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Universal Magnetic Structure Prediction from Atomic Coordinates with Near-Experimental Accuracy

Magnetic order is a fundamental property of materials, governing collective behavior and enabling a broad range of functionalities. Yet magnetic structure remai...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Evaluating Design Video Generation: Metrics for Compositional Fidelity

Generative video models are increasingly used in design animation tasks, yet no standardized evaluation framework exists for this domain. Unlike natural video g...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

[Paper] Artificial Aphasias in Lesioned Language Models

Aphasias, selective language impairments which can arise from brain damage, reveal the functional organization of human language by providing causal links betwe...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] The Privacy Price of Tail-Risk Learning: Effective Tail Sample Size in Differentially Private CVaR Optimization

Differential privacy changes the effective sample size governing CVaR learning. For tail mass τ, the privacy-relevant sample size is not n, but nτ; equivalently...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Argus: Evidence Assembly for Scalable Deep Research Agents

Deep research agents have achieved remarkable progress on complex information seeking tasks. Even long ReAct style rollouts explore only a single trajectory, wh...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] Fully Open Meditron: An Auditable Pipeline for Clinical LLMs

Clinical decision support systems (CDSS) require scrutable, auditable pipelines that enable rigorous, reproducible validation. Yet current LLM-based CDSS remain...

#research #paper #ai #machine-learning #nlp

Newer posts

Older posts