Source

arXiv

5752 posts from this source

Sort:

2 months ago · ai · - · -

[Paper] InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions

Humans rarely plan whole-body interactions with objects at the level of explicit whole-body movements. High-level intentions, such as affordance, define the goa...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Multimodal Large Language Models (MLLMs) have recently been applied to universal multimodal retrieval, where Chain-of-Thought (CoT) reasoning improves candidate...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Can vision language models learn intuitive physics from interaction?

Pre-trained vision language models do not have good intuitions about the physical world. Recent work has shown that supervised fine-tuning can improve model per...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Splat and Distill: Augmenting Teachers with Feed-Forward 3D Reconstruction For 3D-Aware Distillation

Vision Foundation Models (VFMs) have achieved remarkable success when applied to various downstream 2D tasks. Despite their effectiveness, they often exhibit a ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] AP-OOD: Attention Pooling for Out-of-Distribution Detection

Out-of-distribution (OOD) detection, which maps high-dimensional data into a scalar OOD score, is critical for the reliable deployment of machine learning model...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling

Large language model (LLM)-based multi-agent systems enable expressive agent reasoning but are expensive to scale and poorly calibrated for timestep-aligned sta...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Curiosity is Knowledge: Self-Consistent Learning and No-Regret Optimization with Active Inference

Active inference (AIF) unifies exploration and exploitation by minimizing the Expected Free Energy (EFE), balancing epistemic value (information gain) and pragm...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Context Forcing: Consistent Autoregressive Video Generation with Long Context

Recent approaches to real-time long video generation typically employ streaming tuning strategies, attempting to train a long-context student using a short-cont...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

Memory is increasingly central to Large Language Model (LLM) agents operating beyond a single context window, yet most existing systems rely on offline, query-a...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Learning Event-Based Shooter Models from Virtual Reality Experiments

Virtual reality (VR) has emerged as a powerful tool for evaluating school security measures in high-risk scenarios such as school shootings, offering experiment...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering

Large language models (LLMs) exhibit persistent miscalibration, especially after instruction tuning and preference alignment. Modified training objectives can i...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Diffusion Model's Generalization Can Be Characterized by Inductive Biases toward a Data-Dependent Ridge Manifold

When a diffusion model is not memorizing the training data set, how does it generalize exactly? A quantitative understanding of the distribution it generates wo...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Multi-Token Prediction via Self-Distillation

Existing techniques for accelerating language model inference, such as speculative decoding, require training auxiliary speculator models and building and deplo...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies

Large language models (LLMs) are increasingly being used in a zero-shot fashion to assess mental health conditions, yet we have limited knowledge on what factor...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Optimism Stabilizes Thompson Sampling for Adaptive Inference

Thompson sampling (TS) is widely used for stochastic multi-armed bandits, yet its inferential properties under adaptive data collection are subtle. Classical as...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?

The rapid advancement of visual generation models has outpaced traditional evaluation approaches, necessitating the adoption of Vision-Language Models as surrog...

#research #paper #ai #machine-learning #computer-vision
2 months ago · software · - · -

[Paper] Characterizing and Modeling the GitHub Security Advisories Review Pipeline

GitHub Security Advisories (GHSA) have become a central component of open-source vulnerability disclosure and are widely used by developers and security tools. ...

#research #paper #software
2 months ago · ai · - · -

[Paper] AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions

Large language model (LLM)-based agents are increasingly expected to negotiate, coordinate, and transact autonomously, yet existing benchmarks lack principled s...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods

Speech Emotion Recognition (SER) research has faced limitations due to the lack of standard and sufficiently large datasets. Recent studies have leveraged pre-t...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs

Diffusion large language models (dLLMs) have emerged as a promising alternative for text generation, distinguished by their native support for parallel decoding...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

Deep research agents have emerged as powerful systems for addressing complex queries. Meanwhile, LLM-based retrievers have demonstrated strong capability in fol...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Characterizing Human Semantic Navigation in Concept Production as Trajectories in Embedding Space

Semantic representations can be framed as a structured, dynamic knowledge space through which humans navigate to retrieve and manipulate meaning. To investigate...

#research #paper #ai #machine-learning #nlp
2 months ago · devops · - · -

[Paper] Location-Aware Dispersion on Anonymous Graphs

The well-studied DISPERSION problem is a fundamental coordination problem in distributed robotics, where a set of mobile robots must relocate so that each occup...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

Long reasoning models often struggle in multilingual settings: they tend to reason in English for non-English questions; when constrained to reasoning in the qu...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Polyglots or Multitudes? Multilingual LLM Answers to Value-laden Multiple-Choice Questions

Multiple-Choice Questions (MCQs) are often used to assess knowledge, reasoning abilities, and even values encoded in large language models (LLMs). While the eff...

#research #paper #ai #nlp
2 months ago · software · - · -

[Paper] When Elo Lies: Hidden Biases in Codeforces-Based Evaluation of Large Language Models

As Large Language Models (LLMs) achieve breakthroughs in complex reasoning, Codeforces-based Elo ratings have emerged as a prominent metric for evaluating compe...

#research #paper #software
2 months ago · ai · - · -

[Paper] DARWIN: Dynamic Agentically Rewriting Self-Improving Network

DARWIN is an evolutionary GPT model, utilizing a genetic-algorithm like optimization structure with several independent GPT agents being trained individually us...

#research #paper #ai #machine-learning #nlp
2 months ago · devops · - · -

[Paper] The Quantum Message Complexity of Distributed Wake-Up with Advice

We consider the distributed wake-up problem with advice, where nodes are equipped with initial knowledge about the network at large. After the adversary awakens...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Automated Customization of LLMs for Enterprise Code Repositories Using Semantic Scopes

Code completion (CC) is a task frequently used by developers when working in collaboration with LLM-based programming assistants. Despite the increased performa...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] RocqSmith: Can Automatic Optimization Forge Better Proof Agents?

This work studies the applicability of automatic AI agent optimization methods to real-world agents in formal verification settings, focusing on automated theor...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Toward Quantum-Safe Software Engineering: A Vision for Post-Quantum Cryptography Migration

The quantum threat to cybersecurity has accelerated the standardization of Post-Quantum Cryptography (PQC). Migrating legacy software to these quantum-safe algo...

#research #paper #software
2 months ago · ai · - · -

[Paper] TimelyFreeze: Adaptive Parameter Freezing Mechanism for Pipeline Parallelism

Pipeline parallelism enables training models that exceed single-device memory, but practical throughput remains limited by pipeline bubbles. Although parameter ...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] A Bayesian Optimization-Based AutoML Framework for Non-Intrusive Load Monitoring

Non-Intrusive Load Monitoring (NILM), commonly known as energy disaggregation, aims to estimate the power consumption of individual appliances by analyzing a ho...

#research #paper #software
2 months ago · ai · - · -

[Paper] Neuro-Inspired Visual Pattern Recognition via Biological Reservoir Computing

In this paper, we present a neuro-inspired approach to reservoir computing (RC) in which a network of in vitro cultured cortical neurons serves as the physical ...

#research #paper #ai #computer-vision
2 months ago · software · - · -

[Paper] A Dual-Loop Agent Framework for Automated Vulnerability Reproduction

Automated vulnerability reproduction from CVE descriptions requires generating executable Proof-of-Concept (PoC) exploits and validating them in target environm...

#research #paper #software
2 months ago · ai · - · -

[Paper] Towards Green AI: Decoding the Energy of LLM Inference in Software Development

Context: AI-assisted tools are increasingly integrated into software development workflows, but their reliance on large language models (LLMs) introduces substa...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] SEAL: Symbolic Execution with Separation Logic (Competition Contribution)

SEAL is a static analyser for the verification of programs that manipulate unbounded linked data structures. It is based on separation logic to represent abstra...

#research #paper #software
2 months ago · ai · - · -

[Paper] FedRandom: Sampling Consistent and Accurate Contribution Values in Federated Learning

Federated Learning is a privacy-preserving decentralized approach for Machine Learning tasks. In industry deployments characterized by a limited number of entit...

#research #paper #ai #machine-learning
2 months ago · devops · - · -

[Paper] Smoothed aggregation algebraic multigrid for problems with heterogeneous and anisotropic materials

This paper introduces a material-aware strength-of-connection measure for smoothed aggregation algebraic multigrid methods, aimed at improving robustness for sc...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Variable Search Stepsize for Randomized Local Search in Multi-Objective Combinatorial Optimization

Over the past two decades, research in evolutionary multi-objective optimization has predominantly focused on continuous domains, with comparatively limited att...

#research #paper #ai
2 months ago · ai · - · -

[Paper] ArkTS-CodeSearch: A Open-Source ArkTS Dataset for Code Retrieval

ArkTS is a core programming language in the OpenHarmony ecosystem, yet research on ArkTS code intelligence is hindered by the lack of public datasets and evalua...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Sovereign-by-Design A Reference Architecture for AI and Blockchain Enabled Systems

Digital sovereignty has emerged as a central concern for modern software-intensive systems, driven by the dominance of non-sovereign cloud infrastructures, the ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Optimization is Not Enough: Why Problem Formulation Deserves Equal Attention

Black-box optimization is increasingly used in engineering design problems where simulation-based evaluations are costly and gradients are unavailable. In this ...

#research #paper #ai
2 months ago · devops · - · -

[Paper] Emergence-as-Code for Self-Governing Reliable Systems

SLO-as-code has made per-service} reliability declarative, but user experience is defined by journeys whose reliability is an emergent property of microservice ...

#research #paper #devops
2 months ago · devops · - · -

[Paper] Reaching Univalency with Subquadratic Communication

The Dolev-Reischuk lower bound establishes that any deterministic Byzantine Agreement (BA) protocol for n processors tolerating f faults requires Ω(f^2+n) messa...

#research #paper #devops
2 months ago · devops · - · -

[Paper] Proteus: Append-Only Ledgers for (Mostly) Trusted Execution Environments

Distributed ledgers are increasingly relied upon by industry to provide trustworthy accountability, strong integrity protection, and high availability for criti...

#research #paper #devops
2 months ago · devops · - · -

[Paper] ORACL: Optimized Reasoning for Autoscaling via Chain of Thought with LLMs for Microservices

Applications are moving away from monolithic designs to microservice and serverless architectures, where fleets of lightweight and independently deployable comp...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Reinforced Attention Learning

Post-training with Reinforcement Learning (RL) has substantially improved reasoning in Large Language Models (LLMs) via test-time scaling. However, extending th...

#research #paper #ai #machine-learning #nlp #computer-vision

Newer posts

Older posts