Source

arXiv

5752 posts from this source

Sort:

2 months ago · ai · - · -

[Paper] PDE foundation models are skillful AI weather emulators for the Martian atmosphere

We show that AI foundation models that are pretrained on numerical solutions to a diverse corpus of partial differential equations can be adapted and fine-tuned...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Spectral Convolution on Orbifolds for Geometric Deep Learning

Geometric deep learning (GDL) deals with supervised learning on data domains that go beyond Euclidean structure, such as data with graph or manifold structure. ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] On the Semantics of Primary Cause in Hybrid Dynamic Domains

Reasoning about actual causes of observed effects is fundamental to the study of rationality. This important problem has been studied since the time of Aristotl...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery

Vision language models (VLMs) achieve strong performance on RGB imagery, but they do not generalize to thermal images. Thermal sensing plays a critical role in ...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Counterfactual Fairness Evaluation of LLM-Based Contact Center Agent Quality Assurance System

Large Language Models (LLMs) are increasingly deployed in contact-center Quality Assurance (QA) to automate agent performance evaluation and coaching feedback. ...

#LLM fairness #bias evaluation #contact center AI #counterfactual testing #prompt mitigation
2 months ago · ai · - · -

[Paper] PhyScensis: Physics-Augmented LLM Agents for Complex Physical Scene Arrangement

Automatically generating interactive 3D environments is crucial for scaling up robotic data collection in simulation. While prior work has primarily focused on ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] PAct: Part-Decomposed Single-View Articulated Object Generation

Articulated objects are central to interactive 3D applications, including embodied AI, robotics, and VR/AR, where functional part decomposition and kinematic mo...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Tool-Aware Planning in Contact Center AI: Evaluating LLMs through Lineage-Guided Query Decomposition

We present a domain-grounded framework and benchmark for tool-aware plan generation in contact centers, where answering a query for business insights, our targe...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories

Maintaining spatial world consistency over long horizons remains a central challenge for camera-controllable video generation. Existing memory-based approaches ...

#video generation #spatial memory #computer vision #deep learning #transformer
2 months ago · ai · - · -

[Paper] Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery

Aligning ground-level imagery with geo-registered satellite maps is crucial for mapping, navigation, and situational awareness, yet remains challenging under la...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design

To address the global health threat of antimicrobial resistance, antimicrobial peptides (AMP) are being explored for their potent and promising ability to fight...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] ReusStdFlow: A Standardized Reusability Framework for Dynamic Workflow Construction in Agentic AI

To address the ``reusability dilemma'' and structural hallucinations in enterprise Agentic AI,this paper proposes ReusStdFlow, a framework centered on a novel `...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] BFS-PO: Best-First Search for Large Reasoning Models

Large Reasoning Models (LRMs) such as OpenAI o1 and DeepSeek-R1 have shown excellent performance in reasoning tasks using long reasoning chains. However, this h...

#large-reasoning-models #reinforcement-learning #best-first-search #token-efficiency #chain-of-thought
2 months ago · ai · - · -

[Paper] Picking the Right Specialist: Attentive Neural Process-based Selection of Task-Specialized Models as Tools for Agentic Healthcare Systems

Task-specialized models form the backbone of agentic healthcare systems, enabling the agents to answer clinical queries across tasks such as disease diagnosis, ...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Web-Scale Multimodal Summarization using CLIP-Based Semantic Alignment

We introduce Web-Scale Multimodal Summarization, a lightweight framework for generating summaries by combining retrieved text and image data from web sources. G...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] CT-Bench: A Benchmark for Multimodal Lesion Understanding in Computed Tomography

Artificial intelligence (AI) can automatically delineate lesions on computed tomography (CT) and generate radiology report content, yet progress is limited by t...

#medical imaging #multimodal AI #benchmark #CT #vision-language
2 months ago · software · - · -

[Paper] Model Context Protocol (MCP) Tool Descriptions Are Smelly! Towards Improving AI Agent Efficiency with Augmented MCP Tool Descriptions

The Model Context Protocol (MCP) standardizes how Foundation Model (FM)-based agents interact with external systems by invoking tools. However, to understand a ...

#research #paper #software
2 months ago · ai · - · -

[Paper] EmbeWebAgent: Embedding Web Agents into Any Customized UI

Most web agents operate at the human interface level, observing screenshots or raw DOM trees without application-level access, which limits robustness and actio...

#web-agents #UI-automation #LLM-integration #frontend-hooks #websocket
2 months ago · ai · - · -

[Paper] Atomix: Timely, Transactional Tool Use for Reliable Agentic Workflows

LLM agents increasingly act on external systems, yet tool effects are immediate. Under failures, speculation, or contention, losing branches can leak unintended...

#LLM agents #transactional runtime #tool integration #agentic workflows #fault tolerance
2 months ago · ai · - · -

[Paper] Testimole-Conversational: A 30-Billion-Word Italian Discussion Board Corpus (1996-2024) for Language Modeling and Sociolinguistic Research

We present 'Testimole-conversational' a massive collection of discussion boards messages in the Italian language. The large size of the corpus, more than 30B wo...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Learning State-Tracking from Code Using Linear RNNs

Over the last years, state-tracking tasks, particularly permutation composition, have become a testbed to understand the limits of sequence models architectures...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] GOT-JEPA: Generic Object Tracking with Model Adaptation and Occlusion Handling using Joint-Embedding Predictive Architecture

The human visual system tracks objects by integrating current observations with previously observed information, adapting to target and scene changes, and reaso...

#research #paper #ai #machine-learning #computer-vision
2 months ago · devops · - · -

[Paper] Evaluation of Dynamic Vector Bin Packing for Virtual Machine Placement

Virtual machine placement is a crucial challenge in cloud computing for efficiently utilizing physical machine resources in data centers. Virtual machine placem...

#research #paper #devops
2 months ago · software · - · -

[Paper] Configuring Agentic AI Coding Tools: An Exploratory Study

Agentic AI coding tools with autonomous capabilities beyond conversational content generation increasingly automate repetitive and time-consuming software devel...

#agentic AI #coding assistants #configuration #empirical study #GitHub
2 months ago · software · - · -

[Paper] The Value of Effective Pull Request Description

In the pull-based development model, code contributions are submitted as pull requests (PRs) to undergo reviews and approval by other developers with the goal o...

#research #paper #software
2 months ago · software · - · -

[Paper] Consistent or Sensitive? Automated Code Revision Tools Against Semantics-Preserving Perturbations

Automated Code Revision (ACR) tools aim to reduce manual effort by automatically generating code revisions based on reviewer feedback. While ACR tools have show...

#research #paper #software
2 months ago · software · - · -

[Paper] Automated Classification of Source Code Changes Based on Metrics Clustering in the Software Development Process

This paper presents an automated method for classifying source code changes during the software development process based on clustering of change metrics. The m...

#code review automation #commit classification #clustering #software metrics #machine learning
2 months ago · devops · - · -

[Paper] An Empirical Study of the Evolution of GitHub Actions Workflows

CI/CD practices play a significant role during collaborative software development by automating time-consuming and repetitive tasks such as testing, building, q...

#GitHub Actions #CI/CD #workflow evolution #empirical study #software engineering
2 months ago · software · - · -

[Paper] When Security Meets Usability: An Empirical Investigation of Post-Quantum Cryptography APIs

Advances in quantum computing increasingly threaten the security and privacy of data protected by current cryptosystems, particularly those relying on public-ke...

#research #paper #software
2 months ago · devops · - · -

[Paper] Efficient Multi-round LLM Inference over Disaggregated Serving

With the rapid evolution of Large Language Models (LLMs), multi-round workflows, such as autonomous agents and iterative retrieval, have become increasingly pre...

#LLM serving #dynamic scheduling #disaggregated architecture #inference latency #AMPD
2 months ago · ai · - · -

[Paper] Parameter-Efficient Fine-Tuning of LLMs with Mixture of Space Experts

Large Language Models (LLMs) have achieved remarkable progress, with Parameter-Efficient Fine-Tuning (PEFT) emerging as a key technique for downstream task adap...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Revisiting the Platonic Representation Hypothesis: An Aristotelian View

The Platonic Representation Hypothesis suggests that representations from neural networks are converging to a common statistical model of reality. We show that ...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Selective Synchronization Attention

The Transformer architecture has become the foundation of modern deep learning, yet its core self-attention mechanism suffers from quadratic computational compl...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Boule or Baguette? A Study on Task Topology, Length Generalization, and the Benefit of Reasoning Traces

Recent years have witnessed meteoric progress in reasoning models: neural networks that generate intermediate reasoning traces (RTs) before producing a final ou...

#reasoning-traces #logic-dataset #length-generalization #transformer-models #propositional-logic
2 months ago · ai · - · -

[Paper] Floe: Federated Specialization for Real-Time LLM-SLM Inference

Deploying large language models (LLMs) in real-time systems remains challenging due to their substantial computational demands and privacy concerns. We propose ...

#research #paper #ai #machine-learning
2 months ago · devops · - · -

[Paper] Parallel Sparse and Data-Sparse Factorization-based Linear Solvers

Efficient solutions of large-scale, ill-conditioned and indefinite algebraic equations are ubiquitously needed in numerous computational fields, including multi...

#research #paper #devops
2 months ago · devops · - · -

[Paper] ML-ECS: A Collaborative Multimodal Learning Framework for Edge-Cloud Synergies

Edge-cloud synergies provide a promising paradigm for privacy-preserving deployment of foundation models, where lightweight on-device models adapt to domain-spe...

#research #paper #devops
2 months ago · ai · - · -

[Paper] An effective Genetic Programming Hyper-Heuristic for Uncertain Agile Satellite Scheduling

This paper investigates a novel problem, namely the Uncertain Agile Earth Observation Satellite Scheduling Problem (UAEOSSP). Unlike the static AEOSSP, it takes...

#genetic programming #hyper-heuristic #satellite scheduling #uncertainty modeling #evolutionary algorithms
2 months ago · ai · - · -

[Paper] Evolving Multi-Channel Confidence-Aware Activation Functions for Missing Data with Channel Propagation

Learning in the presence of missing data can result in biased predictions and poor generalizability, among other difficulties, which data imputation methods onl...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] A Unified Physics-Informed Neural Network for Modeling Coupled Electro- and Elastodynamic Wave Propagation Using Three-Stage Loss Optimization

Physics-Informed Neural Networks present a novel approach in SciML that integrates physical laws in the form of partial differential equations directly into the...

#physics-informed neural networks #PINN #multi-physics simulation #piezoelectric wave propagation #deep learning
2 months ago · devops · - · -

[Paper] TEG: Exascale Cluster Governance via Non-Equilibrium Thermodynamics and Langevin Dynamics

As cloud computing scales toward the Exascale regime (10^5+ nodes), the prevailing 'Newtonian' orchestration paradigm -- exemplified by Kubernetes -- approaches...

#research #paper #devops
2 months ago · ai · - · -

[Paper] OR-Agent: Bridging Evolutionary Search and Structured Research for Automated Algorithm Discovery

Automating scientific discovery in complex, experiment-driven domains requires more than iterative mutation of programs; it demands structured hypothesis manage...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Discrete Gene Crossover Accelerates Solution Discovery in Quality-Diversity Algorithms

Quality-Diversity (QD) algorithms aim to discover diverse, high-performing solutions across behavioral niches. However, QD search often stagnates as incremental...

#research #paper #ai
2 months ago · devops · - · -

[Paper] SIDSense: Database-Free TV White Space Sensing for Disaster-Resilient Connectivity

Small Island Developing States (SIDS) are disproportionately exposed to climate-driven disasters, yet often rely on fragile terrestrial networks that fail when ...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Preventing Rank Collapse in Federated Low-Rank Adaptation with Client Heterogeneity

Federated low-rank adaptation (FedLoRA) has facilitated communication-efficient and privacy-preserving fine-tuning of foundation models for downstream tasks. In...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Imitating What Works: Simulation-Filtered Modular Policy Learning from Human Videos

The ability to learn manipulation skills by watching videos of humans has the potential to unlock a new source of highly scalable data for robot learning. Here,...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Conversational Image Segmentation: Grounding Abstract Concepts with Scalable Supervision

Conversational image segmentation grounds abstract, intent-driven concepts into pixel-accurate masks. Prior work on referring image grounding focuses on categor...

#image segmentation #computer vision #large language models #dataset #research paper
2 months ago · ai · - · -

[Paper] Semantic Chunking and the Entropy of Natural Language

The entropy rate of printed English is famously estimated to be about one bit per character, a benchmark that modern large language models (LLMs) have only rece...

#research #paper #ai #machine-learning #nlp

Newer posts

Older posts