Source

arXiv

5861 posts from this source

Sort:

5 months ago · ai · - · -

[Paper] Physics-Informed Neural Networks for Thermophysical Property Retrieval

Inverse heat problems refer to the estimation of material thermophysical properties given observed or known heat diffusion behaviour. Inverse heat problems have...

#research #paper #ai #machine-learning #computer-vision
5 months ago · ai · - · -

[Paper] Provable Benefits of Sinusoidal Activation for Modular Addition

This paper studies the role of activation functions in learning modular addition with two-layer neural networks. We first establish a sharp expressivity gap: si...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts

Offline reinforcement learning (RL) enables agents to learn optimal policies from pre-collected datasets. However, datasets containing suboptimal and fragmented...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation

Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handlin...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent

We introduce SuperIntelliAgent, an agentic learning framework that couples a trainable small diffusion model (the learner) with a frozen large language model (t...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model

Recent advances in generative world models have enabled remarkable progress in creating open-ended game environments, evolving from static scene synthesis towar...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] DisMo: Disentangled Motion Representations for Open-World Motion Transfer

Recent advances in text-to-video (T2V) and image-to-video (I2V) models, have enabled the creation of visually compelling and dynamic videos from simple textual ...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities

Automated vulnerability patching is crucial for software security, and recent advancements in Large Language Models (LLMs) present promising capabilities for au...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] MANTA: Physics-Informed Generalized Underwater Object Tracking

Underwater object tracking is challenging due to wavelength dependent attenuation and scattering, which severely distort appearance across depths and water cond...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] LFM2 Technical Report

We present LFM2, a family of Liquid Foundation Models designed for efficient on-device deployment and strong task capabilities. Using hardware-in-the-loop archi...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning

Split learning is well known as a method for resolving data privacy concerns by training a model on distributed devices, thereby avoiding data sharing that rais...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation

Small and medium-sized enterprises (SMEs) in Iran increasingly leverage Telegram for sales, where real-time engagement is essential for conversion. However, dev...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization

Direct Preference Optimization (DPO) is a widely used reinforcement learning from human feedback (RLHF) method across various domains. Recent research has incre...

#research #paper #ai #nlp
5 months ago · ai · - · -

[Paper] Learning-Augmented Online Bipartite Matching in the Random Arrival Order Model

We study the online unweighted bipartite matching problem in the random arrival order model, with $n$ offline and $n$ online vertices, in the learning-augmented...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Hierarchical AI-Meteorologist: LLM-Agent System for Multi-Scale and Explainable Weather Forecast Reporting

We present the Hierarchical AI-Meteorologist, an LLM-agent system that generates explainable weather reports using a hierarchical forecast reasoning and weather...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction

Unifying multimodal understanding, generation and reconstruction representation in a single tokenizer remains a key challenge in building unified models. Previo...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] Is Passive Expertise-Based Personalization Enough? A Case Study in AI-Assisted Test-Taking

Novice and expert users have different systematic preferences in task-oriented dialogues. However, whether catering to these preferences actually improves user ...

#research #paper #ai #nlp
5 months ago · ai · - · -

[Paper] Optimizing Multimodal Language Models through Attention-based Interpretability

Modern large language models become multimodal, analyzing various data formats like text and images. While fine-tuning is effective for adapting these multimoda...

#research #paper #ai #nlp #computer-vision
5 months ago · ai · - · -

[Paper] Scaling HuBERT for African Languages: From Base to Large and XL

Despite recent progress in multilingual speech processing, African languages remain under-represented in both research and deployed systems, particularly when i...

#research #paper #ai #nlp
5 months ago · ai · - · -

[Paper] Agentic AI Framework for Smart Inventory Replenishment

In contemporary retail, the variety of products available (e.g. clothing, groceries, cosmetics, frozen goods) make it difficult to predict the demand, prevent s...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Functional Program Synthesis with Higher-Order Functions and Recursion Schemes

Program synthesis is the process of generating a computer program following a set of specifications, such as a set of input-output examples. It can be modeled a...

#research #paper #ai
5 months ago · ai · - · -

[Paper] Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach

Knowledge-enhanced text generation aims to enhance the quality of generated text by utilizing internal or external knowledge sources. While language models have...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Tackling a Challenging Corpus for Early Detection of Gambling Disorder: UNSL at MentalRiskES 2025

Gambling disorder is a complex behavioral addiction that is challenging to understand and address, with severe physical, psychological, and social consequences....

#research #paper #ai #nlp
5 months ago · software · - · -

[Paper] Chart2Code-MoLA: Efficient Multi-Modal Code Generation via Adaptive Expert Routing

Chart-to-code generation is a critical task in automated data visualization, translating complex chart structures into executable programs. While recent Multi-m...

#research #paper #software
5 months ago · ai · - · -

[Paper] Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

This work explores the challenge of building ``Machines that Can Remember'', framing long-term memory as the problem of efficient ultra-long context modeling. W...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach

Large-scale Vision Language Models (LVLMs) exhibit advanced capabilities in tasks that require visual information, including object detection. These capabilitie...

#research #paper #ai #machine-learning #nlp #computer-vision
5 months ago · software · - · -

[Paper] FLIMs: Fault Localization Interference Mutants, Definition, Recognition and Mitigation

Mutation-based Fault Localization (MBFL) has been widely explored for automated software debugging, leveraging artificial mutants to identify faulty code entiti...

#research #paper #software
5 months ago · devops · - · -

[Paper] Beyond 2-Edge-Connectivity: Algorithms and Impossibility for Content-Oblivious Leader Election

The content-oblivious model, introduced by Censor-Hillel, Cohen, Gelles, and Sel (PODC 2022; Distributed Computing 2023), captures an extremely weak form of com...

#research #paper #devops
5 months ago · ai · - · -

[Paper] Closing the Generalization Gap in Parameter-efficient Federated Edge Learning

Federated edge learning (FEEL) provides a promising foundation for edge artificial intelligence (AI) by enabling collaborative model training while preserving d...

#research #paper #ai #machine-learning
5 months ago · devops · - · -

[Paper] RetryGuard: Preventing Self-Inflicted Retry Storms in Cloud Microservices Applications

Modern cloud applications are built on independent, diverse microservices, offering scalability, flexibility, and usage-based billing. However, the structural d...

#research #paper #devops
5 months ago · software · - · -

[Paper] GAPS: Guiding Dynamic Android Analysis with Static Path Synthesis

Dynamically resolving method reachability in Android applications remains a critical and largely unsolved problem. Despite notable advancements in GUI testing a...

#research #paper #software
5 months ago · devops · - · -

[Paper] Communication-Computation Pipeline Parallel Split Learning over Wireless Edge Networks

Split learning (SL) offloads main computing tasks from multiple resource-constrained user equippments (UEs) to the base station (BS), while preserving local dat...

#research #paper #devops
5 months ago · ai · - · -

[Paper] AI for software engineering: from probable to provable

Vibe coding, the much-touted use of AI techniques for programming, faces two overwhelming obstacles: the difficulty of specifying goals ('prompt engineering' is...

#research #paper #ai #machine-learning
5 months ago · software · - · -

[Paper] Amplifiers or Equalizers? A Longitudinal Study of LLM Evolution in Software Engineering Project-Based Learning

As LLMs reshape software development, integrating LLM-augmented practices into SE education has become imperative. While existing studies explore LLMs' educatio...

#research #paper #software
5 months ago · ai · - · -

[Paper] Spectral Concentration at the Edge of Stability: Information Geometry of Kernel Associative Memory

High-capacity kernel Hopfield networks exhibit a 'Ridge of Optimization' characterized by extreme stability. While previously linked to 'Spectral Concentration,...

#research #paper #ai #machine-learning
5 months ago · devops · - · -

[Paper] Areon: Latency-Friendly and Resilient Multi-Proposer Consensus

We present Areon, a family of latency-friendly, stake-weighted, multi-proposer proof-of-stake consensus protocols. By allowing multiple proposers per slot and o...

#research #paper #devops
5 months ago · ai · - · -

[Paper] Intelligent Neural Networks: From Layered Architectures to Graph-Organized Intelligence

Biological neurons exhibit remarkable intelligence: they maintain internal states, communicate selectively with other neurons, and self-organize into complex gr...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Privacy-preserving fall detection at the edge using Sony IMX636 event-based vision sensor and Intel Loihi 2 neuromorphic processor

Fall detection for elderly care using non-invasive vision-based systems remains an important yet unsolved problem. Driven by strict privacy requirements, infere...

#research #paper #ai
5 months ago · ai · - · -

[Paper] Prediction performance of random reservoirs with different topology for nonlinear dynamical systems with different number of degrees of freedom

Reservoir computing (RC) is a powerful framework for predicting nonlinear dynamical systems, yet the role of reservoir topology$-$particularly symmetry in conne...

#research #paper #ai
5 months ago · ai · - · -

[Paper] Equilibrium Propagation Without Limits

We liberate Equilibrium Propagation (EP) from the limit of infinitesimal perturbations by establishing a finite-nudge foundation for local credit assignment. By...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Revisiting Generalization Across Difficulty Levels: It's Not So Easy

We investigate how well large language models (LLMs) generalize across different task difficulties, a key question for effective data curation and evaluation. E...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Canvas-to-Image: Compositional Image Generation with Multimodal Controls

While modern diffusion models excel at generating high-quality and diverse images, they still struggle with high-fidelity compositional and multimodal control, ...

#image generation #diffusion models #multimodal control #computer vision #research
5 months ago · ai · - · -

[Paper] TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos

Learning new robot tasks on new platforms and in new scenes from only a handful of demonstrations remains challenging. While videos of other embodiments - human...

#research #paper #ai #machine-learning #computer-vision
5 months ago · ai · - · -

[Paper] ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Large language models are powerful generalists, yet solving deep and complex problems such as those of the Humanity's Last Exam (HLE) remains both conceptually ...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Vision-Language Models (VLMs) still lack robustness in spatial intelligence, demonstrating poor performance on spatial understanding and reasoning tasks. We att...

#research #paper #ai #machine-learning #nlp #computer-vision
5 months ago · ai · - · -

[Paper] Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework

Synthetic data has become increasingly important for training large language models, especially when real data is scarce, expensive, or privacy-sensitive. Many ...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Seeing without Pixels: Perception from Camera Trajectories

Can one perceive a video's content without seeing its pixels, just from the camera trajectory-the path it carves through space? This paper is the first to syste...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] Agentic Learner with Grow-and-Refine Multimodal Semantic Memory

MLLMs exhibit strong reasoning on isolated queries, yet they operate de novo -- solving each problem independently and often repeating the same mistakes. Existi...

#multimodal memory #lifelong learning #large multimodal models #semantic memory #AI reasoning

Newer posts

Older posts