Source

arXiv

5861 posts from this source

Sort:

5 months ago · devops · - · -

[Paper] MAD-DAG: Protecting Blockchain Consensus from MEV

Blockchain security is threatened by selfish mining, where a miner (operator) deviates from the protocol to increase their revenue. Selfish mining is exacerbate...

#research #paper #devops
5 months ago · ai · - · -

[Paper] MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors

Human activity recognition (HAR) from inertial sensors is essential for ubiquitous computing, mobile health, and ambient intelligence. Conventional deep models ...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Video Generation Models Are Good Latent Reward Models

Reward feedback learning (ReFL) has proven effective for aligning image generation with human preferences. However, its extension to video generation faces sign...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] Context-Specific Causal Graph Discovery with Unobserved Contexts: Non-Stationarity, Regimes and Spatio-Temporal Patterns

Real-world data, for example in climate applications, often consists of spatially gridded time series data or data with comparable structure. While the underlyi...

#causal discovery #non‑stationary data #context‑specific graphs #machine learning
5 months ago · devops · - · -

[Paper] Modeling the Effect of Data Redundancy on Speedup in MLFMA Near-Field Computation

The near-field (P2P) operator in the Multilevel Fast Multipole Algorithm (MLFMA) is a performance bottleneck on GPUs due to poor memory locality. This work intr...

#research #paper #devops
5 months ago · ai · - · -

[Paper] Bangla Sign Language Translation: Dataset Creation Challenges, Benchmarking and Prospects

Bangla Sign Language Translation (BdSLT) has been severely constrained so far as the language itself is very low resource. Standard sentence level dataset creat...

#sign-language #dataset #translation #computer-vision #benchmark
5 months ago · ai · - · -

[Paper] Predictive Safety Shield for Dyna-Q Reinforcement Learning

Obtaining safety guarantees for reinforcement learning is a major challenge to achieve applicability for real-world tasks. Safety shields extend standard reinfo...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] The Age-specific Alzheimer 's Disease Prediction with Characteristic Constraints in Nonuniform Time Span

Alzheimer's disease is a debilitating disorder marked by a decline in cognitive function. Timely identification of the disease is essential for the development ...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] Phase Transition for Stochastic Block Model with more than sqrt(n) Communities (II)

A fundamental theoretical question in network analysis is to determine under which conditions community recovery is possible in polynomial time in the Stochasti...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor?

Recent advances in foundation models have shown great promise in domains such as natural language processing and computer vision, and similar efforts are now em...

#ensemble learning #remote sensing #foundation models #computer vision #sustainability
5 months ago · ai · - · -

[Paper] Pessimistic Verification for Open Ended Math Questions

The key limitation of the verification performance lies in the ability of error detection. With this intuition we designed several variants of pessimistic verif...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Self-Paced Learning for Images of Antinuclear Antibodies

Antinuclear antibody (ANA) testing is a crucial method for diagnosing autoimmune disorders, including lupus, Sjögren's syndrome, and scleroderma. Despite its im...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation

Unlike text, speech conveys information about the speaker, such as gender, through acoustic cues like pitch. This gives rise to modality-specific bias concerns....

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Mechanistic Interpretability for Transformer-based Time Series Classification

Transformer-based models have become state-of-the-art tools in various machine learning tasks, including time series classification, yet their complexity makes ...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] IntAttention: A Fully Integer Attention Pipeline for Efficient Edge Inference

Deploying Transformer models on edge devices is limited by latency and energy budgets. While INT8 quantization effectively accelerates the primary matrix multip...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Tool-RoCo: An Agent-as-Tool Self-organization Large Language Model Benchmark in Multi-robot Cooperation

This study proposes Tool-RoCo, a novel benchmark for evaluating large language models (LLMs) in long-term multi-agent cooperation based on RoCo, a multi-robot c...

#research #paper #ai #machine-learning
5 months ago · software · - · -

[Paper] SV-LIB 1.0: A Standard Exchange Format for Software-Verification Tasks

In the past two decades, significant research and development effort went into the development of verification tools for individual languages, such asC, C++, an...

#research #paper #software
5 months ago · ai · - · -

[Paper] Generalized Design Choices for Deepfake Detectors

The effectiveness of deepfake detection methods often depends less on their core design and more on implementation details such as data preprocessing, augmentat...

#deepfake detection #computer vision #benchmarking #model optimization
5 months ago · ai · - · -

[Paper] CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation

We propose Cross-Attention-based Non-local Knowledge Distillation (CanKD), a novel feature-based knowledge distillation framework that leverages cross-attention...

#knowledge distillation #cross-attention #computer vision #model compression #deep learning
5 months ago · ai · - · -

[Paper] Lost in Time? A Meta-Learning Framework for Time-Shift-Tolerant Physiological Signal Transformation

Translating non-invasive signals such as photoplethysmography (PPG) and ballistocardiography (BCG) into clinically meaningful signals like arterial blood pressu...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning

We present a novel training approach, named Merge-and-Bound (M&B) for Class Incremental Learning (CIL), which directly manipulates model weights in the para...

#research #paper #ai #machine-learning #computer-vision
5 months ago · ai · - · -

[Paper] Frequency-Aware Token Reduction for Efficient Vision Transformer

Vision Transformers have demonstrated exceptional performance across various computer vision tasks, yet their quadratic computational complexity concerning toke...

#vision transformers #token reduction #frequency-aware pruning #computer vision #model efficiency
5 months ago · ai · - · -

[Paper] MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices

Recently, video generation has witnessed rapid advancements, drawing increasing attention to image-to-video (I2V) synthesis on mobile devices. However, the subs...

#research #paper #ai #computer-vision
5 months ago · ai · - · -

[Paper] Going with the Speed of Sound: Pushing Neural Surrogates into Highly-turbulent Transonic Regimes

The widespread use of neural surrogates in automotive aerodynamics, enabled by datasets such as DrivAerML and DrivAerNet++, has primarily focused on bluff-body ...

#neural surrogates #transonic aerodynamics #CFD dataset #machine learning for fluid dynamics #AB‑UPT
5 months ago · ai · - · -

[Paper] Hierarchical Ranking Neural Network for Long Document Readability Assessment

Readability assessment aims to evaluate the reading difficulty of a text. In recent years, while deep learning technology has been gradually applied to readabil...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition

Spatial cognition is fundamental to real-world multimodal intelligence, allowing models to effectively interact with the physical environment. While multimodal ...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Mean-Field Limits for Two-Layer Neural Networks Trained with Consensus-Based Optimization

We study two-layer neural networks and train these with a particle-based method called consensus-based optimization (CBO). We compare the performance of CBO aga...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Ensemble Performance Through the Lens of Linear Independence of Classifier Votes in Data Streams

Ensemble learning improves classification performance by combining multiple base classifiers. While increasing the number of classifiers generally enhances accu...

#ensemble learning #data streams #linear independence #machine learning research #model sizing
5 months ago · ai · - · -

[Paper] MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning

Ensuring the safety of embodied AI agents during task planning is critical for real-world deployment, especially in household environments where dangerous instr...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation

Event cameras produce asynchronous event streams that are spatially sparse yet temporally dense. Mainstream event representation learning algorithms typically u...

#event cameras #hypergraph neural network #multimodal fusion #computer vision #deep learning
5 months ago · ai · - · -

[Paper] A Systematic Study of Model Merging Techniques in Large Language Models

Model merging combines multiple fine-tuned checkpoints into a single model without additional training, offering an attractive approach to reusing models and ef...

#model merging #large language models #task arithmetic #LLM research #benchmarking
5 months ago · devops · - · -

[Paper] MemFine: Memory-Aware Fine-Grained Scheduling for MoE Training

The training of large-scale Mixture of Experts (MoE) models faces a critical memory bottleneck due to severe load imbalance caused by dynamic token routing. Thi...

#research #paper #devops
5 months ago · ai · - · -

[Paper] From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings

We present a novel unsupervised framework to unlock vast unlabeled human demonstration data from continuous industrial video streams for Vision-Language-Action ...

#unsupervised video segmentation #action primitives #vision-language-action #industrial AI #latent action tokenization
5 months ago · ai · - · -

[Paper] E-M3RF: An Equivariant Multimodal 3D Re-assembly Framework

3D reassembly is a fundamental geometric problem, and in recent years it has increasingly been challenged by deep learning methods rather than classical optimiz...

#equivariant neural networks #multimodal 3D reconstruction #point cloud processing #computer vision
5 months ago · ai · - · -

[Paper] SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning

Remote sensing change captioning is an emerging and popular research task that aims to describe, in natural language, the content of interest that has changed b...

#research #paper #ai #machine-learning #computer-vision
5 months ago · ai · - · -

[Paper] Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning

Text-attributed graphs require models to effectively combine strong textual understanding with structurally informed reasoning. Existing approaches either rely ...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models

We introduce DiverseVAR, a framework that enhances the diversity of text-conditioned visual autoregressive models (VAR) at test time without requiring retrainin...

#visual-autoregressive #image generation #diversity #text-to-image #AI research
5 months ago · ai · - · -

[Paper] SUPN: Shallow Universal Polynomial Networks

Deep neural networks (DNNs) and Kolmogorov-Arnold networks (KANs) are popular methods for function approximation due to their flexibility and expressivity. Howe...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] Automated Dynamic AI Inference Scaling on HPC-Infrastructure: Integrating Kubernetes, Slurm and vLLM

Due to rising demands for Artificial Inteligence (AI) inference, especially in higher education, novel solutions utilising existing infrastructure are emerging....

#LLM inference #Kubernetes #Slurm #vLLM #HPC
5 months ago · ai · - · -

[Paper] Subjective Depth and Timescale Transformers: Learning Where and When to Compute

The rigid, uniform allocation of computation in standard Transformer (TF) architectures can limit their efficiency and scalability, particularly for large-scale...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Text-to-SQL as Dual-State Reasoning: Integrating Adaptive Context and Progressive Generation

Recent divide-and-conquer reasoning approaches, particularly those based on Chain-of-Thought (CoT), have substantially improved the Text-to-SQL capabilities of ...

#research #paper #ai #nlp
5 months ago · ai · - · -

[Paper] Can LLMs extract human-like fine-grained evidence for evidence-based fact-checking?

Misinformation frequently spreads in user comments under online news articles, highlighting the need for effective methods to detect factually incorrect informa...

#LLM #evidence extraction #fact-checking #multilingual dataset #benchmark
5 months ago · ai · - · -

[Paper] Training Introspective Behavior: Fine-Tuning Induces Reliable Internal State Detection in a 7B Model

Lindsey (2025) investigates introspective awareness in language models through four experiments, finding that models can sometimes detect and identify injected ...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Prune4Web: DOM Tree Pruning Programming for Web Agent

Web automation employs intelligent agents to execute high-level tasks by mimicking human interactions with web interfaces. Despite the capabilities of recent La...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Do Reasoning Vision-Language Models Inversely Scale in Test-Time Compute? A Distractor-centric Empirical Analysis

How does irrelevant information (i.e., distractors) affect test-time scaling in vision-language models (VLMs)? Prior studies on language models have reported an...

#vision-language models #distractor analysis #inverse scaling #prompt engineering #multimodal reasoning
5 months ago · ai · - · -

[Paper] Monet: Reasoning in Latent Visual Space Beyond Images and Language

'Thinking with images' has emerged as an effective paradigm for advancing visual reasoning, extending beyond text-only chains of thought by injecting visual evi...

#research #paper #ai #machine-learning #computer-vision
5 months ago · software · - · -

[Paper] Large Language Models for Unit Test Generation: Achievements, Challenges, and the Road Ahead

Unit testing is an essential yet laborious technique for verifying software and mitigating regression risks. Although classic automated methods effectively expl...

#research #paper #software
5 months ago · ai · - · -

[Paper] BanglaASTE: A Novel Framework for Aspect-Sentiment-Opinion Extraction in Bangla E-commerce Reviews Using Ensemble Deep Learning

Aspect-Based Sentiment Analysis (ABSA) has emerged as a critical tool for extracting fine-grained sentiment insights from user-generated content, particularly i...

#aspect-based sentiment analysis #Bangla NLP #ensemble deep learning #low-resource languages #dataset release

Newer posts

Older posts