Source

arXiv

5804 posts from this source

Sort:

4 months ago · ai · - · -

[Paper] Splitwise: Collaborative Edge-Cloud Inference for LLMs via Lyapunov-Assisted DRL

Deploying large language models (LLMs) on edge devices is challenging due to their limited memory and power resources. Cloud-only inference reduces device burde...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images

Multimodal Large Language Models (LLMs) introduce an emerging paradigm for medical imaging by interpreting scans through the lens of extensive clinical knowledg...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration

Audiobook interpretations are attracting increasing attention, as they provide accessible and in-depth analyses of books that offer readers practical insights a...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Chinese Morph Resolution in E-commerce Live Streaming Scenarios

E-commerce live streaming in China, particularly on platforms like Douyin, has become a major sales channel, but hosts often use morphs to evade scrutiny and en...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation

Parameter-efficient fine-tuning has become the dominant paradigm for adapting large language models to downstream tasks. Low-rank adaptation methods such as LoR...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] FairGFL: Privacy-Preserving Fairness-Aware Federated Learning with Overlapping Subgraphs

Graph federated learning enables the collaborative extraction of high-order information from distributed subgraphs while preserving the privacy of raw data. How...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Anka: A Domain-Specific Language for Reliable LLM Code Generation

Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation, yet they exhibit systematic errors on complex, multi-step programming...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Scoring, Reasoning, and Selecting the Best! Ensembling Large Language Models via a Peer-Review Process

We propose LLM-PeerReview, an unsupervised LLM Ensemble method that selects the most ideal response from multiple LLM-generated candidates for each query, harne...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Osmotic Learning: A Self-Supervised Paradigm for Decentralized Contextual Data Representation

Data within a specific context gains deeper significance beyond its isolated interpretation. In distributed systems, interdependent data sources reveal hidden r...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Large vision-language models (VLMs) often benefit from intermediate visual cues, either injected via external tools or generated as latent visual tokens during ...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] ProEdit: Inversion-based Editing From Prompts Done Right

Inversion-based visual editing provides an effective and training-free way to edit an image or a video based on user instructions. Existing methods typically in...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Agentic Structured Graph Traversal for Root Cause Analysis of Code-related Incidents in Cloud Applications

Cloud incidents pose major operational challenges in production, with unresolved production cloud incidents cost on average over $2M per hour. Prior research id...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Pruning as a Game: Equilibrium-Driven Sparsification of Neural Networks

Neural network pruning is widely used to reduce model size and computational cost. Yet, most existing methods treat sparsity as an externally imposed constraint...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Learning Association via Track-Detection Matching for Multi-Object Tracking

Multi-object tracking aims to maintain object identities over time by associating detections across video frames. Two dominant paradigms exist in literature: tr...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Explainable Multimodal Regression via Information Decomposition

Multimodal regression aims to predict a continuous target from heterogeneous input sources and typically relies on fusion strategies such as early or late fusio...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] A2P-Vis: an Analyzer-to-Presenter Agentic Pipeline for Visual Insights Generation and Reporting

Automating end-to-end data science pipeline with AI agents still stalls on two gaps: generating insightful, diverse visual evidence and assembling it into a coh...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Introducing TrGLUE and SentiTurca: A Comprehensive Benchmark for Turkish General Language Understanding and Sentiment Analysis

Evaluating the performance of various model architectures, such as transformers, large language models (LLMs), and other NLP systems, requires comprehensive ben...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Yume-1.5: A Text-Controlled Interactive World Generation Model

Recent approaches have demonstrated the promise of using diffusion models to generate interactive and explorable worlds. However, most of these methods face cri...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Unifying Learning Dynamics and Generalization in Transformers Scaling Law

The scaling law, a cornerstone of Large Language Model (LLM) development, predicts improvements in model performance with increasing computational resources. Ye...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Context as a Tool: Context Management for Long-Horizon SWE-Agents

Agents based on large language models have recently shown strong potential on real-world software engineering (SWE) tasks that require long-horizon interaction ...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] A Frobenius-Optimal Projection for Enforcing Linear Conservation in Learned Dynamical Models

We consider the problem of restoring linear conservation laws in data-driven linear dynamical models. Given a learned operator widehat{A} and a full-rank constr...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Scaling Adversarial Training via Data Selection

Projected Gradient Descent (PGD) is a strong and widely used first-order adversarial attack, yet its computational cost scales poorly, as all training samples u...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Prefill vs. Decode Bottlenecks: SRAM-Frequency Tradeoffs and the Memory-Bandwidth Ceiling

Energy consumption dictates the cost and environmental impact of deploying Large Language Models. This paper investigates the impact of on-chip SRAM size and op...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars

Real-time, streaming interactive avatars represent a critical yet challenging goal in digital human research. Although diffusion-based human avatar generation m...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Toward Secure and Compliant AI: Organizational Standards and Protocols for NLP Model Lifecycle Management

Natural Language Processing (NLP) systems are increasingly used in sensitive domains such as healthcare, finance, and government, where they handle large volume...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Why Smooth Stability Assumptions Fail for ReLU Learning

Stability analyses of modern learning systems are frequently derived under smoothness assumptions that are violated by ReLU-type nonlinearities. In this note, w...

#research #paper #ai #machine-learning
4 months ago · devops · - · -

[Paper] Proceedings First Workshop on Adaptable Cloud Architectures

This volume contains the post-proceedings of the Workshop on Adaptable Cloud Architectures (WACA 2025), held on June 20, 2025, in Lille, France, co-located with...

#research #paper #devops
4 months ago · ai · - · -

[Paper] MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

The development of GUI agents could revolutionize the next generation of human-computer interaction. Motivated by this vision, we present MAI-UI, a family of fo...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models

Prompt-driven Video Segmentation Foundation Models (VSFMs) such as SAM2 are increasingly deployed in applications like autonomous driving and digital pathology,...

#research #paper #ai #computer-vision
4 months ago · software · - · -

[Paper] HALF: Process Hollowing Analysis Framework for Binary Programs with the Assistance of Kernel Modules

Binary program analysis is still very important in system security. There are many practical achievements in binary code analysis, but fine-grained analysis suc...

#research #paper #software
4 months ago · devops · - · -

[Paper] FUSCO: High-Performance Distributed Data Shuffling via Transformation-Communication Fusion

Large-scale Mixture-of-Experts (MoE) models rely on expert parallelism for efficient training and inference, which splits experts across devices and necessitate...

#research #paper #devops
4 months ago · devops · - · -

[Paper] Robust Federated Fine-Tuning in Heterogeneous Networks with Unreliable Connections: An Aggregation View

Federated Fine-Tuning (FFT) has attracted growing interest as it leverages both server- and client-side data to enhance global model generalization while preser...

#research #paper #devops
4 months ago · ai · - · -

[Paper] From In Silico to In Vitro: Evaluating Molecule Generative Models for Hit Generation

Hit identification is a critical yet resource-intensive step in the drug discovery pipeline, traditionally relying on high-throughput screening of large compoun...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] LibContinual: A Comprehensive Library towards Realistic Continual Learning

A fundamental challenge in Continual Learning (CL) is catastrophic forgetting, where adapting to new tasks degrades the performance on previous ones. While the ...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Patch-Discontinuity Mining for Generalized Deepfake Detection

The rapid advancement of generative artificial intelligence has enabled the creation of highly realistic fake facial images, posing serious threats to personal ...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Direction Finding with Sparse Arrays Based on Variable Window Size Spatial Smoothing

In this work, we introduce a variable window size (VWS) spatial smoothing framework that enhances coarray-based direction of arrival (DOA) estimation for sparse...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Meta-Learning-Based Handover Management in NextG O-RAN

While traditional handovers (THOs) have served as a backbone for mobile connectivity, they increasingly suffer from failures and delays, especially in dense dep...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] SketchPlay: Intuitive Creation of Physically Realistic VR Content with Gesture-Driven Sketching

Creating physically realistic content in VR often requires complex modeling tools or predefined 3D models, textures, and animations, which present significant b...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] LongFly: Long-Horizon UAV Vision-and-Language Navigation with Spatiotemporal Context Integration

Unmanned aerial vehicles (UAVs) are crucial tools for post-disaster search and rescue, facing challenges such as high information density, rapid changes in view...

#research #paper #ai #machine-learning #computer-vision
4 months ago · devops · - · -

[Paper] BLEST: Blazingly Efficient BFS using Tensor Cores

Breadth-First Search (BFS) is a fundamental graph kernel that underpins a wide range of applications. While modern GPUs provide specialised Matrix-Multiply-Accu...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Self-attention vector output similarities reveal how machines pay attention

The self-attention mechanism has significantly advanced the field of natural language processing, facilitating the development of advanced language-learning mac...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Broken Words, Broken Performance: Effect of Tokenization on Performance of LLMs

Tokenization is the first step in training any Large Language Model (LLM), where the text is split into a sequence of tokens as per the model's fixed vocabulary...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] SWE-RM: Execution-free Feedback For Software Engineering Agents

Execution-based feedback like unit testing is widely used in the development of coding agents through test-time scaling (TTS) and reinforcement learning (RL). T...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Accelerate Speculative Decoding with Sparse Computation in Verification

Speculative decoding accelerates autoregressive language model inference by verifying multiple draft tokens in parallel. However, the verification stage often b...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Explainable Statute Prediction via Attention-based Model and LLM Prompting

In this paper, we explore the problem of automatic statute prediction where for a given case description, a subset of relevant statutes are to be predicted. Her...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Optimizing Resource Allocation for Geographically-Distributed Inference by Large Language Models

Large language models have demonstrated extraordinary performance in many AI tasks but are expensive to use, even after training, due to their requirement of hi...

#research #paper #ai #machine-learning
4 months ago · devops · - · -

[Paper] LIME:Accelerating Collaborative Lossless LLM Inference on Memory-Constrained Edge Devices

Large language models (LLMs) have emerged as a powerful foundation for intelligent reasoning and decision-making, demonstrating substantial impact across a wide...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Conserved active information

We introduce conserved active information I^oplus, a symmetric extension of active information that quantifies net information gain/loss across the entire searc...

#research #paper #ai

Newer posts

Older posts