Source

arXiv

5856 posts from this source

Sort:

4 months ago · ai · - · -

[Paper] Unsupervised Learning of Density Estimates with Topological Optimization

Kernel density estimation is a key component of a wide variety of algorithms in machine learning, Bayesian inference, stochastic dynamics and signal processing....

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training

While scaling laws for Large Language Models (LLMs) traditionally focus on proxy metrics like pretraining loss, predicting downstream task performance has been ...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Toward Faithful Retrieval-Augmented Generation with Sparse Autoencoders

Retrieval-Augmented Generation (RAG) improves the factuality of large language models (LLMs) by grounding outputs in retrieved evidence, but faithfulness failur...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

Visual reasoning is challenging, requiring both precise object grounding and understanding complex spatial relationships. Existing methods fall into two camps: ...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Accelerated Rotation-Invariant Convolution for UAV Image Segmentation

Rotation invariance is essential for precise, object-level segmentation in UAV aerial imagery, where targets can have arbitrary orientations and exhibit fine-sc...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Explainable Anomaly Detection for Industrial IoT Data Streams

Industrial maintenance is being transformed by the Internet of Things and edge computing, generating continuous data streams that demand real-time, adaptive dec...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Decentralized Trust for Space AI: Blockchain-Based Federated Learning Across Multi-Vendor LEO Satellite Networks

The rise of space AI is reshaping government and industry through applications such as disaster detection, border surveillance, and climate monitoring, powered ...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing

Vision-language models (VLMs) are emerging as powerful generalist tools for remote sensing, capable of integrating information across diverse tasks and enabling...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] DAO-GP Drift Aware Online Non-Linear Regression Gaussian-Process

Real-world datasets often exhibit temporal dynamics characterized by evolving data distributions. Disregarding this phenomenon, commonly referred to as concept ...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation

Large Language Models (LLMs) have recently demonstrated remarkable performance in generating high-quality tabular synthetic data. In practice, two primary appro...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning

Image captioning is essential in many fields including assisting visually impaired individuals, improving content management systems, and enhancing human-comput...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents

LLM agents are widely deployed in complex interactive tasks, yet privacy constraints often preclude centralized optimization and co-evolution across dynamic env...

#research #paper #ai #machine-learning
4 months ago · software · - · -

[Paper] SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA

The Development Knowledge Question Answering (Dev Knowledge QA) task aims to provide natural language answers to knowledge-seeking questions during software dev...

#research #paper #software
4 months ago · ai · - · -

[Paper] Do Depth-Grown Models Overcome the Curse of Depth? An In-Depth Analysis

Gradually growing the depth of Transformers during training can not only reduce training cost but also lead to improved reasoning performance, as shown by MIDAS...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Ask, Answer, and Detect: Role-Playing LLMs for Personality Detection with Question-Conditioned Mixture-of-Experts

Understanding human personality is crucial for web applications such as personalized recommendation and mental health assessment. Existing studies on personalit...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Multicalibration for LLM-based Code Generation

As AI-based code generation becomes widespread, researchers are investigating the calibration of code LLMs - ensuring their confidence scores faithfully represe...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework

Despite advancements in machine learning for security, rule-based detection remains prevalent in Security Operations Centers due to the resource intensiveness a...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Can TabPFN Compete with GNNs for Node Classification via Graph Tabularization?

Foundation models pretrained on large data have demonstrated remarkable zero-shot generalization capabilities across domains. Building on the success of TabPFN ...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance

Document shadow removal is essential for enhancing the clarity of digitized documents. Preserving high-frequency details (e.g., text edges and lines) is critica...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs

This paper addresses the challenge of aligning large language models (LLMs) with diverse human preferences within federated learning (FL) environments, where st...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages

We propose a post-training method for lower-resource languages that preserves fluency of language models even when aligned by disfluent reward models. Preferenc...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices

In recent years, high-performance computer vision models have achieved remarkable success in medical imaging, with some skin lesion classification systems even ...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture

Automatic Sign Language Recognition (ASLR) has emerged as a vital field for bridging the gap between deaf and hearing communities. However, the problem of sign-...

#research #paper #ai #nlp #computer-vision
4 months ago · devops · - · -

[Paper] A Task Parallel Orthonormalization Multigrid Method For Multiphase Elliptic Problems

Multigrid methods have been a popular approach for solving linear systems arising from the discretization of partial differential equations (PDEs) for several d...

#research #paper #devops
4 months ago · devops · - · -

[Paper] Spatio-Temporal Shifting to Reduce Carbon, Water, and Land-Use Footprints of Cloud Workloads

In this paper, we investigate the potential of spatial and temporal cloud workload shifting to reduce carbon, water, and land-use footprints. Specifically, we p...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Automatic Essay Scoring and Feedback Generation in Basque Language Learning

This paper introduces the first publicly available dataset for Automatic Essay Scoring (AES) and feedback generation in Basque, targeting the CEFR C1 proficienc...

#research #paper #ai #machine-learning #nlp
4 months ago · software · - · -

[Paper] RESTifAI: LLM-Based Workflow for Reusable REST API Testing

With this paper, we introduce RESTifAI, an LLM-driven approach for generating reusable, CI/CD ready REST API tests, following the happy-path approach. Unlike ex...

#research #paper #software
4 months ago · devops · - · -

[Paper] Model-based Testing of Practical Distributed Systems in Actor Model

Designing and implementing distributed systems correctly can be quite challenging. Although these systems are often accompanied by formal specifications that ar...

#research #paper #devops
4 months ago · ai · - · -

[Paper] An Agentic AI System for Multi-Framework Communication Coding

Clinical communication is central to patient outcomes, yet large-scale human annotation of patient-provider conversation remains labor-intensive, inconsistent, ...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Reusability in MLOps: Leveraging Ports and Adapters to Build a Microservices Architecture for the Maritime Domain

ML-Enabled Systems (MLES) are inherently complex since they require multiple components to achieve their business goal. This experience report showcases the sof...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] QSTN: A Modular Framework for Robust Questionnaire Inference with Large Language Models

We introduce QSTN, an open-source Python framework for systematically generating responses from questionnaire-style prompts to support in-silico surveys and ann...

#research #paper #ai #nlp
4 months ago · software · - · -

[Paper] Inferring Causal Relationships to Improve Caching for Clients with Correlated Requests: Applications to VR

Efficient edge caching reduces latency and alleviates backhaul congestion in modern networks. Traditional caching policies, such as Least Recently Used (LRU) an...

#research #paper #software
4 months ago · ai · - · -

[Paper] Long-Sequence LSTM Modeling for NBA Game Outcome Prediction Using a Novel Multi-Season Dataset

Predicting the outcomes of professional basketball games, particularly in the National Basketball Association (NBA), has become increasingly important for coach...

#research #paper #ai #machine-learning
4 months ago · devops · - · -

[Paper] Basic Lock Algorithms in Lightweight Thread Environments

Traditionally, multithreaded data structures have been designed for access by the threads of Operating Systems (OS). However, implementations for access by prog...

#research #paper #devops
4 months ago · devops · - · -

[Paper] A scalable high-order multigrid-FFT Poisson solver for unbounded domains on adaptive multiresolution grids

Multigrid solvers are among the most efficient methods for solving the Poisson equation, which is ubiquitous in computational physics. For example, in the conte...

#research #paper #devops
4 months ago · software · - · -

[Paper] Gamification with Purpose: What Learners Prefer to Motivate Their Learning

This study investigates learners' preferences for game design elements (GDEs) in educational contexts to inform the development of purpose-driven gamification s...

#research #paper #software
4 months ago · software · - · -

[Paper] Measuring Computer Science Enthusiasm: A Questionnaire-Based Analysis of Age and Gender Effects on Students' Interest

This study offers new insights into students' interest in computer science (CS) education by disentangling the distinct effects of age and gender across a diver...

#research #paper #software
4 months ago · software · - · -

[Paper] Measuring Agile Agreement: Development and Validation of the Manifesto and Principle Scales

While the importance of human factors in agile software development is widely acknowledged, the measurement of an individual's 'agile agreement' remains an ill-...

#research #paper #software
4 months ago · ai · - · -

[Paper] Magneton: Optimizing Energy Efficiency of ML Systems via Differential Energy Debugging

The training and deployment of machine learning (ML) models have become extremely energy-intensive. While existing optimization efforts focus primarily on hardw...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata

Biological systems exhibit remarkable morphogenetic plasticity, where a single genome can encode various specialized cellular structures triggered by local chem...

#research #paper #ai #machine-learning #computer-vision
4 months ago · devops · - · -

[Paper] Emulation of Complex Matrix Multiplication based on the Chinese Remainder Theorem

Modern computing architectures feature low-precision matrix multiplication units that achieve substantially higher throughput than their high-precision counterp...

#research #paper #devops
4 months ago · software · - · -

[Paper] Exposing and Defending Membership Leakage in Vulnerability Prediction Models

Neural models for vulnerability prediction (VP) have achieved impressive performance by learning from large-scale code repositories. However, their susceptibili...

#research #paper #software
4 months ago · devops · - · -

[Paper] Synergizing Monetization, Orchestration, and Semantics in Computing Continuum

Industry demands are growing for hyper-distributed applications that span from the cloud to the edge in domains such as smart manufacturing, transportation, and...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Long-only cryptocurrency portfolio management by ranking the assets: a neural network approach

This paper will propose a novel machine learning based portfolio management method in the context of the cryptocurrency market. Previous researchers mainly focu...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Resonant and Stochastic Vibration in Neurorehabilitation

Neurological injuries and age-related decline can impair sensory processing and disrupt motor coordination, gait, and balance. As mechanisms of neuroplasticity ...

#research #paper #ai
4 months ago · ai · - · -

[Paper] Towards symbolic regression for interpretable clinical decision scores

Medical decision-making makes frequent use of algorithms that combine risk equations with rules, providing clear and standardized treatment pathways. Symbolic r...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Voxify3D: Pixel Art Meets Volumetric Rendering

Voxel art is a distinctive stylization widely used in games and digital media, yet automated generation from 3D meshes remains challenging due to conflicting re...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Relational Visual Similarity

Humans do not just see attribute similarity -- we also see relational similarity. An apple is like a peach because both are reddish fruit, but the Earth is also...

#research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts