Source

arXiv

5804 posts from this source

Sort:

3 months ago · software · - · -

[Paper] Using LLMs to Evaluate Architecture Documents: Results from a Digital Marketplace Environment

Generative AI plays an increasing role during software engineering activities to make them, e.g., more efficient or provide better quality. However, it is often...

#research #paper #software
3 months ago · ai · - · -

[Paper] ProToken: Token-Level Attribution for Federated Large Language Models

Federated Learning (FL) enables collaborative training of Large Language Models (LLMs) across distributed data sources while preserving privacy. However, when f...

#research #paper #ai #machine-learning
3 months ago · it · - · -

[Paper] Convex Hull 3D Filtering with GPU Ray Tracing and Tensor Cores

In recent years, applications such as real-time simulations, autonomous systems, and video games increasingly demand the processing of complex geometric models ...

#GPU acceleration #convex hull #ray tracing #tensor cores #high-performance computing
3 months ago · software · - · -

[Paper] Who Said CVE? How Vulnerability Identifiers Are Mentioned by Humans, Bots, and Agents in Pull Requests

Vulnerability identifiers such as CVE, CWE, and GHSA are standardised references to known software security issues, yet their use in practice is not well unders...

#research #paper #software
3 months ago · it · - · -

[Paper] DynQ: A Dynamic Topology-Agnostic Quantum Virtual Machine via Quality-Weighted Community Detection

Quantum cloud platforms remain fundamentally non-virtualised: despite rapid hardware scaling, each user program still monopolises an entire quantum processor, p...

#quantum computing #quantum virtual machine #cloud quantum services #hardware virtualization #quality-weighted community detection
3 months ago · devops · - · -

[Paper] Modular Foundation Model Inference at the Edge: Network-Aware Microservice Optimization

Foundation models (FMs) unlock unprecedented multimodal and multitask intelligence, yet their cloud-centric deployment precludes real-time responsiveness and co...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Tournament Informed Adversarial Quality Diversity

Quality diversity (QD) is a branch of evolutionary computation that seeks high-quality and behaviorally diverse solutions to a problem. While adversarial proble...

#research #paper #ai
3 months ago · ai · - · -

[Paper] Rethinking Intelligence: Brain-like Neuron Network

Since their inception, artificial neural networks have relied on manually designed architectures and inductive biases to better adapt to data and tasks. With th...

#research #paper #ai
3 months ago · ai · - · -

[Paper] Posterior Distribution-assisted Evolutionary Dynamic Optimization as an Online Calibrator for Complex Social Simulations

The calibration of simulators for complex social systems aims to identify the optimal parameter that drives the output of the simulator best matching the target...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] ROIDS: Robust Outlier-Aware Informed Down-Sampling

Informed down-sampling (IDS) is known to improve performance in symbolic regression when combined with various selection strategies, especially tournament selec...

#research #paper #ai
3 months ago · devops · - · -

[Paper] Decentralized Nonsmooth Nonconvex Optimization with Client Sampling

This paper considers decentralized nonsmooth nonconvex optimization problem with Lipschitz continuous local functions. We propose an efficient stochastic first-...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Revisiting Parameter Server in LLM Post-Training

Modern data parallel (DP) training favors collective communication over parameter servers (PS) for its simplicity and efficiency under balanced workloads. Howev...

#research #paper #ai #machine-learning
3 months ago · devops · - · -

[Paper] KUBEDIRECT: Unleashing the Full Power of the Cluster Manager for Serverless Computing

FaaS platforms rely on cluster managers like Kubernetes for resource management. Kubernetes is popular due to its state-centric APIs that decouple the control p...

#research #paper #devops
3 months ago · ai · - · -

[Paper] HEATACO: Heatmap-Guided Ant Colony Decoding for Large-Scale Travelling Salesman Problems

Heatmap-based non-autoregressive solvers for large-scale Travelling Salesman Problems output dense edge-probability scores, yet final performance largely hinges...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] ctELM: Decoding and Manipulating Embeddings of Clinical Trials with Embedding Language Models

Text embeddings have become an essential part of a variety of language applications. However, methods for interpreting, exploring and reversing embedding spaces...

#embedding language models #clinical trials #biomedical NLP #synthetic data #large language models
3 months ago · ai · - · -

[Paper] Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes

Typical reinforcement learning (RL) methods for LLM reasoning waste compute on hard problems, where correct on-policy traces are rare, policy gradients vanish, ...

#reinforcement learning #large language models #off-policy learning #sample efficiency #prefix conditioning
3 months ago · ai · - · -

[Paper] MEGnifying Emotion: Sentiment Analysis from Annotated Brain Data

Decoding emotion from brain activity could unlock a deeper understanding of the human experience. While a number of existing datasets align brain data with spee...

#sentiment analysis #brain-computer interface #MEG #machine learning #neuroscience
3 months ago · ai · - · -

[Paper] Subword-Based Comparative Linguistics across 242 Languages Using Wikipedia Glottosets

We present a large-scale comparative study of 242 Latin and Cyrillic-script languages using subword-based methodologies. By constructing 'glottosets' from Wikip...

#subword segmentation #BPE #multilingual NLP #comparative linguistics #language similarity
3 months ago · ai · - · -

[Paper] MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts

Large Language Models are increasingly optimized for deep reasoning, prioritizing the correct execution of complex tasks over general conversation. We investiga...

#large language models #AI safety #reasoning benchmarks #emergency response #natural language processing
3 months ago · ai · - · -

[Paper] Unsupervised Text Segmentation via Kernel Change-Point Detection on Sentence Embeddings

Unsupervised text segmentation is crucial because boundary labels are expensive, subjective, and often fail to transfer across domains and granularity choices. ...

#text segmentation #unsupervised learning #kernel change-point detection #sentence embeddings #nlp
3 months ago · ai · - · -

[Paper] Design Techniques for LLM-Powered Interactive Storytelling: A Case Study of the Dramamancer System

The rise of Large Language Models (LLMs) has enabled a new paradigm for bridging authorial intent and player agency in interactive narrative. We consider this p...

#large language models #interactive storytelling #natural language processing #AI research
3 months ago · ai · - · -

[Paper] Multi-Objective Reinforcement Learning for Efficient Tactical Decision Making for Trucks in Highway Traffic

Balancing safety, efficiency, and operational costs in highway driving poses a challenging decision-making problem for heavy-duty vehicles. A central difficulty...

#reinforcement learning #multi-objective optimization #autonomous trucking #Pareto frontier #policy optimization
3 months ago · ai · - · -

[Paper] POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration

Reinforcement learning (RL) has improved the reasoning abilities of large language models (LLMs), yet state-of-the-art methods still fail to learn on many train...

#reinforcement learning #large language models #reasoning #privileged exploration #machine learning
3 months ago · ai · - · -

[Paper] Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Can a model learn to escape its own learning plateau? Reinforcement learning methods for finetuning large reasoning models stall on datasets with low initial su...

#self-improvement #meta-reinforcement-learning #large-language-models #curriculum-generation #machine-learning-research
3 months ago · ai · - · -

[Paper] PRECISE: Reducing the Bias of LLM Evaluations Using Prediction-Powered Ranking Estimation

Evaluating the quality of search, ranking and RAG systems traditionally requires a significant number of human relevance annotations. In recent times, several d...

#LLM evaluation #bias mitigation #information retrieval #prediction-powered inference #precision@k
3 months ago · ai · - · -

[Paper] Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex reasoning tasks, particularly when augmented with search mechanisms that enabl...

#dependency-aware reasoning #large language models #retrieval-augmented generation #multi-hop question answering #persistent memory
3 months ago · ai · - · -

[Paper] Learning to Discover: A Generalized Framework for Raga Identification without Forgetting

Raga identification in Indian Art Music (IAM) remains challenging due to the presence of numerous rarely performed Ragas that are not represented in available t...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] $α^3$-SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks

Autonomous unmanned aerial vehicle (UAV) systems are increasingly deployed in safety-critical, networked environments where they must operate reliably in the pr...

#LLM security #UAV autonomy #6G networks #adversarial benchmarking #AI safety
3 months ago · ai · - · -

[Paper] HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

The reliability of Large Language Models (LLMs) in high-stakes domains such as healthcare, law, and scientific discovery is often compromised by hallucinations....

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] Let's Make Every Pull Request Meaningful: An Empirical Analysis of Developer and Agentic Pull Requests

The automatic generation of pull requests (PRs) using AI agents has become increasingly common. Although AI-generated PRs are fast and easy to create, their mer...

#research #paper #software
3 months ago · ai · - · -

[Paper] SeNeDiF-OOD: Semantic Nested Dichotomy Fusion for Out-of-Distribution Detection Methodology in Open-World Classification. A Case Study on Monument Style Classification

Out-of-distribution (OOD) detection is a fundamental requirement for the reliable deployment of artificial intelligence applications in open-world environments....

#out-of-distribution detection #semantic nested dichotomy #computer vision #architectural style classification
3 months ago · ai · - · -

[Paper] Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge

Recent advancements in multimodal large language models and vision-languageaction models have significantly driven progress in Embodied AI. As the field transit...

#multi-agent systems #embodied AI #vision-language models #robotics #NeurIPS challenge
3 months ago · ai · - · -

[Paper] Low Cost, High Efficiency: LiDAR Place Recognition in Vineyards with Matryoshka Representation Learning

Localization in agricultural environments is challenging due to their unstructured nature and lack of distinctive landmarks. Although agricultural settings have...

#LiDAR #place recognition #computer vision #representation learning #agricultural robotics
3 months ago · ai · - · -

[Paper] SMART: Scalable Mesh-free Aerodynamic Simulations from Raw Geometries using a Transformer-based Surrogate Model

Machine learning-based surrogate models have emerged as more efficient alternatives to numerical solvers for physical simulations over complex geometries, such ...

#surrogate modeling #transformer #aerodynamic simulation #mesh-free #point cloud
3 months ago · ai · - · -

[Paper] Are Video Generation Models Geographically Fair? An Attraction-Centric Evaluation of Global Visual Knowledge

Recent advances in text-to-video generation have produced visually compelling results, yet it remains unclear whether these models encode geographically equitab...

#text-to-video #geographic bias #computer vision #benchmark #evaluation
3 months ago · ai · - · -

[Paper] A Pragmatic VLA Foundation Model

Offering great potential in robotic manipulation, a capable Vision-Language-Action (VLA) foundation model is expected to faithfully generalize across tasks and ...

#vision-language-action #robotics #foundation-model #computer-vision #machine-learning
3 months ago · ai · - · -

[Paper] Counterfactual Explanations on Robust Perceptual Geodesics

Latent-space optimization methods for counterfactual explanations - framed as minimal semantic perturbations that change model predictions - inherit the ambigui...

#counterfactual explanations #perceptual geodesics #computer vision #machine learning #robustness
3 months ago · ai · - · -

[Paper] Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting

Talking Head Generation aims at synthesizing natural-looking talking videos from speech and a single portrait image. Previous 3D talking head generation methods...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

When humans face problems beyond their immediate capabilities, they rely on tools, providing a promising paradigm for improving visual reasoning in multimodal l...

#multimodal-llm #visual-reasoning #tool-orchestration #reinforcement-learning #research-paper
3 months ago · ai · - · -

[Paper] CONQUER: Context-Aware Representation with Query Enhancement for Text-Based Person Search

Text-Based Person Search (TBPS) aims to retrieve pedestrian images from large galleries using natural language descriptions. This task, essential for public saf...

#text-based person search #cross-modal retrieval #computer vision #query enhancement #optimal transport
3 months ago · ai · - · -

[Paper] Global Optimization of Atomic Clusters via Physically-Constrained Tensor Train Decomposition

The global optimization of atomic clusters represents a fundamental challenge in computational chemistry and materials science due to the exponential growth of ...

#research #paper #ai
3 months ago · software · - · -

[Paper] How are MLOps Frameworks Used in Open Source Projects? An Empirical Characterization

Machine Learning (ML) Operations (MLOps) frameworks have been conceived to support developers and AI engineers in managing the lifecycle of their ML models. Whi...

#research #paper #software
3 months ago · software · - · -

[Paper] On the Abolition of the 'ICSE Paper' and the Adoption of the 'Registered Proposal' and the 'Results Report'

To address the 'novelty-vicious cycle' and the 'replicability crisis' of the field (both discussed in the survey) we propose abolishing the 'ICSE paper' as we k...

#research #paper #software
3 months ago · software · - · -

[Paper] An Audit of Machine Learning Experiments on Software Defect Prediction

Background: Machine learning algorithms are widely used to predict defect prone software components. In this literature, computational experiments are the main ...

#machine learning #software defect prediction #empirical study #reproducibility #software engineering
3 months ago · ai · - · -

[Paper] Scaling Behaviors of Evolutionary Algorithms on GPUs: When Does Parallelism Pay Off?

Evolutionary algorithms (EAs) are increasingly implemented on graphics processing units (GPUs) to leverage parallel processing capabilities for enhanced efficie...

#research #paper #ai
3 months ago · ai · - · -

[Paper] daVinci-Dev: Agent-native Mid-training for Software Engineering

Recently, the frontier of Large Language Model (LLM) capabilities has shifted from single-turn code generation to agentic software engineering-a paradigm where ...

#agentic AI #mid-training #software engineering #LLM #code generation
3 months ago · devops · - · -

[Paper] On the Bandwidth Consumption of Blockchains

With the advent of blockchain technology, the number of proposals has boomed. The network traffic imposed by these blockchain proposals increases the cost of ho...

#research #paper #devops
3 months ago · it · - · -

[Paper] An Adaptive Purification Controller for Quantum Networks: Dynamic Protocol Selection and Multipartite Distillation

Efficient entanglement distribution is the cornerstone of the Quantum Internet. However, physical link parameters such as photon loss, memory coherence time, an...

#quantum networking #entanglement purification #adaptive protocol selection #quantum internet #dynamic resource allocation

Newer posts

Older posts