Source

arXiv

5644 posts from this source

Sort:

1 month ago · ai · - · -

[Paper] High-Dimensional Gaussian Mean Estimation under Realizable Contamination

We study mean estimation for a Gaussian distribution with identity covariance in mathbb{R}^d under a missing data scheme termed realizable ε-contamination model...

#research #paper #ai #machine-learning
1 month ago · software · - · -

[Paper] Improving Code Comprehension through Cognitive-Load Aware Automated Refactoring for Novice Programmers

Novice programmers often struggle to comprehend code due to vague naming, deep nesting, and poor structural organization. While explanations may offer partial s...

#research #paper #software
1 month ago · ai · - · -

[Paper] InCoder-32B: Code Foundation Model for Industrial Scenarios

Recent code large language models have achieved remarkable progress on general programming tasks. Nevertheless, their performance degrades significantly in indu...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] SpokenUS: A Spoken User Simulator for Task-Oriented Dialogue

Robust task-oriented spoken dialogue agents require exposure to the full diversity of how people interact through speech. Building spoken user simulators that a...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] SOMP: Scalable Gradient Inversion for Large Language Models via Subspace-Guided Orthogonal Matching Pursuit

Gradient inversion attacks reveal that private training text can be reconstructed from shared gradients, posing a privacy risk to large language models (LLMs). ...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] TurnWise: The Gap between Single- and Multi-turn Language Model Capabilities

Multi-turn conversations are a common and critical mode of language model interaction. However, current open training and evaluation data focus on single-turn s...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Probing Cultural Signals in Large Language Models through Author Profiling

Large language models (LLMs) are increasingly deployed in applications with societal impact, raising concerns about the cultural biases they encode. We probe th...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] IQuest-Coder-V1 Technical Report

In this report, we introduce the IQuest-Coder-V1 series-(7B/14B/40B/40B-Loop), a new family of code large language models (LLMs). Moving beyond static code repr...

#research #paper #ai #machine-learning #nlp
1 month ago · devops · - · -

[Paper] Looking for (Genomic) Needles in a Haystack: Sparsity-Driven Search for Identifying Correlated Genetic Mutations in Cancer

Cancer typically arises not from a single genetic mutation (i.e., hit) but from multi-hit combinations that accumulate within cells. However, enumerating multi-...

#research #paper #devops
1 month ago · devops · - · -

[Paper] Dataflow-Oriented Classification and Performance Analysis of GPU-Accelerated Homomorphic Encryption

Fully Homomorphic Encryption (FHE) enables secure computation over encrypted data, but its computational cost remains a major obstacle to practical deployment. ...

#research #paper #devops
1 month ago · devops · - · -

[Paper] Accelerating the Particle-In-Cell code ECsim with OpenACC

The Particle-In-Cell (PIC) method is a computational technique widely used in plasma physics to model plasmas at the kinetic level. In this work, we present our...

#research #paper #devops
1 month ago · software · - · -

[Paper] Reasoning About Variability Models Through Network Analysis

Feature models are widely used to capture the configuration space of software systems. Although automated reasoning has been studied for detecting problematic f...

#research #paper #software
1 month ago · devops · - · -

[Paper] FleetOpt: Analytical Fleet Provisioning for LLM Inference with Compress-and-Route as Implementation Mechanism

Modern LLM GPU fleets are provisioned for worst-case context lengths that the vast majority of requests never approach, wasting GPU capacity on idle KV-cache sl...

#research #paper #devops
1 month ago · software · - · -

[Paper] TRACE: Evaluating Execution Efficiency of LLM-Based Code Translation

While Large Language Models (LLMs) have substantially improved the functional correctness of code translation, the critical dimension of execution efficiency re...

#research #paper #software
1 month ago · ai · - · -

[Paper] Linearized Bregman Iterations for Sparse Spiking Neural Networks

Spiking Neural Networks (SNNs) offer an energy efficient alternative to conventional Artificial Neural Networks (ANNs) but typically still require a large numbe...

#research #paper #ai
1 month ago · ai · - · -

[Paper] An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU

Fine-tuning Large Language Models (LLMs) has become essential for domain adaptation, but its memory-intensive property exceeds the capabilities of most GPUs. To...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Deep Reinforcement Learning-Assisted Automated Operator Portfolio for Constrained Multi-objective Optimization

Constrained multi-objective optimization problems (CMOPs) are of great significance in the context of practical applications, ranging from scientific to enginee...

#research #paper #ai
1 month ago · software · - · -

[Paper] Beyond Grading Accuracy: Exploring Alignment of TAs and LLMs

In this paper, we investigate the potential of open-source Large Language Models (LLMs) for grading Unified Modeling Language (UML) class diagrams. In contrast ...

#research #paper #software
1 month ago · devops · - · -

[Paper] Biased Compression in Gradient Coding for Distributed Learning

Communication bottlenecks and the presence of stragglers pose significant challenges in distributed learning (DL). To deal with these challenges, recent advance...

#research #paper #devops
1 month ago · software · - · -

[Paper] SseRex: Practical Symbolic Execution of Solana Smart Contracts

Solana is rapidly gaining traction among smart contract developers and users. However, its growing adoption has been accompanied by a series of major security i...

#research #paper #software
1 month ago · software · - · -

[Paper] Prompts Blend Requirements and Solutions: From Intent to Implementation

AI coding assistants are reshaping software development by shifting focus from writing code to formulating prompts. In chat-focused approaches such as vibe codi...

#research #paper #software
1 month ago · ai · - · -

[Paper] A Human-Centred Architecture for Large Language Models-Cognitive Assistants in Manufacturing within Quality Management Systems

Large Language Models-Cognitive Assistants (LLM-CAs) can enhance Quality Management Systems (QMS) in manufacturing, fostering continuous process improvement and...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Surrogate-Assisted Genetic Programming with Rank-Based Phenotypic Characterisation for Dynamic Multi-Mode Project Scheduling

The dynamic multi-mode resource-constrained project scheduling problem (DMRCPSP) is of practical importance, as it requires making real-time decisions under cha...

#research #paper #ai #machine-learning
1 month ago · devops · - · -

[Paper] inference-fleet-sim: A Queueing-Theory-Grounded Fleet Capacity Planner for LLM Inference

Sizing a GPU fleet for LLM inference is harder than it looks. The obvious questions -- how many GPUs, which type, where to split a two-pool fleet -- have no clo...

#research #paper #devops
1 month ago · ai · - · -

[Paper] EvoIQA - Explaining Image Distortions with Evolved White-Box Logic

Traditional Image Quality Assessment (IQA) metrics typically fall into one of two extremes: rigid, hand-crafted mathematical models or 'black-box' deep learning...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Towards Generalizable Robotic Manipulation in Dynamic Environments

Vision-Language-Action (VLA) models excel in static manipulation but struggle in dynamic environments with moving targets. This performance gap primarily stems ...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Mixture-of-Depths Attention

Scaling depth is a key driver for large language models (LLMs). Yet, as LLMs become deeper, they often suffer from signal degradation: informative features form...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

Vision-Language-Action (VLA) models have recently emerged as a promising paradigm for robotic manipulation, in which reliable action prediction critically depen...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] HorizonMath: Measuring AI Progress Toward Mathematical Discovery with Automatic Verification

Can AI make progress on important, unsolved mathematical problems? Large language models are now capable of sophisticated mathematical and scientific reasoning,...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering

Generating accurate glyphs for visual text rendering is essential yet challenging. Existing methods typically enhance text rendering by training on a large amou...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Mechanistic Origin of Moral Indifference in Language Models

Existing behavioral alignment techniques for Large Language Models (LLMs) often neglect the discrepancy between surface compliance and internal unaligned repres...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motion

Recent video diffusion models have made remarkable strides in visual quality, yet precise, fine-grained control remains a key bottleneck that limits practical c...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

We present HSImul3R, a unified framework for simulation-ready 3D reconstruction of human-scene interactions (HSI) from casual captures, including sparse-view im...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Reinforcement learning for code generation relies on verifiable rewards from unit test pass rates. Yet high-quality test suites are scarce, existing datasets of...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] Do Metrics for Counterfactual Explanations Align with User Perception?

Explainability is widely regarded as essential for trustworthy artificial intelligence systems. However, the metrics commonly used to evaluate counterfactual ex...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Fast SAM 3D Body: Accelerating SAM 3D Body for Real-Time Full-Body Human Mesh Recovery

SAM 3D Body (3DB) achieves state-of-the-art accuracy in monocular 3D human mesh recovery, yet its inference latency of several seconds per image precludes real-...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Accurate process supervision remains a critical challenge for long-horizon robotic manipulation. A primary bottleneck is that current video MLLMs, trained prima...

#research #paper #ai #machine-learning #nlp #computer-vision
1 month ago · ai · - · -

[Paper] SmartSearch: How Ranking Beats Structure for Conversational Memory Retrieval

Recent conversational memory systems invest heavily in LLM-based structuring at ingestion time and learned retrieval policies at query time. We show that neithe...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] AC-Foley: Reference-Audio-Guided Video-to-Audio Synthesis with Acoustic Transfer

Existing video-to-audio (V2A) generation methods predominantly rely on text prompts alongside visual information to synthesize audio. However, two critical bott...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] Robust and Computationally Efficient Linear Contextual Bandits under Adversarial Corruption and Heavy-Tailed Noise

We study linear contextual bandits under adversarial corruption and heavy-tailed noise with finite (1+ε)-th moments for some εin (0,1]. Existing work that addre...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Deep search capabilities have become an indispensable competency for frontier Large Language Model (LLM) agents, yet the development of high-performance search ...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Effective Distillation to Hybrid xLSTM Architectures

There have been numerous attempts to distill quadratic attention-based large language models (LLMs) into sub-quadratic linearized architectures. However, despit...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Computational Concept of the Psyche

This article presents an overview of approaches to modeling the human psyche in the context of constructing an artificial one. Based on this overview, a concept...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Physics-Informed Neural Systems for the Simulation of EUV Electromagnetic Wave Diffraction from a Lithography Mask

Physics-informed neural networks (PINNs) and neural operators (NOs) for solving the problem of diffraction of Extreme Ultraviolet (EUV) electromagnetic waves fr...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Grounding World Simulation Models in a Real-World Metropolis

What if a world simulation model could render not an imagined environment but a city that actually exists? Prior generative world models synthesize visually pla...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Benchmarking Machine Learning Approaches for Polarization Mapping in Ferroelectrics Using 4D-STEM

Four-dimensional scanning transmission electron microscopy (4D-STEM) provides rich, atomic-scale insights into materials structures. However, extracting specifi...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Unbiased and Biased Variance-Reduced Forward-Reflected-Backward Splitting Methods for Stochastic Composite Inclusions

This paper develops new variance-reduction techniques for the forward-reflected-backward splitting (FRBS) method to solve a class of possibly nonmonotone stocha...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Co-Design of Memory-Storage Systems for Workload Awareness with Interpretable Models

Solid-state storage architectures based on NAND or emerging memory devices (SSD), are fundamentally architected and optimized for both reliability and performan...

#research #paper #ai #machine-learning

Newer posts

Older posts