Source

arXiv

5644 posts from this source

Sort:

2 months ago · ai · - · -

[Paper] DySCO: Dynamic Attention-Scaling Decoding for Long-Context LMs

Understanding and reasoning over long contexts is a crucial capability for language models (LMs). Although recent models support increasingly long context windo...

#long-context #attention-scaling #LLM-decoding #retrieval-heads #inference-optimization
2 months ago · ai · - · -

[Paper] Applying a Random-Key Optimizer on Mixed Integer Programs

Mixed-Integer Programs (MIPs) are NP-hard optimization models that arise in a broad range of decision-making applications, including finance, logistics, energy ...

#random-key optimizer #mixed-integer programming #metaheuristic #optimization algorithms #operations research
2 months ago · ai · - · -

[Paper] CASR: A Robust Cyclic Framework for Arbitrary Large-Scale Super-Resolution with Distribution Alignment and Self-Similarity Awareness

Arbitrary-Scale SR (ASISR) remains fundamentally limited by cross-scale distribution shift: once the inference scale leaves the training range, noise, blur, and...

#super-resolution #cyclic upscaling #distribution alignment #self-similarity #computer vision
2 months ago · devops · - · -

[Paper] LLMTailor: A Layer-wise Tailoring Tool for Efficient Checkpointing of Large Language Models

Checkpointing is essential for fault tolerance in training large language models (LLMs). However, existing methods, regardless of their I/O strategies, periodic...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Dynamic Personality Adaptation in Large Language Models via State Machines

The inability of Large Language Models (LLMs) to modulate their personality expression in response to evolving dialogue dynamics hinders their performance in co...

#large language models #personality modeling #state machines #prompt engineering #dialogue systems
2 months ago · ai · - · -

[Paper] Stream Neural Networks: Epoch-Free Learning with Persistent Temporal State

Most contemporary neural learning systems rely on epoch-based optimization and repeated access to historical data, implicitly assuming reversible computation. I...

#research #paper #ai
2 months ago · ai · - · -

[Paper] CoLoGen: Progressive Learning of Concept`-`Localization Duality for Unified Image Generation

Unified conditional image generation remains difficult because different tasks depend on fundamentally different internal representations. Some require conceptu...

#diffusion models #image generation #concept-localization duality #computer vision #machine learning
2 months ago · ai · - · -

[Paper] Enhancing Framingham Cardiovascular Risk Score Transparency through Logic-Based XAI

Cardiovascular disease (CVD) remains one of the leading global health challenges, accounting for more than 19 million deaths worldwide. To address this, several...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual

Reinforcement Learning from Human Feedback (RLHF) plays a significant role in aligning Large Language Models (LLMs) with human preferences. While RLHF with expe...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models

Large Language Models (LLMs) are increasingly used to ``professionalize'' workplace communication, often at the cost of linguistic identity. We introduce 'Cultu...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

Object hallucination is a critical issue in Large Vision-Language Models (LVLMs), where outputs include objects that do not appear in the input image. A natural...

#research #paper #ai #machine-learning #nlp #computer-vision
2 months ago · ai · - · -

[Paper] MedTri: A Platform for Structured Medical Report Normalization to Enhance Vision-Language Pretraining

Medical vision-language pretraining increasingly relies on medical reports as large-scale supervisory signals; however, raw reports often exhibit substantial st...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] WeaveTime: Stream from Earlier Frames into Emergent Memory in VideoLLMs

Recent advances in Multimodal Large Language Models have greatly improved visual understanding and reasoning, yet their quadratic attention and offline training...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] SigmaQuant: Hardware-Aware Heterogeneous Quantization Method for Edge DNN Inference

Deep neural networks (DNNs) are essential for performing advanced tasks on edge or mobile devices, yet their deployment is often hindered by severe resource con...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Sample Complexity Bounds for Robust Mean Estimation with Mean-Shift Contamination

We study the basic task of mean estimation in the presence of mean-shift contamination. In the mean-shift contamination model, an adversary is allowed to replac...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages

Instruction-following benchmarks remain predominantly English-centric, leaving a critical evaluation gap for the hundreds of millions of Indic language speakers...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents

Small language models (SLMs) offer compelling advantages in cost, latency, and adaptability, but have so far lagged behind larger models on long-horizon softwar...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Probing the Geometry of Diffusion Models with the String Method

Understanding the geometry of learned distributions is fundamental to improving and interpreting diffusion models, yet systematic tools for exploring their land...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Don't stop me now: Rethinking Validation Criteria for Model Parameter Selection

Despite the extensive literature on training loss functions, the evaluation of generalization on the validation set remains underexplored. In this work, we cond...

#model-selection #validation-metrics #early-stopping #loss-functions #neural-networks
2 months ago · devops · - · -

[Paper] PASTA: A Modular Program Analysis Tool Framework for Accelerators

The increasing complexity and diversity of hardware accelerators in modern computing systems demand flexible, low-overhead program analysis tools. We present PA...

#research #paper #devops
2 months ago · software · - · -

[Paper] Visual Milestone Planning in a Hybrid Development Context

This paper explains the Visual Milestone Planning (VMP) method using an agile vocabulary to facilitate its adoption by agile practitioners as a front end for a ...

#research #paper #software
2 months ago · software · - · -

[Paper] Detecting UX smells in Visual Studio Code using LLMs

Integrated Development Environments shape developers' daily experience, yet the empirical study of their usability and user experience (UX) remains limited. Thi...

#LLM #UX smells #Visual Studio Code #issue mining #software usability
2 months ago · devops · - · -

[Paper] IOAgent: Democratizing Trustworthy HPC I/O Performance Diagnosis Capability via LLMs

As the complexity of the HPC storage stack rapidly grows, domain scientists face increasing challenges in effectively utilizing HPC storage systems to achieve t...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Enhancing LLM-Based Test Generation by Eliminating Covered Code

Automated test generation is essential for software quality assurance, with coverage rate serving as a key metric to ensure thorough testing. Recent advancement...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Outpatient Appointment Scheduling Optimization with a Genetic Algorithm Approach

The optimization of complex medical appointment scheduling remains a significant operational challenge in multi-center healthcare environments, where clinical s...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Energy Efficient Federated Learning with Hyperdimensional Computing over Wireless Communication Networks

In this paper, we investigate a problem of minimizing total energy consumption for secure federated learning (FL) over wireless edge networks. To address the hi...

#federated learning #hyperdimensional computing #energy-efficient AI #wireless edge devices #differential privacy
2 months ago · software · - · -

[Paper] A task-based data-flow methodology for programming heterogeneous systems with multiple accelerator APIs

Heterogeneous nodes that combine multi-core CPUs with diverse accelerators are rapidly becoming the norm in both high-performance computing (HPC) and AI infrast...

#heterogeneous computing #task-based runtime #accelerator APIs #OpenMP/OmpSs-2 #CUDA SYCL PoCL
2 months ago · ai · - · -

[Paper] JSAM: Privacy Straggler-Resilient Joint Client Selection and Incentive Mechanism Design in Differentially Private Federated Learning

Differentially private federated learning faces a fundamental tension: privacy protection mechanisms that safeguard client data simultaneously create quantifiab...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] From Restructuring to Stabilization: A Large-Scale Experiment on Iterative Code Readability Refactoring with Large Language Models

Large language models (LLMs) are increasingly used for automated code refactoring tasks. Although these models can quickly refactor code, the quality may exhibi...

#research #paper #software
2 months ago · ai · - · -

[Paper] An Empirical Study of Bugs in Modern LLM Agent Frameworks

LLM agents have been widely adopted in real-world applications, relying on agent frameworks for workflow execution and multi-agent coordination. As these system...

#LLM agents #bug taxonomy #framework reliability #LangChain #CrewAI
2 months ago · ai · - · -

[Paper] An Evaluation of Context Length Extrapolation in Long Code via Positional Embeddings and Efficient Attention

The rapid advancement of large language models (LLMs) has led to a significant increase in automated tools in the software engineering, capable of performing va...

#code‑LLM #context‑window extrapolation #positional embeddings #efficient attention #long‑code completion
2 months ago · ai · - · -

[Paper] DHP: Efficient Scaling of MLLM Training with Dynamic Hybrid Parallelism

Scaling long-context capabilities is crucial for Multimodal Large Language Models (MLLMs). However, real-world multimodal datasets are extremely heterogeneous. ...

#multimodal large language models #dynamic hybrid parallelism #distributed training #GPU/NPUs scaling #deep learning research
2 months ago · ai · - · -

[Paper] Survey on Neural Routing Solvers

Neural routing solvers (NRSs) that leverage deep learning to tackle vehicle routing problems have demonstrated notable potential for practical applications. By ...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Proto-ML: An IDE for ML Solution Prototyping

Prototyping plays a critical role in the development of machine learning (ML) solutions, yet existing tools often provide limited support for effective collabor...

#research #paper #software
2 months ago · devops · - · -

[Paper] Lamport's Arrow of Time: The Category Mistake in Logical Clocks

Lamport's 1978 paper introduced the happens-before relation and logical clocks, freeing distributed systems from dependence on synchronized physical clocks. Thi...

#research #paper #devops
2 months ago · software · - · -

[Paper] EditFlow: Benchmarking and Optimizing Code Edit Recommendation Systems via Reconstruction of Developer Flows

Large language models (LLMs) for code editing have achieved remarkable progress, yet recent empirical studies reveal a fundamental disconnect between technical ...

#code-edit recommendation #developer flow modeling #LLM benchmarking #EditFlow dataset #software engineering AI
2 months ago · software · - · -

[Paper] AkiraRust: Re-thinking LLM-aided Rust Repair Using a Feedback-guided Thinking Switch

Eliminating undefined behaviors (UBs) in Rust programs requires a deep semantic understanding to enable accurate and reliable repair. While existing studies hav...

#rust #large-language-models #automated-bug-fixing #finite-state-machine #software-engineering
2 months ago · devops · - · -

[Paper] Type-Based Enforcement of Non-Interference for Choreographic Programming

Choreographies describe distributed protocols from a global viewpoint, enabling correct-by-construction synthesis of local behaviours. We develop a policy-param...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Code World Models for Parameter Control in Evolutionary Algorithms

Can an LLM learn how an optimizer behaves -- and use that knowledge to control it? We extend Code World Models (CWMs), LLM-synthesized Python programs that pred...

#large language models #evolutionary algorithms #parameter control #code world models #reinforcement learning
2 months ago · ai · - · -

[Paper] Test-Time Training with KV Binding Is Secretly Linear Attention

Test-time training (TTT) with KV binding as sequence modeling layer is commonly interpreted as a form of online meta-learning that memorizes a key-value mapping...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics

Visual reinforcement learning is appealing for robotics but expensive -- off-policy methods are sample-efficient yet slow; on-policy methods parallelize well bu...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Multi-Vector Index Compression in Any Modality

We study efficient multi-vector retrieval for late interaction in any modality. Late interaction has emerged as a dominant paradigm for information retrieval in...

#research #paper #ai #nlp #computer-vision
2 months ago · ai · - · -

[Paper] Aletheia tackles FirstProof autonomously

We report the performance of Aletheia (Feng et al., 2026b), a mathematics research agent powered by Gemini 3 Deep Think, on the inaugural FirstProof challenge. ...

#autonomous AI #mathematical reasoning #large language models #proof generation #Gemini 3
2 months ago · ai · - · -

[Paper] Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs

Embodied LLMs endow robots with high-level task reasoning, but they cannot reflect on what went wrong or why, turning deployment into a sequence of independent ...

#research #paper #ai #machine-learning #nlp #computer-vision
2 months ago · ai · - · -

[Paper] Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

Efficiently processing long sequences with Transformer models usually requires splitting the computations across accelerators via context parallelism. The domin...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Region of Interest Segmentation and Morphological Analysis for Membranes in Cryo-Electron Tomography

Cryo-electron tomography (cryo-ET) enables high resolution, three-dimensional reconstruction of biological structures, including membranes and membrane proteins...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] On Data Engineering for Scaling LLM Terminal Capabilities

Despite rapid recent progress in the terminal capabilities of large language models, the training data strategies behind state-of-the-art terminal agents remain...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Statistical Query Lower Bounds for Smoothed Agnostic Learning

We study the complexity of smoothed agnostic learning, recently introduced by~cite{CKKMS24}, in which the learner competes with the best classifier in a target ...

#research #paper #ai #machine-learning

Newer posts

Older posts