Source

arXiv

5752 posts from this source

Sort:

2 months ago · software · - · -

[Paper] ArkEval: Benchmarking and Evaluating Automated CodeRepair for ArkTS

Large language models have transformed code generation, enabling unprecedented automation in software development. As mobile ecosystems evolve, HarmonyOS has em...

#code-repair #benchmark #ArkTS #LLM #HarmonyOS
2 months ago · ai · - · -

[Paper] A Methodology for Effective Surrogate Learning in Complex Optimization

Solving complex problems requires continuous effort in developing theory and practice to cope with larger, more difficult scenarios. Working with surrogates is ...

#surrogate modeling #graph neural networks #optimization #energy efficiency #traffic simulation
2 months ago · ai · - · -

[Paper] Permissive-Washing in the Open AI Supply Chain: A Large-Scale Audit of License Integrity

Permissive licenses like MIT, Apache-2.0, and BSD-3-Clause dominate open-source AI, signaling that artifacts like models, datasets, and code can be freely used,...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Verifying DNN-based Semantic Communication Against Generative Adversarial Noise

Safety-critical applications like autonomous vehicles and industrial IoT are adopting semantic communication (SemCom) systems using deep neural networks to redu...

#research #paper #software
2 months ago · devops · - · -

[Paper] Equilibria: Fair Multi-Tenant CXL Memory Tiering At Scale

Memory dominates datacenter system cost and power. Memory expansion via Compute Express Link (CXL) is an effective way to provide additional memory at lower cos...

#CXL #memory tiering #container fairness #Linux kernel #cloud infrastructure
2 months ago · ai · - · -

[Paper] Taming Scylla: Understanding the multi-headed agentic daemon of the coding seas

LLM-based tools are automating more software development tasks at a rapid pace, but there is no rigorous way to evaluate how different architectural choices -- ...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] DyMA-Fuzz: Dynamic Direct Memory Access Abstraction for Re-hosted Monolithic Firmware Fuzzing

The rise of smart devices in critical domains--including automotive, medical, industrial--demands robust firmware testing. Fuzzing firmware in re-hosted environ...

#firmware fuzzing #DMA abstraction #re-hosted emulation #security research #coverage‑guided fuzzing
2 months ago · devops · - · -

[Paper] PARD: Enhancing Goodput for Inference Pipeline via Proactive Request Dropping

Modern deep neural network (DNN) applications integrate multiple DNN models into inference pipelines with stringent latency requirements for customized tasks. T...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Enhancing Genetic Algorithms with Graph Neural Networks: A Timetabling Case Study

This paper investigates the impact of hybridizing a multi-modal Genetic Algorithm with a Graph Neural Network for timetabling optimization. The Graph Neural Net...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches

Reproducing computational research is often assumed to be as simple as rerunning the original code with provided data. In practice, missing packages, fragile fi...

#reproducibility #large-language-models #AI-agents #prompt-engineering #docker
2 months ago · ai · - · -

[Paper] TreeTensor: Boost AI System on Nested Data with Constrained Tree-Like Tensor

Tensor is the most basic and essential data structure of nowadays artificial intelligence (AI) system. The natural properties of Tensor, especially the memory-c...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Do physics-informed neural networks (PINNs) need to be deep? Shallow PINNs using the Levenberg-Marquardt algorithm

This work investigates the use of shallow physics-informed neural networks (PINNs) for solving forward and inverse problems of nonlinear partial differential eq...

#physics-informed neural networks #Levenberg-Marquardt #shallow networks #PDE solving #machine learning research
2 months ago · ai · - · -

[Paper] Do physics-informed neural networks (PINNs) need to be deep? Shallow PINNs using the Levenberg-Marquardt algorithm

This work investigates the use of shallow physics-informed neural networks (PINNs) for solving forward and inverse problems of nonlinear partial differential eq...

#physics-informed neural networks #Levenberg-Marquardt optimizer #shallow PINNs #PDE solving with deep learning #second-order optimization
2 months ago · ai · - · -

[Paper] A Multi-objective Evolutionary Algorithm Based on Bi-population with Uniform Sampling for Neural Architecture Search

Neural architecture search (NAS) automates neural network design, improving efficiency over manual approaches. However, efficiently discovering high-performance...

#neural architecture search #evolutionary algorithms #multi-objective optimization #NAS research #machine learning
2 months ago · ai · - · -

[Paper] RIFLE: Robust Distillation-based FL for Deep Model Deployment on Resource-Constrained IoT Networks

Federated learning (FL) is a decentralized learning paradigm widely adopted in resource-constrained Internet of Things (IoT) environments. These devices, typica...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Modalities, a PyTorch-native Framework For Large-scale LLM Training and Research

Today's LLM (pre-) training and research workflows typically allocate a significant amount of compute to large-scale ablation studies. Despite the substantial c...

#research #paper #ai #machine-learning
2 months ago · devops · - · -

[Paper] Towards CXL Resilience to CPU Failures

Compute Express Link (CXL) 3.0 and beyond allows the compute nodes of a cluster to share data with hardware cache coherence and at the granularity of a cache li...

#research #paper #devops
2 months ago · devops · - · -

[Paper] HEAL: Online Incremental Recovery for Leaderless Distributed Systems Across Persistency Models

Ensuring resilience in distributed systems has become an acute concern. In today's environment, it is crucial to develop light-weight mechanisms that recover a ...

#distributed systems #leaderless recovery #incremental recovery #persistent memory #cloud infrastructure
2 months ago · devops · - · -

[Paper] Fork, Explore, Commit: OS Primitives for Agentic Exploration

AI agents increasingly perform agentic exploration: pursuing multiple solution paths in parallel and committing only the successful one. Because each exploratio...

#operating-system #branchfs #copy-on-write #system-call #agentic-exploration
2 months ago · devops · - · -

[Paper] ZipFlow: a Compiler-based Framework to Unleash Compressed Data Movement for Modern GPUs

In GPU-accelerated data analytics, the overhead of data transfer from CPU to GPU becomes a performance bottleneck when the data scales beyond GPU memory capacit...

#research #paper #devops
2 months ago · ai · - · -

[Paper] The CAPSARII Approach to Cyber-Secure Wearable, Ultra-Low-Power Networked Sensors for Soldier Health Monitoring

The European Defence Agency's revised Capability Development Plan (CDP) identifies as a priority improving ground combat capabilities by enhancing soldiers' equ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Rethinking Latency Denial-of-Service: Attacking the LLM Serving Framework, Not the Model

Large Language Models face an emerging and critical threat known as latency attacks. Because LLM inference is inherently expensive, even modest slowdowns can tr...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Direct Soft-Policy Sampling via Langevin Dynamics

Soft policies in reinforcement learning define policies as Boltzmann distributions over state-action value functions, providing a principled mechanism for balan...

#reinforcement-learning #langevin-dynamics #soft-policy #q-learning #continuous-control
2 months ago · ai · - · -

[Paper] Orchestrating Attention: Bringing Harmony to the 'Chaos' of Neurodivergent Learning States

Adaptive learning systems optimize content delivery based on performance metrics but ignore the dynamic attention fluctuations that characterize neurodivergent ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Emergent Misalignment is Easy, Narrow Misalignment is Hard

Finetuning large language models on narrowly harmful datasets can cause them to become emergently misaligned, giving stereotypically `evil' responses across div...

#LLM alignment #emergent misalignment #fine‑tuning #linear probe #AI safety research
2 months ago · ai · - · -

[Paper] LQA: A Lightweight Quantized-Adaptive Framework for Vision-Language Models on the Edge

Deploying Vision-Language Models (VLMs) on edge devices is challenged by resource constraints and performance degradation under distribution shifts. While test-...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Evaluating and Calibrating LLM Confidence on Questions with Multiple Correct Answers

Confidence calibration is essential for making large language models (LLMs) reliable, yet existing training-free methods have been primarily studied under singl...

#LLM confidence calibration #multiple-answer evaluation #MACE benchmark #semantic confidence aggregation #AI research
2 months ago · ai · - · -

[Paper] TodoEvolve: Learning to Architect Agent Planning Systems

Planning has become a central capability for contemporary agent systems in navigating complex, long-horizon tasks, yet existing approaches predominantly rely on...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] SPD-Faith Bench: Diagnosing and Improving Faithfulness in Chain-of-Thought for Multimodal Large Language Models

Chain-of-Thought reasoning is widely used to improve the interpretability of multimodal large language models (MLLMs), yet the faithfulness of the generated rea...

#research #paper #ai #machine-learning #nlp #computer-vision
2 months ago · ai · - · -

[Paper] Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training

Data quality determines foundation model performance, yet systematic processing frameworks are lacking. We introduce Data Darwinism, a ten-level taxonomy (L0-L9...

#large-language-models #data-curation #pretraining #scientific-data #taxonomy
2 months ago · ai · - · -

[Paper] LLMs Know More About Numbers than They Can Say

Although state-of-the-art LLMs can solve math problems, we find that they make errors on numerical comparisons with mixed notation: 'Which is larger, 5.7 times ...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Approximating Matrix Functions with Deep Neural Networks and Transformers

Transformers have revolutionized natural language processing, but their use for numerical computation has received less attention. We study the approximation of...

#matrix functions #transformers #neural network approximation #numerical linear algebra
2 months ago · ai · - · -

[Paper] Generative structural elucidation from mass spectra as an iterative optimization problem

Liquid chromatography tandem mass spectrometry (LC-MS/MS) is a critical analytical technique for molecular identification across metabolomics, environmental che...

#mass spectrometry #metabolite identification #genetic algorithm #spectral simulation #machine learning
2 months ago · ai · - · -

[Paper] On the Infinite Width and Depth Limits of Predictive Coding Networks

Predictive coding (PC) is a biologically plausible alternative to standard backpropagation (BP) that minimises an energy function with respect to network activi...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Optimizing Chlorination in Water Distribution Systems via Surrogate-assisted Neuroevolution

Ensuring the microbiological safety of large, heterogeneous water distribution systems (WDS) typically requires managing appropriate levels of disinfectant resi...

#research #paper #ai
2 months ago · ai · - · -

[Paper] Evolving LLM-Derived Control Policies for Residential EV Charging and Vehicle-to-Grid Energy Optimization

This research presents a novel application of Evolutionary Computation to the domain of residential electric vehicle (EV) energy management. While reinforcement...

#large-language-models #evolutionary-computation #electric-vehicle-charging #vehicle-to-grid #policy-synthesis
2 months ago · ai · - · -

[Paper] MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images

Multimodal large language models (MLLMs) have rapidly advanced, yet their adoption in medicine remains limited by gaps in domain coverage, modality alignment, a...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Learning a Generative Meta-Model of LLM Activations

Existing approaches for analyzing neural network activations, such as PCA and sparse autoencoders, rely on strong structural assumptions. Generative models offe...

#LLM interpretability #diffusion models #activation steering #meta‑model research
2 months ago · ai · - · -

[Paper] InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Large reasoning models achieve strong performance by scaling inference-time chain-of-thought, but this paradigm suffers from quadratic cost, context length limi...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation

Cinematic video production requires control over scene-subject composition and camera movement, but live-action shooting remains costly due to the need for cons...

#video generation #diffusion models #implicit 3D representation #computer vision #scene encoding
2 months ago · ai · - · -

[Paper] Improving Credit Card Fraud Detection with an Optimized Explainable Boosting Machine

Addressing class imbalance is a central challenge in credit card fraud detection, as it directly impacts predictive reliability in real-world financial systems....

#fraud detection #explainable AI #boosting machine #hyperparameter optimization #credit‑card fraud
2 months ago · ai · - · -

[Paper] DAWN: Dependency-Aware Fast Inference for Diffusion LLMs

Diffusion large language models (dLLMs) have shown advantages in text generation, particularly due to their inherent ability for parallel decoding. However, con...

#diffusion-LLM #fast inference #dependency-aware decoding #parallel generation #research paper
2 months ago · ai · - · -

[Paper] DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Being able to simulate the outcomes of actions in varied environments will revolutionize the development of generalist agents at scale. However, modeling these ...

#robotics #self-supervised learning #world model #video pretraining #computer vision
2 months ago · ai · - · -

[Paper] Agentic Uncertainty Reveals Agentic Overconfidence

Can AI agents predict whether they will succeed at a task? We study agentic uncertainty by eliciting success probability estimates before, during, and after tas...

#agentic uncertainty #model calibration #confidence estimation #AI agents #benchmark
2 months ago · it · - · -

[Paper] Distributed Knowledge in Simplicial Models

The usual semantics of multi-agent epistemic logic is based on Kripke models, defined in terms of binary relations on a set of possible worlds. Recently, there ...

#distributed computing #epistemic logic #simplicial complexes #consensus algorithms #theoretical computer science
2 months ago · ai · - · -

[Paper] Optimal Derivative Feedback Control for an Active Magnetic Levitation System: An Experimental Study on Data-Driven Approaches

This paper presents the design and implementation of data-driven optimal derivative feedback controllers for an active magnetic levitation system. A direct, mod...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Optimal Turkish Subword Strategies at Scale: Systematic Evaluation of Data, Vocabulary, Morphology Interplay

Tokenization is a pivotal design choice for neural language modeling in morphologically rich languages (MRLs) such as Turkish, where productive agglutination ch...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Endogenous Resistance to Activation Steering in Language Models

Large language models can resist task-misaligned activation steering during inference, sometimes recovering mid-generation to produce improved responses even wh...

#large-language-models #activation-steering #self-correction #sparse-autoencoders #LLM-research

Newer posts

Older posts