Source

arXiv

5644 posts from this source

Sort:

1 month ago · ai · - · -

[Paper] Mamba-3: Improved Sequence Modeling using State Space Principles

Scaling inference-time compute has emerged as an important driver of LLM performance, making inference efficiency a central focus of model design alongside mode...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Estimating Staged Event Tree Models via Hierarchical Clustering on the Simplex

Staged tree models enhance Bayesian networks by incorporating context-specific dependencies through a stage-based structure. In this study, we present a new fra...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Lore: Repurposing Git Commit Messages as a Structured Knowledge Protocol for AI Coding Agents

As AI coding agents become both primary producers and consumers of source code, the software industry faces an accelerating loss of institutional knowledge. Eac...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

We present the PokeAgent Challenge, a large-scale benchmark for decision-making research built on Pokemon's multi-agent battle system and expansive role-playing...

#research #paper #ai #machine-learning
1 month ago · software · - · -

[Paper] Probabilistic Model Checking Taken by Storm

This tutorial paper presents a hands-on perspective on probabilistic model checking with the Storm model checker. Storm is a decade-old model checker that excel...

#research #paper #software
1 month ago · ai · - · -

[Paper] Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation

Modeling plausible student misconceptions is critical for AI in education. In this work, we examine how large language models (LLMs) reason about misconceptions...

#research #paper #ai #machine-learning #nlp
1 month ago · devops · - · -

[Paper] DUET: Disaggregated Hybrid Mamba-Transformer LLMs with Prefill and Decode-Specific Packages

Large language models operate in distinct compute-bound prefill followed by memory bandwidth-bound decode phases. Hybrid Mamba-Transformer models inherit this a...

#research #paper #devops
1 month ago · ai · - · -

[Paper] SlovKE: A Large-Scale Dataset and LLM Evaluation for Slovak Keyphrase Extraction

Keyphrase extraction for morphologically rich, low-resource languages remains understudied, largely due to the scarcity of suitable evaluation datasets. We addr...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

While locate-then-edit knowledge editing efficiently updates knowledge encoded within Large Language Models (LLMs), a critical generalization failure mode emerg...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models

Vietnamese medical research has become an increasingly vital domain, particularly with the rise of intelligent technologies aimed at reducing time and resource ...

#research #paper #ai #nlp
1 month ago · devops · - · -

[Paper] Cuckoo-GPU: Accelerating Cuckoo Filters on Modern GPUs

Approximate Membership Query (AMQ) structures are essential for high-throughput systems in databases, networking, and bioinformatics. While Bloom filters offer ...

#research #paper #devops
1 month ago · software · - · -

[Paper] Formalisms for Robotic Mission Specification and Execution: A Comparative Analysis

Robots are increasingly deployed across diverse domains and designed for multi-purpose operation. As robotic systems grow in complexity and operate in dynamic e...

#research #paper #software
1 month ago · ai · - · -

[Paper] Invisible failures in human-AI interactions

AI systems fail silently far more often than they fail visibly. In a large-scale quantitative analysis of human-AI interactions from the WildChat dataset, we fi...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Agent skills, structured procedural knowledge packages injected at inference time, are increasingly used to augment LLM agents on software engineering tasks. Ho...

#research #paper #ai #machine-learning
1 month ago · devops · - · -

[Paper] Multi-Objective Load Balancing for Heterogeneous Edge-Based Object Detection Systems

The rapid proliferation of the Internet of Things (IoT) and smart applications has led to a surge in data generated by distributed sensing devices. Edge computi...

#research #paper #devops
1 month ago · software · - · -

[Paper] Formalizing and validating properties in Asmeta with Large Language Models (Extended Abstract)

Writing temporal logic properties is often a challenging task for users of model-based development frameworks, particularly when translating informal requiremen...

#research #paper #software
1 month ago · ai · - · -

[Paper] SKILLS: Structured Knowledge Injection for LLM-Driven Telecommunications Operations

As telecommunications operators accelerate adoption of AI-enabled automation, a practical question remains unresolved: can general-purpose large language model ...

#research #paper #ai #machine-learning
1 month ago · software · - · -

[Paper] To be FAIR or RIGHT? Methodological [R]esearch [I]ntegrity [G]iven [H]uman-facing [T]echnologies using the example of Learning Technologies

Quality assessment of Research Software Engineering (RSE) plays an important role in all scientific fields. From the canonical three criteria (reliability, vali...

#research #paper #software
1 month ago · software · - · -

[Paper] The Impact of AI-Assisted Development on Software Security: A Study of Gemini and Developer Experience

The ongoing shortage of skilled developers, particularly in security-critical software development, has led organizations to increasingly adopt AI-powered devel...

#research #paper #software
1 month ago · ai · - · -

[Paper] Towards Foundation Models for Consensus Rank Aggregation

Aggregating a consensus ranking from multiple input rankings is a fundamental problem with applications in recommendation systems, search engines, job recruitme...

#research #paper #ai #machine-learning
1 month ago · devops · - · -

[Paper] LMetric: Simple is Better - Multiplication May Be All You Need for LLM Request Scheduling

High-quality LLM request scheduling requires achieving two key objectives: whether the routed instance has KV to accelerate the request execution and whether th...

#research #paper #devops
1 month ago · ai · - · -

[Paper] CATFormer: When Continual Learning Meets Spiking Transformers With Dynamic Thresholds

Although deep neural networks perform extremely well in controlled environments, they fail in real-world scenarios where data isn't available all at once, and t...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Token Coherence: Adapting MESI Cache Protocols to Minimize Synchronization Overhead in Multi-Agent LLM Systems

Multi-agent LLM orchestration incurs synchronization costs scaling as O(n x S x |D|) in agents, steps, and artifact size under naive broadcast -- a regime I ter...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Large Language Models (LLMs) have shown strong potential for code generation, yet they remain limited in private-library-oriented code generation, where the goa...

#research #paper #ai #machine-learning #nlp
1 month ago · devops · - · -

[Paper] Guaranteeing Semantic and Performance Determinism in Flexible GPU Sharing

GPU sharing is critical for maximizing hardware utilization in modern data centers. However, existing approaches present a stark trade-off: coarse-grained tempo...

#research #paper #devops
1 month ago · ai · - · -

[Paper] PCodeTrans: Translate Decompiled Pseudocode to Compilable and Executable Equivalent

Decompilation is foundational to binary analysis, yet conventional tools prioritize human readability over strict recompilability and verifiable runtime correct...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling

Road crashes remain a leading cause of preventable fatalities. Existing prediction models predominantly produce binary outcomes, which offer limited actionable ...

#research #paper #ai #machine-learning
1 month ago · devops · - · -

[Paper] Protecting Distributed Blockchain with Twin-Field Quantum Key Distribution: A Quantum Resistant Approach

Quantum computing provides the feasible multi-layered security challenges to classical blockchain systems. Whereas, quantum-secured blockchains relied on quantu...

#research #paper #devops
1 month ago · software · - · -

[Paper] Counterexample Guided Branching via Directional Relaxation Analysis in Complete Neural Network Verification

Deep Neural Networks demonstrate exceptional performance but remain vulnerable to adversarial perturbations, necessitating formal verification for safety-critic...

#research #paper #software
1 month ago · ai · - · -

[Paper] SimCert: Probabilistic Certification for Behavioral Similarity in Deep Neural Network Compression

Deploying Deep Neural Networks (DNNs) on resource-constrained embedded systems requires aggressive model compression techniques like quantization and pruning. H...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Fold-CP: A Context Parallelism Framework for Biomolecular Modeling

Understanding cellular machinery requires atomic-scale reconstruction of large biomolecular assemblies. However, predicting the structures of these systems has ...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Knowledge Activation: AI Skills as the Institutional Knowledge Primitive for Agentic Software Development

Enterprise software organizations accumulate critical institutional knowledge - architectural decisions, deployment procedures, compliance policies, incident pl...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] DeFRiS: Silo-Cooperative IoT Applications Scheduling via Decentralized Federated Reinforcement Learning

Next-generation IoT applications increasingly span across autonomous administrative entities, necessitating silo-cooperative scheduling to leverage diverse comp...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Beyond Local Code Optimization: Multi-Agent Reasoning for Software System Optimization

Large language models and AI agents have recently shown promise in automating software performance optimization, but existing approaches predominantly rely on l...

#research #paper #ai #machine-learning
1 month ago · devops · - · -

[Paper] Can you keep a secret? A new protocol for sender-side enforcement of causal message delivery

Protocols for causal message delivery are widely used in distributed systems. Traditionally, causal delivery can be enforced either on the message sender's side...

#research #paper #devops
1 month ago · ai · - · -

[Paper] MorphSNN: Adaptive Graph Diffusion and Structural Plasticity for Spiking Neural Networks

Spiking Neural Networks (SNNs) currently face a critical bottleneck: while individual neurons exhibit dynamic biological properties, their macro-scopic architec...

#research #paper #ai
1 month ago · ai · - · -

[Paper] Multifidelity Surrogate Modeling of Depressurized Loss of Forced Cooling in High-temperature Gas Reactors

High-fidelity computational fluid dynamics (CFD) simulations are widely used to analyze nuclear reactor transients, but are computationally expensive when explo...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Solving physics-constrained inverse problems with conditional flow matching

This study presents a conditional flow matching framework for solving physics-constrained Bayesian inverse problems. In this setting, samples from the joint dis...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] DualSwinFusionSeg: Multimodal Martian Landslide Segmentation via Dual Swin Transformer with Multi-Scale Fusion and UNet++

Automated segmentation of Martian landslides, particularly in tectonically active regions such as Valles Marineris,is important for planetary geology, hazard as...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] Is the reconstruction loss culprit? An attempt to outperform JEPA

We evaluate JEPA-style predictive representation learning versus reconstruction-based autoencoders on a controlled 'TV-series' linear dynamical system with know...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] The GELATO Dataset for Legislative NER

This paper introduces GELATO (Government, Executive, Legislative, and Treaty Ontology), a dataset of U.S. House and Senate bills from the 118th Congress annotat...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] Diffusion Reinforcement Learning via Centered Reward Distillation

Diffusion and flow models achieve State-Of-The-Art (SOTA) generative performance, yet many practically important behaviors such as fine-grained prompt fidelity,...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] Implementation and discussion of the Pith Estimation on Rough Log End Images using Local Fourier Spectrum Analysis method

In this article, we analyze and propose a Python implementation of the method 'Pith Estimation on Rough Log End images using Local Fourier Spectrum Analysis', b...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] The Institutional Scaling Law: Non-Monotonic Fitness, Capability-Trust Divergence, and Symbiogenetic Scaling in Generative AI

Classical scaling laws model AI performance as monotonically improving with model size. We challenge this assumption by deriving the Institutional Scaling Law, ...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Low-Field Magnetic Resonance Image Enhancement using Undersampled k-Space

Low-field magnetic resonance imaging (MRI) offers a cost-effective alternative for medical imaging in resource-limited settings. However, its widespread adoptio...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Towards Agentic Honeynet Configuration

Honeypots are deception systems that emulate vulnerable services to collect threat intelligence. While deploying many honeypots increases the opportunity to obs...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Low-Field Magnetic Resonance Image Quality Enhancement using Undersampled k-Space and Out-of-Distribution Generalisation

Low-field magnetic resonance imaging (MRI) offers affordable access to diagnostic imaging but faces challenges such as prolonged acquisition times and reduced i...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Improving Visual Reasoning with Iterative Evidence Refinement

Vision language models (VLMs) are increasingly capable of reasoning over images, but robust visual reasoning often requires re-grounding intermediate steps in t...

#research #paper #ai #computer-vision

Newer posts

Older posts