Source

arXiv

5856 posts from this source

Sort:

4 months ago · ai · - · -

[Paper] Do Generalisation Results Generalise?

A large language model's (LLM's) out-of-distribution (OOD) generalisation ability is crucial to its deployment. Previous work assessing LLMs' generalisation per...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Recent video generation models demonstrate impressive synthesis capabilities but remain limited by single-modality conditioning, constraining their holistic wor...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Visual generative models (e.g., diffusion models) typically operate in compressed latent spaces to balance training efficiency and sample quality. In parallel, ...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] The Adoption and Usage of AI Agents: Early Evidence from Perplexity

This paper presents the first large-scale field study of the adoption, usage intensity, and use cases of general-purpose AI agents operating in open-world web e...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] An Adaptive Multi-Layered Honeynet Architecture for Threat Behavior Analysis via Deep Learning

The escalating sophistication and variety of cyber threats have rendered static honeypots inadequate, necessitating adaptive, intelligence-driven deception. In ...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

The quality and diversity of instruction-based image editing datasets are continuously increasing, yet large-scale, high-quality datasets for instruction-based ...

#research #paper #ai #computer-vision
4 months ago · software · - · -

[Paper] Studying the Role of Reusing Crowdsourcing Knowledge in Software Development

Crowdsourcing platforms, such as Stack Overflow, have changed and impacted the software development practice. In these platforms, developers share and reuse the...

#research #paper #software
4 months ago · ai · - · -

[Paper] WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling

Recent video generators achieve striking photorealism, yet remain fundamentally inconsistent in 3D. We present WorldReel, a 4D video generator that is natively ...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Graph-Based Learning of Spectro-Topographical EEG Representations with Gradient Alignment for Brain-Computer Interfaces

We present a novel graph-based learning of EEG representations with gradient alignment (GEEGA) that leverages multi-domain information to learn EEG representati...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Provable Long-Range Benefits of Next-Token Prediction

Why do modern language models, trained to do well on next-word prediction, appear to generate coherent documents and capture long-range structure? Here we show ...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Understanding Privacy Risks in Code Models Through Training Dynamics: A Causal Approach

Large language models for code (LLM4Code) have greatly improved developer productivity but also raise privacy concerns due to their reliance on open-source repo...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Auditing Games for Sandbagging

Future AI systems could conceal their capabilities ('sandbagging') during evaluations, potentially misleading developers and auditors. We stress-tested sandbagg...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] LUNA: LUT-Based Neural Architecture for Fast and Low-Cost Qubit Readout

Qubit readout is a critical operation in quantum computing systems, which maps the analog response of qubits into discrete classical states. Deep neural network...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Lang3D-XL: Language Embedded 3D Gaussians for Large-scale Scenes

Embedding a language field in a 3D representation enables richer semantic understanding of spatial environments by linking geometry with descriptive meaning. Th...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Multi-view Pyramid Transformer: Look Coarser to See Broader

We propose Multi-view Pyramid Transformer (MVP), a scalable multi-view transformer architecture that directly reconstructs large 3D scenes from tens to hundreds...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Group Representational Position Encoding

We present GRAPE (Group RepresentAtional Position Encoding), a unified framework for positional encoding based on group actions. GRAPE brings together two famil...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Storytelling in real-world videos often unfolds through multiple shots -- discontinuous yet semantically connected clips that together convey a coherent narrati...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support

LLM-based agents are rapidly being plugged into expert decision-support, yet in messy, high-stakes settings they rarely make the team smarter: human-AI teams of...

#research #paper #ai #machine-learning #nlp
4 months ago · devops · - · -

[Paper] Quantifying the Carbon Reduction of DAG Workloads: A Job Shop Scheduling Perspective

Carbon-aware schedulers aim to reduce the operational carbon footprint of data centers by running flexible workloads during periods of low carbon intensity. Mos...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Large Causal Models from Large Language Models

We introduce a new paradigm for building large causal models (LCMs) that exploits the enormous potential latent in today's large language models (LLMs). We desc...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning

Large language models (LLMs) are increasingly deployed in settings where reasoning, such as multi-step problem solving and chain-of-thought, is essential. Yet, ...

#research #paper #ai #machine-learning #nlp
4 months ago · devops · - · -

[Paper] Designing Co-operation in Systems of Hierarchical, Multi-objective Schedulers for Stream Processing

Stream processing is a computing paradigm that supports real-time data processing for a wide variety of applications. At Meta, it's used across the company for ...

#research #paper #devops
4 months ago · ai · - · -

[Paper] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Recent reinforcement learning (RL) techniques have yielded impressive reasoning improvements in language models, yet it remains unclear whether post-training tr...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Distribution Matching Variational AutoEncoder

Most visual generative models compress images into a latent space before applying diffusion or autoregressive modelling. Yet, existing approaches such as VAEs a...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Mary, the Cheeseburger-Eating Vegetarian: Do LLMs Recognize Incoherence in Narratives?

Leveraging a dataset of paired narratives, we investigate the extent to which large language models (LLMs) can reliably separate incoherent and coherent stories...

#research #paper #ai #nlp
4 months ago · devops · - · -

[Paper] A Performance Analyzer for a Public Cloud's ML-Augmented VM Allocator

Many operational cloud systems use one or more machine learning models that help them achieve better efficiency and performance. But operators do not have tools...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Automated Generation of Custom MedDRA Queries Using SafeTerm Medical Map

In pre-market drug safety review, grouping related adverse event terms into standardised MedDRA queries or the FDA Office of New Drugs Custom Medical Queries (O...

#research #paper #ai #nlp
5 months ago · ai · - · -

[Paper] HalluShift++: Bridging Language and Vision through Internal Representation Shifts for Hierarchical Hallucinations in MLLMs

Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities in vision-language understanding tasks. While these models often produce ling...

#research #paper #ai #nlp #computer-vision
5 months ago · ai · - · -

[Paper] When Large Language Models Do Not Work: Online Incivility Prediction through Graph Neural Networks

Online incivility has emerged as a widespread and persistent problem in digital communities, imposing substantial social and psychological burdens on users. Alt...

#research #paper #ai #machine-learning #nlp
5 months ago · ai · - · -

[Paper] Bridging Code Graphs and Large Language Models for Better Code Understanding

Large Language Models (LLMs) have demonstrated remarkable performance in code intelligence tasks such as code generation, summarization, and translation. Howeve...

#research #paper #ai #nlp
5 months ago · software · - · -

[Paper] Reliable agent engineering should integrate machine-compatible organizational principles

As AI agents built on large language models (LLMs) become increasingly embedded in society, issues of coordination, control, delegation, and accountability are ...

#research #paper #software
5 months ago · ai · - · -

[Paper] Algorithm-hardware co-design of neuromorphic networks with dual memory pathways

Spiking neural networks excel at event-driven sensing yet maintaining task-relevant context over long timescales. However building these networks in hardware re...

#research #paper #ai
5 months ago · devops · - · -

[Paper] Bandwidth-Aware Network Topology Optimization for Decentralized Learning

Network topology is critical for efficient parameter synchronization in distributed learning over networks. However, most existing studies do not account for ba...

#research #paper #devops
5 months ago · software · - · -

[Paper] VP-AutoTest: A Virtual-Physical Fusion Autonomous Driving Testing Platform

The rapid development of autonomous vehicles has led to a surge in testing demand. Traditional testing methods, such as virtual simulation, closed-course, and p...

#research #paper #software
5 months ago · ai · - · -

[Paper] AutoICE: Automatically Synthesizing Verifiable C Code via LLM-driven Evolution

Automatically synthesizing verifiable code from natural language requirements ensures software correctness and reliability while significantly lowering the barr...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations

We investigate how large language models (LLMs) fail when operating as autonomous agents with tool-use capabilities. Using the Kamiwaza Agentic Merit Index (KAM...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] KAN-Dreamer: Benchmarking Kolmogorov-Arnold Networks as Function Approximators in World Models

DreamerV3 is a state-of-the-art online model-based reinforcement learning (MBRL) algorithm known for remarkable sample efficiency. Concurrently, Kolmogorov-Arno...

#research #paper #ai #machine-learning #computer-vision
5 months ago · software · - · -

[Paper] Systematic Evaluation of Black-Box Checking for Fast Bug Detection

Combinations of active automata learning, model-based testing and model checking have been successfully used in numerous applications, e.g., for spotting bugs i...

#research #paper #software
5 months ago · ai · - · -

[Paper] Do LLMs Trust the Code They Write?

Despite the effectiveness of large language models (LLMs) for code generation, they often output incorrect code. One reason is that model output probabilities a...

#research #paper #ai #machine-learning
5 months ago · devops · - · -

[Paper] Otus Supercomputer

Otus is a high-performance computing cluster that was launched in 2025 and is operated by the Paderborn Center for Parallel Computing (PC2) at Paderborn Univers...

#research #paper #devops
5 months ago · ai · - · -

[Paper] From sparse recovery to plug-and-play priors, understanding trade-offs for stable recovery with generalized projected gradient descent

We consider the problem of recovering an unknown low-dimensional vector from noisy, underdetermined observations. We focus on the Generalized Projected Gradient...

#research #paper #ai
5 months ago · software · - · -

[Paper] Challenges in Developing Secure Software -- Results of an Interview Study in the German Software Industry

The damage caused by cybercrime makes the development of secure software inevitable. Although many tools and frameworks exist to support the development of secu...

#research #paper #software
5 months ago · ai · - · -

[Paper] An Asynchronous Mixed-Signal Resonate-and-Fire Neuron

Analog computing at the edge is an emerging strategy to limit data storage and transmission requirements, as well as energy consumption, and its practical imple...

#research #paper #ai
5 months ago · devops · - · -

[Paper] Communication-Efficient Serving for Video Diffusion Models with Latent Parallelism

Video diffusion models (VDMs) perform attention computation over the 3D spatio-temporal domain. Compared to large language models (LLMs) processing 1D sequences...

#research #paper #devops
5 months ago · ai · - · -

[Paper] Venus: An Efficient Edge Memory-and-Retrieval System for VLM-based Online Video Understanding

Vision-language models (VLMs) have demonstrated impressive multimodal comprehension capabilities and are being deployed in an increasing number of online video ...

#research #paper #ai #machine-learning
5 months ago · ai · - · -

[Paper] DCO: Dynamic Cache Orchestration for LLM Accelerators through Predictive Management

The rapid adoption of large language models (LLMs) is pushing AI accelerators toward increasingly powerful and specialized designs. Instead of further complicat...

#research #paper #ai #machine-learning
5 months ago · devops · - · -

[Paper] ContinuumConductor : Decentralized Process Mining on the Edge-Cloud Continuum

Process mining traditionally assumes centralized event data collection and analysis. However, modern Industrial Internet of Things systems increasingly operate ...

#research #paper #devops
5 months ago · ai · - · -

[Paper] Synchrony-Gated Plasticity with Dopamine Modulation for Spiking Neural Networks

While surrogate backpropagation proves useful for training deep spiking neural networks (SNNs), incorporating biologically inspired local signals on a large sca...

#research #paper #ai

Newer posts

Older posts