Source

arXiv

5856 posts from this source

Sort:

4 months ago · ai · - · -

[Paper] In Pursuit of Pixel Supervision for Visual Pre-training

At the most basic level, pixels are the source of the visual information through which we perceive the world. Pixels contain information at all levels, ranging ...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

In recent multimodal research, the diffusion paradigm has emerged as a promising alternative to the autoregressive paradigm (AR), owing to its unique decoding a...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Predictive Concept Decoders: Training Scalable End-to-End Interpretability Assistants

Interpreting the internal activations of neural networks can produce more faithful explanations of their behavior, but is difficult due to the complex structure...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering

We present Gaussian Pixel Codec Avatars (GPiCA), photorealistic head avatars that can be generated from multi-view images and efficiently rendered on mobile dev...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Artism: AI-Driven Dual-Engine System for Art Generation and Critique

This paper proposes a dual-engine AI architectural method designed to address the complex problem of exploring potential trajectories in the evolution of art. W...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Multi-View Foundation Models

Foundation models are vital tools in various Computer Vision applications. They take as input a single RGB image and output a deep feature representation that i...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] GateFusion: Hierarchical Gated Cross-Modal Fusion for Active Speaker Detection

Active Speaker Detection (ASD) aims to identify who is currently speaking in each frame of a video. Most state-of-the-art approaches rely on late fusion to comb...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Learning Model Parameter Dynamics in a Combination Therapy for Bladder Cancer from Sparse Biological Data

In a mathematical model of interacting biological organisms, where external interventions may alter behavior over time, traditional models that assume fixed par...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Dynamic Rebatching for Efficient Early-Exit Inference with DREX

Early-Exit (EE) is a Large Language Model (LLM) architecture that accelerates inference by allowing easier tokens to be generated using only a subset of the mod...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] End-to-End Training for Autoregressive Video Diffusion via Self-Resampling

Autoregressive video diffusion models hold promise for world simulation but are vulnerable to exposure bias arising from the train-test mismatch. While recent w...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression

Evaluations of image compression performance which include human preferences have generally found that naive distortion functions such as MSE are insufficiently...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] FrontierCS: Evolving Challenges for Evolving Intelligence

We introduce FrontierCS, a benchmark of 156 open-ended problems across diverse areas of computer science, designed and reviewed by experts, including CS PhDs an...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning

The misuse of AI-driven video generation technologies has raised serious social concerns, highlighting the urgent need for reliable AI-generated video detectors...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs

Prevailing Vision-Language-Action Models (VLAs) for robotic manipulation are built upon vision-language backbones pretrained on large-scale, but disconnected st...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Multi-Modal Semantic Communication

Semantic communication aims to transmit information most relevant to a task rather than raw data, offering significant gains in communication efficiency for app...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] BashArena: A Control Setting for Highly Privileged AI Agents

Future AI agents might run autonomously with elevated privileges. If these agents are misaligned, they might abuse these privileges to cause serious damage. The...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Reinforcement learning has become essential for strengthening the reasoning abilities of large language models, yet current exploration mechanisms remain fundam...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] A Multivariate Statistical Framework for Detection, Classification and Pre-localization of Anomalies in Water Distribution Networks

This paper presents a unified framework, for the detection, classification, and preliminary localization of anomalies in water distribution networks using multi...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] High-Dimensional Partial Least Squares: Spectral Analysis and Fundamental Limitations

Partial Least Squares (PLS) is a widely used method for data integration, designed to extract latent components shared across paired high-dimensional datasets. ...

#research #paper #ai #machine-learning
4 months ago · software · - · -

[Paper] A High-level Synthesis Toolchain for the Julia Language

With the push towards Exascale computing and data-driven methods, problem sizes have increased dramatically, increasing the computational requirements of the un...

#research #paper #software
4 months ago · ai · - · -

[Paper] Stylized Synthetic Augmentation further improves Corruption Robustness

This paper proposes a training data augmentation pipeline that combines synthetic image data with neural style transfer in order to address the vulnerability of...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers

Large language model (LLM) activations are notoriously difficult to understand, with most existing techniques using complex, specialized methods for interpretin...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Explaining the Reasoning of Large Language Models Using Attribution Graphs

Large language models (LLMs) exhibit remarkable capabilities, yet their reasoning remains opaque, raising safety and trust concerns. Attribution methods, which ...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning

Human beings solve complex problems through critical thinking, where reasoning and evaluation are intertwined to converge toward correct solutions. However, mos...

#research #paper #ai #machine-learning
4 months ago · devops · - · -

[Paper] LeaseGuard: Raft Leases Done Right

Raft is a leading consensus algorithm for replicating writes in distributed databases. However, distributed databases also require consistent reads. To guarante...

#research #paper #devops
4 months ago · ai · - · -

[Paper] PPSEBM: An Energy-Based Model with Progressive Parameter Selection for Continual Learning

Continual learning remains a fundamental challenge in machine learning, requiring models to learn from a stream of tasks without forgetting previously acquired ...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Characterizing Mamba's Selective Memory using Auto-Encoders

State space models (SSMs) are a promising alternative to transformers for language modeling because they use fixed memory during inference. However, this fixed ...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?

The computational and memory overheads associated with expanding the context window of LLMs severely limit their scalability. A noteworthy solution is vision-te...

#research #paper #ai #machine-learning #nlp #computer-vision
4 months ago · ai · - · -

[Paper] How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness

Large language models are increasingly adapted to downstream tasks through fine-tuning. Full supervised fine-tuning (SFT) and parameter-efficient fine-tuning (P...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Evaluating Metrics for Safety with LLM-as-Judges

LLMs (Large Language Models) are increasingly used in text processing pipelines to intelligently respond to a variety of inputs and generation tasks. This raise...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] Human-like Working Memory from Artificial Intrinsic Plasticity Neurons

Working memory enables the brain to integrate transient information for rapid decision-making. Artificial networks typically replicate this via recurrent or par...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] You Never Know a Person, You Only Know Their Defenses: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations

Psychological defenses are strategies, often automatic, that people use to manage distress. Rigid or overuse of defenses is negatively linked to mental health a...

#research #paper #ai #nlp
4 months ago · devops · - · -

[Paper] Optimizing Bloom Filters for Modern GPU Architectures

Bloom filters are a fundamental data structure for approximate membership queries, with applications ranging from data analytics to databases and genomics. Seve...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Bolmo: Byteifying the Next Generation of Language Models

We introduce Bolmo, the first family of competitive fully open byte-level language models (LMs) at the 1B and 7B parameter scales. In contrast to prior research...

#research #paper #ai #nlp
4 months ago · software · - · -

[Paper] WuppieFuzz: Coverage-Guided, Stateful REST API Fuzzing

Many business processes currently depend on web services, often using REST APIs for communication. REST APIs expose web service functionality through endpoints,...

#research #paper #software
4 months ago · ai · - · -

[Paper] How Do Semantically Equivalent Code Transformations Impact Membership Inference on LLMs for Code?

The success of large language models for code relies on vast amounts of code data, including public open-source repositories, such as GitHub, and private, confi...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] On Assessing the Relevance of Code Reviews Authored by Generative Models

The use of large language models like ChatGPT in code review offers promising efficiency gains but also raises concerns about correctness and safety. Existing e...

#research #paper #ai #machine-learning
4 months ago · software · - · -

[Paper] A Container-based Approach For Proactive Asset Administration Shell Digital Twins

In manufacturing, digital twins, realized as Asset Administration Shells (AAS), have emerged as a prevalent practice. These digital replicas, often utilized as ...

#research #paper #software
4 months ago · software · - · -

[Paper] Insecure Ingredients? Exploring Dependency Update Patterns of Bundled JavaScript Packages on the Web

Reusable software components, typically distributed as packages, are a central paradigm of modern software development. The JavaScript ecosystem serves as a pri...

#research #paper #software
4 months ago · software · - · -

[Paper] Can AI Generate more Comprehensive Test Scenarios? Review on Automated Driving Systems Test Scenario Generation Methods

Ensuring the safety and reliability of Automated Driving Systems (ADS) remains a critical challenge, as traditional verification methods such as large-scale on-...

#research #paper #software
4 months ago · ai · - · -

[Paper] Routing-Led Evolutionary Algorithm for Large-Scale Multi-Objective VNF Placement Problems

Modern data centers contain thousands of servers making them major consumers of electricity. To minimize their environmental impact, it is critical that we use ...

#research #paper #ai
4 months ago · ai · - · -

[Paper] LLMQ: Efficient Lower-Precision Pretraining for Consumer GPUs

We present LLMQ, an end-to-end CUDA/C++ implementation for medium-sized language-model training, e.g. 3B to 32B parameters, on affordable, commodity GPUs. These...

#research #paper #ai #machine-learning
4 months ago · software · - · -

[Paper] Heterogeneous Model Alignment in Digital Twin

Digital twin (DT) technology integrates heterogeneous data and models, along with semantic technologies to create multi-layered digital representation of physic...

#research #paper #software
4 months ago · software · - · -

[Paper] Automating Execution and Verification of BPMN+DMN Business Processes

The increasing and widespread use of BPMN business processes, also embodying DMN tables, requires tools and methodologies to verify their correctness. However, ...

#research #paper #software
4 months ago · ai · - · -

[Paper] Offline Multi-Task Multi-Objective Data-Driven Evolutionary Algorithm with Language Surrogate Model and Implicit Q-Learning

Data-driven evolutionary algorithms has shown surprising results in addressing expensive optimization problems through robust surrogate modeling. Though promisi...

#research #paper #ai #machine-learning
4 months ago · devops · - · -

[Paper] Reexamining Paradigms of End-to-End Data Movement

The pursuit of high-performance data transfer often focuses on raw network bandwidth, and international links of 100 Gbps or higher are frequently considered th...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Dense Associative Memories with Analog Circuits

The increasing computational demands of modern AI systems have exposed fundamental limitations of digital hardware, driving interest in alternative paradigms fo...

#research #paper #ai
4 months ago · devops · - · -

[Paper] Optimizing Sensor Node Localization for Achieving Sustainable Smart Agriculture System Connectivity

The innovative agriculture system is revolutionizing how we farm, making it one of the most critical innovations of our time! Yet it faces significant connectiv...

#research #paper #devops

Newer posts

Older posts