research — Page 111

Sort:

2 months ago · software · - · -

[Paper] Visualisation for the CIS benchmark scanning results

In this paper, we introduce GraphSecure, a web application that provides advanced analysis and visualisation of security scanning results. GraphSecure enables u...

#research #paper #software
2 months ago · devops · - · -

[Paper] RollMux: Phase-Level Multiplexing for Disaggregated RL Post-Training

Rollout-training disaggregation is emerging as the standard architecture for Reinforcement Learning (RL) post-training, where memory-bound rollout and compute-b...

#research #paper #devops
2 months ago · software · - · -

[Paper] Coverage Isn't Enough: SBFL-Driven Insights into Manually Created vs. Automatically Generated Tests

The testing phase is an essential part of software development, but manually creating test cases can be time-consuming. Consequently, there is a growing need fo...

#research #paper #software
2 months ago · ai · - · -

[Paper] Theoretical Foundations of GPU-Native Compilation for Rapid Code Iteration

Current AI code generation systems suffer from significant latency bottlenecks due to CPU-GPU data transfers during compilation, execution, and testing phases. ...

#research #paper #ai #machine-learning
2 months ago · devops · - · -

[Paper] An LLVM-Based Optimization Pipeline for SPDZ

Actively secure arithmetic MPC is now practical for real applications, but performance and usability are still limited by framework-specific compilation stacks,...

#research #paper #devops
2 months ago · ai · - · -

[Paper] StereoSpace: Depth-Free Synthesis of Stereo Geometry via End-to-End Diffusion in a Canonical Space

We introduce StereoSpace, a diffusion-based framework for monocular-to-stereo synthesis that models geometry purely through viewpoint conditioning, without expl...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Generative world models are reshaping embodied AI, enabling agents to synthesize realistic 4D driving environments that look convincing but often fail physicall...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Empowering Dynamic Urban Navigation with Stereo and Mid-Level Vision

The success of foundation models in language and vision motivated research in fully end-to-end robot navigation foundation models (NFMs). NFMs directly map mono...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization

Visual concept personalization aims to transfer only specific image attributes, such as identity, expression, lighting, and style, into unseen contexts. However...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model

We propose a decoupled 3D scene generation framework called SceneMaker in this work. Due to the lack of sufficient open-set de-occlusion and pose estimation pri...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Bidirectional Normalizing Flow: From Data to Noise and Back

Normalizing Flows (NFs) have been established as a principled framework for generative modeling. Standard NFs consist of a forward process and a reverse process...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration

In this work, we explore an untapped signal in diffusion model inference. While all previous methods generate images independently at inference, we instead ask ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Hierarchical Dataset Selection for High-Quality Data Sharing

The success of modern machine learning hinges on access to high-quality training data. In many real-world scenarios, such as acquiring data from public reposito...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training

Self-supervised pre-training has revolutionized foundation models for languages, individual 2D images and videos, but remains largely unexplored for learning 3D...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

Reinforcement learning (RL), earlier proven to be effective in large language and multi-modal models, has been successfully extended to enhance 2D image generat...

#research #paper #ai #machine-learning #nlp #computer-vision
2 months ago · ai · - · -

[Paper] ClusIR: Towards Cluster-Guided All-in-One Image Restoration

All-in-One Image Restoration (AiOIR) aims to recover high-quality images from diverse degradations within a unified framework. However, existing methods often f...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] ImplicitRDP: An End-to-End Visual-Force Diffusion Policy with Structural Slow-Fast Learning

Human-level contact-rich manipulation relies on the distinct roles of two key modalities: vision provides spatially rich but temporally slow global context, whi...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation

Recent advances in subject-driven video generation with large diffusion models have enabled personalized content synthesis conditioned on user-provided subjects...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Mull-Tokens: Modality-Agnostic Latent Thinking

Reasoning goes beyond language; the real world requires reasoning about space, time, affordances, and much more that words alone cannot convey. Existing multimo...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

Prior approaches injecting camera control into diffusion models have focused on specific subsets of 4D consistency tasks: novel view synthesis, text-to-video wi...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Stronger Normalization-Free Transformers

Although normalization layers have long been viewed as indispensable components of deep learning architectures, the recent introduction of Dynamic Tanh (DyT) ha...

#research #paper #ai #machine-learning #nlp #computer-vision
2 months ago · ai · - · -

[Paper] On Decision-Making Agents and Higher-Order Causal Processes

We establish a precise correspondence between decision-making agents in partially observable Markov decision processes (POMDPs) and one-input process functions,...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Empirical evaluation of the Frank-Wolfe methods for constructing white-box adversarial attacks

The construction of adversarial attacks for neural networks appears to be a crucial challenge for their deployment in various services. To estimate the adversar...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Any4D: Unified Feed-Forward Metric 4D Reconstruction

We present Any4D, a scalable multi-view transformer for metric-scale, dense feed-forward 4D reconstruction. Any4D directly generates per-pixel motion and geomet...

#research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts