EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training

    Self-supervised pre-training has revolutionized foundation models for languages, individual 2D images and videos, but remains largely unexplored for learning 3D...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

    Reinforcement learning (RL), earlier proven to be effective in large language and multi-modal models, has been successfully extended to enhance 2D image generat...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 month ago · ai

    [Paper] ClusIR: Towards Cluster-Guided All-in-One Image Restoration

    All-in-One Image Restoration (AiOIR) aims to recover high-quality images from diverse degradations within a unified framework. However, existing methods often f...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation

    Recent advances in subject-driven video generation with large diffusion models have enabled personalized content synthesis conditioned on user-provided subjects...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Mull-Tokens: Modality-Agnostic Latent Thinking

    Reasoning goes beyond language; the real world requires reasoning about space, time, affordances, and much more that words alone cannot convey. Existing multimo...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

    Prior approaches injecting camera control into diffusion models have focused on specific subsets of 4D consistency tasks: novel view synthesis, text-to-video wi...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Stronger Normalization-Free Transformers

    Although normalization layers have long been viewed as indispensable components of deep learning architectures, the recent introduction of Dynamic Tanh (DyT) ha...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 month ago · ai

    [Paper] Any4D: Unified Feed-Forward Metric 4D Reconstruction

    We present Any4D, a scalable multi-view transformer for metric-scale, dense feed-forward 4D reconstruction. Any4D directly generates per-pixel motion and geomet...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    Interest in Spoor’s bird-monitoring AI software is soaring

    Spoor's computer vision software can help wind farms, and other industries, track bird populations and migration patterns....

    #computer vision #bird monitoring #wildlife conservation #environmental AI #wind farms #Spoor #migration tracking
  • 1 month ago · ai

    [Paper] GAINS: Gaussian-based Inverse Rendering from Sparse Multi-View Captures

    Recent advances in Gaussian Splatting-based inverse rendering extend Gaussian primitives with shading parameters and physically grounded light transport, enabli...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

    Video unified models exhibit strong capabilities in understanding and generation, yet they struggle with reason-informed visual editing even when equipped with ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Splatent: Splatting Diffusion Latents for Novel View Synthesis

    Radiance field representations have recently been explored in the latent space of VAEs that are commonly used by diffusion models. This direction offers efficie...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026