EUNO.NEWS EUNO.NEWS
  • All (2571) +248
  • AI (578) +19
  • DevOps (150) +2
  • Software (1091) +156
  • IT (746) +70
  • Education (6) +1
  • Notice
  • All (2571) +248
    • AI (578) +19
    • DevOps (150) +2
    • Software (1091) +156
    • IT (746) +70
    • Education (6) +1
  • Notice
  • All (2571) +248
  • AI (578) +19
  • DevOps (150) +2
  • Software (1091) +156
  • IT (746) +70
  • Education (6) +1
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 hour ago · ai

    YOLOv1 Paper Walkthrough: The Day YOLO First Saw the World

    A detailed walkthrough of the YOLOv1 architecture and its PyTorch implementation from scratch The post YOLOv1 Paper Walkthrough: The Day YOLO First Saw the Worl...

    #YOLOv1 #object detection #computer vision #deep learning #PyTorch #model walkthrough #neural networks
  • 20 hours ago · ai

    [Paper] The Universal Weight Subspace Hypothesis

    We show that deep neural networks trained across diverse tasks exhibit remarkably similar low-dimensional parametric subspaces. We provide the first large-scale...

    #research #paper #ai #machine-learning #computer-vision
  • 20 hours ago · ai

    [Paper] Light-X: Generative 4D Video Rendering with Camera and Illumination Control

    Recent advances in illumination control extend image-based methods to video, yet still facing a trade-off between lighting fidelity and temporal consistency. Mo...

    #research #paper #ai #computer-vision
  • 20 hours ago · ai

    [Paper] Value Gradient Guidance for Flow Matching Alignment

    While methods exist for aligning flow matching models--a popular and effective class of generative models--with human preferences, existing approaches fail to a...

    #research #paper #ai #machine-learning #computer-vision
  • 20 hours ago · ai

    [Paper] Deep infant brain segmentation from multi-contrast MRI

    Segmentation of magnetic resonance images (MRI) facilitates analysis of human brain development by delineating anatomical structures. However, in infants and yo...

    #research #paper #ai #machine-learning #computer-vision
  • 20 hours ago · ai

    [Paper] DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation

    Recent unified multimodal large language models (MLLMs) have shown impressive capabilities, incorporating chain-of-thought (CoT) reasoning for enhanced text-to-...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 20 hours ago · ai

    [Paper] Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting

    Synthesizing high-fidelity frozen 3D scenes from monocular Mannequin-Challenge (MC) videos is a unique problem distinct from standard dynamic scene reconstructi...

    #research #paper #ai #computer-vision
  • 20 hours ago · ai

    [Paper] ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

    Reward models are critical for aligning vision-language systems with human preferences, yet current approaches suffer from hallucination, weak visual grounding,...

    #research #paper #ai #computer-vision
  • 20 hours ago · ai

    [Paper] ShadowDraw: From Any Object to Shadow-Drawing Compositional Art

    We introduce ShadowDraw, a framework that transforms ordinary 3D objects into shadow-drawing compositional art. Given a 3D object, our system predicts scene par...

    #research #paper #ai #machine-learning #computer-vision
  • 20 hours ago · ai

    [Paper] NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation

    Standard diffusion corrupts data using Gaussian noise whose Fourier coefficients have random magnitudes and random phases. While effective for unconditional or ...

    #research #paper #ai #machine-learning #computer-vision
  • 20 hours ago · ai

    [Paper] EvoIR: Towards All-in-One Image Restoration via Evolutionary Frequency Modulation

    All-in-One Image Restoration (AiOIR) tasks often involve diverse degradation that require robust and versatile strategies. However, most existing approaches typ...

    #research #paper #ai #computer-vision
  • 20 hours ago · ai

    [Paper] TV2TV: A Unified Framework for Interleaved Language and Video Generation

    Video generation models are rapidly advancing, but can still struggle with complex video outputs that require significant semantic branching or repeated high-le...

    #research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2025