EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation

    Recent unified multimodal large language models (MLLMs) have shown impressive capabilities, incorporating chain-of-thought (CoT) reasoning for enhanced text-to-...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 month ago · ai

    [Paper] Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting

    Synthesizing high-fidelity frozen 3D scenes from monocular Mannequin-Challenge (MC) videos is a unique problem distinct from standard dynamic scene reconstructi...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

    Reward models are critical for aligning vision-language systems with human preferences, yet current approaches suffer from hallucination, weak visual grounding,...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] ShadowDraw: From Any Object to Shadow-Drawing Compositional Art

    We introduce ShadowDraw, a framework that transforms ordinary 3D objects into shadow-drawing compositional art. Given a 3D object, our system predicts scene par...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation

    Standard diffusion corrupts data using Gaussian noise whose Fourier coefficients have random magnitudes and random phases. While effective for unconditional or ...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] EvoIR: Towards All-in-One Image Restoration via Evolutionary Frequency Modulation

    All-in-One Image Restoration (AiOIR) tasks often involve diverse degradation that require robust and versatile strategies. However, most existing approaches typ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] TV2TV: A Unified Framework for Interleaved Language and Video Generation

    Video generation models are rapidly advancing, but can still struggle with complex video outputs that require significant semantic branching or repeated high-le...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards

    In recent years, Image Quality Assessment (IQA) for AI-generated images (AIGI) has advanced rapidly; however, existing methods primarily target portraits and ar...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    See Through Walls: AI's New Eye on Occluded Motion by Arvind Sundararajan

    Ever struggle to get accurate motion capture when hands are intertwined, hidden behind objects, or even just slightly out of view? Standard computer vision syst...

    #computer vision #motion capture #occlusion handling #deformable state space model #visual feature extraction #AI research
  • 1 month ago · ai

    [Paper] SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows

    Normalizing Flows (NFs) learn invertible mappings between the data and a Gaussian distribution. Prior works usually suffer from two limitations. First, they add...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Unique Lives, Shared World: Learning from Single-Life Videos

    We introduce the 'single-life' learning paradigm, where we train a distinct vision model exclusively on egocentric videos captured by one individual. We leverag...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

    Graphic design forms the cornerstone of modern visual communication, serving as a vital medium for promoting cultural and commercial events. Recent advances hav...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026