EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] Radiance Meshes for Volumetric Reconstruction

    We introduce radiance meshes, a technique for representing radiance fields with constant density tetrahedral cells produced with a Delaunay tetrahedralization. ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

    Vision Language Models (VLMs) demonstrate strong qualitative visual understanding, but struggle with metrically precise spatial reasoning required for embodied ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Stable Signer: Hierarchical Sign Language Generative Model

    Sign Language Production (SLP) is the process of converting the complex input text into a real video. Most previous works focused on the Text2Gloss, Gloss2Pose,...

    #research #paper #ai #nlp #computer-vision
  • 1 month ago · ai

    [Paper] RELIC: Interactive Video World Model with Long-Horizon Memory

    A truly interactive world model requires three key ingredients: real-time long-horizon streaming, consistent spatial memory, and precise user control. However, ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Fast & Efficient Normalizing Flows and Applications of Image Generative Models

    This thesis presents novel contributions in two primary areas: advancing the efficiency of generative models, particularly normalizing flows, and applying gener...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Jina-VLM: Small Multilingual Vision Language Model

    We present Jina-VLM, a 2.4B parameter vision-language model that achieves state-of-the-art multilingual visual question answering among open 2B-scale VLMs. The ...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 month ago · ai

    Measuring What Matters: Objective Metrics for Image Generation Assessment

    Generating high‑quality visuals with state‑of‑the‑art models is becoming increasingly accessible. Open‑source models run on laptops, and cloud services turn tex...

    #image generation #evaluation metrics #generative AI #computer vision #quality assessment #Pruna #P-image #AI model benchmarking
  • 1 month ago · ai

    [Paper] PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation

    Attention mechanisms are the core of foundation models, but their quadratic complexity remains a critical bottleneck for scaling. This challenge has driven the ...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] On the Temporality for Sketch Representation Learning

    Sketches are simple human hand-drawn abstractions of complex scenes and real-world objects. Although the field of sketch representation learning has advanced si...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

    We propose MagicQuill V2, a novel system that introduces a layered composition paradigm to generative image editing, bridging the gap between the sema...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models

    Multi-view diffusion models have recently emerged as a powerful paradigm for novel view synthesis, yet the underlying mechanism that enables their view-consiste...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] OneThinker: All-in-one Reasoning Model for Image and Video

    Reinforcement learning (RL) has recently achieved remarkable success in eliciting visual reasoning within Multimodal Large Language Models (MLLMs). However, exi...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026