EUNO.NEWS EUNO.NEWS
  • All (2817) +36
  • AI (590) +2
  • DevOps (157)
  • Software (1229) +31
  • IT (835) +3
  • Education (6)
  • Notice
  • All (2817) +36
    • AI (590) +2
    • DevOps (157)
    • Software (1229) +31
    • IT (835) +3
    • Education (6)
  • Notice
  • All (2817) +36
  • AI (590) +2
  • DevOps (157)
  • Software (1229) +31
  • IT (835) +3
  • Education (6)
  • Notice
Sources Tags Search
한국어 English 中文
  • 2 days ago · ai

    [Paper] PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation

    Attention mechanisms are the core of foundation models, but their quadratic complexity remains a critical bottleneck for scaling. This challenge has driven the ...

    #research #paper #ai #machine-learning #computer-vision
  • 2 days ago · ai

    [Paper] On the Temporality for Sketch Representation Learning

    Sketches are simple human hand-drawn abstractions of complex scenes and real-world objects. Although the field of sketch representation learning has advanced si...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

    We propose MagicQuill V2, a novel system that introduces a layered composition paradigm to generative image editing, bridging the gap between the sema...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models

    Multi-view diffusion models have recently emerged as a powerful paradigm for novel view synthesis, yet the underlying mechanism that enables their view-consiste...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] OneThinker: All-in-one Reasoning Model for Image and Video

    Reinforcement learning (RL) has recently achieved remarkable success in eliciting visual reasoning within Multimodal Large Language Models (MLLMs). However, exi...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] PPTArena: A Benchmark for Agentic PowerPoint Editing

    We introduce PPTArena, a benchmark for PowerPoint editing that measures reliable modifications to real slides under natural-language instructions. In contrast t...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

    Current video generation techniques excel at single-shot clips but struggle to produce narrative multi-shot videos, which require flexible shot arrangement, coh...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation

    We investigate whether video generative models can exhibit visuospatial intelligence, a capability central to human cognition, using only visual data. To this e...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

    Despite progress in video-to-audio generation, the field focuses predominantly on mono output, lacking spatial immersion. Existing binaural approaches remain co...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation

    We propose MAViD, a novel Multimodal framework for Audio-Visual Dialogue understanding and generation. Existing approaches primarily focus on non-interactive sy...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control

    Data-driven motion priors that can guide agents toward producing naturalistic behaviors play a pivotal role in creating life-like virtual characters. Adversaria...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] Unrolled Networks are Conditional Probability Flows in MRI Reconstruction

    Magnetic Resonance Imaging (MRI) offers excellent soft-tissue contrast without ionizing radiation, but its long acquisition time limits clinical utility. Recent...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2025