EUNO.NEWS EUNO.NEWS
  • All (2625) +302
  • AI (581) +22
  • DevOps (151) +3
  • Software (1114) +179
  • IT (773) +97
  • Education (6) +1
  • Notice
  • All (2625) +302
    • AI (581) +22
    • DevOps (151) +3
    • Software (1114) +179
    • IT (773) +97
    • Education (6) +1
  • Notice
  • All (2625) +302
  • AI (581) +22
  • DevOps (151) +3
  • Software (1114) +179
  • IT (773) +97
  • Education (6) +1
  • Notice
Sources Tags Search
한국어 English 中文
  • 3 days ago · ai

    [Paper] EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI

    Generative modeling has recently shown remarkable promise for visuomotor policy learning, enabling flexible and expressive control across diverse embodied AI ta...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] Data-Centric Visual Development for Self-Driving Labs

    Self-driving laboratories offer a promising path toward reducing the labor-intensive, time-consuming, and often irreproducible workflows in the biological scien...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion

    Today, people can easily record memorable moments, ranging from concerts, sports events, lectures, family gatherings, and birthday parties with multiple consume...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now

    Video generators are increasingly evaluated as potential world models, which requires them to encode and understand physical laws. We investigate their represen...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] Generative Video Motion Editing with 3D Point Tracks

    Camera and object motions are central to a video's narrative. However, precisely editing these captured motions remains a significant challenge, especially unde...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

    Unified multimodal models (UMMs) aim to jointly perform multimodal understanding and generation within a single framework. We present TUNA, a native UMM that bu...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] Improved Mean Flows: On the Challenges of Fastforward Generative Models

    MeanFlow (MF) has recently been established as a framework for one-step generative modeling. However, its ``fastforward'' nature introduces key challenges in bo...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] AirSim360: A Panoramic Simulation Platform within Drone View

    The field of 360-degree omnidirectional understanding has been receiving increasing attention for advancing spatial intelligence. However, the lack of large-sca...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] MV-TAP: Tracking Any Point in Multi-View Videos

    Multi-view camera systems enable rich observations of complex real-world scenes, and understanding dynamic objects in multi-view settings has become central to ...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] Learning Visual Affordance from Audio

    We introduce Audio-Visual Affordance Grounding (AV-AG), a new task that segments object interaction regions from action sounds. Unlike existing approaches that ...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies

    Autonomous driving policies are typically trained via open-loop behavior cloning of human demonstrations. However, such policies suffer from covariate shift whe...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback

    GUI grounding aims to align natural language instructions with precise regions in complex user interfaces. Advanced multimodal large language models show strong...

    #research #paper #ai #machine-learning #nlp #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2025