EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 3 weeks ago · ai

    Detecting Adversarial Samples from Artifacts

    Overview Many AI systems can be fooled by tiny, almost invisible edits to images that cause them to give incorrect answers. Researchers have discovered a simpl...

    #adversarial attacks #uncertainty estimation #model robustness #computer vision #AI safety
  • 3 weeks ago · ai

    Apple releases open-source model that instantly turns 2D photos into 3D views

    Article URL: https://github.com/apple/ml-sharp Comments URL: https://news.ycombinator.com/item?id=46401539 Points: 71 Comments: 23...

    #apple #open-source #3d-reconstruction #computer-vision #machine-learning
  • 3 weeks ago · ai

    [Paper] See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

    Large vision-language models (VLMs) often benefit from intermediate visual cues, either injected via external tools or generated as latent visual tokens during ...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] ProEdit: Inversion-based Editing From Prompts Done Right

    Inversion-based visual editing provides an effective and training-free way to edit an image or a video based on user instructions. Existing methods typically in...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Learning Association via Track-Detection Matching for Multi-Object Tracking

    Multi-object tracking aims to maintain object identities over time by associating detections across video frames. Two dominant paradigms exist in literature: tr...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Yume-1.5: A Text-Controlled Interactive World Generation Model

    Recent approaches have demonstrated the promise of using diffusion models to generate interactive and explorable worlds. However, most of these methods face cri...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars

    Real-time, streaming interactive avatars represent a critical yet challenging goal in digital human research. Although diffusion-based human avatar generation m...

    #research #paper #ai #machine-learning #computer-vision
  • 3 weeks ago · ai

    [Paper] MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

    The development of GUI agents could revolutionize the next generation of human-computer interaction. Motivated by this vision, we present MAI-UI, a family of fo...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models

    Prompt-driven Video Segmentation Foundation Models (VSFMs) such as SAM2 are increasingly deployed in applications like autonomous driving and digital pathology,...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Patch-Discontinuity Mining for Generalized Deepfake Detection

    The rapid advancement of generative artificial intelligence has enabled the creation of highly realistic fake facial images, posing serious threats to personal ...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] SketchPlay: Intuitive Creation of Physically Realistic VR Content with Gesture-Driven Sketching

    Creating physically realistic content in VR often requires complex modeling tools or predefined 3D models, textures, and animations, which present significant b...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] LongFly: Long-Horizon UAV Vision-and-Language Navigation with Spatiotemporal Context Integration

    Unmanned aerial vehicles (UAVs) are crucial tools for post-disaster search and rescue, facing challenges such as high information density, rapid changes in view...

    #research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026