EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs

    Prevailing Vision-Language-Action Models (VLAs) for robotic manipulation are built upon vision-language backbones pretrained on large-scale, but disconnected st...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Stylized Synthetic Augmentation further improves Corruption Robustness

    This paper proposes a training data augmentation pipeline that combines synthetic image data with neural style transfer in order to address the vulnerability of...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?

    The computational and memory overheads associated with expanding the context window of LLMs severely limit their scalability. A noteworthy solution is vision-te...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 month ago · ai

    [Paper] Human-like Working Memory from Artificial Intrinsic Plasticity Neurons

    Working memory enables the brain to integrate transient information for rapid decision-making. Artificial networks typically replicate this via recurrent or par...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · software

    The Hot-Reload Magic - Tweak Pipelines Live (No Restarts!)

    Edit your config.toml while the app is running and watch the pipeline update instantly. No recompiling. No stopping the camera. Pure iteration bliss. Why This M...

    #Go #GoCV #hot-reload #config.toml #fsnotify #computer-vision #pipeline #live-reload #OpenCV #devtools
  • 1 month ago · ai

    Data Annotation: Powering Accurate and Scalable AI Systems

    Introduction Data annotation is a foundational process in artificial intelligence that enables machines to learn from real‑world data. It involves adding meani...

    #data annotation #machine learning #training data #labeling #computer vision #natural language processing #speech recognition #AI model accuracy
  • 1 month ago · ai

    AI Background Remover: How AI Detects Objects and Separates Backgrounds

    An AI background remover may feel like magic at first glance. You upload an image, click a button, and the background disappears. Behind that simple interaction...

    #background removal #computer vision #image segmentation #machine learning #deep learning #AI tools
  • 1 month ago · software

    Renderizando la cámara con Metal en iOS (AVFoundation + MetalKit)

    Renderizado de vídeo de cámara con Metal sin AVCaptureVideoPreviewLayer En este tutorial vamos a renderizar el video de la cámara directamente en pantalla usan...

    #iOS #Metal #AVFoundation #MetalKit #camera #video rendering #Swift #shaders #AR #computer vision #machine learning
  • 1 month ago · ai

    [Paper] MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

    The core challenge for streaming video generation is maintaining the content consistency in long context, which poses high requirement for the memory design. Mo...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

    This paper does not introduce a novel method but instead establishes a straightforward, incremental, yet essential baseline for video temporal grounding (VTG), ...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 month ago · ai

    [Paper] Spherical Leech Quantization for Visual Tokenization and Generation

    Non-parametric quantization has received much attention due to its efficiency on parameters and scalability to a large codebook. In this paper, we present a uni...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives

    We introduce CRISP, a method that recovers simulatable human motion and scene geometry from monocular video. Prior work on joint human-scene reconstruction reli...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026