EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 3 weeks ago · ai

    WiFi DensePose: WiFi-based dense human pose estimation system through walls

    Article URL: https://github.com/ruvnet/wifi-densepose Comments URL: https://news.ycombinator.com/item?id=46388904 Points: 10 Comments: 1...

    #WiFi #DensePose #human pose estimation #computer vision #through walls #deep learning #open-source #research
  • 3 weeks ago · ai

    LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs

    LAION-400M is a giant public resource designed to spark new ideas. It consists of about 400 million images paired with short captions, cleaned and CLIP‑filtered...

    #LAION-400M #image-text dataset #CLIP-filtered #multimodal AI #open data #machine learning #computer vision
  • 3 weeks ago · ai

    AutoAugment: Learning Augmentation Policies from Data

    Overview AutoAugment is a method that automatically discovers effective image augmentation policies. By systematically testing many simple transformations—such...

    #autoaugment #data augmentation #computer vision #image classification #machine learning #deep learning #neural networks
  • 3 weeks ago · ai

    [Paper] HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

    High-resolution video generation, while crucial for digital media and film, is computationally bottlenecked by the quadratic complexity of diffusion models, mak...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models

    We expose a significant popularity bias in state-of-the-art vision-language models (VLMs), which achieve up to 34% higher accuracy on famous buildings compared ...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Streaming Video Instruction Tuning

    We present Streamo, a real-time streaming video LLM that serves as a general-purpose interactive assistant. Unlike existing online video models that focus narro...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Fast SAM2 with Text-Driven Token Pruning

    Segment Anything Model 2 (SAM2), a vision foundation model has significantly advanced in prompt-driven video object segmentation, yet their practical deployment...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] TICON: A Slide-Level Tile Contextualizer for Histopathology Representation Learning

    The interpretation of small tiles in large whole slide images (WSI) often needs a larger image context. We introduce TICON, a transformer-based tile representat...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Does the Data Processing Inequality Reflect Practice? On the Utility of Low-Level Tasks

    The data processing inequality is an information-theoretic principle stating that the information content of a signal cannot be increased by processing the obse...

    #research #paper #ai #machine-learning #computer-vision
  • 3 weeks ago · ai

    [Paper] AndroidLens: Long-latency Evaluation with Nested Sub-targets for Android GUI Agents

    Graphical user interface (GUI) agents can substantially improve productivity by automating frequently executed long-latency tasks on mobile devices. However, ex...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction

    Structured data extraction from tables plays a crucial role in document image analysis for scanned documents and digital archives. Although many methods have be...

    #research #paper #ai #computer-vision
  • 3 weeks ago · ai

    [Paper] Surgical Scene Segmentation using a Spike-Driven Video Transformer with Real-Time Potential

    Modern surgical systems increasingly rely on intelligent scene understanding to provide timely situational awareness for enhanced intra-operative safety. Within...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026