EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 week ago · ai

    [Paper] ImLoc: Revisiting Visual Localization with Image-based Representation

    Existing visual localization methods are typically either 2D image-based, which are easy to build and maintain but limited in effective geometric reasoning, or ...

    #research #paper #ai #computer-vision
  • 1 week ago · ai

    [Paper] Scanner-Induced Domain Shifts Undermine the Robustness of Pathology Foundation Models

    Pathology foundation models (PFMs) have become central to computational pathology, aiming to offer general encoders for feature extraction from whole-slide imag...

    #research #paper #ai #machine-learning #computer-vision
  • 1 week ago · ai

    [Paper] ToTMNet: FFT-Accelerated Toeplitz Temporal Mixing Network for Lightweight Remote Photoplethysmography

    Remote photoplethysmography (rPPG) estimates a blood volume pulse (BVP) waveform from facial videos captured by commodity cameras. Although recent deep models i...

    #research #paper #ai #computer-vision
  • 1 week ago · ai

    [Paper] Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning

    Direct Preference Optimization (DPO) has recently improved Text-to-Video (T2V) generation by enhancing visual fidelity and text alignment. However, current meth...

    #research #paper #ai #computer-vision
  • 1 week ago · ai

    [Paper] Klear: Unified Multi-Task Audio-Video Joint Generation

    Audio-video joint generation has progressed rapidly, yet substantial challenges still remain. Non-commercial approaches still suffer audio-visual asynchrony, po...

    #research #paper #ai #machine-learning #computer-vision
  • 1 week ago · ai

    [Paper] Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test

    As world models gain momentum in Embodied AI, an increasing number of works explore using video foundation models as predictive world models for downstream embo...

    #research #paper #ai #machine-learning #computer-vision
  • 1 week ago · ai

    [Paper] Pixel-Wise Multimodal Contrastive Learning for Remote Sensing Images

    Satellites continuously generate massive volumes of data, particularly for Earth observation, including satellite image time series (SITS). However, most deep l...

    #research #paper #ai #machine-learning #computer-vision
  • 1 week ago · ai

    [Paper] InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

    GUI agents that interact with graphical interfaces on behalf of users represent a promising direction for practical AI assistants. However, training such agents...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 week ago · ai

    [Paper] MORPHFED: Federated Learning for Cross-institutional Blood Morphology Analysis

    Automated blood morphology analysis can support hematological diagnostics in low- and middle-income countries (LMICs) but remains sensitive to dataset shifts fr...

    #research #paper #ai #machine-learning #computer-vision
  • 1 week ago · ai

    [Paper] Analyzing Reasoning Consistency in Large Multimodal Models under Cross-Modal Conflicts

    Large Multimodal Models (LMMs) have demonstrated impressive capabilities in video reasoning via Chain-of-Thought (CoT). However, the robustness of their reasoni...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 week ago · ai

    [Paper] Better, But Not Sufficient: Testing Video ANNs Against Macaque IT Dynamics

    Feedforward artificial neural networks (ANNs) trained on static images remain the dominant models of the the primate ventral visual stream, yet they are intrins...

    #research #paper #ai #computer-vision
  • 1 week ago · ai

    [Paper] Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training

    We present Muses, the first training-free method for fantastic 3D creature generation in a feed-forward paradigm. Previous methods, which rely on part-aware opt...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026