EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] PathBench-MIL: A Comprehensive AutoML and Benchmarking Framework for Multiple Instance Learning in Histopathology

    We introduce PathBench-MIL, an open-source AutoML and benchmarking framework for multiple instance learning (MIL) in histopathology. The system automates end-to...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Generative Refocusing: Flexible Defocus Control from a Single Image

    Depth-of-field control is essential in photography, but getting the perfect focus often takes several tries or special equipment. Single-image refocusing is sti...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

    We present WorldCanvas, a framework for promptable world events that enables rich, user-directed simulation by combining text, trajectories, and reference image...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Next-Embedding Prediction Makes Strong Vision Learners

    Inspired by the success of generative pretraining in natural language, we ask whether the same principles can yield strong self-supervised visual learners. Inst...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification

    Conventional evaluation methods for multimodal LLMs (MLLMs) lack interpretability and are often insufficient to fully disclose significant capability gaps acros...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] DVGT: Driving Visual Geometry Transformer

    Perceiving and reconstructing 3D scene geometry from visual inputs is crucial for autonomous driving. However, there still lacks a driving-targeted dense geomet...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] EasyV2V: A High-quality Instruction-based Video Editing Framework

    While image editing has advanced rapidly, video editing remains less explored, facing challenges in consistency, control, and generalization. We study the desig...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] AdaTooler-V: Adaptive Tool-Use for Images and Videos

    Recent advances have shown that multimodal large language models (MLLMs) benefit from multimodal interleaved chain-of-thought (CoT) with vision tool interaction...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

    The rapid growth of stereoscopic displays, including VR headsets and 3D cinemas, has led to increasing demand for high-quality stereo video content. However, pr...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

    In this work, we present a panoramic metric depth foundation model that generalizes across diverse scene distances. We explore a data-in-the-loop paradigm from ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] SFTok: Bridging the Performance Gap in Discrete Tokenizers

    Recent advances in multimodal models highlight the pivotal role of image tokenization in high-resolution image generation. By compressing images into compact la...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Flowing from Reasoning to Motion: Learning 3D Hand Trajectory Prediction from Egocentric Human Interaction Videos

    Prior works on 3D hand trajectory prediction are constrained by datasets that decouple motion from semantic supervision and by models that weakly link reasoning...

    #research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026