EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] Native and Compact Structured Latents for 3D Generation

    Recent advancements in 3D generative modeling have significantly improved the generation realism, yet the field is still hampered by existing representations, w...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] MMGR: Multi-Modal Generative Reasoning

    Video foundation models generate visually realistic and temporally coherent content, but their reliability as world simulators depends on whether they capture p...

    #research #paper #ai #nlp #computer-vision
  • 1 month ago · ai

    [Paper] VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

    We propose VASA-3D, an audio-driven, single-shot 3D head avatar generator. This research tackles two major challenges: capturing the subtle expression details p...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] ART: Articulated Reconstruction Transformer

    We introduce ART, Articulated Reconstruction Transformer -- a category-agnostic, feed-forward model that reconstructs complete 3D articulated objects from only ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models

    Achieving truly adaptive embodied intelligence requires agents that learn not just by imitating static demonstrations, but by continuously improving through env...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Enhancing Visual Sentiment Analysis via Semiotic Isotopy-Guided Dataset Construction

    Visual Sentiment Analysis (VSA) is a challenging task due to the vast diversity of emotionally salient images and the inherent difficulty of acquiring sufficien...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images

    Timely and accurate lymphoma diagnosis is essential for guiding cancer treatment. Standard diagnostic practice combines hematoxylin and eosin (HE)-stained whole...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] JMMMU-Pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark via Vibe Benchmark Construction

    This paper introduces JMMMU-Pro, an image-based Japanese Multi-discipline Multimodal Understanding Benchmark, and Vibe Benchmark Construction, a scalable constr...

    #research #paper #ai #machine-learning #nlp #computer-vision
  • 1 month ago · software

    alpr.watch

    Article URL: https://alpr.watch/ Comments URL: https://news.ycombinator.com/item?id=46290916 Points: 224 Comments: 114...

    #license-plate-recognition #computer-vision #open-source #ALPR #surveillance-tool
  • 1 month ago · ai

    Ai2’s Molmo 2 shows open-source models can rival proprietary giants in video understanding

    Fresh off releasing the latest version of its Olmo foundation model, the Allen Institute for AI Ai2 launched its open-source video model, Molmo 2, on Tuesday, a...

    #Molmo 2 #video understanding #open-source AI #Allen Institute for AI #foundation models #computer vision
  • 1 month ago · ai

    AlphaFlow: Understanding and Improving MeanFlow Models

    AlphaFlow provides a smoother training schedule for MeanFlow image models, reducing the conflict between its two objectives and accelerating learning. Overview...

    #MeanFlow #AlphaFlow #image generation #training optimization #deep learning #computer vision
  • 1 month ago · ai

    [Paper] DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders

    Video diffusion models have revolutionized generative video synthesis, but they are imprecise, slow, and can be opaque during generation -- keeping users in the...

    #research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026