EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning

    Image captioning is essential in many fields including assisting visually impaired individuals, improving content management systems, and enhancing human-comput...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance

    Document shadow removal is essential for enhancing the clarity of digitized documents. Preserving high-frequency details (e.g., text edges and lines) is critica...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices

    In recent years, high-performance computer vision models have achieved remarkable success in medical imaging, with some skin lesion classification systems even ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture

    Automatic Sign Language Recognition (ASLR) has emerged as a vital field for bridging the gap between deaf and hearing communities. However, the problem of sign-...

    #research #paper #ai #nlp #computer-vision
  • 1 month ago · ai

    [Paper] Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata

    Biological systems exhibit remarkable morphogenetic plasticity, where a single genome can encode various specialized cellular structures triggered by local chem...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Voxify3D: Pixel Art Meets Volumetric Rendering

    Voxel art is a distinctive stylization widely used in games and digital media, yet automated generation from 3D meshes remains challenging due to conflicting re...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] Relational Visual Similarity

    Humans do not just see attribute similarity -- we also see relational similarity. An apple is like a peach because both are reddish fruit, but the Earth is also...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

    Recent video generation models demonstrate impressive synthesis capabilities but remain limited by single-modality conditioning, constraining their holistic wor...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

    Visual generative models (e.g., diffusion models) typically operate in compressed latent spaces to balance training efficiency and sample quality. In parallel, ...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

    The quality and diversity of instruction-based image editing datasets are continuously increasing, yet large-scale, high-quality datasets for instruction-based ...

    #research #paper #ai #computer-vision
  • 1 month ago · ai

    [Paper] WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling

    Recent video generators achieve striking photorealism, yet remain fundamentally inconsistent in 3D. We present WorldReel, a 4D video generator that is natively ...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Lang3D-XL: Language Embedded 3D Gaussians for Large-scale Scenes

    Embedding a language field in a 3D representation enables richer semantic understanding of spatial environments by linking geometry with descriptive meaning. Th...

    #research #paper #ai #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026