EUNO.NEWS EUNO.NEWS
  • All (2682) +359
  • AI (585) +26
  • DevOps (156) +8
  • Software (1140) +205
  • IT (795) +119
  • Education (6) +1
  • Notice
  • All (2682) +359
    • AI (585) +26
    • DevOps (156) +8
    • Software (1140) +205
    • IT (795) +119
    • Education (6) +1
  • Notice
  • All (2682) +359
  • AI (585) +26
  • DevOps (156) +8
  • Software (1140) +205
  • IT (795) +119
  • Education (6) +1
  • Notice
Sources Tags Search
한국어 English 中文
  • 3 days ago · ai

    [Paper] MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

    Current video generation techniques excel at single-shot clips but struggle to produce narrative multi-shot videos, which require flexible shot arrangement, coh...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation

    We investigate whether video generative models can exhibit visuospatial intelligence, a capability central to human cognition, using only visual data. To this e...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

    Despite progress in video-to-audio generation, the field focuses predominantly on mono output, lacking spatial immersion. Existing binaural approaches remain co...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] Learning Physically Consistent Lagrangian Control Models Without Acceleration Measurements

    This article investigates the modeling and control of Lagrangian systems involving non-conservative forces using a hybrid method that does not require accelerat...

    #research #paper #ai #machine-learning
  • 3 days ago · ai

    [Paper] MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation

    We propose MAViD, a novel Multimodal framework for Audio-Visual Dialogue understanding and generation. Existing approaches primarily focus on non-interactive sy...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control

    Data-driven motion priors that can guide agents toward producing naturalistic behaviors play a pivotal role in creating life-like virtual characters. Adversaria...

    #research #paper #ai #machine-learning #computer-vision
  • 3 days ago · ai

    [Paper] The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models

    The rapid advancement and adaptability of Large Language Models (LLMs) highlight the need for moral consistency, the capacity to maintain ethically coherent rea...

    #research #paper #ai #machine-learning #nlp
  • 3 days ago · ai

    [Paper] LORE: A Large Generative Model for Search Relevance

    Achievement. We introduce LORE, a systematic framework for Large Generative Model-based relevance in e-commerce search. Deployed and iterated over three years, ...

    #research #paper #ai #machine-learning #nlp
  • 3 days ago · ai

    [Paper] TokenPowerBench: Benchmarking the Power Consumption of LLM Inference

    Large language model (LLM) services now answer billions of queries per day, and industry reports show that inference, not training, accounts for more than 90% o...

    #research #paper #ai #machine-learning
  • 3 days ago · ai

    [Paper] Unrolled Networks are Conditional Probability Flows in MRI Reconstruction

    Magnetic Resonance Imaging (MRI) offers excellent soft-tissue contrast without ionizing radiation, but its long acquisition time limits clinical utility. Recent...

    #research #paper #ai #computer-vision
  • 3 days ago · ai

    [Paper] Distribution-Calibrated Inference time compute for Thinking LLM-as-a-Judge

    Thinking Large Language Models (LLMs) used as judges for pairwise preferences remain noisy at the single-sample level, and common aggregation rules (majority vo...

    #research #paper #ai #machine-learning
  • 3 days ago · ai

    [Paper] In-Context Sync-LoRA for Portrait Video Editing

    Editing portrait videos is a challenging task that requires flexible yet precise control over a wide range of modifications, such as appearance changes, express...

    #research #paper #ai #machine-learning #computer-vision

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2025