EUNO.NEWS EUNO.NEWS
  • All (2746) +423
  • AI (587) +28
  • DevOps (157) +9
  • Software (1177) +242
  • IT (819) +143
  • Education (6) +1
  • Notice
  • All (2746) +423
    • AI (587) +28
    • DevOps (157) +9
    • Software (1177) +242
    • IT (819) +143
    • Education (6) +1
  • Notice
  • All (2746) +423
  • AI (587) +28
  • DevOps (157) +9
  • Software (1177) +242
  • IT (819) +143
  • Education (6) +1
  • Notice
Sources Tags Search
한국어 English 中文
  • 4 days ago · ai

    [Paper] Data-Centric Visual Development for Self-Driving Labs

    Self-driving laboratories offer a promising path toward reducing the labor-intensive, time-consuming, and often irreproducible workflows in the biological scien...

    #research #paper #ai #computer-vision
  • 4 days ago · ai

    [Paper] Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion

    Today, people can easily record memorable moments, ranging from concerts, sports events, lectures, family gatherings, and birthday parties with multiple consume...

    #research #paper #ai #machine-learning #computer-vision
  • 4 days ago · ai

    [Paper] Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now

    Video generators are increasingly evaluated as potential world models, which requires them to encode and understand physical laws. We investigate their represen...

    #research #paper #ai #computer-vision
  • 4 days ago · ai

    [Paper] Generative Video Motion Editing with 3D Point Tracks

    Camera and object motions are central to a video's narrative. However, precisely editing these captured motions remains a significant challenge, especially unde...

    #research #paper #ai #computer-vision
  • 4 days ago · ai

    [Paper] TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

    Unified multimodal models (UMMs) aim to jointly perform multimodal understanding and generation within a single framework. We present TUNA, a native UMM that bu...

    #research #paper #ai #computer-vision
  • 4 days ago · ai

    [Paper] Improved Mean Flows: On the Challenges of Fastforward Generative Models

    MeanFlow (MF) has recently been established as a framework for one-step generative modeling. However, its ``fastforward'' nature introduces key challenges in bo...

    #research #paper #ai #machine-learning #computer-vision
  • 4 days ago · ai

    [Paper] Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling

    As large language models have grown larger, low-precision numerical formats such as NVFP4 have become increasingly popular due to the speed and memory benefits ...

    #research #paper #ai #machine-learning #nlp
  • 4 days ago · ai

    [Paper] AirSim360: A Panoramic Simulation Platform within Drone View

    The field of 360-degree omnidirectional understanding has been receiving increasing attention for advancing spatial intelligence. However, the lack of large-sca...

    #research #paper #ai #computer-vision
  • 4 days ago · ai

    [Paper] The Art of Scaling Test-Time Compute for Large Language Models

    Test-time scaling (TTS) -- the dynamic allocation of compute during inference -- is a promising direction for improving reasoning in large language models (LLMs...

    #research #paper #ai #nlp
  • 4 days ago · ai

    [Paper] MV-TAP: Tracking Any Point in Multi-View Videos

    Multi-view camera systems enable rich observations of complex real-world scenes, and understanding dynamic objects in multi-view settings has become central to ...

    #research #paper #ai #computer-vision
  • 4 days ago · ai

    [Paper] Learning Visual Affordance from Audio

    We introduce Audio-Visual Affordance Grounding (AV-AG), a new task that segments object interaction regions from action sounds. Unlike existing approaches that ...

    #research #paper #ai #computer-vision
  • 4 days ago · ai

    [Paper] AlignSAE: Concept-Aligned Sparse Autoencoders

    Large Language Models (LLMs) encode factual knowledge within hidden parametric spaces that are difficult to inspect or control. While Sparse Autoencoders (SAEs)...

    #research #paper #ai #machine-learning #nlp

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2025