ai — Page 65 | EUNO.NEWS

3 weeks ago · ai

[Paper] AirGS: Real-Time 4D Gaussian Streaming for Free-Viewpoint Video Experiences

Free-viewpoint video (FVV) enables immersive viewing experiences by allowing users to view scenes from arbitrary perspectives. As a prominent reconstruction tec...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] FEM-Bench: A Structured Scientific Reasoning Benchmark for Evaluating Code-Generating LLMs

As LLMs advance their reasoning capabilities about the physical world, the absence of rigorous benchmarks for evaluating their ability to generate scientificall...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] SemanticGen: Video Generation in Semantic Space

State-of-the-art video generative models typically learn the distribution of video latents in the VAE space and map them to pixels using a VAE decoder. While th...

#research #paper #ai #computer-vision
3 weeks ago · ai

[Paper] LongVideoAgent: Multi-Agent Reasoning with Long Videos

Recent advances in multimodal LLMs and systems that use tools for long-video QA point to the promise of reasoning over hour-long episodes. However, many methods...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai

[Paper] SpatialTree: How Spatial Abilities Branch Out in MLLMs

Cognitive science suggests that spatial ability develops progressively-from perception to reasoning and interaction. Yet in multimodal LLMs (MLLMs), this hierar...

#research #paper #ai #computer-vision
3 weeks ago · ai

[Paper] Active Intelligence in Video Avatars via Closed-loop World Modeling

Current video avatar generation methods excel at identity preservation and motion alignment but lack genuine agency, they cannot autonomously pursue long-term g...

#research #paper #ai #computer-vision
3 weeks ago · ai

[Paper] Making Large Language Models Efficient Dense Retrievers

Recent work has shown that directly fine-tuning large language models (LLMs) for dense retrieval yields strong performance, but their substantial parameter coun...

#research #paper #ai #nlp
3 weeks ago · ai

[Paper] FedPOD: the deployable units of training for federated learning

This paper proposes FedPOD (Proportionally Orchestrated Derivative) for optimizing learning efficiency and communication cost in federated learning among multip...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai

[Paper] Saddle-to-Saddle Dynamics Explains A Simplicity Bias Across Neural Network Architectures

Neural networks trained with gradient descent often learn solutions of increasing complexity over time, a phenomenon known as simplicity bias. Despite being wid...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] Repurposing Video Diffusion Transformers for Robust Point Tracking

Point tracking aims to localize corresponding points across video frames, serving as a fundamental task for 4D reconstruction, robotics, and video editing. Exis...

#research #paper #ai #computer-vision
3 weeks ago · ai

[Paper] Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Large-scale autoregressive models pretrained on next-token prediction and finetuned with reinforcement learning (RL) have achieved unprecedented success on many...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts

We present MoE-DiffuSeq, a mixture of experts based framework for enhancing diffusion models in long document generation. Existing diffusion based text generati...

#research #paper #ai #nlp

Newer posts

Older posts