paper — Page 15 | EUNO.NEWS

Sort:

4 days ago · ai · - · -

[Paper] Sketch2Colab: Sketch-Conditioned Multi-Human Animation via Controllable Flow Distillation

We present Sketch2Colab, which turns storyboard-style 2D sketches into coherent, object-aware 3D multi-human motion with fine-grained control over agents, joint...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] Multi-Head Low-Rank Attention

Long-context inference in large language models is bottlenecked by Key--Value (KV) cache loading during the decoding stage, where the sequential nature of gener...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

Multi-attribution learning (MAL), which enhances model performance by learning from conversion labels yielded by multiple attribution mechanisms, has emerged as...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta

The classification of Intangible Cultural Heritage (ICH) images in the Mekong Delta poses unique challenges due to limited annotated data, high visual similarit...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] Reservoir Subspace Injection for Online ICA under Top-n Whitening

Reservoir expansion can improve online independent component analysis (ICA) under nonlinear mixing, yet top-n whitening may discard injected features. We formal...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Organizing, Orchestrating, and Benchmarking Agent Skills at Ecosystem Scale

The rapid proliferation of Claude agent skills has raised the central question of how to effectively leverage, manage, and scale the agent skill ecosystem. In t...

#research #paper #ai #nlp
4 days ago · ai · - · -

[Paper] Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance

Instruction-based video editing has witnessed rapid progress, yet current methods often struggle with precise visual control, as natural language is inherently ...

#research #paper #ai #machine-learning #computer-vision
4 days ago · ai · - · -

[Paper] GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis

We introduce GeoDiT, a diffusion transformer designed for text-to-satellite image generation with point-based control. Existing controlled satellite image gener...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] SageBwd: A Trainable Low-bit Attention

Low-bit attention, such as SageAttention, has emerged as an effective approach for accelerating model inference, but its applicability to training remains poorl...

#research #paper #ai #machine-learning
4 days ago · ai · - · -

[Paper] Bridging the gap between Performance and Interpretability: An Explainable Disentangled Multimodal Framework for Cancer Survival Prediction

While multimodal survival prediction models are increasingly more accurate, their complexity often reduces interpretability, limiting insight into how different...

#research #paper #ai #computer-vision
4 days ago · ai · - · -

[Paper] Scaling Retrieval Augmented Generation with RAG Fusion: Lessons from an Industry Deployment

Retrieval-Augmented Generation (RAG) systems commonly adopt retrieval fusion techniques such as multi-query retrieval and reciprocal rank fusion (RRF) to increa...

#research #paper #ai #machine-learning #nlp
4 days ago · ai · - · -

[Paper] Zero- and Few-Shot Named-Entity Recognition: Case Study and Dataset in the Crime Domain (CrimeNER)

The extraction of critical information from crime-related documents is a crucial task for law enforcement agencies. Named-Entity Recognition (NER) can perform t...

#research #paper #ai #machine-learning #nlp

Newer posts

Older posts