research — Page 105

Sort:

2 months ago · ai · - · -

[Paper] Native and Compact Structured Latents for 3D Generation

Recent advancements in 3D generative modeling have significantly improved the generation realism, yet the field is still hampered by existing representations, w...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] MMGR: Multi-Modal Generative Reasoning

Video foundation models generate visually realistic and temporally coherent content, but their reliability as world simulators depends on whether they capture p...

#research #paper #ai #nlp #computer-vision
2 months ago · ai · - · -

[Paper] CHIP: Adaptive Compliance for Humanoid Control through Hindsight Perturbation

Recent progress in humanoid robots has unlocked agile locomotion skills, including backflipping, running, and crawling. Yet it remains challenging for a humanoi...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization

Recent audio language models can follow long conversations. However, research on emotion-aware or spoken dialogue summarization is constrained by the lack of da...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Bias-Variance Trade-off for Clipped Stochastic First-Order Methods: From Bounded Variance to Infinite Mean

Stochastic optimization is fundamental to modern machine learning. Recent research has extended the study of stochastic first-order methods (SFOMs) from light-t...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Early Warning Index for Patient Deteriorations in Hospitals

Hospitals lack automated systems to harness the growing volume of heterogeneous clinical and operational data to effectively forecast critical events. Early ide...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Multi-token generation has emerged as a promising paradigm for accelerating transformer-based large model inference. Recent efforts primarily explore diffusion ...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

We propose VASA-3D, an audio-driven, single-shot 3D head avatar generator. This research tackles two major challenges: capturing the subtle expression details p...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Beyond Lipschitz Continuity and Monotonicity: Fractal and Chaotic Activation Functions in Echo State Networks

Contemporary reservoir computing relies heavily on smooth, globally Lipschitz continuous activation functions, limiting applications in defense, disaster respon...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Reconsidering Conversational Norms in LLM Chatbots for Sustainable AI

LLM based chatbots have become central interfaces in technical, educational, and analytical domains, supporting tasks such as code reasoning, problem solving, a...

#research #paper #software
2 months ago · ai · - · -

[Paper] ART: Articulated Reconstruction Transformer

We introduce ART, Articulated Reconstruction Transformer -- a category-agnostic, feed-forward model that reconstructs complete 3D articulated objects from only ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models

Achieving truly adaptive embodied intelligence requires agents that learn not just by imitating static demonstrations, but by continuously improving through env...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Enhancing Visual Sentiment Analysis via Semiotic Isotopy-Guided Dataset Construction

Visual Sentiment Analysis (VSA) is a challenging task due to the vast diversity of emotionally salient images and the inherent difficulty of acquiring sufficien...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] gridfm-datakit-v1: A Python Library for Scalable and Realistic Power Flow and Optimal Power Flow Data Generation

We introduce gridfm-datakit-v1, a Python library for generating realistic and diverse Power Flow (PF) and Optimal Power Flow (OPF) datasets for training Machine...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Segmental Attention Decoding With Long Form Acoustic Encodings

We address the fundamental incompatibility of attention-based encoder-decoder (AED) models with long-form acoustic encodings. AED models trained on segmented ut...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] TiME: Tiny Monolingual Encoders for Efficient NLP Pipelines

Today, a lot of research on language models is focused on large, general-purpose models. However, many NLP pipelines only require models with a well-defined, sm...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images

Timely and accurate lymphoma diagnosis is essential for guiding cancer treatment. Standard diagnostic practice combines hematoxylin and eosin (HE)-stained whole...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] MuseCPBench: an Empirical Study of Music Editing Methods through Music Context Preservation

Music editing plays a vital role in modern music production, with applications in film, broadcasting, and game development. Recent advances in music generation ...

#research #paper #ai #machine-learning
2 months ago · devops · - · -

[Paper] PruneX: A Hierarchical Communication-Efficient System for Distributed CNN Training with Structured Pruning

Inter-node communication bandwidth increasingly constrains distributed training at scale on multi-node GPU clusters. While compact models are the ultimate deplo...

#research #paper #devops
2 months ago · ai · - · -

[Paper] JMMMU-Pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark via Vibe Benchmark Construction

This paper introduces JMMMU-Pro, an image-based Japanese Multi-discipline Multimodal Understanding Benchmark, and Vibe Benchmark Construction, a scalable constr...

#research #paper #ai #machine-learning #nlp #computer-vision
2 months ago · ai · - · -

[Paper] ParaFormer: A Generalized PageRank Graph Transformer for Graph Representation Learning

Graph Transformers (GTs) have emerged as a promising graph learning tool, leveraging their all-pair connected property to effectively capture global information...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Model-Based Reinforcement Learning in Discrete-Action Non-Markovian Reward Decision Processes

Many practical decision-making problems involve tasks whose success depends on the entire system history, rather than on achieving a state with desired properti...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] MoT: A Model-Driven Low-Code Approach for Simplifying Cloud-of-Things Application Development

The integration of cloud computing and the Internet of Things (IoT) is essential for scalable, intelligent systems. However, developing cloud-of-things (CoT) ap...

#research #paper #software
2 months ago · ai · - · -

[Paper] Towards Nepali-language LLMs: Efficient GPT training with a Nepali BPE tokenizer

Nepali, a low-resource language spoken by over 32 million people, continues to face challenges in natural language processing (NLP) due to its complex grammar, ...

#research #paper #ai #machine-learning #nlp

Newer posts

Older posts