research — Page 134

Sort:

3 months ago · ai · - · -

[Paper] Canvas-to-Image: Compositional Image Generation with Multimodal Controls

While modern diffusion models excel at generating high-quality and diverse images, they still struggle with high-fidelity compositional and multimodal control, ...

#image generation #diffusion models #multimodal control #computer vision #research
3 months ago · ai · - · -

[Paper] TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos

Learning new robot tasks on new platforms and in new scenes from only a handful of demonstrations remains challenging. While videos of other embodiments - human...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Large language models are powerful generalists, yet solving deep and complex problems such as those of the Humanity's Last Exam (HLE) remains both conceptually ...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Vision-Language Models (VLMs) still lack robustness in spatial intelligence, demonstrating poor performance on spatial understanding and reasoning tasks. We att...

#research #paper #ai #machine-learning #nlp #computer-vision
3 months ago · ai · - · -

[Paper] Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework

Synthetic data has become increasingly important for training large language models, especially when real data is scarce, expensive, or privacy-sensitive. Many ...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Seeing without Pixels: Perception from Camera Trajectories

Can one perceive a video's content without seeing its pixels, just from the camera trajectory-the path it carves through space? This paper is the first to syste...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] On Evolution-Based Models for Experimentation Under Interference

Causal effect estimation in networked systems is central to data-driven decision making. In such settings, interventions on one unit can spill over to others, a...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Revolutionizing Glioma Segmentation & Grading Using 3D MRI - Guided Hybrid Deep Learning Models

Gliomas are brain tumor types that have a high mortality rate which means early and accurate diagnosis is important for therapeutic intervention for the tumors....

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Through the telecom lens: Are all training samples important?

The rise of AI in telecommunications, from optimizing Radio Access Networks to managing user experience, has sharply increased data volumes and training demands...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Uncertainty Quantification for Visual Object Pose Estimation

Quantifying the uncertainty of an object's pose estimate is essential for robust control and planning. Although pose estimation is a well-studied robotics probl...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Large multimodal models (LMMs) are increasingly adopted as judges in multimodal evaluation systems due to their strong instruction following and consistency wit...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Bridging the Unavoidable A Priori: A Framework for Comparative Causal Modeling

AI/ML models have rapidly gained prominence as innovations for solving previously unsolved problems and their unintended consequences from amplifying human bias...

#causal inference #system dynamics #probabilistic modeling #python library #research
3 months ago · ai · - · -

[Paper] Mechanisms of Non-Monotonic Scaling in Vision Transformers

Deeper Vision Transformers often perform worse than shallower ones, which challenges common scaling assumptions. Through a systematic empirical analysis of ViT-...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Qwen3-VL Technical Report

We introduce Qwen3-VL, the most capable vision-language model in the Qwen series to date, achieving superior performance across a broad range of multimodal benc...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] The author is dead, but what if they never lived? A reception experiment on Czech AI- and human-authored poetry

Large language models are increasingly capable of producing creative texts, yet most studies on AI-generated poetry focus on English -- a language that dominate...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Scale-Agnostic Kolmogorov-Arnold Geometry in Neural Networks

Recent work by Freedman and Mulligan demonstrated that shallow multilayer perceptrons spontaneously develop Kolmogorov-Arnold geometric (KAG) structure during t...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] TAGFN: A Text-Attributed Graph Dataset for Fake News Detection in the Age of LLMs

Large Language Models (LLMs) have recently revolutionized machine learning on text-attributed graphs, but the application of LLMs to graph outlier detection, pa...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] On the Origin of Algorithmic Progress in AI

Algorithms have been estimated to increase AI training FLOP efficiency by a factor of 22,000 between 2012 and 2023 [Ho et al., 2024]. Running small-scale ablati...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images

Interactive segmentation models such as the Segment Anything Model (SAM) have demonstrated remarkable generalization on natural images, but perform suboptimally...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] TAB-DRW: A DFT-based Robust Watermark for Generative Tabular Data

The rise of generative AI has enabled the production of high-fidelity synthetic tabular data across fields such as healthcare, finance, and public policy, raisi...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Visualizing LLM Latent Space Geometry Through Dimensionality Reduction

Large language models (LLMs) achieve state-of-the-art results across many natural language tasks, but their internal mechanisms remain difficult to interpret. I...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training

Video diffusion models achieve strong frame-level fidelity but still struggle with motion coherence, dynamics and realism, often producing jitter, ghosting, or ...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] On the Limits of Innate Planning in Large Language Models

Large language models (LLMs) achieve impressive results on many benchmarks, yet their capacity for planning and stateful reasoning remains unclear. We study the...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving

End-to-end (E2E) autonomous driving models have demonstrated strong performance in open-loop evaluations but often suffer from cascading errors and poor general...

#research #paper #ai #machine-learning

Newer posts

Older posts