research — Page 23

Sort:

1 week ago · ai · - · -

[Paper] MediX-R1: Open Ended Medical Reinforcement Learning

We introduce MediX-R1, an open-ended Reinforcement Learning (RL) framework for medical multimodal large language models (MLLMs) that enables clinically grounded...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] VGG-T$^3$: Offline Feed-Forward 3D Reconstruction at Scale

We present a scalable 3D reconstruction model that addresses a critical limitation in offline feed-forward methods: their computational and memory requirements ...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Model Agreement via Anchoring

Numerous lines of aim to control model disagreement -- the extent to which two machine learning models disagree in their predictions. We adopt a simple and stan...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation

We identify occlusion reasoning as a fundamental yet overlooked aspect for 3D layout-conditioned generation. It is essential for synthesizing partially occluded...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] A Dataset is Worth 1 MB

A dataset server must often distribute the same large payload to many clients, incurring massive communication costs. Since clients frequently operate on divers...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Sensor Generalization for Adaptive Sensing in Event-based Object Detection via Joint Distribution Training

Bio-inspired event cameras have recently attracted significant research due to their asynchronous and low-latency capabilities. These features provide a high dy...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport

The Platonic Representation Hypothesis posits that neural networks trained on different modalities converge toward a shared statistical model of the world. Rece...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] EvoX: Meta-Evolution for Automated Discovery

Recent work such as AlphaEvolve has shown that combining LLM-driven optimization with evolutionary search can effectively improve programs, prompts, and algorit...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

The lack of reasoning capabilities in Vision-Language Models (VLMs) has remained at the forefront of research discourse. We posit that this behavior stems from ...

#research #paper #ai #nlp #computer-vision
1 week ago · ai · - · -

[Paper] FlashOptim: Optimizers for Memory Efficient Training

Standard mixed-precision training of neural networks requires many bytes of accelerator memory for each model parameter. These bytes reflect not just the parame...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms

Coarse data arise when learners observe only partial information about samples; namely, a set containing the sample rather than its exact value. This occurs nat...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?

Open-vocabulary segmentation (OVS) extends the zero-shot recognition capabilities of vision-language models (VLMs) to pixel-level prediction, enabling segmentat...

#research #paper #ai #computer-vision

Newer posts

Older posts