research — Page 132

Sort:

3 months ago · ai · - · -

[Paper] Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction

Developing robust world model reasoning is crucial for large language model (LLM) agents to plan and interact in complex environments. While multi-turn interact...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement

Recently, multi-person video generation has started to gain prominence. While a few preliminary works have explored audio-driven multi-person talking video gene...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] ThetaEvolve: Test-time Learning on Open Problems

Recent advances in large language models (LLMs) have enabled breakthroughs in mathematical discovery, exemplified by AlphaEvolve, a closed-source system that ev...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Visual Generation Tuning

Large Vision Language Models (VLMs) effectively bridge the modality gap through extensive pretraining, acquiring sophisticated visual representations aligned wi...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments

Current world models lack a unified and controlled setting for systematic evaluation, making it difficult to assess whether they truly capture the underlying ru...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] The Price of Progress: Algorithmic Efficiency and the Falling Cost of AI Inference

Language models have seen enormous progress on advanced benchmarks in recent years, but much of this progress has only been possible by using more costly models...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Object-Centric Data Synthesis for Category-level Object Detection

Deep learning approaches to object detection have achieved reliable detection of specific object classes in images. However, extending a model's detection capab...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Physics-Informed Neural Networks for Thermophysical Property Retrieval

Inverse heat problems refer to the estimation of material thermophysical properties given observed or known heat diffusion behaviour. Inverse heat problems have...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Provable Benefits of Sinusoidal Activation for Modular Addition

This paper studies the role of activation functions in learning modular addition with two-layer neural networks. We first establish a sharp expressivity gap: si...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts

Offline reinforcement learning (RL) enables agents to learn optimal policies from pre-collected datasets. However, datasets containing suboptimal and fragmented...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation

Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handlin...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent

We introduce SuperIntelliAgent, an agentic learning framework that couples a trainable small diffusion model (the learner) with a frozen large language model (t...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model

Recent advances in generative world models have enabled remarkable progress in creating open-ended game environments, evolving from static scene synthesis towar...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] DisMo: Disentangled Motion Representations for Open-World Motion Transfer

Recent advances in text-to-video (T2V) and image-to-video (I2V) models, have enabled the creation of visually compelling and dynamic videos from simple textual ...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities

Automated vulnerability patching is crucial for software security, and recent advancements in Large Language Models (LLMs) present promising capabilities for au...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] MANTA: Physics-Informed Generalized Underwater Object Tracking

Underwater object tracking is challenging due to wavelength dependent attenuation and scattering, which severely distort appearance across depths and water cond...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] LFM2 Technical Report

We present LFM2, a family of Liquid Foundation Models designed for efficient on-device deployment and strong task capabilities. Using hardware-in-the-loop archi...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning

Split learning is well known as a method for resolving data privacy concerns by training a model on distributed devices, thereby avoiding data sharing that rais...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation

Small and medium-sized enterprises (SMEs) in Iran increasingly leverage Telegram for sales, where real-time engagement is essential for conversion. However, dev...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization

Direct Preference Optimization (DPO) is a widely used reinforcement learning from human feedback (RLHF) method across various domains. Recent research has incre...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Learning-Augmented Online Bipartite Matching in the Random Arrival Order Model

We study the online unweighted bipartite matching problem in the random arrival order model, with $n$ offline and $n$ online vertices, in the learning-augmented...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Hierarchical AI-Meteorologist: LLM-Agent System for Multi-Scale and Explainable Weather Forecast Reporting

We present the Hierarchical AI-Meteorologist, an LLM-agent system that generates explainable weather reports using a hierarchical forecast reasoning and weather...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction

Unifying multimodal understanding, generation and reconstruction representation in a single tokenizer remains a key challenge in building unified models. Previo...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Is Passive Expertise-Based Personalization Enough? A Case Study in AI-Assisted Test-Taking

Novice and expert users have different systematic preferences in task-oriented dialogues. However, whether catering to these preferences actually improves user ...

#research #paper #ai #nlp

Newer posts

Older posts