[Paper] Evolving Excellence: Automated Optimization of LLM-based Agents
Agentic AI systems built on large language models (LLMs) offer significant potential for automating complex workflows, from software development to customer sup...
Agentic AI systems built on large language models (LLMs) offer significant potential for automating complex workflows, from software development to customer sup...
Recent advances in diffusion transformers have empowered video generation models to generate high-quality video clips from texts or images. However, world model...
Novel View Synthesis (NVS) has traditionally relied on models with explicit 3D inductive biases combined with known camera parameters from Structure-from-Motion...
Understanding and reconstructing the complex geometry and motion of dynamic scenes from video remains a formidable challenge in computer vision. This paper intr...
We introduce two new benchmarks REST and REST+(Render-Equivalence Stress Tests) to enable systematic evaluation of cross-modal inconsistency in multimodal large...
Text-Aware Image Restoration (TAIR) aims to recover high- quality images from low-quality inputs containing degraded textual content. While diffusion models pro...
Human video demonstrations provide abundant training data for learning robot policies, but video alone cannot capture the rich contact signals critical for mast...
Quantum Error Correction (QEC) decoding faces a fundamental accuracy-efficiency tradeoff. Classical methods like Minimum Weight Perfect Matching (MWPM) exhibit ...
Nighttime environments pose significant challenges for camera-based perception, as existing methods passively rely on the scene lighting. We introduce Lighting-...
In empirical software engineering (SE) research, researchers have considerable freedom to decide how to process data, what operationalizations to use, and which...
Generating high-quality, textured 3D scenes from a single image remains a fundamental challenge in vision and graphics. Recent image-to-3D generators recover re...
Content-aware layout generation is a critical task in graphic design automation, focused on creating visually appealing arrangements of elements that seamlessly...