research — Page 93

Sort:

2 months ago · software · - · -

[Paper] Process Analytics -- Data-driven Business Process Management

Data-driven analysis of business processes has a long tradition in research. However, recently the term of process mining is mostly used when referring to data-...

#research #paper #software
2 months ago · ai · - · -

[Paper] SemanticGen: Video Generation in Semantic Space

State-of-the-art video generative models typically learn the distribution of video latents in the VAE space and map them to pixels using a VAE decoder. While th...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] LongVideoAgent: Multi-Agent Reasoning with Long Videos

Recent advances in multimodal LLMs and systems that use tools for long-video QA point to the promise of reasoning over hour-long episodes. However, many methods...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] SpatialTree: How Spatial Abilities Branch Out in MLLMs

Cognitive science suggests that spatial ability develops progressively-from perception to reasoning and interaction. Yet in multimodal LLMs (MLLMs), this hierar...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Active Intelligence in Video Avatars via Closed-loop World Modeling

Current video avatar generation methods excel at identity preservation and motion alignment but lack genuine agency, they cannot autonomously pursue long-term g...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Making Large Language Models Efficient Dense Retrievers

Recent work has shown that directly fine-tuning large language models (LLMs) for dense retrieval yields strong performance, but their substantial parameter coun...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] FedPOD: the deployable units of training for federated learning

This paper proposes FedPOD (Proportionally Orchestrated Derivative) for optimizing learning efficiency and communication cost in federated learning among multip...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Saddle-to-Saddle Dynamics Explains A Simplicity Bias Across Neural Network Architectures

Neural networks trained with gradient descent often learn solutions of increasing complexity over time, a phenomenon known as simplicity bias. Despite being wid...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Repurposing Video Diffusion Transformers for Robust Point Tracking

Point tracking aims to localize corresponding points across video frames, serving as a fundamental task for 4D reconstruction, robotics, and video editing. Exis...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Large-scale autoregressive models pretrained on next-token prediction and finetuned with reinforcement learning (RL) have achieved unprecedented success on many...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts

We present MoE-DiffuSeq, a mixture of experts based framework for enhancing diffusion models in long document generation. Existing diffusion based text generati...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs

We introduce Cube Bench, a Rubik's-cube benchmark for evaluating spatial and sequential reasoning in multimodal large language models (MLLMs). The benchmark dec...

#research #paper #ai #machine-learning #nlp #computer-vision
2 months ago · ai · - · -

[Paper] Leveraging High-Fidelity Digital Models and Reinforcement Learning for Mission Engineering: A Case Study of Aerial Firefighting Under Perfect Information

As systems engineering (SE) objectives evolve from design and operation of monolithic systems to complex System of Systems (SoS), the discipline of Mission Engi...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent

Stereotactic radiosurgery (SRS) demands precise dose shaping around critical structures, yet black-box AI systems have limited clinical adoption due to opacity ...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Relu and softplus neural nets as zero-sum turn-based games

We show that the output of a ReLU neural network can be interpreted as the value of a zero-sum, turn-based, stopping game, which we call the ReLU net game. The ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

Large language models (LLMs) generate fluent and complex outputs but often fail to recognize their own mistakes and hallucinations. Existing approaches typicall...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Improving ML Training Data with Gold-Standard Quality Metrics

Hand-tagged training data is essential to many machine learning tasks. However, training data quality control has received little attention in the literature, d...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Performative Policy Gradient: Optimality in Performative Reinforcement Learning

Post-deployment machine learning algorithms often influence the environments they act in, and thus shift the underlying dynamics that the standard reinforcement...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs

Diffusion Large Language Models (dLLMs) offer fast, parallel token generation, but their standalone use is plagued by an inherent efficiency-quality tradeoff. W...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Distilling to Hybrid Attention Models via KL-Guided Layer Selection

Distilling pretrained softmax attention Transformers into more efficient hybrid architectures that interleave softmax and linear attention layers is a promising...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving

Simulators can generate virtually unlimited driving data, yet imitation learning policies in simulation still struggle to achieve robust closed-loop performance...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Shallow Neural Networks Learn Low-Degree Spherical Polynomials with Learnable Channel Attention

We study the problem of learning a low-degree spherical polynomial of degree ell_0 = Θ(1) ge 1 defined on the unit sphere in RR^d by training an over-parameteri...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models

Large vision-language models (VLMs) typically process hundreds or thousands of visual tokens per image or video frame, incurring quadratic attention cost and su...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Vision-language models (VLM) excel at general understanding yet remain weak at dynamic spatial reasoning (DSR), i.e., reasoning about the evolvement of object g...

#research #paper #ai #computer-vision

Newer posts

Older posts