research — Page 121

Sort:

3 months ago · ai · - · -

[Paper] QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling

Recurrent neural architectures such as LSTM and GRU remain widely used in sequence modeling, but they continue to face two core limitations: redundant gate-spec...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms

In the era of Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) architectures are gaining significant attention for their ability to ground lan...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Instruction-based image editing has emerged as a prominent research area, which, benefiting from image generation foundation models, have achieved high aestheti...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Training-Time Action Conditioning for Efficient Real-Time Chunking

Real-time chunking (RTC) enables vision-language-action models (VLAs) to generate smooth, reactive robot trajectories by asynchronously predicting action chunks...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity

Reinforcement Learning (RL) has become the de facto standard for tuning LLMs to solve tasks involving reasoning. However, growing evidence shows that models tra...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement

Underwater images often suffer from severe color distortion, low contrast, and a hazy appearance due to wavelength-dependent light absorption and scattering. Si...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG

Vision-language models (VLMs) have achieved strong performance in visual question answering (VQA), yet they remain constrained by static training data. Retrieva...

#research #paper #ai #machine-learning #nlp #computer-vision
3 months ago · ai · - · -

[Paper] MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution

Generative search engines based on large language models (LLMs) are replacing traditional search, fundamentally changing how information providers are compensat...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Consequences of Kernel Regularity for Bandit Optimization

In this work we investigate the relationship between kernel regularity and algorithmic performance in the bandit optimization of RKHS functions. While reproduci...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models

Vision-Language Models (VLMs) exhibit remarkable common-sense and semantic reasoning capabilities. However, they lack a grounded understanding of physical dynam...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code

We introduce, a large-scale synthetic benchmark of 15,045 university-level physics problems (90/10% train/test split). Each problem is fully parameterized, supp...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Trusted AI Agents in the Cloud

AI agents powered by large language models are increasingly deployed as cloud services that autonomously access sensitive data, invoke external tools, and inter...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Impugan: Learning Conditional Generative Models for Robust Data Imputation

Incomplete data are common in real-world applications. Sensors fail, records are inconsistent, and datasets collected from different sources often differ in sca...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Developing synthetic microdata through machine learning for firm-level business surveys

Public-use microdata samples (PUMS) from the United States (US) Census Bureau on individuals have been available for decades. However, large increases in comput...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation Problem

Resource allocation remains NP-hard due to combinatorial complexity. While deep reinforcement learning (DRL) methods, such as the Rainbow Deep Q-Network (DQN), ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding

Grounding is a fundamental capability for building graphical user interface (GUI) agents. Although existing approaches rely on large-scale bounding box supervis...

#research #paper #ai #machine-learning #nlp #computer-vision
3 months ago · ai · - · -

[Paper] Designing an Optimal Sensor Network via Minimizing Information Loss

Optimal experimental design is a classic topic in statistics, with many well-studied problems, applications, and solutions. The design problem we study is the p...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception

Common approaches to explainable AI (XAI) for deep learning focus on analyzing the importance of input features on the classification task in a given model: sal...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition

In this paper, we present a synthesis pipeline and dataset for training / testing data in the task of traffic sign recognition that combines the advantages of d...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Physically-Based Simulation of Automotive LiDAR

We present an analytic model for simulating automotive time-of-flight (ToF) LiDAR that includes blooming, echo pulse width, and ambient light, along with steps ...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] On the Bayes Inconsistency of Disagreement Discrepancy Surrogates

Deep neural networks often fail when deployed in real-world contexts due to distribution shift, a critical barrier to building safe and reliable systems. An eme...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] A Comparative Study on Synthetic Facial Data Generation Techniques for Face Recognition

Facial recognition has become a widely used method for authentication and identification, with applications for secure access and locating missing persons. Its ...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty

Recent advances in generative video models have led to significant breakthroughs in high-fidelity video synthesis, specifically in controllable video generation...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] BalLOT: Balanced $k$-means clustering with optimal transport

We consider the fundamental problem of balanced k-means clustering. In particular, we introduce an optimal transport approach to alternating minimization called...

#research #paper #ai #machine-learning

Newer posts

Older posts