paper — Page 129

1 month ago · ai

[Paper] QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling

Recurrent neural architectures such as LSTM and GRU remain widely used in sequence modeling, but they continue to face two core limitations: redundant gate-spec...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms

In the era of Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) architectures are gaining significant attention for their ability to ground lan...

#research #paper #ai #machine-learning #nlp
1 month ago · ai

[Paper] EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Instruction-based image editing has emerged as a prominent research area, which, benefiting from image generation foundation models, have achieved high aestheti...

#research #paper #ai #computer-vision
1 month ago · ai

[Paper] Training-Time Action Conditioning for Efficient Real-Time Chunking

Real-time chunking (RTC) enables vision-language-action models (VLAs) to generate smooth, reactive robot trajectories by asynchronously predicting action chunks...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity

Reinforcement Learning (RL) has become the de facto standard for tuning LLMs to solve tasks involving reasoning. However, growing evidence shows that models tra...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement

Underwater images often suffer from severe color distortion, low contrast, and a hazy appearance due to wavelength-dependent light absorption and scattering. Si...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai

[Paper] M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG

Vision-language models (VLMs) have achieved strong performance in visual question answering (VQA), yet they remain constrained by static training data. Retrieva...

#research #paper #ai #machine-learning #nlp #computer-vision
1 month ago · ai

[Paper] MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution

Generative search engines based on large language models (LLMs) are replacing traditional search, fundamentally changing how information providers are compensat...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Consequences of Kernel Regularity for Bandit Optimization

In this work we investigate the relationship between kernel regularity and algorithmic performance in the bandit optimization of RKHS functions. While reproduci...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models

Vision-Language Models (VLMs) exhibit remarkable common-sense and semantic reasoning capabilities. However, they lack a grounded understanding of physical dynam...

#research #paper #ai #computer-vision
1 month ago · ai

[Paper] SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code

We introduce, a large-scale synthetic benchmark of 15,045 university-level physics problems (90/10% train/test split). Each problem is fully parameterized, supp...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Trusted AI Agents in the Cloud

AI agents powered by large language models are increasingly deployed as cloud services that autonomously access sensitive data, invoke external tools, and inter...

#research #paper #ai #machine-learning

Newer posts

Older posts