[Paper] Predictive Safety Shield for Dyna-Q Reinforcement Learning
Obtaining safety guarantees for reinforcement learning is a major challenge to achieve applicability for real-world tasks. Safety shields extend standard reinfo...
Obtaining safety guarantees for reinforcement learning is a major challenge to achieve applicability for real-world tasks. Safety shields extend standard reinfo...
Alzheimer's disease is a debilitating disorder marked by a decline in cognitive function. Timely identification of the disease is essential for the development ...
A fundamental theoretical question in network analysis is to determine under which conditions community recovery is possible in polynomial time in the Stochasti...
Recent advances in foundation models have shown great promise in domains such as natural language processing and computer vision, and similar efforts are now em...
The key limitation of the verification performance lies in the ability of error detection. With this intuition we designed several variants of pessimistic verif...
Antinuclear antibody (ANA) testing is a crucial method for diagnosing autoimmune disorders, including lupus, Sjögren's syndrome, and scleroderma. Despite its im...
Unlike text, speech conveys information about the speaker, such as gender, through acoustic cues like pitch. This gives rise to modality-specific bias concerns....
Transformer-based models have become state-of-the-art tools in various machine learning tasks, including time series classification, yet their complexity makes ...
Deploying Transformer models on edge devices is limited by latency and energy budgets. While INT8 quantization effectively accelerates the primary matrix multip...
This study proposes Tool-RoCo, a novel benchmark for evaluating large language models (LLMs) in long-term multi-agent cooperation based on RoCo, a multi-robot c...
The effectiveness of deepfake detection methods often depends less on their core design and more on implementation details such as data preprocessing, augmentat...
We propose Cross-Attention-based Non-local Knowledge Distillation (CanKD), a novel feature-based knowledge distillation framework that leverages cross-attention...