[Paper] Titans-as-a-Layer: Test-Time Memory for Conversational Speech Emotion Recognition

Published: 3 days ago (June 7, 2026 at 07:07 AM EDT)

2 min read

Source: arXiv

Source: arXiv - 2606.08573v1

Overview

Speech emotion recognition (SER) is commonly formulated as utterance-level classification, although conversational emotion depends on a speaker’s usual vocal range and the emotional context established by previous utterances. Speech-language models provide strong pretrained acoustic and semantic representations, and can adapts them to SER labels via finetune, but this mechanism still missing per-dialogue state. We study whether test-time neural memory can supply this missing context while leaving the large audio language models (LALMs) backbone intact. Building on Titans, we introduce a plug-and-play Memory-as-a-Layer (MAL) adapter that writes dialogue history into a small neural memory and reads it back as an audio-token-aligned residual update, avoiding changes to the host model’s token positions. Across different audio LLMs and emotion recognition datasets evaluations, our design improves SER performs across different evaluation metrics, supporting test-time memory as a residual contextual mechanism for conversational SER.

Key Contributions

This paper presents research in the following areas:

cs.LG
cs.CL

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of cs.LG.

Authors

Daniel Chen
Qicong Hu
Yang Xiao
Ting Dang
Hong Jia

Paper Information

arXiv ID: 2606.08573v1
Categories: cs.LG, cs.CL
Published: June 7, 2026
PDF: Download PDF

[Paper] Titans-as-a-Layer: Test-Time Memory for Conversational Speech Emotion Recognition

Overview

Key Contributions

Methodology

Practical Implications

Authors

Paper Information

Related posts

[Paper] A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design

[Paper] Provenance-Grounded Gating and Adaptive Recovery in Synthetic Post-Training Data Curation

[Paper] TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning

[Paper] PhantomBench: Benchmarking the Non-existential Threat of Language Models