Back to corpus
research noteexperiment writeup candidatescore 18
Paper 5: Compositional Generalization and Speaker Adaptation in Script-Aware ASR
Workspace document requiring curation.
Full HTML reader
Read the full artifact
Extracted abstract or opening context
# Paper 5: Compositional Generalization and Speaker Adaptation in Script-Aware ASR
## Thesis N'Ko's phonetic transparency advantage (Paper 4) extends beyond controlled CER comparison: N'Ko generalizes better to unseen vocabulary (Exp F), enables zero-shot vocabulary expansion via graph update (Exp H), and adapts faster to new speakers through test-time training (Exp G). Together, these three experiments demonstrate that phonetically transparent scripts produce ASR systems with superior operational lifetime characteristics.
## Core Argument Paper 4 showed that trajectory-biased CTC gives N'Ko -5.25pp CER advantage over Latin at 297K scale. Paper 5 asks: does this advantage persist in deployment scenarios where the model encounters words and speakers it never saw in training?
## Section 1: Introduction - ASR systems degrade in deployment: new vocabulary, new speakers, domain shift - For under-resourced scripts like N'Ko, these problems are amplified (no large-scale fine-tuning data available) - We test three deployment scenarios: compositional generalization, vocabulary expansion, speaker adaptation - All experiments use the same 297K-pair controlled setup from Paper 4
## Section 2: Related Work - Compositional generalization in ASR (cite subword approaches, BPE vs character-level) - Zero-shot vocabulary expansion (cite KG-augmented ASR, semantic priming) - Test-time training / adaptation (cite TTT, few-shot speaker adaptation, meta-learning for ASR) - Script-dependent effects in multilingual ASR (cite Whisper, MMS)
Promotion decision
What has to happen next
Attach run IDs, datasets, metrics, and reproduction commands.
Why this is not always a full paper yet
Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.