Back to corpus
experimentexperiment writeup candidatescore 32
Whisper Encoder / Feature Path Audit
**Date:** 2026-06-02 **Scope:** Determine whether the current workspace already contains a reusable CoreML Whisper encoder / ANE feature extraction path for the clean anchor ASR serving stack.
Full HTML reader
Read the full artifact
Extracted abstract or opening context
**Date:** 2026-06-02 **Scope:** Determine whether the current workspace already contains a reusable CoreML Whisper encoder / ANE feature extraction path for the clean anchor ASR serving stack.
No reusable Whisper encoder CoreML artifact was found locally for the clean anchor path, so a new exporter was added and validated.
Targeted searches under `[home-path]` and `/Volumes/HD1` found unrelated CoreML models such as MotionMix `ConditioningEncoder.mlpackage`, but no `Whisper*.mlmodel`, `Whisper*.mlmodelc`, `Whisper*.mlpackage`, or equivalent ASR encoder package.
- `ane_ctc_train.py` - `features/*.pt` - `pairs.jsonl` - MLX CTC-head checkpoints under `checkpoints/`
Despite the script title ("ANE+MLX CTC Training — Frozen Whisper encoder on ANE, CTC head on MLX GPU"), `ane_ctc_train.py` is a trainer over already-extracted feature tensors. It does not export or run a Whisper encoder. The training loop loads `.pt` tensors from `features/` and trains a small MLX CTC head.
Promotion decision
What has to happen next
Attach run IDs, datasets, metrics, and reproduction commands.
Why this is not always a full paper yet
Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.