Back to corpus
experimentexperiment writeup candidatescore 32

Whisper Encoder / Feature Path Audit

**Date:** 2026-06-02 **Scope:** Determine whether the current workspace already contains a reusable CoreML Whisper encoder / ANE feature extraction path for the clean anchor ASR serving stack.

Full HTML reader

Read the full artifact

Open in new tab

Extracted abstract or opening context

**Date:** 2026-06-02 **Scope:** Determine whether the current workspace already contains a reusable CoreML Whisper encoder / ANE feature extraction path for the clean anchor ASR serving stack. No reusable Whisper encoder CoreML artifact was found locally for the clean anchor path, so a new exporter was added and validated. Targeted searches under `[home-path]` and `/Volumes/HD1` found unrelated CoreML models such as MotionMix `ConditioningEncoder.mlpackage`, but no `Whisper*.mlmodel`, `Whisper*.mlmodelc`, `Whisper*.mlpackage`, or equivalent ASR encoder package. - `ane_ctc_train.py` - `features/*.pt` - `pairs.jsonl` - MLX CTC-head checkpoints under `checkpoints/` Despite the script title ("ANE+MLX CTC Training — Frozen Whisper encoder on ANE, CTC head on MLX GPU"), `ane_ctc_train.py` is a trainer over already-extracted feature tensors. It does not export or run a Whisper encoder. The training loop loads `.pt` tensors from `features/` and trains a small MLX CTC head.

Promotion decision

What has to happen next

Attach run IDs, datasets, metrics, and reproduction commands.

Why this is not always a full paper yet

Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.