Back to corpus
research noteexperiment writeup candidatescore 36

Pulse Plan: NKO Brain Scanner — arXiv Sprint

Workspace document requiring curation.

Full HTML reader

Read the full artifact

Open in new tab

Extracted abstract or opening context

## Goal Execute the Evo3 master plan to take the NKO Brain Scanner from experimental results to arXiv preprint in 10 days. Fix critical evaluation issues, extend the tokenizer, build production artifacts, and write the paper. ## Source Evo3 master plan at `Desktop/evo-cube-output/nko-brain-scanner-frontier/stage3-expand-master-plan.md` ## Known Problems (from Evo3 stress test) 1. **English eval broken**: Only 4 examples (141 tokens). Every English PPL number is noise. 2. **MLX embedding resize uncharted**: No `resize_token_embeddings()` in MLX. Need manual weight surgery on quantized model. 3. **Architecture mismatch**: Brain scan on Qwen2-72B (80 layers), fine-tuning on Qwen3-8B (36 layers). Need 8B brain scan. 4. **English degradation**: Fine-tuning may have caused catastrophic forgetting (unclear due to broken eval). 5. **No production serving**: Adapters exist but no inference API deployed. 6. **No HuggingFace artifacts**: Model, tokenizer, dataset not published. ## Wave 0: Pre-Flight (Iteration 1) ✅ COMPLETE - [x] Check Mac5 disk space (27GB free) - [x] Verify adapter files exist (19.4MB each) - [x] Check embedding dtype on Mac5: uint32 (quantized, shape 151936x1024) - [x] Smoke-test cross-script bridge: Bridge class works - [x] Verify GCS download status (169 uploaded) ## Wave 1: Fix English Eval + True Baselines (Iterations 2-3) ✅ COMPLETE - [x] Created `eval/build_eval_set.py`: 100 English + 100 N'Ko examples, SHA-256 dedup - [x] Created `eval/run_corrected_profiler.py`: all 3 stages on frozen eval - [x] Uploaded to Mac5 and ran profiler - [x] Corrected results: NKo acc 32.8%, Eng PPL 3.80, translation tax 0.70x (-76%) - [x] Updated blog + docs, committed (793f540)

Promotion decision

What has to happen next

Attach run IDs, datasets, metrics, and reproduction commands.

Why this is not always a full paper yet

Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.