Back to corpus
research noteexperiment writeup candidatescore 36
Pulse Plan: NKO Brain Scanner — arXiv Sprint
Workspace document requiring curation.
Full HTML reader
Read the full artifact
Extracted abstract or opening context
## Goal Execute the Evo3 master plan to take the NKO Brain Scanner from experimental results to arXiv preprint in 10 days. Fix critical evaluation issues, extend the tokenizer, build production artifacts, and write the paper.
## Source Evo3 master plan at `Desktop/evo-cube-output/nko-brain-scanner-frontier/stage3-expand-master-plan.md`
## Known Problems (from Evo3 stress test) 1. **English eval broken**: Only 4 examples (141 tokens). Every English PPL number is noise. 2. **MLX embedding resize uncharted**: No `resize_token_embeddings()` in MLX. Need manual weight surgery on quantized model. 3. **Architecture mismatch**: Brain scan on Qwen2-72B (80 layers), fine-tuning on Qwen3-8B (36 layers). Need 8B brain scan. 4. **English degradation**: Fine-tuning may have caused catastrophic forgetting (unclear due to broken eval). 5. **No production serving**: Adapters exist but no inference API deployed. 6. **No HuggingFace artifacts**: Model, tokenizer, dataset not published.
## Wave 0: Pre-Flight (Iteration 1) ✅ COMPLETE - [x] Check Mac5 disk space (27GB free) - [x] Verify adapter files exist (19.4MB each) - [x] Check embedding dtype on Mac5: uint32 (quantized, shape 151936x1024) - [x] Smoke-test cross-script bridge: Bridge class works - [x] Verify GCS download status (169 uploaded)
## Wave 1: Fix English Eval + True Baselines (Iterations 2-3) ✅ COMPLETE - [x] Created `eval/build_eval_set.py`: 100 English + 100 N'Ko examples, SHA-256 dedup - [x] Created `eval/run_corrected_profiler.py`: all 3 stages on frozen eval - [x] Uploaded to Mac5 and ran profiler - [x] Corrected results: NKo acc 32.8%, Eng PPL 3.80, translation tax 0.70x (-76%) - [x] Updated blog + docs, committed (793f540)
Promotion decision
What has to happen next
Attach run IDs, datasets, metrics, and reproduction commands.
Why this is not always a full paper yet
Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.