Back to corpus
experimentexperiment writeup candidatescore 24

Edit-Candidate Oracle Report

| condition | CER | delta pp | changed | better/same/worse | |---|---:|---:|---:|---:| | baseline | 0.3514 | +0.00 | 0 | 0/0/0 | | oracle_any | 0.2951 | -5.63 | 1367 | 1367/0/0 | | oracle_preserve | 0.3234 | -2.80 | 669 | 669/0/0 | | acoustic_gate | 0.3497 | -0.18 | 213 | 119/52/42 | | acoustic_preserve_gate | 0.3496 | -0.18 | 194 | 113/47/34 | | acoustic_featural_preserve_gate | 0.3496 | -0.18 | 194 | 113/47/34 |

Full HTML reader

Read the full artifact

Open in new tab

Extracted abstract or opening context

Rows: 1381 Baseline CER: 0.3514 Rows with any bounded local-edit improvement: 1367/1381 (99.0%) | condition | CER | delta pp | changed | better/same/worse | |---|---:|---:|---:|---:| | baseline | 0.3514 | +0.00 | 0 | 0/0/0 | | oracle_any | 0.2951 | -5.63 | 1367 | 1367/0/0 | | oracle_preserve | 0.3234 | -2.80 | 669 | 669/0/0 | | acoustic_gate | 0.3497 | -0.18 | 213 | 119/52/42 | | acoustic_preserve_gate | 0.3496 | -0.18 | 194 | 113/47/34 | | acoustic_featural_preserve_gate | 0.3496 | -0.18 | 194 | 113/47/34 | Acoustic-selected improvements: 119/1381 (8.6%). Hit rate on rows where the candidate set had a true improvement: 8.7%. The oracle rows are not deployable because they use clean references to choose the best candidate. They measure ceiling only. The acoustic rows are the deployable selection test. Artifacts: `experiments/acoustic_gate/overnight/edit_candidate_oracle_v1/oracle_report.json`, `experiments/acoustic_gate/overnight/edit_candidate_oracle_v1/oracle_cases.jsonl`.

Promotion decision

What has to happen next

Attach run IDs, datasets, metrics, and reproduction commands.

Why this is not always a full paper yet

Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.