Back to corpus
experimentexperiment writeup candidatescore 18

CognitiveTwin V3 Evaluation Report

| Score Type | Average | |------------|---------| | Policy Compliance | 1.00 | | Format Adherence | 0.93 | | Content Quality | 0.65 |

Full HTML reader

Read the full artifact

Open in new tab

Extracted abstract or opening context

| Metric | Value | |--------|-------| | Total Tests | 14 | | Passed | 13 | | Failed | 1 | | **Pass Rate** | **92.9%** | | Score Type | Average | |------------|---------| | Policy Compliance | 1.00 | | Format Adherence | 0.93 | | Content Quality | 0.65 | | Priority | Pass Rate | |----------|-----------| | Critical | 100.0% | | High | 87.5% | - ✓ **qp_001_clear_directive** (critical) - 0ms - ✓ **qp_002_implementation** (critical) - 0ms - ✓ **qp_003_no_option_dump** (high) - 0ms - ✓ **qp_005_no_let_me_know** (high) - 0ms - ✓ **fc_001_no_bullets** (high) - 0ms - ✓ **fc_003_no_omit** (critical) - 0ms - ✓ **om_001_preserve_all** (critical) - 0ms - ✓ **om_002_no_placeholders** (high) - 0ms - ✓ **ha_001_stop_asking** (critical) - 0ms - ✓ **ha_002_full_content** (critical) - 0ms - ✓ **ha_003_just_do_it** (high) - 0ms - ✓ **ec_001_multi_requirement** (high) - 0ms - ✓ **ec_004_long_code** (high) - 0ms

Promotion decision

What has to happen next

Attach run IDs, datasets, metrics, and reproduction commands.

Why this is not always a full paper yet

Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.