Back to corpus
proposalexperiment writeup candidatescore 24
Phrase Database Enhancement Guide
1. **Sub-segment** existing phrases into shorter, more expressive sub-phrases 2. **Analyze** your database to understand what you have 3. **Enhance** structure while staying CPU-efficient
Full HTML reader
Read the full artifact
Extracted abstract or opening context
You currently have **68 phrases** from **7 files** using **fixed segmentation**. This guide shows how to:
1. **Sub-segment** existing phrases into shorter, more expressive sub-phrases 2. **Analyze** your database to understand what you have 3. **Enhance** structure while staying CPU-efficient
- **Method**: Fixed segmentation (uniform chunks) - **Phrases**: 68 - **Source files**: 7 - **Average phrase length**: ~12-16 bars (estimated)
Apply a **second layer** of segmentation on top of your fixed segments to get more granular structure:
**What this does:** - Takes each existing 12-16 bar phrase - Further segments it into 2-8 bar sub-phrases - Uses lightweight, CPU-efficient methods - Preserves all original phrases - Creates new sub-phrases with `_sub1`, `_sub2` labels
Promotion decision
What has to happen next
Attach run IDs, datasets, metrics, and reproduction commands.
Why this is not always a full paper yet
Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.