Training And Learning - Source-Grounded Overview
1. **data capture** - MotionMixApp logs 128D SAN input and output frames; 2. **model training** - an offline process that must produce weight artifacts and validation evidence.
Full Public Reader
Training And Learning - Source-Grounded Overview
The system currently has two separate things that must not be confused:
1. data capture - MotionMixApp logs 128D SAN input and output frames;
2. model training - an offline process that must produce weight artifacts and
validation evidence.
The code proves capture and runtime loading exist. It does not, by itself, prove
the current models are well trained.
Verified Capture Path
Source: `MotionMixApp/Services/SANTrajectoryLogger.swift`.
SANService.step(core:)
-> san_get_flat_input(...)
-> san_get_output(...)
-> SANTrajectoryLogger.logFrameDirect(...)
-> Documents/san-training/<session-id>.jsonl
-> optional HTTP NDJSON POST to http://[ip]:9471/san-frameFrame schema:
schema_version = 2
dims = 128
input = [128 floats]
outputs = audio, camera, phrase, pattern, gesture, phase, regime, latencyVerified Runtime Artifacts
Local SAN artifacts:
MotionMixApp/MotionMixApp/Resources/san_weights.bin
MotionMixApp/MotionMixApp/Resources/san_manifest.jsonCurrent inspected manifest:
entries: 76
parameters: 164,248
weights size: 656,992 bytesCoreML generation artifacts:
MotionMixApp/MotionMixApp/MLModels/ConditioningEncoder.mlpackage
MotionMixApp/MotionMixApp/MLModels/FlowGenerator1Step.mlpackage
MotionMixApp/MotionMixApp/Models/MotionToMusic.mlpackageBut `MASTER-TASKS.md` describes the CoreML generation models as shell/zero-init
placeholders at that checkpoint, so quality must be re-verified before claiming
trained generation.
What Must Be Proven Before Saying "The Model Learned"
For SAN:
- exact training data path;
- frame count;
- train/validation split;
- training script path;
- checkpoint path;
- exported `san_weights.bin` and `san_manifest.json`;
- parameter count;
- validation metrics;
- on-device load count;
- live non-flat output;
- `mixFactor` greater than zero in the behavior being evaluated.
For CoreML flow generation:
- exact training data path;
- proof that `ConditioningEncoder` is not a placeholder;
- proof that `FlowGenerator1Step` is not a placeholder;
- output token-grid sanity checks;
- comparison against rule-based fallback;
- latency on device.
For N'Ko motion inscription:
- ClaimBridge detections with source dynamics;
- Convex inscription records;
- confidence thresholds and rejected claims;
- mapping from claim sigil to rendered N'Ko line.
Current Learning Lanes
SAN capture lane
iPhone runtime -> 128D JSONL -> future SAN training
CoreML generation lane
dynamics -> ConditioningEncoder -> one-step flow -> token grid
N'Ko inscription lane
dynamics -> ClaimBridge -> controlled N'Ko claim record
AirDeck gesture lane
camera/body truth -> gesture detector -> K11 safety bridge -> RekordboxThese lanes can share data later, but they are not currently one trained unified
model.
Docs In This Section
| File | Purpose |
|---|---|
| [san-training-v5.md](san-training-v5.md) | Verified SAN artifacts and what old training claims need to prove |
| [data-capture.md](data-capture.md) | Current recording/logging structure and required session format |
| [v6-roadmap.md](v6-roadmap.md) | Source-grounded next training tasks, not speculative model claims |
| [karl-reward.md](karl-reward.md) | Needs separate source audit before treating as current implementation |
Promotion Decision
Promote into a technical note or architecture paper with implementation anchors.
Source Anchor
computational-choreography/05-training-and-learning/overview.md
Detected Structure
Method · Evaluation · Code Anchors · Architecture