Grand Diomande Research · Full HTML Reader

Training And Learning - Source-Grounded Overview

1. **data capture** - MotionMixApp logs 128D SAN input and output frames; 2. **model training** - an offline process that must produce weight artifacts and validation evidence.

Embodied Trajectory Systems architecture technical paper candidate score 46 .md

Full Public Reader

Training And Learning - Source-Grounded Overview

The system currently has two separate things that must not be confused:

1. data capture - MotionMixApp logs 128D SAN input and output frames;
2. model training - an offline process that must produce weight artifacts and
validation evidence.

The code proves capture and runtime loading exist. It does not, by itself, prove
the current models are well trained.

Verified Capture Path

Source: `MotionMixApp/Services/SANTrajectoryLogger.swift`.

text
SANService.step(core:)
  -> san_get_flat_input(...)
  -> san_get_output(...)
  -> SANTrajectoryLogger.logFrameDirect(...)
  -> Documents/san-training/<session-id>.jsonl
  -> optional HTTP NDJSON POST to http://[ip]:9471/san-frame

Frame schema:

text
schema_version = 2
dims = 128
input = [128 floats]
outputs = audio, camera, phrase, pattern, gesture, phase, regime, latency

Verified Runtime Artifacts

Local SAN artifacts:

text
MotionMixApp/MotionMixApp/Resources/san_weights.bin
MotionMixApp/MotionMixApp/Resources/san_manifest.json

Current inspected manifest:

text
entries: 76
parameters: 164,248
weights size: 656,992 bytes

CoreML generation artifacts:

text
MotionMixApp/MotionMixApp/MLModels/ConditioningEncoder.mlpackage
MotionMixApp/MotionMixApp/MLModels/FlowGenerator1Step.mlpackage
MotionMixApp/MotionMixApp/Models/MotionToMusic.mlpackage

But `MASTER-TASKS.md` describes the CoreML generation models as shell/zero-init
placeholders at that checkpoint, so quality must be re-verified before claiming
trained generation.

What Must Be Proven Before Saying "The Model Learned"

For SAN:

  • exact training data path;
  • frame count;
  • train/validation split;
  • training script path;
  • checkpoint path;
  • exported `san_weights.bin` and `san_manifest.json`;
  • parameter count;
  • validation metrics;
  • on-device load count;
  • live non-flat output;
  • `mixFactor` greater than zero in the behavior being evaluated.

For CoreML flow generation:

  • exact training data path;
  • proof that `ConditioningEncoder` is not a placeholder;
  • proof that `FlowGenerator1Step` is not a placeholder;
  • output token-grid sanity checks;
  • comparison against rule-based fallback;
  • latency on device.

For N'Ko motion inscription:

  • ClaimBridge detections with source dynamics;
  • Convex inscription records;
  • confidence thresholds and rejected claims;
  • mapping from claim sigil to rendered N'Ko line.

Current Learning Lanes

text
SAN capture lane
  iPhone runtime -> 128D JSONL -> future SAN training

CoreML generation lane
  dynamics -> ConditioningEncoder -> one-step flow -> token grid

N'Ko inscription lane
  dynamics -> ClaimBridge -> controlled N'Ko claim record

AirDeck gesture lane
  camera/body truth -> gesture detector -> K11 safety bridge -> Rekordbox

These lanes can share data later, but they are not currently one trained unified
model.

Docs In This Section

FilePurpose
[san-training-v5.md](san-training-v5.md)Verified SAN artifacts and what old training claims need to prove
[data-capture.md](data-capture.md)Current recording/logging structure and required session format
[v6-roadmap.md](v6-roadmap.md)Source-grounded next training tasks, not speculative model claims
[karl-reward.md](karl-reward.md)Needs separate source audit before treating as current implementation

Promotion Decision

Promote into a technical note or architecture paper with implementation anchors.

Source Anchor

computational-choreography/05-training-and-learning/overview.md

Detected Structure

Method · Evaluation · Code Anchors · Architecture