Grand Diomande Research · Full HTML Reader

Unified RAG++ Architecture

1. [System Overview](#1-system-overview) 2. [Layer Architecture](#2-layer-architecture) 3. [Foundation Layer: Rust Core](#3-foundation-layer-rust-core) 4. [Data Layer: Supabase Schema](#4-data-layer-supabase-schema) 5. [Ingestion Layer: Prompt Pipeline](#5-ingestion-layer-prompt-pipeline) 6. [ML Layer: CognitiveTwin](#6-ml-layer-cognitivetwin) 7. [Orchestration Layer: Orbit](#7-orchestration-layer-orbit) 8. [Integration Layer: Prompt Logger](#8-integration-layer-prompt-logger) 9. [API Layer: Endpoints Reference](#9

Agents That Account for Themselves architecture technical paper candidate score 62 .md

Full Public Reader

Unified RAG++ Architecture

Complete System Reference

Version: 2.0.0
Last Updated: December 2025
Status: Production

---

1. [System Overview](#1-system-overview)
2. [Layer Architecture](#2-layer-architecture)
3. [Foundation Layer: Rust Core](#3-foundation-layer-rust-core)
4. [Data Layer: Supabase Schema](#4-data-layer-supabase-schema)
5. [Ingestion Layer: Prompt Pipeline](#5-ingestion-layer-prompt-pipeline)
6. [ML Layer: CognitiveTwin](#6-ml-layer-cognitivetwin)
7. [Orchestration Layer: Orbit](#7-orchestration-layer-orbit)
8. [Integration Layer: Prompt Logger](#8-integration-layer-prompt-logger)
9. [API Layer: Endpoints Reference](#9-api-layer-endpoints-reference)
10. [Data Flow Diagrams](#10-data-flow-diagrams)
11. [Configuration Reference](#11-configuration-reference)
12. [Component Interfaces](#12-component-interfaces)

---

1. System Overview

1.1 Purpose

The Unified RAG++ system creates a 5D trajectory knowledge fabric that:

Captures every AI interaction (Claude, ChatGPT, Cursor) as unified `memory_turns`
Computes trajectory coordinates using the Rust core (HNSWIndex + IRCPPropagator)
Trains a CognitiveTwin to learn user reasoning patterns and style
Enables trajectory-aware retrieval for contextual generation

1.2 Core Principles

Principle	Implementation
Unified Data	All prompts become `memory_turns` - single source of truth
Rust Performance	Trajectory computation via PyO3 bindings
Orbit Orchestration	Full control over training, context, and fabric operations
Global Style	One evolving signature across all projects

1.3 System Components

┌─────────────────────────────────────────────────────────────────────────────┐
│                          UNIFIED RAG++ ARCHITECTURE                          │
├─────────────────────────────────────────────────────────────────────────────┤
│                                                                              │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐    │
│  │   Claude     │  │   Cursor     │  │   ChatGPT    │  │    Speak     │    │
│  │    Code      │  │    Agent     │  │    Export    │  │    Voice     │    │
│  └──────┬───────┘  └──────┬───────┘  └──────┬───────┘  └──────┬───────┘    │
│         │                 │                 │                 │             │
│         └────────────┬────┴────────────┬────┴────────────┬────┘             │
│                      │                 │                 │                  │
│                      ▼                 ▼                 ▼                  │
│  ┌───────────────────────────────────────────────────────────────────┐     │
│  │                       PROMPT LOGGER                                │     │
│  │  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐               │     │
│  │  │ main.py     │  │response_hook│  │ orbit_sync  │               │     │
│  │  │ (capture)   │  │ (response)  │  │ (unified)   │               │     │
│  │  └─────────────┘  └─────────────┘  └──────┬──────┘               │     │
│  └───────────────────────────────────────────┼───────────────────────┘     │
│                                              │                              │
│                                              ▼                              │
│  ┌───────────────────────────────────────────────────────────────────┐     │
│  │                         ORBIT SERVER                              │     │
│  │  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐               │     │
│  │  │ /api/rag/   │  │ Project     │  │ Session     │               │     │
│  │  │ ingest      │  │ Manager     │  │ Handler     │               │     │
│  │  └──────┬──────┘  └─────────────┘  └─────────────┘               │     │
│  └─────────┼─────────────────────────────────────────────────────────┘     │
│            │                                                                │
│            ▼                                                                │
│  ┌───────────────────────────────────────────────────────────────────┐     │
│  │                       PROMPT INGESTER                             │     │
│  │  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐               │     │
│  │  │ Embedding   │  │ Trajectory  │  │ Supabase    │               │     │
│  │  │ (Gemini)    │  │ Compute     │  │ Insert      │               │     │
│  │  └──────┬──────┘  └──────┬──────┘  └──────┬──────┘               │     │
│  └─────────┼────────────────┼────────────────┼───────────────────────┘     │
│            │                │                │                              │
│            │                ▼                │                              │
│  ┌─────────┼────────────────────────────────┼───────────────────────┐     │
│  │         │          RUST CORE             │                        │     │
│  │  ┌──────┴──────┐  ┌─────────────┐  ┌─────┴─────┐                 │     │
│  │  │ HNSWIndex   │  │    IRCP     │  │ Trajectory │                 │     │
│  │  │ (neighbors) │  │ Propagator  │  │ Coord 5D   │                 │     │
│  │  └─────────────┘  └─────────────┘  └───────────┘                 │     │
│  └───────────────────────────────────────────────────────────────────┘     │
│            │                │                │                              │
│            └────────────────┴────────────────┘                              │
│                             │                                               │
│                             ▼                                               │
│  ┌───────────────────────────────────────────────────────────────────┐     │
│  │                         SUPABASE                                  │     │
│  │  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐               │     │
│  │  │ memory_     │  │ memory_     │  │ claude_     │               │     │
│  │  │ turns       │  │ convos      │  │ prompts     │               │     │
│  │  └─────────────┘  └─────────────┘  └─────────────┘               │     │
│  └───────────────────────────────────────────────────────────────────┘     │
│                             │                                               │
│                             ▼                                               │
│  ┌───────────────────────────────────────────────────────────────────┐     │
│  │                    COGNITIVETWIN                                  │     │
│  │  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐               │     │
│  │  │   Hybrid    │  │   Global    │  │   Pattern   │               │     │
│  │  │   Trainer   │  │  Signature  │  │   Memory    │               │     │
│  │  └─────────────┘  └─────────────┘  └─────────────┘               │     │
│  └───────────────────────────────────────────────────────────────────┘     │
│                                                                              │
└─────────────────────────────────────────────────────────────────────────────┘

---

2. Layer Architecture

2.1 Layer Stack (Bottom to Top)

Layer	Components	Responsibility
L0: Foundation	Rust Core, PyO3 Bindings	High-performance vector ops, trajectory computation
L1: Data	Supabase, PostgreSQL, pgvector	Persistent storage, vector search
L2: Ingestion	PromptIngester, Embedder	Transform prompts → memory_turns
L3: ML	CognitiveTwin, HybridTrainer	Learn user patterns, style extraction
L4: Orchestration	Orbit Server	Project management, training triggers
L5: Integration	Prompt Logger, MCP Server	Capture from AI tools, expose to agents

2.2 Dependency Graph

L5: Integration ─────┐
        │            │
        ▼            │
L4: Orchestration ───┤
        │            │
        ▼            │
L3: ML ──────────────┤
        │            │
        ▼            │
L2: Ingestion ───────┤
        │            │
        ▼            │
L1: Data ────────────┤
        │            │
        ▼            │
L0: Foundation ◀─────┘

---

3. Foundation Layer: Rust Core

3.1 Overview

The Rust core provides high-performance implementations of:

HNSWIndex: Hierarchical Navigable Small World graph for approximate nearest neighbor search
IRCPPropagator: Inverse Reasoning Coordinate Propagator for attention computation
TrajectoryCoordinate5D: 5-dimensional coordinate system for conversation topology

3.2 PyO3 Bindings

Location: `crates/python/src/lib.rs`

3.2.1 HNSWIndex

python

from _rag_plusplus_rs import HNSWIndex

index = HNSWIndex()

# Add vectors
index.add("turn-uuid-123", embedding_768d)

# Search neighbors
results = index.search(query_embedding, k=10)
# Returns: List[SearchResult] with id, score

3.2.2 IRCPPropagator

python

from _rag_plusplus_rs import IRCPPropagator, IRCPConfig

config = IRCPConfig(
    forward_weight=0.4,
    inverse_weight=0.3,
    cross_weight=0.3,
)
propagator = IRCPPropagator(config)

# Compute attention weights
weights = propagator.compute_attention(
    query_coord,      # PyTrajectoryCoordinate5D
    context_coords,   # List[PyTrajectoryCoordinate5D]
    query_embedding,  # numpy array
    context_embeddings,  # List[numpy array]
)

3.2.3 compute_trajectory_coordinates

python

from _rag_plusplus_rs import compute_trajectory_coordinates
import numpy as np

coord = compute_trajectory_coordinates(
    embedding=np.array([...], dtype=np.float32),  # 768-dim
    neighbor_similarities=[0.85, 0.72, 0.68],     # from HNSW search
    session_depth=5,                               # depth in session
    turn_index=12,                                 # position in conversation
    sibling_count=1,                               # branches at this depth
    sibling_order=0,                               # position among siblings
    project_phase="implementation",                # phase string
)
# Returns: PyTrajectoryCoordinate5D

3.3 TrajectoryCoordinate5D

Dimension	Symbol	Range	Meaning
Depth	x	0-1	Normalized conversation depth (0=root, 1=deep)
Sibling Order	y	0-1	Position among branches (0=first, 1=last)
Homogeneity	z	0-1	Semantic similarity to neighbors (0=diverse, 1=focused)
Temporal	t	0-1	Normalized time position (0=early, 1=recent)
Complexity	n	0-1	Content complexity (0=simple, 1=complex)

3.3.1 Phase Enumeration

rust

pub enum TrajectoryPhase {
    Exploration,   // Initial topic discovery
    Refinement,    // Narrowing focus, planning
    Application,   // Implementation, execution
    Resolution,    // Debugging, review, completion
}

3.4 Rust Module Structure

crates/
├── core/
│   └── src/
│       ├── lib.rs              # Core exports
│       ├── index/
│       │   ├── hnsw.rs         # HNSW implementation
│       │   └── flat.rs         # Flat index fallback
│       └── trajectory/
│           ├── mod.rs          # Trajectory exports
│           ├── coordinate.rs   # 5D coordinate types
│           ├── ircp.rs         # IRCP propagator
│           └── phase.rs        # Phase inference
└── python/
    └── src/
        └── lib.rs              # PyO3 bindings

---

4. Data Layer: Supabase Schema

4.1 Core Tables

4.1.1 memory_conversations

sql

CREATE TABLE memory_conversations (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    title TEXT,
    source TEXT NOT NULL,  -- 'chatgpt', 'claude', 'orbit'
    created_at TIMESTAMPTZ DEFAULT NOW(),
    updated_at TIMESTAMPTZ DEFAULT NOW(),
    metadata JSONB DEFAULT '{}'::jsonb
);

4.1.2 memory_turns (The Unified Knowledge Fabric)

sql

CREATE TABLE memory_turns (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    conversation_id UUID REFERENCES memory_conversations(id),
    content TEXT NOT NULL,
    role TEXT NOT NULL,  -- 'user', 'assistant', 'system'
    turn_index INTEGER NOT NULL,

    -- 768-dim embedding (Gemini)
    embedding vector(768),

    -- 5D Trajectory Coordinates
    trajectory_depth FLOAT,           -- x: depth (0-1)
    trajectory_sibling_order FLOAT,   -- y: sibling position (0-1)
    trajectory_homogeneity FLOAT,     -- z: semantic focus (0-1)
    trajectory_temporal FLOAT,        -- t: time position (0-1)
    trajectory_complexity FLOAT,      -- n: complexity (0-1)
    trajectory_phase TEXT,            -- phase enum string

    -- Salience and importance
    salience_score FLOAT DEFAULT 0.5,

    -- Metadata
    metadata JSONB DEFAULT '{}'::jsonb,
    created_at TIMESTAMPTZ DEFAULT NOW(),

    -- Indexes
    CONSTRAINT fk_conversation FOREIGN KEY (conversation_id)
        REFERENCES memory_conversations(id) ON DELETE CASCADE
);

-- Vector similarity index
CREATE INDEX idx_turns_embedding ON memory_turns
    USING ivfflat (embedding vector_cosine_ops) WITH (lists = 100);

-- Trajectory coordinate indexes
CREATE INDEX idx_turns_trajectory ON memory_turns (
    trajectory_depth, trajectory_temporal, trajectory_phase
);

4.1.3 claude_prompts (Prompt Logger Source)

sql

CREATE TABLE claude_prompts (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    prompt_id TEXT UNIQUE NOT NULL,
    session_id TEXT NOT NULL,
    prompt_text TEXT NOT NULL,
    prompt_text_length INTEGER,
    source TEXT DEFAULT 'claude',

    -- Timing
    prompt_received_at TIMESTAMPTZ,
    response_complete_at TIMESTAMPTZ,
    total_duration_ms FLOAT,

    -- Git context
    git_repo TEXT,
    git_branch TEXT,
    git_commit_before TEXT,

    -- Link to unified memory_turns (if ingested)
    memory_turn_id UUID REFERENCES memory_turns(id),

    metadata JSONB DEFAULT '{}'::jsonb,
    created_at TIMESTAMPTZ DEFAULT NOW()
);

4.2 Indexes and Performance

sql

-- Full-text search on content
CREATE INDEX idx_turns_content_fts ON memory_turns
    USING gin(to_tsvector('english', content));

-- Metadata JSONB indexing
CREATE INDEX idx_turns_metadata ON memory_turns USING gin(metadata);

-- Source filtering
CREATE INDEX idx_turns_source ON memory_turns ((metadata->>'source'));

-- Orbit project filtering
CREATE INDEX idx_turns_orbit_project ON memory_turns
    ((metadata->>'orbit_project_id'));

4.3 Metadata Schema

jsonc

// memory_turns.metadata for prompts ingested via Orbit
{
    "source": "claude_prompt",
    "prompt_id": "session123-20241229123456-abc12345",
    "orbit_project_id": "uuid-of-project",
    "orbit_session_id": "uuid-of-session",
    "project_name": "cc-rag-plus-plus",
    "working_directory": "/Users/user/project",
    "turn_index": 5
}

---

5. Ingestion Layer: Prompt Pipeline

5.1 PromptIngester

Location: `rag_plusplus/ingestion/prompt_ingester.py`

5.1.1 Class Definition

python

class PromptIngester:
    """Ingests Claude prompts into unified memory_turns table."""

    def __init__(
        self,
        supabase_url: Optional[str] = None,
        supabase_key: Optional[str] = None,
        embedding_fn: Optional[Callable[[str], List[float]]] = None,
        hybrid_trainer: Optional[HybridCognitiveTwinTrainer] = None,
        enable_incremental_learning: bool = True,
    ) -> None:
        ...

5.1.2 Ingestion Flow

python

async def ingest_prompt(
    self,
    prompt_id: str,
    prompt_text: str,
    response_text: str,
    orbit_context: OrbitContext,
) -> IngestedTurn:
    """
    Complete ingestion flow:

    1. Combine prompt + response → content
    2. Compute embedding via Gemini (768-dim)
    3. Find neighbors via Rust HNSWIndex
    4. Compute trajectory via Rust IRCPPropagator
    5. Insert as memory_turn
    6. Trigger incremental learning (optional)
    """

5.2 OrbitContext

python

@dataclass
class OrbitContext:
    """Context from Orbit orchestrator."""
    project_id: str          # Orbit project UUID
    session_id: str          # Orbit session UUID
    project_status: str      # "active", "paused", etc.
    project_name: str        # Human-readable name
    session_depth: int       # Depth within session
    turn_index: int          # Turn index
    working_directory: str   # CWD
    metadata: Dict[str, Any] # Additional context

5.3 TrajectoryCoords

python

@dataclass
class TrajectoryCoords:
    """5D trajectory coordinates."""
    x: float = 0.0   # Depth
    y: float = 0.0   # Sibling order
    z: float = 0.5   # Homogeneity
    t: float = 0.0   # Temporal
    n: float = 0.5   # Complexity
    phase: str = "exploration"

    def to_dict(self) -> Dict[str, Any]:
        return {
            "trajectory_depth": self.x,
            "trajectory_sibling_order": self.y,
            "trajectory_homogeneity": self.z,
            "trajectory_temporal": self.t,
            "trajectory_complexity": self.n,
            "trajectory_phase": self.phase,
        }

5.4 IngestedTurn

python

@dataclass
class IngestedTurn:
    """Result of ingesting a prompt."""
    turn_id: str
    prompt_id: str
    content: str
    role: str
    embedding: List[float]
    trajectory_coords: TrajectoryCoords
    orbit_project_id: str
    orbit_session_id: str
    created_at: datetime
    metadata: Dict[str, Any]

    def to_supabase_row(self, conversation_id: str) -> Dict[str, Any]:
        """Convert to Supabase memory_turns row."""
        ...

---

6. ML Layer: CognitiveTwin

6.1 Architecture Overview

┌───────────────────────────────────────────────────────────────────────┐
│                        COGNITIVETWIN                                   │
│  ┌─────────────────┐  ┌──────────────────┐  ┌─────────────────────┐  │
│  │ ReasoningPattern│  │   Style          │  │   Pattern           │  │
│  │ Encoder         │──│   Projector      │──│   Memory            │  │
│  │ (768 → 1024)    │  │   (1024 → 256)   │  │   (10K patterns)    │  │
│  └────────┬────────┘  └────────┬─────────┘  └──────────┬──────────┘  │
│           │                    │                       │              │
│           └────────────────────┴───────────────────────┘              │
│                                │                                       │
│                                ▼                                       │
│  ┌─────────────────────────────────────────────────────────────────┐  │
│  │                    PromptGenerator                               │  │
│  │  Generates prompts/continuations in user's style                 │  │
│  └─────────────────────────────────────────────────────────────────┘  │
└───────────────────────────────────────────────────────────────────────┘

6.2 HybridCognitiveTwinTrainer

Location: `rag_plusplus/ml/cognitive/hybrid_trainer.py`

6.2.1 Training Modes

python

class TrainingMode(str, Enum):
    BATCH_PROMPTS = "batch_prompts"      # Train on prompt/response pairs
    BATCH_TURNS = "batch_turns"          # Train on memory_turns
    REALTIME_STYLE = "realtime_style"    # Inference-only extraction
    INCREMENTAL = "incremental"          # Update per interaction

6.2.2 Training Triggers

python

class TriggerType(str, Enum):
    SESSION_END = "session_end"    # Orbit session boundary
    SCHEDULED = "scheduled"        # Cron schedule (e.g., 3 AM daily)
    THRESHOLD = "threshold"        # After N incremental updates
    MANUAL = "manual"              # API/UI trigger

6.2.3 Core Methods

python

class HybridCognitiveTwinTrainer:
    async def batch_train_prompts(
        self, prompts: List[PromptResponsePair], epochs: int = 3
    ) -> TrainingMetrics:
        """Train on prompt/response pairs from claude_prompts."""

    async def batch_train_turns(
        self, turns: List[MemoryTurn], epochs: int = 3
    ) -> TrainingMetrics:
        """Train on unified memory_turns with trajectory context."""

    def realtime_style_extract(
        self, text: str, embedding: Optional[Tensor] = None
    ) -> StyleSignatureVector:
        """Extract style without training (inference only)."""

    async def incremental_learn(self, turn: MemoryTurn) -> None:
        """Update global signature after single interaction."""

    # Trigger handlers
    async def on_session_end(self, session_id: str) -> TrainingMetrics:
    async def on_schedule(self) -> TrainingMetrics:
    async def on_manual_trigger(self, mode: TrainingMode, filter: Dict)

6.3 GlobalStyleSignature

Location: `rag_plusplus/ml/cognitive/global_signature.py`

6.3.1 EMA Blending

python

class GlobalStyleSignature:
    """
    Evolving global style signature using Exponential Moving Average.

    signature = momentum * old + (1 - momentum) * weight * new
    """

    def __init__(
        self,
        dim: int = 256,
        momentum: float = 0.9,        # 90% old, 10% new
        snapshot_interval: int = 10,   # Snapshot every 10 updates
        max_history: int = 100,        # Keep last 100 snapshots
    ):
        self.signature = torch.zeros(dim)
        self.confidence = 0.0
        self.update_count = 0

    def blend(
        self,
        new_signature: Tensor,
        weight: float = 0.1,
        source: Optional[str] = None,
    ) -> None:
        """Blend new observation into global signature."""
        new_norm = F.normalize(new_signature.detach(), dim=0)
        self.signature = (
            self.momentum * self.signature +
            (1 - self.momentum) * weight * new_norm
        )
        self.signature = F.normalize(self.signature, dim=0)
        self.update_count += 1
        # Confidence: asymptotic to 1.0
        self.confidence = min(1.0, self.update_count / (self.update_count + 100))

6.3.2 Persistence

python

def save(self, path: Path) -> None:
    torch.save(self.state_dict(), path)

@classmethod
def load(cls, path: Path) -> "GlobalStyleSignature":
    state = torch.load(path, weights_only=True)
    instance = cls(dim=state["dim"], momentum=state["momentum"])
    instance.load_state_dict(state)
    return instance

6.4 StyleSignatureVector

python

@dataclass
class StyleSignatureVector:
    """Current style signature with metadata."""
    vector: Tensor           # 256-dim style embedding
    confidence: float        # 0-1, how confident in signature
    update_count: int        # Number of blending updates
    last_updated: datetime   # When last updated

---

7. Orchestration Layer: Orbit

7.1 Orbit Server

Location: `trajectory-orbit/crates/orbit-server/src/main.rs`

7.1.1 Core Responsibilities

1. Project Management: Track projects, status, git info
2. Session Tracking: Claude session boundaries
3. RAG++ Integration: Ingestion, search, training endpoints
4. Event Broadcasting: WebSocket updates to clients

7.2 RAG++ API Endpoints

Endpoint	Method	Description
`/api/rag/health`	GET	Health check for RAG++ integration
`/api/rag/ingest`	POST	Ingest prompt to memory_turns
`/api/rag/search`	GET	Search unified knowledge fabric
`/api/rag/context/:id`	GET	Get project context
`/api/rag/signature`	GET	Get CognitiveTwin style signature
`/api/rag/train`	POST	Trigger training
`/api/rag/session-end`	POST	Notify session end

7.2.1 Ingest Request

rust

#[derive(Debug, Deserialize)]
struct RagIngestRequest {
    prompt_id: String,
    prompt_text: String,
    response_text: String,
    session_id: String,
    project_id: String,
    project_name: Option<String>,
    working_directory: Option<String>,
    turn_index: Option<i32>,
}

7.2.2 Training Request

rust

#[derive(Debug, Deserialize)]
struct RagTrainRequest {
    mode: Option<String>,     // "batch_prompts" | "batch_turns" | "incremental"
    filter: Option<Value>,    // {"project_id": "..."} optional filter
}

7.3 Project ↔ Session ↔ Turn Hierarchy

Project (Orbit)
    └── Session (Claude Code session)
            └── Turn (memory_turn)
                    ├── content
                    ├── embedding[768]
                    └── trajectory_coords[5]

---

8. Integration Layer: Prompt Logger

8.1 Hook System

Location: `[home-path]`

8.1.1 Hook Chain

UserPromptSubmit
        │
        ▼
    main.py
        │  ├── capture prompt
        │  ├── register pending
        │  └── sync_with_orbit()
        │
        ▼
    Stop (response complete)
        │
        ▼
    response_hook.py
        │  ├── capture response
        │  ├── extract tool calls
        │  ├── capture file diffs
        │  └── sync_full_prompt()
        │
        ▼
    orbit_sync.py
        │  ├── find_project_by_path()
        │  ├── update_project_activity()
        │  └── ingest_to_rag()

8.2 orbit_sync.py

Location: `[home-path]`

8.2.1 Key Functions

python

def sync_with_orbit(
    cwd: str,
    prompt_id: str,
    prompt_text: str,
    session_id: str,
    response_text: str = "",
) -> Optional[str]:
    """Basic sync - find project, update activity, optionally ingest."""

def sync_full_prompt(
    cwd: str,
    prompt_id: str,
    prompt_text: str,
    response_text: str,
    session_id: str,
    turn_index: int = 0,
) -> Tuple[Optional[str], Optional[str]]:
    """Full sync with RAG++ ingestion. Returns (project_name, memory_turn_id)."""

def ingest_to_rag(
    prompt_id: str,
    prompt_text: str,
    response_text: str,
    session_id: str,
    project_id: str,
    project_name: str,
    cwd: str,
    turn_index: int = 0,
) -> Optional[str]:
    """POST to /api/rag/ingest. Returns memory_turn_id."""

def notify_session_end(session_id: str, project_id: str) -> bool:
    """Notify Orbit of session end to trigger training."""

8.3 MCP Server

Location: `[home-path]`

8.3.1 RAG++ Tools

python

# Search unified memory_turns
{
    "name": "rag_search",
    "description": "Search the unified memory_turns knowledge fabric via RAG++.",
    "inputSchema": {
        "properties": {
            "query": {"type": "string"},
            "limit": {"type": "integer", "default": 10},
            "project_id": {"type": "string"}  # optional filter
        },
        "required": ["query"]
    }
}

# Get project context
{
    "name": "rag_context",
    "description": "Get contextual memory turns for a project/session.",
    "inputSchema": {
        "properties": {
            "project_id": {"type": "string"},
            "session_id": {"type": "string"},
            "limit": {"type": "integer", "default": 20}
        },
        "required": ["project_id"]
    }
}

# Get style signature
{
    "name": "rag_style_signature",
    "description": "Get the current global CognitiveTwin style signature."
}

# Trigger training
{
    "name": "rag_trigger_training",
    "description": "Trigger CognitiveTwin training via Orbit.",
    "inputSchema": {
        "properties": {
            "mode": {"enum": ["batch_prompts", "batch_turns", "incremental"]},
            "project_id": {"type": "string"}
        }
    }
}

# Health check
{
    "name": "rag_health",
    "description": "Check RAG++ and Orbit integration health."
}

---

9. API Layer: Endpoints Reference

9.1 Orbit Server Endpoints

9.1.1 Health & Status

Endpoint	Method	Response
`/health`	GET	`{"status": "ok", "service": "orbit-server"}`
`/api/hashes`	GET	Data hashes for change detection

9.1.2 Projects

Endpoint	Method	Description
`/api/projects`	GET	List all projects
`/api/projects`	POST	Create project
`/api/projects/:id`	GET	Get project
`/api/projects/:id`	PUT	Update project
`/api/projects/:id`	DELETE	Delete project
`/api/projects/:id/activity`	POST	Touch last_active_at
`/api/projects/:id/context`	GET	Get project context
`/api/projects/:id/tree`	GET	Get file tree

9.1.3 Sessions

Endpoint	Method	Description
`/api/sessions`	GET	List sessions
`/api/sessions/detect`	GET	Detect Claude sessions
`/api/projects/:id/sessions`	POST	Start session
`/api/sessions/:id`	DELETE	End session

9.1.4 RAG++ Integration

Endpoint	Method	Request	Response
`/api/rag/health`	GET	-	`{"status": "ok", "endpoints": [...]}`
`/api/rag/ingest`	POST	RagIngestRequest	`{"memory_turn_id": "..."}`
`/api/rag/search`	GET	`?query=...&limit=10`	`{"results": [...]}`
`/api/rag/context/:id`	GET	`?session_id=...`	`{"turns": [...]}`
`/api/rag/signature`	GET	-	`{"signature": [...], "confidence": 0.8}`
`/api/rag/train`	POST	RagTrainRequest	`{"triggered": true}`
`/api/rag/session-end`	POST	RagSessionEndRequest	`{"training_triggered": true}`

9.2 MCP Protocol

9.2.1 Resources

URI	Description
`prompt-logs://recent`	Recent prompt entries
`prompt-logs://prompt/{id}`	Full prompt entry
`prompt-logs://prompt/{id}/diffs`	File diffs for prompt
`prompt-logs://plans`	Plans index
`prompt-logs://plan/{slug}`	Plan content

9.2.2 Tools (via `tools/call`)

`prompt_logs_recent`
`prompt_logs_get`
`prompt_logs_diffs`
`rag_search`
`rag_context`
`rag_style_signature`
`rag_trigger_training`
`rag_health`

---

10. Data Flow Diagrams

10.1 Prompt Ingestion Flow

┌─────────────┐
│ User types  │
│ prompt in   │
│ Claude Code │
└──────┬──────┘
       │
       ▼
┌─────────────────┐
│ main.py hook    │
│ captures prompt │
└────────┬────────┘
         │
         ▼
┌─────────────────────────────────────────────────────────────────┐
│                     Claude Code processes                        │
└────────────────────────────┬────────────────────────────────────┘
                             │
                             ▼
┌─────────────────────────────────────────────────────────────────┐
│ response_hook.py captures:                                       │
│   - Full text response                                           │
│   - Tool calls (Read, Edit, Write, Bash)                        │
│   - File diffs                                                   │
│   - Timing metrics                                               │
└────────────────────────────┬────────────────────────────────────┘
                             │
                             ▼
┌─────────────────────────────────────────────────────────────────┐
│ orbit_sync.sync_full_prompt()                                    │
│   ├── find_project_by_path(cwd)                                 │
│   ├── update_project_activity(project_id)                       │
│   └── ingest_to_rag(...)                                        │
└────────────────────────────┬────────────────────────────────────┘
                             │
                             ▼
┌─────────────────────────────────────────────────────────────────┐
│ Orbit Server: POST /api/rag/ingest                              │
│   └── Forwards to PromptIngester                                │
└────────────────────────────┬────────────────────────────────────┘
                             │
                             ▼
┌─────────────────────────────────────────────────────────────────┐
│ PromptIngester.ingest_prompt()                                   │
│   ├── content = f"User: {prompt}\n\nAssistant: {response}"      │
│   ├── embedding = GeminiEmbedder.embed(content)  [768-dim]      │
│   ├── neighbors = HNSWIndex.search(embedding, k=10)             │
│   ├── coords = compute_trajectory_coordinates(...)               │
│   ├── Supabase.insert(memory_turns, {...})                      │
│   └── HybridTrainer.incremental_learn(turn)                     │
└────────────────────────────┬────────────────────────────────────┘
                             │
                             ▼
┌─────────────────────────────────────────────────────────────────┐
│ memory_turns table:                                              │
│   id: uuid                                                       │
│   content: "User: ...\n\nAssistant: ..."                        │
│   embedding: [768 floats]                                        │
│   trajectory_depth: 0.25                                         │
│   trajectory_sibling_order: 0.0                                  │
│   trajectory_homogeneity: 0.72                                   │
│   trajectory_temporal: 0.12                                      │
│   trajectory_complexity: 0.45                                    │
│   trajectory_phase: "implementation"                             │
│   metadata: {"source": "claude_prompt", "orbit_project_id": ...} │
└─────────────────────────────────────────────────────────────────┘

10.2 Training Trigger Flow

┌───────────────────┐     ┌───────────────────┐     ┌───────────────────┐
│   Session End     │     │    Scheduled      │     │    Threshold      │
│   (Orbit event)   │     │    (3 AM daily)   │     │    (50 updates)   │
└─────────┬─────────┘     └─────────┬─────────┘     └─────────┬─────────┘
          │                         │                         │
          ▼                         ▼                         ▼
┌─────────────────────────────────────────────────────────────────────────┐
│                     HybridCognitiveTwinTrainer                          │
│  ┌────────────────────────────────────────────────────────────────┐    │
│  │ on_session_end(session_id)                                      │    │
│  │   └── turns = fetch_session_turns(session_id)                   │    │
│  │       └── batch_train_turns(turns)                              │    │
│  │                                                                  │    │
│  │ on_schedule()                                                    │    │
│  │   └── turns = fetch_recent_turns(since=last_scheduled)          │    │
│  │       └── batch_train_turns(turns)                              │    │
│  │                                                                  │    │
│  │ _trigger_threshold_training()                                    │    │
│  │   └── batch_train_turns(pending_batch)                          │    │
│  └────────────────────────────────────────────────────────────────┘    │
│                                 │                                       │
│                                 ▼                                       │
│  ┌────────────────────────────────────────────────────────────────┐    │
│  │ batch_train_turns(turns):                                       │    │
│  │   1. Stack embeddings → [batch, 768]                            │    │
│  │   2. Stack trajectory_coords → [batch, 5]                       │    │
│  │   3. CognitiveTwin.learn_from_conversation(...)                 │    │
│  │   4. Update GlobalStyleSignature with batch                     │    │
│  │   5. Persist checkpoint                                         │    │
│  └────────────────────────────────────────────────────────────────┘    │
│                                 │                                       │
│                                 ▼                                       │
│  ┌────────────────────────────────────────────────────────────────┐    │
│  │ GlobalStyleSignature.blend_batch(style_embeddings)              │    │
│  │   └── signature = EMA(old_signature, weighted_avg(new))         │    │
│  │   └── confidence = min(1.0, updates / (updates + 100))          │    │
│  └────────────────────────────────────────────────────────────────┘    │
└─────────────────────────────────────────────────────────────────────────┘

10.3 Retrieval Flow

┌─────────────────┐
│ MCP Tool Call:  │
│ rag_search      │
│ query="..."     │
└────────┬────────┘
         │
         ▼
┌─────────────────────────────────────────────────────────────────┐
│ MCP Server → Orbit Server: GET /api/rag/search?query=...       │
└────────────────────────────┬────────────────────────────────────┘
                             │
                             ▼
┌─────────────────────────────────────────────────────────────────┐
│ RAG++ Search Pipeline                                            │
│   1. query_embedding = GeminiEmbedder.embed(query, RETRIEVAL_QUERY)  │
│   2. candidates = Supabase.rpc('match_turns', {                  │
│        query_embedding: [...],                                   │
│        match_count: 20,                                          │
│        similarity_threshold: 0.7                                 │
│      })                                                          │
│   3. reranked = TrajectoryReranker.rerank(candidates, context)   │
│   4. return top_k                                                │
└────────────────────────────┬────────────────────────────────────┘
                             │
                             ▼
┌─────────────────────────────────────────────────────────────────┐
│ Response:                                                        │
│ {                                                                │
│   "query": "...",                                                │
│   "results": [                                                   │
│     {                                                            │
│       "turn_id": "...",                                          │
│       "content": "...",                                          │
│       "similarity": 0.85,                                        │
│       "trajectory": {                                            │
│         "depth": 0.3,                                            │
│         "phase": "implementation"                                │
│       }                                                          │
│     },                                                           │
│     ...                                                          │
│   ]                                                              │
│ }                                                                │
└─────────────────────────────────────────────────────────────────┘

---

11. Configuration Reference

11.1 Environment Variables

bash

# Supabase
SUPABASE_URL=https://project.supabase.co
SUPABASE_ANON_KEY=eyJ...

# Orbit
ORBIT_SERVER_URL=http://localhost:3847
ORBIT_DATABASE_URL=[home-path]
ORBIT_STORAGE_BACKEND=sqlite  # sqlite | supabase | hybrid

# RAG++ Integration
RAG_INGEST_ENABLED=true

# Gemini (for embeddings)
GOOGLE_API_KEY=...

# Training
COGNITIVE_TWIN_CHECKPOINT_DIR=checkpoints/cognitive_twin
GLOBAL_SIGNATURE_PATH=checkpoints/cognitive_twin/global_signature.pt

11.2 HybridConfig

python

@dataclass
class HybridConfig:
    enable_incremental: bool = True        # Run incremental learning
    incremental_weight: float = 0.1        # Weight for incremental blend
    threshold_count: int = 50              # Trigger batch after N updates
    signature_checkpoint_path: Path = ...  # Path to save signature
    style_dim: int = 256                   # Style embedding dimension
    batch_epochs: int = 3                  # Epochs for batch training
    min_batch_size: int = 10               # Minimum samples for batch
    auto_save_signature: bool = True       # Auto-persist after training

11.3 ScheduleConfig

python

@dataclass
class ScheduleConfig:
    daily_hour: int = 3              # Hour for daily training
    daily_minute: int = 0            # Minute for daily training
    enable_daily: bool = True        # Enable daily training
    interval_hours: int = 24         # Interval between trainings
    timezone: str = "UTC"            # Timezone for scheduling
    celery_broker: str = "redis://localhost:6379/0"  # For distributed

11.4 CognitiveTwinConfig Presets

python

# Fast (inference-optimized)
CognitiveTwinConfig.fast()
# - 2 attention heads
# - 128-dim style
# - Approximate retrieval

# Accurate (quality-optimized)
CognitiveTwinConfig.accurate()
# - 8 attention heads
# - 512-dim style
# - Exact retrieval

# Balanced (default)
CognitiveTwinConfig.balanced()
# - 4 attention heads
# - 256-dim style
# - Hybrid retrieval

---

12. Component Interfaces

12.1 PromptIngester Interface

python

class PromptIngester:
    async def ingest_prompt(
        self,
        prompt_id: str,
        prompt_text: str,
        response_text: str,
        orbit_context: OrbitContext,
    ) -> IngestedTurn:
        """Ingest single prompt."""

    async def ingest_batch(
        self,
        prompts: List[Dict[str, Any]],
        orbit_context: OrbitContext,
    ) -> List[IngestedTurn]:
        """Ingest batch of prompts."""

    def add_to_index(self, turn_id: str, embedding: List[float]) -> None:
        """Add turn to HNSW index."""

    def build_index_from_supabase(self, limit: int = 100000) -> int:
        """Build index from existing turns."""

12.2 HybridCognitiveTwinTrainer Interface

python

class HybridCognitiveTwinTrainer:
    # Training modes
    async def batch_train_prompts(self, prompts, epochs) -> TrainingMetrics
    async def batch_train_turns(self, turns, epochs) -> TrainingMetrics
    def realtime_style_extract(self, text, embedding) -> StyleSignatureVector
    async def incremental_learn(self, turn) -> None

    # Triggers
    async def on_session_end(self, session_id) -> TrainingMetrics
    async def on_schedule(self) -> TrainingMetrics
    async def on_manual_trigger(self, mode, filter) -> TrainingMetrics

    # Signature management
    def get_global_signature(self) -> StyleSignatureVector
    def persist_global_signature(self) -> None
    def reset_global_signature(self) -> None

    # History
    def get_training_history(self) -> List[Dict[str, Any]]

12.3 GlobalStyleSignature Interface

python

class GlobalStyleSignature:
    def blend(self, new_signature, weight, source) -> None
        """Blend single signature."""

    def blend_batch(self, signatures, weights) -> None
        """Blend batch of signatures."""

    def current(self) -> StyleSignatureVector
        """Get current signature state."""

    def similarity(self, other) -> float
        """Cosine similarity with another signature."""

    def state_dict(self) -> Dict[str, Any]
        """For checkpointing."""

    def load_state_dict(self, state) -> None
        """Load from checkpoint."""

    def save(self, path) -> None
    def load(cls, path) -> GlobalStyleSignature
    def reset(self) -> None

12.4 TrainingScheduler Interface

python

class TrainingScheduler:
    def start(self) -> None
        """Start scheduler."""

    def stop(self) -> None
        """Stop scheduler."""

    async def trigger_now(self) -> Dict[str, Any]
        """Manual immediate trigger."""

    def add_callback(self, callback) -> None
        """Add completion callback."""

    def get_status(self) -> Dict[str, Any]
        """Get scheduler status."""

12.5 OrbitSessionHandler Interface

python

class OrbitSessionHandler:
    async def on_session_end(self, session_id, project_id) -> Dict[str, Any]
        """Handle session end event."""

    def on_session_start(self, session_id, project_id) -> None
        """Track session start."""

    def get_active_sessions(self) -> Dict[str, str]
        """Get currently active sessions."""

---

Appendix A: File Locations

Component	Path
Rust Core	`crates/core/src/`
PyO3 Bindings	`crates/python/src/lib.rs`
PromptIngester	`rag_plusplus/ingestion/prompt_ingester.py`
HybridTrainer	`rag_plusplus/ml/cognitive/hybrid_trainer.py`
GlobalSignature	`rag_plusplus/ml/cognitive/global_signature.py`
TrainingScheduler	`rag_plusplus/ml/training/scheduler.py`
Orbit Server	`trajectory-orbit/crates/orbit-server/src/main.rs`
Prompt Logger	`[home-path]`
MCP Server	`[home-path]`
orbit_sync	`[home-path]`

---

Appendix B: Glossary

Term	Definition
5D Trajectory	5-dimensional coordinate system (depth, sibling, homogeneity, temporal, complexity)
CognitiveTwin	ML model learning user reasoning patterns and style
EMA	Exponential Moving Average - smoothing technique for signature blending
GlobalStyleSignature	Single evolving style vector representing user across all projects
HNSW	Hierarchical Navigable Small World - approximate nearest neighbor index
IRCP	Inverse Reasoning Coordinate Propagator - attention computation
memory_turns	Unified table storing all AI interactions
MCP	Model Context Protocol - interface for AI assistants to access tools
Orbit	Project orchestrator managing sessions, context, and training
PromptIngester	Component converting prompts to memory_turns
Salience	Importance score for a turn
Trajectory Phase	Exploration, Refinement, Application, or Resolution

---

Appendix C: E2E Verification (Cloud Run)

Use this when validating the full pipeline: Prompt Logger → Orbit → RAG++ → Supabase.

Prereqs:
- Cloud Run services are healthy.
- `SUPABASE_URL` and `SUPABASE_ACCESS_TOKEN` are set in `.env`.
- `ORBIT_SERVER_URL` points to Cloud Run (or is exported in your shell).

Quick command:

bash

scripts/e2e_ingest_test.sh

Expected:
- The ingest call returns a `memory_turn_id`.
- The Supabase query returns a row matching the generated `prompt_id`.

End of Architecture Document

Promotion Decision

Promote into a technical note or architecture paper with implementation anchors.

Source Anchor

Comp-Core/core/retrieval/cc-rag-plus-plus/docs/UNIFIED_RAG_ARCHITECTURE.md

Detected Structure

Method · Evaluation · References · Code Anchors · Architecture

Full Public Reader

Unified RAG++ Architecture

Complete System Reference

Table of Contents

1. System Overview

1.1 Purpose

1.2 Core Principles

1.3 System Components

2. Layer Architecture

2.1 Layer Stack (Bottom to Top)

2.2 Dependency Graph

3. Foundation Layer: Rust Core

3.1 Overview

3.2 PyO3 Bindings

3.2.1 HNSWIndex

3.2.2 IRCPPropagator

3.2.3 compute_trajectory_coordinates

3.3 TrajectoryCoordinate5D

3.3.1 Phase Enumeration

3.4 Rust Module Structure

4. Data Layer: Supabase Schema

4.1 Core Tables

4.1.1 memory_conversations

4.1.2 memory_turns (The Unified Knowledge Fabric)

4.1.3 claude_prompts (Prompt Logger Source)

4.2 Indexes and Performance

4.3 Metadata Schema

5. Ingestion Layer: Prompt Pipeline

5.1 PromptIngester

5.1.1 Class Definition

5.1.2 Ingestion Flow

5.2 OrbitContext

5.3 TrajectoryCoords

5.4 IngestedTurn

6. ML Layer: CognitiveTwin

6.1 Architecture Overview

6.2 HybridCognitiveTwinTrainer

6.2.1 Training Modes

6.2.2 Training Triggers

6.2.3 Core Methods

6.3 GlobalStyleSignature

6.3.1 EMA Blending

6.3.2 Persistence

6.4 StyleSignatureVector

7. Orchestration Layer: Orbit

7.1 Orbit Server

7.1.1 Core Responsibilities

7.2 RAG++ API Endpoints

7.2.1 Ingest Request

7.2.2 Training Request

7.3 Project ↔ Session ↔ Turn Hierarchy

8. Integration Layer: Prompt Logger

8.1 Hook System

8.1.1 Hook Chain

8.2 orbit_sync.py

8.2.1 Key Functions

8.3 MCP Server

8.3.1 RAG++ Tools

9. API Layer: Endpoints Reference

9.1 Orbit Server Endpoints

9.1.1 Health & Status

9.1.2 Projects

9.1.3 Sessions

9.1.4 RAG++ Integration

9.2 MCP Protocol

9.2.1 Resources

9.2.2 Tools (via `tools/call`)

10. Data Flow Diagrams

10.1 Prompt Ingestion Flow

10.2 Training Trigger Flow

10.3 Retrieval Flow

11. Configuration Reference

11.1 Environment Variables

11.2 HybridConfig

11.3 ScheduleConfig

11.4 CognitiveTwinConfig Presets

12. Component Interfaces

12.1 PromptIngester Interface

12.2 HybridCognitiveTwinTrainer Interface

12.3 GlobalStyleSignature Interface