Back to corpus
research noteexperiment writeup candidatescore 24
Getting Started: Build Your Personal AI
You have **289 MB of personal data** across 5 files: - conversations.json (190 MB) - conversations_new.json (64 MB) - conversation_openai.json (8 MB) - notes.json (15 MB) - cc_conversations.json (13 MB)
Full HTML reader
Read the full artifact
Extracted abstract or opening context
Quick guide to building your personalized AI inference system with full context memory.
You have **289 MB of personal data** across 5 files: - conversations.json (190 MB) - conversations_new.json (64 MB) - conversation_openai.json (8 MB) - notes.json (15 MB) - cc_conversations.json (13 MB)
We'll transform this into a **personal AI** that knows everything about you and your projects.
That's it! The `embeddinggemma-300m` model will download automatically on first run (~600 MB).
**What it does**: - ✅ Loads all conversations from all sources - ✅ Extracts clean message threads - ✅ Deduplicates content - ✅ Auto-categorizes by topic (CC, music, business, etc.) - ✅ Creates `data/unified_knowledge.json`
Promotion decision
What has to happen next
Attach run IDs, datasets, metrics, and reproduction commands.
Why this is not always a full paper yet
Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.