Back to corpus
research noteexperiment writeup candidatescore 24

Getting Started: Build Your Personal AI

You have **289 MB of personal data** across 5 files: - conversations.json (190 MB) - conversations_new.json (64 MB) - conversation_openai.json (8 MB) - notes.json (15 MB) - cc_conversations.json (13 MB)

Full HTML reader

Read the full artifact

Open in new tab

Extracted abstract or opening context

Quick guide to building your personalized AI inference system with full context memory. You have **289 MB of personal data** across 5 files: - conversations.json (190 MB) - conversations_new.json (64 MB) - conversation_openai.json (8 MB) - notes.json (15 MB) - cc_conversations.json (13 MB) We'll transform this into a **personal AI** that knows everything about you and your projects. That's it! The `embeddinggemma-300m` model will download automatically on first run (~600 MB). **What it does**: - ✅ Loads all conversations from all sources - ✅ Extracts clean message threads - ✅ Deduplicates content - ✅ Auto-categorizes by topic (CC, music, business, etc.) - ✅ Creates `data/unified_knowledge.json`

Promotion decision

What has to happen next

Attach run IDs, datasets, metrics, and reproduction commands.

Why this is not always a full paper yet

Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.