Personal arXiv

Research you can
open and save.

A paper-style HTML archive for the research corpus. Each entry has a full standalone HTML reader on the server, so it can be opened, downloaded, saved, and promoted into the living paper schema when it is ready.

HTML files

1147

Preprint shelf

68

Technical

272

Backlog

807

Preprint shelf

Closest to submission form

HTML here is readable and downloadable. It is not the same as a submitted paper until references, figures, evidence checkpoints, and release status are attached.

preprint structure candidatescore 100

The Anticipatory Transformer: Geometry-Steered Attention for Trajectory-Aware Reasoning

Standard transformers attend based on learned position encodings (sinusoidal, RoPE, ALiBi) that encode *where* tokens are in a sequence but not *what the sequence is doing* as a geometric process. I introduce the Anticipatory Transformer, a modified transformer architecture where seven geometric scalars derived from Anticipation Geometry (commitment, uncertainty, transition pressure, recovery margin, phase stiffness, novelty, stability) steer the multi-head attention mechanism via additive bias. The trajectory bias

preprint structure candidatescore 100

Anticipation Geometry: Domain-General Trajectory Characterization with Knowledge Graph-Grounded Rewards

I present Anticipation Geometry, a mathematical framework that characterizes trajectories through arbitrary state spaces using seven geometric scalars: commitment, uncertainty, transition pressure, recovery margin, phase stiffness, novelty, and stability. These scalars are domain-general, operating on any sequence of vectors in a metric space equipped with a differentiable time parameter. I combine this framework with knowledge graph path-derived reward signals to create a unified system for both trajectory analysi

preprint structure candidatescore 100

Cognitive Twin: Personality Transfer via Small-Model LoRA with Runtime Knowledge Graph Augmentation

We present the Cognitive Twin architecture, a three-component system that produces a faithful digital replica of a human operator's conversational persona without baking volatile domain knowledge into model weights. The architecture separates personality (a LoRA adapter trained on the operator's historical responses), knowledge (a live knowledge graph queried at inference time), and trajectory awareness (geometric scalars characterizing conversation dynamics). We find that a Qwen2.5-3B model with LoRA adapters on a

preprint structure candidatescore 100

Cognitive Twin Synthesis: A Recursive Polymodal Framework for Autonomous Agent Identity from Conversational Corpora

We present a framework for constructing autonomous cognitive twins from large-scale conversational corpora. Building on the Recursive Polymodal Synthesis (RPS) framework, which fuses heterogeneous sensor modalities through Lipschitz-constrained fixed-point iteration, we extend cross-modal coherence theory from physical signals (accelerometer, gyroscope, heart rate) to cognitive modalities: linguistic style ($\mathcal{V}_L$), decision patterns ($\mathcal{V}_D$), domain knowledge ($\mathcal{V}_K$), value alignment ($

preprint render candidatescore 100

Cognitive Twin Synthesis: Theorems, Proofs, and Derivations

This document presents the mathematical foundations for constructing a cognitive twin using Recursive Polymodal Synthesis (RPS). We extend the original RPS framework from sensor modalities (motion, heart rate, audio) to cognitive modalities (linguistic style, decision patterns, knowledge, values, temporal behavior). We prove convergence of the cognitive synthesis operator, derive the coherence energy functional, establish bounds on identity drift, and formalize the autonomy ratchet protocol. All results build on th

preprint structure candidatescore 100

Enhanced Topological Preference Optimization with Spatial Intelligence: A Unified Framework for Conversation Analysis

We present an enhanced Topological Preference Optimization (TPO) system that integrates spatial intelligence and cross-conversation consolidation for advanced conversation analysis. Our unified framework combines the topological structure analysis of TPO with the spatial coordinate systems and ring topology of Ring Contextual Propagation (RCP), creating a comprehensive system for modeling conversation dynamics and generating preference datasets. The system employs 4D spatial coordinates (x, y, z, t) to represent hi

preprint structure candidatescore 100

Enhanced Topological Preference Optimization with Spatial Intelligence: A Unified Framework for Conversation Analysis

We present an enhanced Topological Preference Optimization (TPO) system that integrates spatial intelligence and cross-conversation consolidation for advanced conversation analysis. Our unified framework combines the topological structure analysis of TPO with the spatial coordinate systems and ring topology of Ring Contextual Propagation (RCP), creating a comprehensive system for modeling conversation dynamics and generating preference datasets. The system employs 4D spatial coordinates (x, y, z, t) to represent hi

preprint structure candidatescore 100

CC-MotionGen: Audio-Conditioned Latent Motion Diffusion with Validation-Based Candidate Selection

CC-MotionGen is a diffusion-based generative system that produces time-indexed motion trajectories conditioned on audio features and optional high-level context. The system targets phrase-level generation: it consumes precomputed audio feature tensors and precomputed motion latents, trains a temporal one-dimensional U-Net denoiser under a Gaussian diffusion process, and performs inference by sampling multiple candidate futures and selecting the best output using a two-stage validation pipeline. The validation pipel

preprint structure candidatescore 100

RAG++: Memory-Conditioned Candidate Selection with Trajectory-Aware Attention

Retrieval-Augmented Generation (RAG) systems typically treat retrieved context as a flat collection of documents, ignoring the structural and temporal relationships between conversation turns. We present RAG++, a trajectory-aware retrieval system that positions memories in a 5-dimensional coordinate space (depth, sibling order, homogeneity, temporal position, and complexity) and enforces context admissibility through cryptographically-verified slicing. Our system introduces three key innovations: (1) **Inverse Ring

preprint render candidatescore 100

CognitiveTwin: Architectural Foundations and Empirical Evaluation of Personalized Language Model Adaptation Through Trajectory-Aware Fine-Tuning

The construction of personalized language model instances capable of reproducing individual cognitive patterns, stylistic signatures, and domain-specific conceptual frameworks represents a significant advancement in the development of AI systems that function as cognitive extensions rather than generic tools. This paper presents the CognitiveTwin framework, a comprehensive architecture for creating personalized language model instances through trajectory-aware supervised fine-tuning on conversational interaction hi

preprint structure candidatescore 100

Policy-Governed Context Slicing for Autonomous Agent Systems: A Lightweight Knowledge Graph Approach

Autonomous AI agent systems face a fundamental challenge: constructing reproducible, trustworthy context windows from large conversational histories while enforcing governance policies over what information may influence downstream decisions. We present the **Graph Kernel**, a deterministic context slicing engine implemented as a single Rust binary (~15 KLOC) that combines a lightweight knowledge graph triple store with cryptographically-signed, policy-governed context window construction. Unlike general-purpose gr

preprint render candidatescore 100

Deterministic Provenance Engines for Autonomous Agent Systems: Architecture, Implementation, and Evaluation of the Graph Kernel

Autonomous AI agents making consequential decisions require infrastructure that ensures every reasoning step is traceable, reproducible, and verifiable. We present the \textbf{Graph Kernel}, a deterministic provenance engine implemented as a single Rust binary (${\sim}15$~KLOC) that produces cryptographically-signed, policy-governed context windows---termed \emph{admissible evidence bundles}---for autonomous agent reasoning. Unlike general-purpose graph databases, vector stores, or RAG pipelines, the Graph Kernel i

preprint render candidatescore 100

Policy-Governed Context Slicing for Autonomous Agent Systems: A Lightweight Knowledge Graph Approach

Autonomous AI agent systems face a fundamental challenge: constructing reproducible, trustworthy context windows from large conversational histories while enforcing governance policies over what information may influence downstream decisions. We present the \textbf{Graph Kernel}, a deterministic context slicing engine implemented as a single Rust binary (${\sim}15$ KLOC) that combines a lightweight knowledge graph triple store with cryptographically-signed, policy-governed context window construction. Unlike genera

preprint render candidatescore 100

Graph-Augmented Recursive Language Models for Personal Knowledge Systems

% ============================================================ We present \textbf{Cog-RLM}, a graph-augmented recursive language model architecture for personal knowledge systems that achieves 90.3\% accuracy on a comprehensive 103-question evaluation spanning ten cognitive dimensions, using a stock 3-billion parameter model with zero fine-tuning and zero inference cost. Our system extends the Recursive Language Model (RLM) paradigm~\citep{zhang2025rlm} with three novel contributions: (1)~a local knowledge graph pr

preprint structure candidatescore 100

Anticipation Geometry: Domain-General Trajectory Characterization with Knowledge Graph-Grounded Rewards

We present Anticipation Geometry, a mathematical framework that characterizes trajectories through arbitrary state spaces using seven geometric scalars: commitment, uncertainty, transition pressure, recovery margin, phase stiffness, novelty, and stability. Originally developed for physical motion capture in the Comp-Core system, we prove these scalars are domain-general, operating on any sequence of vectors in a metric space equipped with a differentiable time parameter. We combine this framework with knowledge gra

preprint structure candidatescore 100

Computational Choreography: Deterministic Motion-to-Audio Synthesis via Geometric Anticipation Signals

We present Computational Choreography, a deterministic pipeline that transforms heterogeneous sensor input -- phone accelerometer, smartwatch heart rate, full-body IMU skeleton -- into real-time audio synthesis through geometric anticipation signals. The system guarantees deterministic replay: identical sensor input always produces identical audio output. The key innovation is the Anticipation Kernel, which computes seven geometric scalars (commitment, uncertainty, transition pressure, recovery margin, phase stiffn

preprint structure candidatescore 100

Live Knowledge Graphs: Runtime Graph Integration for Continuous Domain Adaptation in Language Agents

Recent work on Domain-Specific Superintelligence (Belova et al., 2026) demonstrates that knowledge graph-derived training curricula produce domain specialists that outperform models 400x their size. However, this approach treats knowledge graphs as static training scaffolding: constructed once, used for fine-tuning, then discarded at inference. We present an alternative: runtime knowledge graph integration, where the graph is queried live during inference with provenance-tracked context slicing, real-time entity re

preprint structure candidatescore 100

KARL: Advantage-Weighted Training from Full Agent Session Traces

Standard supervised fine-tuning (SFT) for language model agents operates on input-output pairs: a prompt and the response the model should produce. This format captures *what* an agent said but discards *why* it made specific decisions. We present KARL (Knowledge-Augmented Reinforcement Learning), a trajectory intelligence system that trains language model agents from full session traces rather than isolated completions. A trajectory in KARL records every tool call, file read, code edit, bash command, success signa

preprint structure candidatescore 100

Inscription-Conditioned Cognitive Twin: N'Ko Sigil Encoding as Semantic Compression for Long-Context Personality Models

Context window limitations constrain the fidelity of small personality models. A 4B parameter model with a 32K token context can hold roughly 8,000 words of conversation history before truncation begins discarding information critical to persona coherence. We present the Inscription-Conditioned Cognitive Twin (ICCT), an architecture that addresses this bottleneck by encoding conversation history as N'Ko inscriptions rather than English prose. The encoding uses 10 N'Ko sigils, each a single Unicode character derived

preprint structure candidatescore 100

KARL-Edge: Multi-Signal Reinforcement Learning for Software Engineering Agents on Commodity Hardware

We present KARL-Edge, an adaptation of the Knowledge Agents via Reinforcement Learning (KARL) framework to multi-tool software engineering agents running on commodity Apple Silicon hardware. Where the original KARL system (Chang et al., 2026) trains enterprise search agents using full off-policy RL with binary reward signals, our system introduces three architectural contributions: (1) a 5-signal composite reward function that decomposes trajectory quality into outcome, process, efficiency, verification, and consis

preprint structure candidatescore 100

Trajectory Memory Ledger

We present the Trajectory Memory Ledger, implemented in KARL, a schema-normalized experience replay system for improving AI coding agent performance through closed-loop feedback. The ledger records complete tool-use sequences during real coding sessions, normalizes them into an append-only schema, scores them using a six-signal composite reward function (outcome, process, efficiency, verification, consistency, and wasted motion), and uses the highest-scoring trajectories to generate advantage-weighted supervised fi

preprint structure candidatescore 100

Live Knowledge Graphs: Runtime Graph Integration for Continuous Domain Adaptation in Language Agents

Recent work on Domain-Specific Superintelligence (Belova et al., 2026) demonstrates that knowledge graph-derived training curricula produce domain specialists that outperform models 400x their size. However, this approach treats knowledge graphs as static training scaffolding: constructed once, used for fine-tuning, then discarded at inference. We present an alternative: runtime knowledge graph integration, where the graph is queried live during inference with provenance-tracked context slicing, real-time entity re

preprint render candidatescore 100

The Script That Machines Can't Read: Adapting Large Language Models for N'Ko

We present a systematic study of how large language models process N'Ko (\texttt{U+07C0--U+07FF}), an alphabetic script used by over 40 million Manding-language speakers in West Africa. Through activation profiling (``brain scanning'') of Qwen3-8B before and after fine-tuning, we demonstrate that: (1) fine-tuning concentrates N'Ko adaptation in the top 8 transformer layers, reducing activation magnitudes in reasoning layers while amplifying output confidence; (2) a three-stage training pipeline---continued pre-trai

preprint render candidatescore 100

From Dead Circuits to Living Speech: Activation Profiling and Script-Native ASR for N'Ko

N'Ko is an alphabetic script serving over 40 million Manding-language speakers across West Africa, engineered by Solomana Kant\'{e} in 1949 with a strict 1:1 phoneme-to-character mapping, explicit tonal diacritics, and zero spelling exceptions. We present a dual-thread investigation into why large language models (LLMs) fail on N'Ko and how to build audio-to-N'Ko speech recognition that bypasses LLMs entirely. \textbf{Thread 1 (Diagnostic):} We perform activation profiling---a ``brain scan''---of Qwen2-72B-Instruct

preprint render candidatescore 100

The Script That Machines Can't Read: Adapting Large Language Models for N'Ko

We present a systematic study of how large language models process N'Ko (\texttt{U+07C0--U+07FF}), an alphabetic script used by over 40 million Manding-language speakers in West Africa. Through activation profiling (``brain scanning'') of Qwen3-8B before and after fine-tuning, we demonstrate that: (1) fine-tuning concentrates N'Ko adaptation in the top 8 transformer layers, reducing activation magnitudes in reasoning layers while amplifying output confidence; (2) a three-stage training pipeline---continued pre-trai

preprint render candidatescore 100

From Dead Circuits to Living Speech: Activation Profiling, Script-Native Architecture Search, and Finite-State Phonotactics for N'Ko Automatic Speech Recognition

\nko{} is an alphabetic script serving over forty million Manding-language speakers across West Africa, engineered by Solomana Kant\'e in 1949 with a strict one-to-one phoneme-to-character mapping, explicit tonal diacritics, and zero spelling exceptions. We present a dual-thread investigation into why large language models fail on \nko{} and how to construct audio-to-\nko{} speech recognition that bypasses such models entirely. In the diagnostic thread, we perform activation profiling of Qwen2-72B-Instruct (4-bit N

preprint render candidatescore 100

Theorems, Proofs, and Derivations for N'Ko Script-Native ASR

This document collects the formal mathematical results underlying the N'Ko Brain Scanner and ASR system. We present five main theorems with proofs, three derivations of key quantities, and two corollaries that connect the LLM diagnostic thread to the ASR construction thread. The results establish: (1) a phonetic transparency advantage for CTC decoding on bijective scripts, (2) bounds on the translation tax in under-represented scripts, (3) completeness and soundness of the FSM phonotactic validator, (4) a circuit d

preprint structure candidatescore 100

N'Ko as an Extensible Phonemic Substrate for Governed Low-Resource Speech

Low-resource speech systems usually fail twice: first because there is not enough audio/text data, and second because the available evaluation scripts do not preserve the phonemic structure of the language being measured. This paper argues that N'Ko offers a different path. Because N'Ko is a phonetic, right-to-left script designed for Manding languages and equipped with tone, nasalization, and documented foreign-sound diacritics, it can function as an extensible phonemic substrate: a deterministic sound-code for co

preprint render candidatescore 100

Dead Circuits: Activation Profiling and Script Invisibility in Large Language Models

Large language models achieve remarkable performance on languages written in Latin, Cyrillic, CJK, and Arabic scripts. We ask what happens when these models encounter a script that is absent from their pre-training data. We perform activation profiling---a per-layer ``brain scan''---of Qwen3-8B processing 100 parallel English/N'Ko sentence pairs. N'Ko is an alphabetic script serving over 40 million Manding-language speakers across West Africa, engineered in 1949 with a strict phoneme-to-grapheme bijection, explicit

preprint render candidatescore 100

Script Invisibility Is Structural: Activation Profiling Across Three LLM Families

A prior study demonstrated that Qwen3-8B processes N'Ko text with severely diminished neural activation compared to English, a phenomenon termed \emph{script invisibility}. That finding left an open question: is the deficit specific to one model, or is it a structural property of all models trained on corpora where N'Ko is absent? We answer this by performing identical activation profiling---per-layer extraction of L2 norm, Shannon entropy, sparsity, and kurtosis---on three architecturally distinct models: Qwen3-8B

preprint render candidatescore 100

Beyond Controlled Comparison: Deployment Properties of Script-Aware ASR for N'Ko

Controlled experiments show that phonetically transparent scripts yield lower CER for CTC-based ASR. But ASR systems are not evaluated in controlled conditions---they encounter unseen vocabulary, new speakers, and domain shift. This paper assembles deployment-relevant evidence for Bambara ASR systems using N'Ko (bijective script) and Latin (many-to-many script), anchored by the verified 20.57\% N'Ko trajectory checkpoint but drawing on both current and historical experiments. First, \textbf{compositional generaliza

preprint render candidatescore 100

Dead Circuits: Script Invisibility and Representation Failure for N'Ko in Large Language Models

This paper studies \emph{script invisibility}: the condition in which a large language model accepts a writing system as valid Unicode while allocating little functional internal representation to it. The test case is \nko{}, the script designed by Solomana Kante for Manding languages. \nko{} is not a noisy informal encoding of Bambara, Maninka, or Dioula. It is a dedicated alphabetic system in the Unicode block U+07C0--U+07FF, with a close mapping between Manding phonology and written symbols, explicit diacritic m

preprint render candidatescore 100

Against WER: Phonemic Evaluation, Orthographic Transparency, and the Script Advantage for Manding ASR

Automatic speech recognition for Manding languages is usually reported through Latin-script word error rate. This paper argues that the metric is scientifically weak for the research question at hand. If the goal is to evaluate whether an ASR system recognizes Bambara, Maninka, Dioula, or related Manding speech, then the scoring units should preserve the acoustic-phonemic distinctions carried by the language. Latin Bambara orthography is useful and socially real, but it is not a lossless measurement interface: it u

preprint render candidatescore 100

Script-Native ASR for N'Ko: Anticipatory Transformer CTC Decoding and the CER Anchor

This paper preserves the technical ASR center of the \nko{} research program: an archived script-native trajectory checkpoint reporting \anchorcer{} character error rate on a \corpusn{}-pair Bambara corpus snapshot. The model uses frozen Whisper large-v3 acoustic features, a trainable Transformer CTC decoder, and a compact trajectory state that biases attention with speech-dynamic information. The result is the strongest retained ASR artifact in the project and is the correct way to discuss the phrase ``20 CER'' pu

preprint render candidatescore 100

Anticipation Geometry Partition: Row-Level Governance for Script-Native N'Ko ASR Deployment

This paper defines the deployment layer of the \nko{} ASR project: Anticipation Geometry Partition (AGP). AGP is not the acoustic model that produced the archived 20.57\% CER anchor. It begins after ASR. Its role is to convert trajectory and uncertainty signals into row-level decisions about correction, provenance, corpus admission, and deployment eligibility. The motivation is simple: a scalar CER number is not enough to build a trustworthy transcript corpus or a production speech system. A model can make local mi

preprint render candidatescore 100

Recursive Polymodal Synthesis: A Framework for Real-Time Computational Choreography Through Multi-Modal Sensor Fusion

We present Recursive Polymodal Synthesis (RPS), a framework for real-time computational choreography that achieves robust multi-modal sensor fusion through iterative proximal updates with spectral norm constraints, and couples that embodied state to a phrase-conditioned spectrogram diffusion backend for audio generation. The system integrates kinematic, physiological, and rhythmic data streams into a unified embodied representation that drives either smooth control signals or direct audio synthesis in real time. Ou

preprint render candidatescore 100

RESEARCH PAPER TECHNICAL LATEX

We present a mathematically rigorous framework for multi-modal sensor fusion in real-time embodied interaction systems. Our approach, \emph{Recursive Polymodal Synthesis} (RPS), fuses heterogeneous modalities with disparate sampling rates and noise statistics into a coherent latent representation suitable for generative control. The core mechanism is a proximal fixed-point iteration using spectral-norm-constrained relational operators, yielding contraction guarantees and a unique fixed point. We prove geometric con

preprint structure candidatescore 100

Recursive Polymodal Synthesis for Real-Time Embodied Interaction: A Contraction-Based Framework with Provable Convergence

We present a mathematically rigorous framework for multi-modal sensor fusion in real-time embodied interaction systems, coupled to a phrase-conditioned spectrogram diffusion backend for direct audio generation. Our approach, termed Recursive Polymodal Synthesis (RPS), addresses the fundamental challenge of fusing heterogeneous sensor modalities with different noise characteristics, sampling rates, and semantic meanings into a coherent internal representation suitable for generative control. The key innovation is a

preprint structure candidatescore 98

Semantic Kernel for N'Ko Language Processing: A Schema-Locked Approach to Low-Resource Vocabulary Construction

We present a schema-locked, replayable semantic kernel for constructing and validating vocabulary in low-resource languages, with specific application to N'Ko, the indigenous script of the Manding language family. Our system introduces a 7-operator semantic algebra with formal legality grammar, a morphological compiler producing content-addressable forms with stable signatures, and an evidence-driven lifecycle model for vocabulary promotion. The evaluation methodology employs stress-profile-based adversarial testin

preprint structure candidatescore 98

From Dead Circuits to Living Speech: Activation Profiling and Script-Native ASR for N'Ko

N'Ko is an alphabetic script serving over 40 million Manding-language speakers across West Africa, engineered by Solomana Kanté in 1949 with a strict 1:1 phoneme-to-character mapping, explicit tonal diacritics, and zero spelling exceptions. We present a dual-thread investigation into why large language models (LLMs) fail on N'Ko and how to build audio-to-N'Ko speech recognition that bypasses LLMs entirely. **Thread 1 (Diagnostic):** We perform activation profiling — a "brain scan" — of Qwen2-72B-Instruct (4-bit NF4

preprint structure candidatescore 96

Recursive Polymodal Synthesis for Real-Time Embodied Interaction: A Contraction-Based Framework with Provable Convergence

We present a mathematically rigorous framework for multi-modal sensor fusion in real-time embodied interaction systems. Our approach, termed Recursive Polymodal Synthesis (RPS), addresses the fundamental challenge of fusing heterogeneous sensor modalities with different noise characteristics, sampling rates, and semantic meanings into a coherent internal representation suitable for generative control. The key innovation is a proximal fixed-point iteration scheme that enforces cross-modal coherence through spectral-

preprint structure candidatescore 96

Recursive Polymodal Synthesis: A Framework for Real-Time Computational Choreography Through Multi-Modal Sensor Fusion

We present Recursive Polymodal Synthesis (RPS), a framework for real-time computational choreography that achieves robust multi-modal sensor fusion through iterative proximal updates with spectral norm constraints, and couples that embodied state to a phrase-conditioned spectrogram diffusion backend for audio generation. The system integrates kinematic, physiological, and rhythmic data streams into a unified embodied representation that drives either smooth control signals or direct audio synthesis in real time. Ou

preprint structure candidatescore 94

Topological Preference Optimization (TPO): A Novel Training Strategy for Conversational AI

We introduce **Topological Preference Optimization (TPO)**, a novel training methodology that leverages conversation topology and spatial-temporal coordinates to generate preference datasets for language model training. Unlike traditional Direct Preference Optimization (DPO) which relies on human annotations or simple heuristics, TPO extracts preference signals directly from the structural properties of conversation graphs, incorporating hindsight knowledge and topological awareness to create more accurate and cont

preprint structure candidatescore 94

Topological Preference Optimization (TPO): A Novel Training Strategy for Conversational AI

We introduce **Topological Preference Optimization (TPO)**, a novel training methodology that leverages conversation topology and spatial-temporal coordinates to generate preference datasets for language model training. Unlike traditional Direct Preference Optimization (DPO) which relies on human annotations or simple heuristics, TPO extracts preference signals directly from the structural properties of conversation graphs, incorporating hindsight knowledge and topological awareness to create more accurate and cont

preprint structure candidatescore 92

Memory-Augmented Equilibrium Control (MAEC)

This document formalizes **Memory-Augmented Equilibrium Control (MAEC)**, a control-theoretic framework for real-time embodied creative systems. MAEC addresses a class of problems where traditional control theory and reinforcement learning fail: continuous, non-episodic systems that must maintain expressive viability while generating novel outputs. Unlike RL, MAEC has no scalar reward function, no policy optimization loop, and no episodic resets. Instead, it preserves dynamic equilibrium through memory-conditioned

preprint structure candidatescore 90

Retrieval-Centric ASR for N'Ko: Exploiting Script Structure to Beat Sequence-to-Sequence

We present a retrieval-centric automatic speech recognition (ASR) architecture for Bambara, targeting N'Ko script output directly rather than routing through Latin transcription. The central insight is structural: N'Ko enforces a strict 1:1 phoneme-to-grapheme mapping, explicit tonal diacritics, and a mathematically complete syllable inventory of 3,024 entries (all V, VN, CV, and CVN patterns across five tones). This finite, well-structured output space makes retrieval a better fit than sequence-to-sequence decodin

preprint structure candidatescore 88

RAG++: State-Based Retrieval for Life Trajectory Optimization

We present RAG++ (Retrieval-Augmented Generation Plus Plus), a novel retrieval paradigm that extends traditional RAG from semantic text retrieval to **state-space transition retrieval**. Instead of retrieving "relevant documents," RAG++ retrieves **successful state transitions** from a user's personal history and recommends actions based on what worked in similar dynamical regimes. We demonstrate this approach in TrajectoryOS, a life physics modeling system that treats human life as a dynamical system with measurab

preprint structure candidatescore 88

Graph-Augmented Recursive Language Models for Personal Knowledge Systems

We present Cog-RLM, a graph-augmented recursive language model architecture for personal knowledge systems that achieves 90.3% accuracy on a comprehensive 103-question multi-dimensional evaluation using a stock 3-billion parameter model with zero fine-tuning and zero inference cost. Our system extends the Recursive Language Model (RLM) paradigm (Zhang et al., 2025) with three novel contributions: (1) a local knowledge graph providing relationship-aware context retrieval, (2) a hybrid decomposition classifier that s

preprint structure candidatescore 88

Geometric Motifs for Selecting and Routing Coding-Agent Training Data

We present a method for compactly annotating coding agent sessions with behavioral motifs and geometric features, then conditioning training data generation on these annotations. From 834 real multi-project coding sessions spanning 4,633 turn-level records across 50+ applications, we extract 10-category symbolic labels (inscriptions) and 5 continuous geometric scalars. We show that: (1) transition pressure predicts session convergence at 71.8% accuracy (z = 2.72, p < 0.007), (2) advantage-weighted training using th

preprint structure candidatescore 86

Enhanced Topological Preference Optimization: A Unified Framework for Multi-Dimensional Conversation Analysis with Spatial Intelligence and Cross-Conversation Consolidation

We present a comprehensive enhancement to Topological Preference Optimization (TPO) that integrates spatial intelligence, cross-conversation consolidation, and advanced pattern recognition for conversation analysis. Our unified framework processes hierarchical conversation structures through a four-dimensional spatial coordinate system, implements adaptive clustering algorithms for pattern detection, and employs sophisticated natural language processing techniques for knowledge consolidation across conversation bou

preprint structure candidatescore 86

Enhanced Topological Preference Optimization: A Unified Framework for Multi-Dimensional Conversation Analysis with Spatial Intelligence and Cross-Conversation Consolidation

We present a comprehensive enhancement to Topological Preference Optimization (TPO) that integrates spatial intelligence, cross-conversation consolidation, and advanced pattern recognition for conversation analysis. Our unified framework processes hierarchical conversation structures through a four-dimensional spatial coordinate system, implements adaptive clustering algorithms for pattern detection, and employs sophisticated natural language processing techniques for knowledge consolidation across conversation bou

preprint structure candidatescore 86

Mixture of Anticipatory Orthogonal Experts for N'Ko ASR

MAOE-N'Ko, the Mixture of Anticipatory Orthogonal Experts for N'Ko ASR, is a modular speech-language correction architecture that keeps the acoustic model sovereign while allowing language-prior intelligence to act only where it is admissible. The system begins with a verified N'Ko trajectory CTC acoustic model, currently anchored by the Paper 4 reproduction checkpoint with 20.57 percent CER on the locked N'Ko run. Instead of replacing that model with a monolithic audio-language system, MAOE-N'Ko routes each ASR ch

preprint structure candidatescore 84

Organic Vocabulary Acquisition for Low-Resource African Languages: A Video-First Approach to N'Ko and Manding Language Processing

This document presents a novel approach to building state-of-the-art natural language processing systems for N'Ko, Bambara, and related Manding languages spoken by approximately forty million people across West Africa. Unlike traditional corpus-driven methodologies that depend on pre-existing parallel texts such as Bible translations or government documents, we introduce a video-first organic vocabulary discovery system that extracts training data directly from educational YouTube content. The system processes vide

preprint structure candidatescore 82

The First Reference-Backed Proof: How Narrow Repairs Validate the AGP Bridge Architecture

On 2026-04-21, the AGP bridge architecture achieved its first non-synthetic, reference-backed Character Error Rate (CER) improvement: a reduction from 0.7604 to 0.7512 on a curated slice of archived ASR evaluation data. This result, while numerically modest, constitutes a critical architectural validation. It demonstrates that a reference-leakage-free gating system—operating exclusively on hypothesis-side telemetry—can safely admit edits that improve supervised metrics. The improvement was not achieved through broa

preprint structure candidatescore 82

arXiv Submission: Graph-Augmented Recursive Language Models for Personal Knowledge Systems

We present Cog-RLM, a graph-augmented recursive language model architecture for personal knowledge systems that achieves 90.3% accuracy on a comprehensive 103-question evaluation spanning ten cognitive dimensions, using a stock 3-billion parameter model with zero fine-tuning and zero inference cost. Our system extends the Recursive Language Model (RLM) paradigm with three novel contributions: (1) a local knowledge graph providing relationship-aware context retrieval via breadth-first traversal, (2) a hybrid decompo

preprint structure candidatescore 80

research — docs

This paper presents a retrieval-centric architecture for voice-controlled DJ performance that adapts the Speech-to-Order (S2O) streaming pipeline to the domain of professional DJ software, specifically Rekordbox. Instead of parsing transcribed text into intents via a conventional automatic speech recognition (ASR) and natural language understanding stack, the system learns a direct mapping between spoken commands and a catalog of DJ actions derived from Rekordbox’s performance preset mappings. The design combines a

preprint structure candidatescore 80

The Measure-Theoretic Foundation of Inverse Ring Contextual Propagation

We present Inverse Ring Contextual Propagation (I-RCP), a novel mathematical framework for modeling individual conversation dynamics through inverse mapping of response patterns. Unlike traditional approaches that optimize AI responses to match human preferences, I-RCP inverts the learning objective from P(v|u) to P(u|v), creating a direct model of individual response patterns within a rigorous mathematical structure. The framework introduces a three-dimensional coordinate system (x,y,z) that uniquely captures the

preprint structure candidatescore 80

The Measure-Theoretic Foundation of Inverse Ring Contextual Propagation

We present Inverse Ring Contextual Propagation (I-RCP), a novel mathematical framework for modeling individual conversation dynamics through inverse mapping of response patterns. Unlike traditional approaches that optimize AI responses to match human preferences, I-RCP inverts the learning objective from P(v|u) to P(u|v), creating a direct model of individual response patterns within a rigorous mathematical structure. The framework introduces a three-dimensional coordinate system (x,y,z) that uniquely captures the

preprint structure candidatescore 80

RAG++ Specification

RAG++ is a high-performance retrieval engine that provides **statistical priors** from outcome-annotated trajectories. Unlike traditional RAG systems that retrieve text for language model context, RAG++ retrieves structured execution traces and computes distributional statistics for downstream policy conditioning. **Key Insight**: Past execution outcomes encode implicit knowledge about action feasibility, timing, and context-dependent success rates. RAG++ surfaces this knowledge as queryable priors.

preprint structure candidatescore 76

Recursive Language Model Integration: Technical Specification

This document specifies the architecture and implementation of the Recursive Language Model integration within the cc-orchestrator-agent module. The system provides an inference strategy enabling language models to process unbounded-length input contexts through recursive decomposition, treating context as a programmable variable rather than a monolithic prompt payload. This specification covers the theoretical foundation, architectural design, execution semantics, and integration with Graph Kernel memory systems a

preprint structure candidatescore 76

Computational Choreography: A Dense Manuscript for the LUME Stack

Computational choreography is the name for the layer of LUME that interprets a performer's body as a live compositional instrument. It is not a synonym for motion capture, gesture recognition, depth rendering, or visual reactivity, although it depends on all of them. It is the discipline of deciding what the machine believes about the body, how that belief changes over time, how movement becomes intention, and how intention becomes a bounded visual or musical event. The current LUME stack already contains the physi

preprint structure candidatescore 70

Inverse Ring Contextual Propagation: A Mathematical Framework for Learning Individual Response Patterns in Conversational Dynamics

We present Inverse Ring Contextual Propagation (IRCP), a novel mathematical framework for modeling individual conversation dynamics through inverse mapping of response patterns. Unlike traditional approaches that optimize AI responses to match human preferences, IRCP inverts the learning objective from P(v|u) to P(u|v), creating a direct model of individual response patterns within a rigorous mathematical structure. The framework introduces a four-dimensional coordinate system (x,y,z,t) that uniquely captures the d

preprint structure candidatescore 64

Overview

Perfect — here’s a rewritten abstract and overview with the modular breakdown and explicit mention of bidirectional translations across English, French, N’ko, and Bambara.

Technical shelf

Architecture papers

technical paper candidatescore 74

Obsidian Vault Integration — Architecture & Operations Guide

| Store | Type | Weakness | |-------|------|----------| | `memory/*.md` files | Flat Markdown | No linking, manual curation, linear | | Kimi SQLite DB (`kimi_memory.db`) | Structured tables | Queryable but invisible, no graph | | Supabase | Cloud relational | API-only access, no browsing | | Orbit | Semantic memory | Black-box embeddings, no human navigation | | Discord threads | Chat messages | Ephemeral, unsearchable after scroll | | Plan files (`.claude/plans/`) | Task-scoped Markdown | Die when plans complete |

technical paper candidatescore 74

Skill Entity Architecture (SEA) — DEP + Evo-Cubed Analysis

> **Deprecation note (2026-05-13):** Mac3 was the Tier 2 worker host at the time this design doc was authored. Mac3 has since been retired. Forward-looking references to Mac3 (worker pool, async queue, circuit breaker) should be read as **Mac4:8100** (cognitive twin host) in any current/future implementation. The Mac3-era hardware-assignment sections (§2, Step 6, stress-test §🔴 Mac3 Async Worker Reliability) are kept for historical accuracy but are **obsolete for v1.1 onward**. See SOOP-2 launch memory for migrati

technical paper candidatescore 72

The N'Ko Compute Network

Every existing compute network, from Bitcoin to Akash to Render, treats workers as interchangeable machines. The worker's identity, language, and culture are irrelevant to the protocol. This paper proposes a fundamentally different architecture: a compute network where the worker's linguistic and cultural competence IS the valuable computation, and the protocol pays for it in STX on Bitcoin's Layer 2. The N'Ko Compute Network combines three production systems into a single protocol: (1) the EPOCH Protocol, eight Cl

technical paper candidatescore 70

Cross-Pollination Architecture Specification

1. [Overview](#1-overview) 2. [Prediction Engine](#2-prediction-engine) 3. [Safety Rails](#3-safety-rails) 4. [ACC Integration — Swipeable Prediction Cards](#4-acc-integration--swipeable-prediction-cards) 5. [Autonomous Mode](#5-autonomous-mode) 6. [Push Notifications](#6-push-notifications) 7. [Feedback Loop & DPO Training](#7-feedback-loop--dpo-training) 8. [API Reference](#8-api-reference) 9. [Data Models](#9-data-models) 10. [Deployment & Configuration](#10-deployment--configuration)

technical paper candidatescore 70

ARCHITECTURE

3 > **Document Purpose**: Comprehensive operational map of all entities, projects, capabilities, and systems under host management. > > **Last Updated**: 2026-01-18 > > **Document Type**: Living reference — update as entities evolve

technical paper candidatescore 66

NKo Brain Scanner — Unified Architecture

1. **No thin wrappers.** `nko_core/__init__.py` handles all imports from `Desktop/NKo/` via `sys.path`. No separate `phonetics.py`, `transliterate.py`, `morphology.py` wrapper files. If `from nko_core import phonetics` works, no wrapper is needed. 2. **No premature release.** HuggingFace upload happens AFTER mode collapse is fixed and the model generates coherent N'Ko text. Not before. 3. **Architecture matches disk.** Every file listed below exists. Every number is current. If reality changes, this doc gets update

technical paper candidatescore 66

The Architecture of Gemini Live Voice Control for Rekordbox: A Technical Essay

The Gemini Live voice control system for Rekordbox represents a sophisticated orchestration of modern machine learning services, real-time audio processing, and command dispatch mechanisms. At its highest level, this system transforms the ephemeral quality of human speech into precise digital instructions that control professional DJ software. The architecture embodies a philosophy of delegation, where each component performs a specialized role in service of a singular purpose: to translate the DJ's vocal intent in

technical paper candidatescore 66

Unified Agent OS — Architecture Document

The Unified Agent OS (UAOS) merges three autonomous systems — **Pulse** (development), **Heartbeat** (monitoring), and **Dream Weaver / Noosphere** (incubation) — into a single coherent platform. Today these systems share the filesystem implicitly and bridge state through ad-hoc scripts (`noosphere_bridge.py`, `cadence_bridge.py`). The UAOS replaces those stitches with a unified state bus, a single lifecycle model, and formalized handoff protocols.

technical paper candidatescore 62

HUB-2: Threaded Messaging Architecture

Replace Discord's channel model with a threaded architecture tailored to OpenClaw: - **Threaded, not channel-based** — every conversation is a thread with a parent category - **Quad-inspired layout** — 4 concurrent contexts visible (like the terminal quad) - **Feed integration** — 33 Prefect flows post directly to threads (no Discord webhooks) - **Agent-native** — threads can be owned by agents, not just humans - **Voice-first** — every thread supports voice input/output - **Offline-capable** — SwiftData persistenc

technical paper candidatescore 62

DJ Voice Control: Retrieval-Centric Architecture

The DJ Voice Control system adapts the speech-to-order retrieval-centric paradigm for real-time DJ performance control. Instead of matching spoken orders to menu items, we match spoken commands to DJ actions and keyboard shortcuts. This approach provides superior accuracy compared to traditional ASR + NLU pipelines by learning a direct semantic mapping between audio utterances and command intents.

technical paper candidatescore 62

Unified RAG++ Architecture

1. [System Overview](#1-system-overview) 2. [Layer Architecture](#2-layer-architecture) 3. [Foundation Layer: Rust Core](#3-foundation-layer-rust-core) 4. [Data Layer: Supabase Schema](#4-data-layer-supabase-schema) 5. [Ingestion Layer: Prompt Pipeline](#5-ingestion-layer-prompt-pipeline) 6. [ML Layer: CognitiveTwin](#6-ml-layer-cognitivetwin) 7. [Orchestration Layer: Orbit](#7-orchestration-layer-orbit) 8. [Integration Layer: Prompt Logger](#8-integration-layer-prompt-logger) 9. [API Layer: Endpoints Reference](#9

technical paper candidatescore 62

Graph Kernel

**Version**: 1.1.0 **Last Updated**: 2026-01-03 **Status**: Production **Parent**: [02-TRAJECTORY_OS.md](02-TRAJECTORY_OS.md) **Related**: [08-RAG_PLUS_PLUS.md](08-RAG_PLUS_PLUS.md), [09-ORBIT.md](09-ORBIT.md), [17-AGENT_SDK.md](17-AGENT_SDK.md) **Crate (Rust)**: `core/cc-graph-kernel/` **Service**: Cloud Run `graph-kernel` **Tests**: 140+ passing **Schema Version**: 1.0.0

technical paper candidatescore 62

DEP — Pipeline Protocol Architecture Audit

The Pipeline Protocol is a 3-table Supabase schema (`pipeline_definitions`, `pipeline_runs`, `pipeline_step_logs`) with 2 VIEWs, 1 trigger, and a shared TypeScript module consumed by 3 edge functions. It bridges to Nexus observability via a Prometheus exporter, Grafana dashboard, Prefect watcher, and a Next.js portal page.

technical paper candidatescore 62

Server Architecture Integration — Aura

Aura currently has a flat thread model: `HubThread` objects live in `hub_threads`, categorized by `ThreadCategory` and `ThreadType`, with no parent container. The Discord ecosystem, however, operates on three distinct architectural patterns — each representing an evolution in how the Clawdbot gateway handles task dispatch and message delivery. This document describes those three patterns in abstract form and specifies how they integrate into Aura as a **secluded feature** that does not interfere with the existing t

technical paper candidatescore 62

Inscription Architecture Map

This document maps every file, specification, and implementation related to the N'Ko inscription system, sigils, tokenization, the EPOCH protocol, Stacks/Clarity contracts, PsiChain, the cognitive twin, and anticipation geometry. It traces how they interconnect to form a single pipeline that encodes a life's computational dynamics as hash-chained N'Ko inscriptions settled on Bitcoin.

technical paper candidatescore 58

Milkmen Delivery - Shared Agent CRM Architecture Plan

-- Many-to-many: agents can share territories CREATE TABLE territory_agents ( territory_id uuid REFERENCES territories(id), agent_id uuid REFERENCES agents(id), role text NOT NULL CHECK (role IN ('seeder', 'closer', 'both')), visit_order integer NOT NULL, -- 1 = visits first, 2 = visits second active boolean DEFAULT true, PRIMARY KEY (territory_id, agent_id) ); ```

technical paper candidatescore 58

Stage 4: FORGE -- Final Creative Architecture

FirstDate is a production-first reality dating series that treats transparency as its format, not its liability. Three asymmetric roles (Host, Applicant, Viewer) orbit a 10-week seasonal arc set in Miami, where every consent ritual, every sponsor deal, every casting decision is designed to be seen. The app is not a dating platform with a show bolted on. It is a show management system whose public membrane happens to look like a dating app. Swiping is a personality quiz, not a match engine. The episode is the produc

technical paper candidatescore 56

SOOP-2 Architecture

This document is the authoritative architecture reference for SOOP-2. It describes what exists today (SEA at ~40% shipped), what SOOP-2 adds, and exactly how the pieces connect. Every section maps to at least one acceptance criterion from the launch checkpoint.

technical paper candidatescore 56

ELP-2 — Survivor Architecture

Three of four scrutiny layers returned a convergent verdict: ELP-1 as written cannot ship. The CRITICAL findings are structural — a /inject format mismatch that breaks the primary dispatch path, a concurrent SKILL.md write race enabled by a 5-minute claim TTL, and a Syncthing-backed filesystem fallback that is architecturally described but physically unprovisioned. These are not tuning problems; they are root-cause failures.

technical paper candidatescore 54

Beyond -- Anticipation Geometry for NUMU FARE

Beyond is a NUMU FARE package (`numu-beyond`) that uses anticipation geometry to orchestrate three AI paradigms through a single geometric coordination signal. Instead of each paradigm implementing its own convergence detection, retry logic, and stall recovery, Beyond provides a universal orchestration loop driven by four mathematical scalars computed over the trajectory of bus events.

technical paper candidatescore 54

CC AI Pipeline - Complete Implementation

- **335 conversations** from 5 data sources - **9,572 messages** (user + assistant) - **2,158 notes** from personal records - **Auto-categorized** by topic: - music_production: 76 conversations - machine_learning: 47 conversations - personal: 38 conversations - business: 32 conversations - computational_choreography: 23 conversations

technical paper candidatescore 54

IRCP-DLM Fusion Strategy - Complete Analysis

✅ **dlm/models/** - Pydantic models with `ChainCoordinate` (x, y, z, t, n_parts) ✅ **dlm/engine/** - Processing engines including `ircp_embedder.py` (exists!) ✅ **dlm/inference/** - Conversation and prompt managers ✅ **dlm/response/** - Recently refactored with production-grade utilities

technical paper candidatescore 54

TPO Mathematical Supplement: Detailed Formulations and Proofs

**Definition 1.1** (Conversation Graph): A conversation graph $G = (V, E, \mathbf{C}, \mathbf{M})$ where: - $V = \{v_1, v_2, ..., v_n\}$ is the set of message nodes - $E \subseteq V \times V$ is the set of directed edges representing reply relationships - $\mathbf{C}: V \rightarrow \mathbb{R}^5$ maps each node to its DLM coordinates - $\mathbf{M}: V \rightarrow \Sigma^*$ maps each node to its message content