preprint structure candidatescore 100
Standard transformers attend based on learned position encodings (sinusoidal, RoPE, ALiBi) that encode *where* tokens are in a sequence but not *what the sequence is doing* as a geometric process. I introduce the Anticipatory Transformer, a modified transformer architecture where seven geometric scalars derived from Anticipation Geometry (commitment, uncertainty, transition pressure, recovery margin, phase stiffness, novelty, stability) steer the multi-head attention mechanism via additive bias. The trajectory bias
preprint structure candidatescore 100
I present Anticipation Geometry, a mathematical framework that characterizes trajectories through arbitrary state spaces using seven geometric scalars: commitment, uncertainty, transition pressure, recovery margin, phase stiffness, novelty, and stability. These scalars are domain-general, operating on any sequence of vectors in a metric space equipped with a differentiable time parameter. I combine this framework with knowledge graph path-derived reward signals to create a unified system for both trajectory analysi
preprint structure candidatescore 100
We present the Cognitive Twin architecture, a three-component system that produces a faithful digital replica of a human operator's conversational persona without baking volatile domain knowledge into model weights. The architecture separates personality (a LoRA adapter trained on the operator's historical responses), knowledge (a live knowledge graph queried at inference time), and trajectory awareness (geometric scalars characterizing conversation dynamics). We find that a Qwen2.5-3B model with LoRA adapters on a
preprint structure candidatescore 100
We present a framework for constructing autonomous cognitive twins from large-scale conversational corpora. Building on the Recursive Polymodal Synthesis (RPS) framework, which fuses heterogeneous sensor modalities through Lipschitz-constrained fixed-point iteration, we extend cross-modal coherence theory from physical signals (accelerometer, gyroscope, heart rate) to cognitive modalities: linguistic style ($\mathcal{V}_L$), decision patterns ($\mathcal{V}_D$), domain knowledge ($\mathcal{V}_K$), value alignment ($
preprint render candidatescore 100
This document presents the mathematical foundations for constructing a cognitive twin using Recursive Polymodal Synthesis (RPS). We extend the original RPS framework from sensor modalities (motion, heart rate, audio) to cognitive modalities (linguistic style, decision patterns, knowledge, values, temporal behavior). We prove convergence of the cognitive synthesis operator, derive the coherence energy functional, establish bounds on identity drift, and formalize the autonomy ratchet protocol. All results build on th
preprint structure candidatescore 100
We present an enhanced Topological Preference Optimization (TPO) system that integrates spatial intelligence and cross-conversation consolidation for advanced conversation analysis. Our unified framework combines the topological structure analysis of TPO with the spatial coordinate systems and ring topology of Ring Contextual Propagation (RCP), creating a comprehensive system for modeling conversation dynamics and generating preference datasets. The system employs 4D spatial coordinates (x, y, z, t) to represent hi
preprint structure candidatescore 100
We present an enhanced Topological Preference Optimization (TPO) system that integrates spatial intelligence and cross-conversation consolidation for advanced conversation analysis. Our unified framework combines the topological structure analysis of TPO with the spatial coordinate systems and ring topology of Ring Contextual Propagation (RCP), creating a comprehensive system for modeling conversation dynamics and generating preference datasets. The system employs 4D spatial coordinates (x, y, z, t) to represent hi
preprint structure candidatescore 100
CC-MotionGen is a diffusion-based generative system that produces time-indexed motion trajectories conditioned on audio features and optional high-level context. The system targets phrase-level generation: it consumes precomputed audio feature tensors and precomputed motion latents, trains a temporal one-dimensional U-Net denoiser under a Gaussian diffusion process, and performs inference by sampling multiple candidate futures and selecting the best output using a two-stage validation pipeline. The validation pipel
preprint structure candidatescore 100
Retrieval-Augmented Generation (RAG) systems typically treat retrieved context as a flat collection of documents, ignoring the structural and temporal relationships between conversation turns. We present RAG++, a trajectory-aware retrieval system that positions memories in a 5-dimensional coordinate space (depth, sibling order, homogeneity, temporal position, and complexity) and enforces context admissibility through cryptographically-verified slicing. Our system introduces three key innovations: (1) **Inverse Ring
preprint render candidatescore 100
The construction of personalized language model instances capable of reproducing individual cognitive patterns, stylistic signatures, and domain-specific conceptual frameworks represents a significant advancement in the development of AI systems that function as cognitive extensions rather than generic tools. This paper presents the CognitiveTwin framework, a comprehensive architecture for creating personalized language model instances through trajectory-aware supervised fine-tuning on conversational interaction hi
preprint structure candidatescore 100
> **Manuscript Type:** Full Research Paper (V2 — Post-Audit Definitive Edition) > **Track:** AI Systems & Knowledge Infrastructure > **Date:** July 2026 > **Revision:** 2.0 — Incorporates DEP Audit findings, Evo³ roadmap, and implemented improvements
preprint structure candidatescore 100
Autonomous AI agent systems face a fundamental challenge: constructing reproducible, trustworthy context windows from large conversational histories while enforcing governance policies over what information may influence downstream decisions. We present the **Graph Kernel**, a deterministic context slicing engine implemented as a single Rust binary (~15 KLOC) that combines a lightweight knowledge graph triple store with cryptographically-signed, policy-governed context window construction. Unlike general-purpose gr
preprint render candidatescore 100
Autonomous AI agents making consequential decisions require infrastructure that ensures every reasoning step is traceable, reproducible, and verifiable. We present the \textbf{Graph Kernel}, a deterministic provenance engine implemented as a single Rust binary (${\sim}15$~KLOC) that produces cryptographically-signed, policy-governed context windows---termed \emph{admissible evidence bundles}---for autonomous agent reasoning. Unlike general-purpose graph databases, vector stores, or RAG pipelines, the Graph Kernel i
preprint render candidatescore 100
Autonomous AI agent systems face a fundamental challenge: constructing reproducible, trustworthy context windows from large conversational histories while enforcing governance policies over what information may influence downstream decisions. We present the \textbf{Graph Kernel}, a deterministic context slicing engine implemented as a single Rust binary (${\sim}15$ KLOC) that combines a lightweight knowledge graph triple store with cryptographically-signed, policy-governed context window construction. Unlike genera
preprint render candidatescore 100
% ============================================================ We present \textbf{Cog-RLM}, a graph-augmented recursive language model architecture for personal knowledge systems that achieves 90.3\% accuracy on a comprehensive 103-question evaluation spanning ten cognitive dimensions, using a stock 3-billion parameter model with zero fine-tuning and zero inference cost. Our system extends the Recursive Language Model (RLM) paradigm~\citep{zhang2025rlm} with three novel contributions: (1)~a local knowledge graph pr
preprint structure candidatescore 100
We present Anticipation Geometry, a mathematical framework that characterizes trajectories through arbitrary state spaces using seven geometric scalars: commitment, uncertainty, transition pressure, recovery margin, phase stiffness, novelty, and stability. Originally developed for physical motion capture in the Comp-Core system, we prove these scalars are domain-general, operating on any sequence of vectors in a metric space equipped with a differentiable time parameter. We combine this framework with knowledge gra
preprint structure candidatescore 100
We present Computational Choreography, a deterministic pipeline that transforms heterogeneous sensor input -- phone accelerometer, smartwatch heart rate, full-body IMU skeleton -- into real-time audio synthesis through geometric anticipation signals. The system guarantees deterministic replay: identical sensor input always produces identical audio output. The key innovation is the Anticipation Kernel, which computes seven geometric scalars (commitment, uncertainty, transition pressure, recovery margin, phase stiffn
preprint structure candidatescore 100
Recent work on Domain-Specific Superintelligence (Belova et al., 2026) demonstrates that knowledge graph-derived training curricula produce domain specialists that outperform models 400x their size. However, this approach treats knowledge graphs as static training scaffolding: constructed once, used for fine-tuning, then discarded at inference. We present an alternative: runtime knowledge graph integration, where the graph is queried live during inference with provenance-tracked context slicing, real-time entity re
preprint structure candidatescore 100
Standard supervised fine-tuning (SFT) for language model agents operates on input-output pairs: a prompt and the response the model should produce. This format captures *what* an agent said but discards *why* it made specific decisions. We present KARL (Knowledge-Augmented Reinforcement Learning), a trajectory intelligence system that trains language model agents from full session traces rather than isolated completions. A trajectory in KARL records every tool call, file read, code edit, bash command, success signa
preprint structure candidatescore 100
Context window limitations constrain the fidelity of small personality models. A 4B parameter model with a 32K token context can hold roughly 8,000 words of conversation history before truncation begins discarding information critical to persona coherence. We present the Inscription-Conditioned Cognitive Twin (ICCT), an architecture that addresses this bottleneck by encoding conversation history as N'Ko inscriptions rather than English prose. The encoding uses 10 N'Ko sigils, each a single Unicode character derived
preprint structure candidatescore 100
We present KARL-Edge, an adaptation of the Knowledge Agents via Reinforcement Learning (KARL) framework to multi-tool software engineering agents running on commodity Apple Silicon hardware. Where the original KARL system (Chang et al., 2026) trains enterprise search agents using full off-policy RL with binary reward signals, our system introduces three architectural contributions: (1) a 5-signal composite reward function that decomposes trajectory quality into outcome, process, efficiency, verification, and consis
preprint structure candidatescore 100
We present the Trajectory Memory Ledger, implemented in KARL, a schema-normalized experience replay system for improving AI coding agent performance through closed-loop feedback. The ledger records complete tool-use sequences during real coding sessions, normalizes them into an append-only schema, scores them using a six-signal composite reward function (outcome, process, efficiency, verification, consistency, and wasted motion), and uses the highest-scoring trajectories to generate advantage-weighted supervised fi
preprint structure candidatescore 100
Recent work on Domain-Specific Superintelligence (Belova et al., 2026) demonstrates that knowledge graph-derived training curricula produce domain specialists that outperform models 400x their size. However, this approach treats knowledge graphs as static training scaffolding: constructed once, used for fine-tuning, then discarded at inference. We present an alternative: runtime knowledge graph integration, where the graph is queried live during inference with provenance-tracked context slicing, real-time entity re
preprint render candidatescore 100
% Reading Tone from the Signal: % Featural Acoustic Coding for Tone Resolution in N'Ko Speech Recognition % Compiles with pdflatex (MacTeX). N'Ko shown via Unicode codepoints + % transliteration; IPA via tipa; architecture figure via TikZ.
preprint render candidatescore 100
We present a systematic study of how large language models process N'Ko (\texttt{U+07C0--U+07FF}), an alphabetic script used by over 40 million Manding-language speakers in West Africa. Through activation profiling (``brain scanning'') of Qwen3-8B before and after fine-tuning, we demonstrate that: (1) fine-tuning concentrates N'Ko adaptation in the top 8 transformer layers, reducing activation magnitudes in reasoning layers while amplifying output confidence; (2) a three-stage training pipeline---continued pre-trai
preprint render candidatescore 100
N'Ko is an alphabetic script serving over 40 million Manding-language speakers across West Africa, engineered by Solomana Kant\'{e} in 1949 with a strict 1:1 phoneme-to-character mapping, explicit tonal diacritics, and zero spelling exceptions. We present a dual-thread investigation into why large language models (LLMs) fail on N'Ko and how to build audio-to-N'Ko speech recognition that bypasses LLMs entirely. \textbf{Thread 1 (Diagnostic):} We perform activation profiling---a ``brain scan''---of Qwen2-72B-Instruct
preprint render candidatescore 100
We present a systematic study of how large language models process N'Ko (\texttt{U+07C0--U+07FF}), an alphabetic script used by over 40 million Manding-language speakers in West Africa. Through activation profiling (``brain scanning'') of Qwen3-8B before and after fine-tuning, we demonstrate that: (1) fine-tuning concentrates N'Ko adaptation in the top 8 transformer layers, reducing activation magnitudes in reasoning layers while amplifying output confidence; (2) a three-stage training pipeline---continued pre-trai
preprint render candidatescore 100
\nko{} is an alphabetic script serving over forty million Manding-language speakers across West Africa, engineered by Solomana Kant\'e in 1949 with a strict one-to-one phoneme-to-character mapping, explicit tonal diacritics, and zero spelling exceptions. We present a dual-thread investigation into why large language models fail on \nko{} and how to construct audio-to-\nko{} speech recognition that bypasses such models entirely. In the diagnostic thread, we perform activation profiling of Qwen2-72B-Instruct (4-bit N
preprint render candidatescore 100
This document collects the formal mathematical results underlying the N'Ko Brain Scanner and ASR system. We present five main theorems with proofs, three derivations of key quantities, and two corollaries that connect the LLM diagnostic thread to the ASR construction thread. The results establish: (1) a phonetic transparency advantage for CTC decoding on bijective scripts, (2) bounds on the translation tax in under-represented scripts, (3) completeness and soundness of the FSM phonotactic validator, (4) a circuit d
preprint structure candidatescore 100
Low-resource speech systems usually fail twice: first because there is not enough audio/text data, and second because the available evaluation scripts do not preserve the phonemic structure of the language being measured. This paper argues that N'Ko offers a different path. Because N'Ko is a phonetic, right-to-left script designed for Manding languages and equipped with tone, nasalization, and documented foreign-sound diacritics, it can function as an extensible phonemic substrate: a deterministic sound-code for co
preprint render candidatescore 100
\usepackage[margin=1in]{geometry} \usepackage{booktabs} \usepackage{array} \usepackage{amsmath} \usepackage{amssymb} \usepackage{graphicx} \usepackage{hyperref} \usepackage[numbers]{natbib} \usepackage{xcolor} \usepackage{longtable} \usepackage{caption} \usepackage{seqsplit}
preprint render candidatescore 100
Large language models achieve remarkable performance on languages written in Latin, Cyrillic, CJK, and Arabic scripts. We ask what happens when these models encounter a script that is absent from their pre-training data. We perform activation profiling---a per-layer ``brain scan''---of Qwen3-8B processing 100 parallel English/N'Ko sentence pairs. N'Ko is an alphabetic script serving over 40 million Manding-language speakers across West Africa, engineered in 1949 with a strict phoneme-to-grapheme bijection, explicit
preprint render candidatescore 100
\documentclass[11pt]{article} \usepackage{acl} \usepackage{times} \usepackage{latexsym} \usepackage{graphicx} \usepackage{booktabs} \usepackage{amsmath} \usepackage{amssymb} \usepackage{hyperref} \usepackage{multirow} \usepackage{xcolor} \usepackage{enumitem} \usepackage{tipa}
preprint render candidatescore 100
A prior study demonstrated that Qwen3-8B processes N'Ko text with severely diminished neural activation compared to English, a phenomenon termed \emph{script invisibility}. That finding left an open question: is the deficit specific to one model, or is it a structural property of all models trained on corpora where N'Ko is absent? We answer this by performing identical activation profiling---per-layer extraction of L2 norm, Shannon entropy, sparsity, and kurtosis---on three architecturally distinct models: Qwen3-8B
preprint render candidatescore 100
% Does Script Design Matter? Phonetic Transparency and CTC Decoding for N'Ko ASR % Target: Interspeech 2026 / ICASSP 2027
preprint render candidatescore 100
Controlled experiments show that phonetically transparent scripts yield lower CER for CTC-based ASR. But ASR systems are not evaluated in controlled conditions---they encounter unseen vocabulary, new speakers, and domain shift. This paper assembles deployment-relevant evidence for Bambara ASR systems using N'Ko (bijective script) and Latin (many-to-many script), anchored by the verified 20.57\% N'Ko trajectory checkpoint but drawing on both current and historical experiments. First, \textbf{compositional generaliza
preprint render candidatescore 100
This paper studies \emph{script invisibility}: the condition in which a large language model accepts a writing system as valid Unicode while allocating little functional internal representation to it. The test case is \nko{}, the script designed by Solomana Kante for Manding languages. \nko{} is not a noisy informal encoding of Bambara, Maninka, or Dioula. It is a dedicated alphabetic system in the Unicode block U+07C0--U+07FF, with a close mapping between Manding phonology and written symbols, explicit diacritic m
preprint render candidatescore 100
Automatic speech recognition for Manding languages is usually reported through Latin-script word error rate. This paper argues that the metric is scientifically weak for the research question at hand. If the goal is to evaluate whether an ASR system recognizes Bambara, Maninka, Dioula, or related Manding speech, then the scoring units should preserve the acoustic-phonemic distinctions carried by the language. Latin Bambara orthography is useful and socially real, but it is not a lossless measurement interface: it u
preprint render candidatescore 100
This paper preserves the technical ASR center of the \nko{} research program: an archived script-native trajectory checkpoint reporting \anchorcer{} character error rate on a \corpusn{}-pair Bambara corpus snapshot. The model uses frozen Whisper large-v3 acoustic features, a trainable Transformer CTC decoder, and a compact trajectory state that biases attention with speech-dynamic information. The result is the strongest retained ASR artifact in the project and is the correct way to discuss the phrase ``20 CER'' pu
preprint render candidatescore 100
This paper defines the deployment layer of the \nko{} ASR project: Anticipation Geometry Partition (AGP). AGP is not the acoustic model that produced the archived 20.57\% CER anchor. It begins after ASR. Its role is to convert trajectory and uncertainty signals into row-level decisions about correction, provenance, corpus admission, and deployment eligibility. The motivation is simple: a scalar CER number is not enough to build a trustworthy transcript corpus or a production speech system. A model can make local mi
preprint render candidatescore 100
We present Recursive Polymodal Synthesis (RPS), a framework for real-time computational choreography that achieves robust multi-modal sensor fusion through iterative proximal updates with spectral norm constraints, and couples that embodied state to a phrase-conditioned spectrogram diffusion backend for audio generation. The system integrates kinematic, physiological, and rhythmic data streams into a unified embodied representation that drives either smooth control signals or direct audio synthesis in real time. Ou
preprint render candidatescore 100
We present a mathematically rigorous framework for multi-modal sensor fusion in real-time embodied interaction systems. Our approach, \emph{Recursive Polymodal Synthesis} (RPS), fuses heterogeneous modalities with disparate sampling rates and noise statistics into a coherent latent representation suitable for generative control. The core mechanism is a proximal fixed-point iteration using spectral-norm-constrained relational operators, yielding contraction guarantees and a unique fixed point. We prove geometric con
preprint structure candidatescore 100
We present a mathematically rigorous framework for multi-modal sensor fusion in real-time embodied interaction systems, coupled to a phrase-conditioned spectrogram diffusion backend for direct audio generation. Our approach, termed Recursive Polymodal Synthesis (RPS), addresses the fundamental challenge of fusing heterogeneous sensor modalities with different noise characteristics, sampling rates, and semantic meanings into a coherent internal representation suitable for generative control. The key innovation is a
preprint structure candidatescore 98
We present a schema-locked, replayable semantic kernel for constructing and validating vocabulary in low-resource languages, with specific application to N'Ko, the indigenous script of the Manding language family. Our system introduces a 7-operator semantic algebra with formal legality grammar, a morphological compiler producing content-addressable forms with stable signatures, and an evidence-driven lifecycle model for vocabulary promotion. The evaluation methodology employs stress-profile-based adversarial testin
preprint structure candidatescore 98
N'Ko is an alphabetic script serving over 40 million Manding-language speakers across West Africa, engineered by Solomana Kanté in 1949 with a strict 1:1 phoneme-to-character mapping, explicit tonal diacritics, and zero spelling exceptions. We present a dual-thread investigation into why large language models (LLMs) fail on N'Ko and how to build audio-to-N'Ko speech recognition that bypasses LLMs entirely. **Thread 1 (Diagnostic):** We perform activation profiling — a "brain scan" — of Qwen2-72B-Instruct (4-bit NF4
preprint structure candidatescore 96
We present a mathematically rigorous framework for multi-modal sensor fusion in real-time embodied interaction systems. Our approach, termed Recursive Polymodal Synthesis (RPS), addresses the fundamental challenge of fusing heterogeneous sensor modalities with different noise characteristics, sampling rates, and semantic meanings into a coherent internal representation suitable for generative control. The key innovation is a proximal fixed-point iteration scheme that enforces cross-modal coherence through spectral-
preprint structure candidatescore 96
We present Recursive Polymodal Synthesis (RPS), a framework for real-time computational choreography that achieves robust multi-modal sensor fusion through iterative proximal updates with spectral norm constraints, and couples that embodied state to a phrase-conditioned spectrogram diffusion backend for audio generation. The system integrates kinematic, physiological, and rhythmic data streams into a unified embodied representation that drives either smooth control signals or direct audio synthesis in real time. Ou
preprint structure candidatescore 94
We introduce **Topological Preference Optimization (TPO)**, a novel training methodology that leverages conversation topology and spatial-temporal coordinates to generate preference datasets for language model training. Unlike traditional Direct Preference Optimization (DPO) which relies on human annotations or simple heuristics, TPO extracts preference signals directly from the structural properties of conversation graphs, incorporating hindsight knowledge and topological awareness to create more accurate and cont
preprint structure candidatescore 94
We introduce **Topological Preference Optimization (TPO)**, a novel training methodology that leverages conversation topology and spatial-temporal coordinates to generate preference datasets for language model training. Unlike traditional Direct Preference Optimization (DPO) which relies on human annotations or simple heuristics, TPO extracts preference signals directly from the structural properties of conversation graphs, incorporating hindsight knowledge and topological awareness to create more accurate and cont
preprint structure candidatescore 92
This document formalizes **Memory-Augmented Equilibrium Control (MAEC)**, a control-theoretic framework for real-time embodied creative systems. MAEC addresses a class of problems where traditional control theory and reinforcement learning fail: continuous, non-episodic systems that must maintain expressive viability while generating novel outputs. Unlike RL, MAEC has no scalar reward function, no policy optimization loop, and no episodic resets. Instead, it preserves dynamic equilibrium through memory-conditioned
preprint structure candidatescore 90
We present a retrieval-centric automatic speech recognition (ASR) architecture for Bambara, targeting N'Ko script output directly rather than routing through Latin transcription. The central insight is structural: N'Ko enforces a strict 1:1 phoneme-to-grapheme mapping, explicit tonal diacritics, and a mathematically complete syllable inventory of 3,024 entries (all V, VN, CV, and CVN patterns across five tones). This finite, well-structured output space makes retrieval a better fit than sequence-to-sequence decodin
preprint structure candidatescore 88
We present RAG++ (Retrieval-Augmented Generation Plus Plus), a novel retrieval paradigm that extends traditional RAG from semantic text retrieval to **state-space transition retrieval**. Instead of retrieving "relevant documents," RAG++ retrieves **successful state transitions** from a user's personal history and recommends actions based on what worked in similar dynamical regimes. We demonstrate this approach in TrajectoryOS, a life physics modeling system that treats human life as a dynamical system with measurab
preprint structure candidatescore 88
We present Cog-RLM, a graph-augmented recursive language model architecture for personal knowledge systems that achieves 90.3% accuracy on a comprehensive 103-question multi-dimensional evaluation using a stock 3-billion parameter model with zero fine-tuning and zero inference cost. Our system extends the Recursive Language Model (RLM) paradigm (Zhang et al., 2025) with three novel contributions: (1) a local knowledge graph providing relationship-aware context retrieval, (2) a hybrid decomposition classifier that s
preprint structure candidatescore 88
We present a method for compactly annotating coding agent sessions with behavioral motifs and geometric features, then conditioning training data generation on these annotations. From 834 real multi-project coding sessions spanning 4,633 turn-level records across 50+ applications, we extract 10-category symbolic labels (inscriptions) and 5 continuous geometric scalars. We show that: (1) transition pressure predicts session convergence at 71.8% accuracy (z = 2.72, p < 0.007), (2) advantage-weighted training using th
preprint structure candidatescore 86
We present a comprehensive enhancement to Topological Preference Optimization (TPO) that integrates spatial intelligence, cross-conversation consolidation, and advanced pattern recognition for conversation analysis. Our unified framework processes hierarchical conversation structures through a four-dimensional spatial coordinate system, implements adaptive clustering algorithms for pattern detection, and employs sophisticated natural language processing techniques for knowledge consolidation across conversation bou
preprint structure candidatescore 86
We present a comprehensive enhancement to Topological Preference Optimization (TPO) that integrates spatial intelligence, cross-conversation consolidation, and advanced pattern recognition for conversation analysis. Our unified framework processes hierarchical conversation structures through a four-dimensional spatial coordinate system, implements adaptive clustering algorithms for pattern detection, and employs sophisticated natural language processing techniques for knowledge consolidation across conversation bou
preprint structure candidatescore 86
MAOE-N'Ko, the Mixture of Anticipatory Orthogonal Experts for N'Ko ASR, is a modular speech-language correction architecture that keeps the acoustic model sovereign while allowing language-prior intelligence to act only where it is admissible. The system begins with a verified N'Ko trajectory CTC acoustic model, currently anchored by the Paper 4 reproduction checkpoint with 20.57 percent CER on the locked N'Ko run. Instead of replacing that model with a monolithic audio-language system, MAOE-N'Ko routes each ASR ch
preprint structure candidatescore 84
This document presents a novel approach to building state-of-the-art natural language processing systems for N'Ko, Bambara, and related Manding languages spoken by approximately forty million people across West Africa. Unlike traditional corpus-driven methodologies that depend on pre-existing parallel texts such as Bible translations or government documents, we introduce a video-first organic vocabulary discovery system that extracts training data directly from educational YouTube content. The system processes vide
preprint structure candidatescore 82
On 2026-04-21, the AGP bridge architecture achieved its first non-synthetic, reference-backed Character Error Rate (CER) improvement: a reduction from 0.7604 to 0.7512 on a curated slice of archived ASR evaluation data. This result, while numerically modest, constitutes a critical architectural validation. It demonstrates that a reference-leakage-free gating system—operating exclusively on hypothesis-side telemetry—can safely admit edits that improve supervised metrics. The improvement was not achieved through broa
preprint structure candidatescore 82
We present Cog-RLM, a graph-augmented recursive language model architecture for personal knowledge systems that achieves 90.3% accuracy on a comprehensive 103-question evaluation spanning ten cognitive dimensions, using a stock 3-billion parameter model with zero fine-tuning and zero inference cost. Our system extends the Recursive Language Model (RLM) paradigm with three novel contributions: (1) a local knowledge graph providing relationship-aware context retrieval via breadth-first traversal, (2) a hybrid decompo
preprint structure candidatescore 80
This paper presents a retrieval-centric architecture for voice-controlled DJ performance that adapts the Speech-to-Order (S2O) streaming pipeline to the domain of professional DJ software, specifically Rekordbox. Instead of parsing transcribed text into intents via a conventional automatic speech recognition (ASR) and natural language understanding stack, the system learns a direct mapping between spoken commands and a catalog of DJ actions derived from Rekordbox’s performance preset mappings. The design combines a
preprint structure candidatescore 80
We present Inverse Ring Contextual Propagation (I-RCP), a novel mathematical framework for modeling individual conversation dynamics through inverse mapping of response patterns. Unlike traditional approaches that optimize AI responses to match human preferences, I-RCP inverts the learning objective from P(v|u) to P(u|v), creating a direct model of individual response patterns within a rigorous mathematical structure. The framework introduces a three-dimensional coordinate system (x,y,z) that uniquely captures the
preprint structure candidatescore 80
We present Inverse Ring Contextual Propagation (I-RCP), a novel mathematical framework for modeling individual conversation dynamics through inverse mapping of response patterns. Unlike traditional approaches that optimize AI responses to match human preferences, I-RCP inverts the learning objective from P(v|u) to P(u|v), creating a direct model of individual response patterns within a rigorous mathematical structure. The framework introduces a three-dimensional coordinate system (x,y,z) that uniquely captures the
preprint structure candidatescore 80
RAG++ is a high-performance retrieval engine that provides **statistical priors** from outcome-annotated trajectories. Unlike traditional RAG systems that retrieve text for language model context, RAG++ retrieves structured execution traces and computes distributional statistics for downstream policy conditioning. **Key Insight**: Past execution outcomes encode implicit knowledge about action feasibility, timing, and context-dependent success rates. RAG++ surfaces this knowledge as queryable priors.
preprint structure candidatescore 76
This document specifies the architecture and implementation of the Recursive Language Model integration within the cc-orchestrator-agent module. The system provides an inference strategy enabling language models to process unbounded-length input contexts through recursive decomposition, treating context as a programmable variable rather than a monolithic prompt payload. This specification covers the theoretical foundation, architectural design, execution semantics, and integration with Graph Kernel memory systems a
preprint structure candidatescore 76
Computational choreography is the name for the layer of LUME that interprets a performer's body as a live compositional instrument. It is not a synonym for motion capture, gesture recognition, depth rendering, or visual reactivity, although it depends on all of them. It is the discipline of deciding what the machine believes about the body, how that belief changes over time, how movement becomes intention, and how intention becomes a bounded visual or musical event. The current LUME stack already contains the physi
preprint structure candidatescore 70
We present Inverse Ring Contextual Propagation (IRCP), a novel mathematical framework for modeling individual conversation dynamics through inverse mapping of response patterns. Unlike traditional approaches that optimize AI responses to match human preferences, IRCP inverts the learning objective from P(v|u) to P(u|v), creating a direct model of individual response patterns within a rigorous mathematical structure. The framework introduces a four-dimensional coordinate system (x,y,z,t) that uniquely captures the d
preprint structure candidatescore 64
Perfect — here’s a rewritten abstract and overview with the modular breakdown and explicit mention of bidirectional translations across English, French, N’ko, and Bambara.