Back to corpus
architecturetechnical paper candidatescore 48

SpeakFlow V2 — Architecture & Business Plan

SpeakFlow is a **privacy-first, offline-first voice OS** that replaces typing across every app on Mac, iOS, and eventually Windows. It competes directly with Wispr Flow ($10M ARR, $700M valuation, 270 Fortune 500 customers) by exploiting their three biggest vulnerabilities: cloud-only processing, 800MB RAM bloat, and zero customer support.

Full HTML reader

Read the full artifact

Open in new tab

Extracted abstract or opening context

SpeakFlow is a **privacy-first, offline-first voice OS** that replaces typing across every app on Mac, iOS, and eventually Windows. It competes directly with Wispr Flow ($10M ARR, $700M valuation, 270 Fortune 500 customers) by exploiting their three biggest vulnerabilities: cloud-only processing, 800MB RAM bloat, and zero customer support. | Dimension | Wispr Flow | SpeakFlow | |-----------|-----------|-----------| | **Processing** | 100% cloud (audio sent to servers) | 100% on-device (Apple Speech, CoreML) | | **Privacy** | Screen capture + cloud upload | Zero data leaves device | | **RAM** | 800MB idle | Target: <80MB | | **CPU idle** | 8%+ | Target: <1% | | **Offline** | No. Dead without internet | Full functionality offline | | **Price** | $12/mo ($144/yr) | $49 lifetime or $4/mo | | **Latency** | ~700ms (network round-trip) | <200ms (on-device) | | **Platforms** | Mac, Win, iOS, Android | Mac, iOS (Win later) | | **Support** | 0% response rate (Trustpilot 2.7/5) | Every ticket, every review | | **N'Ko** | No | Native transliteration + keyboard | | **AI Commands** | Cloud LLM (Llama on Baseten) | On-device MLX (Gemma 3 4B) + mesh fallback | | **Architecture** | Electron (Windows) | Native Swift (all platforms) | 1. **Privacy refugees**: Developers, lawyers, medical professionals actively searching for local alternatives (documented in Reddit threads, Trustpilot cancellations) 2. **Resource-conscious users**: 800MB RAM is absurd for dictation. Position as "dictation that doesn't tax your system" 3. **Price-sensitive users**: $12/mo for a utility feels wrong. $49 lifetime matches proven price points (Voibe, Superwhisper) 4. **Offline workers**: Trains, planes, cafes with bad wifi, rural areas. Wispr is dead without internet. #### 1. CommandModeService Wispr Flow's stickiest feature. Voice-driven text editing after dictation. #### 2. WhisperCoreMLService Enhanced recognition for accents, code, and whisper-level input.

Promotion decision

What has to happen next

Promote into a technical note or architecture paper with implementation anchors.

Why this is not always a full paper yet

Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.