Grand Diomande Research ยท Full HTML Reader
SpeakFlow ๐๏ธ
Hold **โฅSpace** anywhere on macOS โ speak โ text appears in whatever app is focused. No cloud, no subscription, no latency.
Full Public Reader
SpeakFlow ๐๏ธ
> Wispr Flow competitor โ free, offline, private voice-to-text for macOS + iOS
What It Does
Hold โฅSpace anywhere on macOS โ speak โ text appears in whatever app is focused. No cloud, no subscription, no latency.
Project Status
| Wave | Tasks | Status |
|---|---|---|
| Wave 0: Foundation | SF-0.1 โ SF-0.4 | โ Done โ builds, pipeline wired |
| Wave 1: Mac Injection | SF-1.1 โ SF-1.4 | โฌ Queued |
| Wave 2: AI Cleanup | SF-2.1 โ SF-2.3 | โฌ Queued |
| Wave 3: iOS Keyboard | SF-3.1 โ SF-3.5 | โฌ Queued |
| Wave 4: Ship | SF-4.1 โ SF-4.4 | โฌ Queued |
Building
bash
# Build macOS app (no code signing needed for dev)
cd Desktop/SpeakFlow
xcodebuild -scheme SpeakFlow -destination 'platform=macOS' build \
CODE_SIGN_IDENTITY="" CODE_SIGNING_REQUIRED=NO CODE_SIGNING_ALLOWED=NO GENERATE_INFOPLIST_FILE=YESArchitecture
SpeakFlow (macOS menu bar)
โโโ AppDelegate โ status bar icon, menu
โโโ AppCoordinator โ orchestrates the pipeline
โโโ HotKeyManager โ โฅSpace global hotkey (Carbon API + CGEvent tap)
โโโ SpeechService โ SFSpeechRecognizer, on-device, offline
SpeakFlowCore (shared framework)
โโโ SpeechService.swift โ SF-0.2: STT engine
โโโ HotKeyManager.swift โ SF-0.3: global hotkey (hold pattern)
โโโ PermissionsManager.swift โ accessibility + speech permissions
โโโ TranscriptionResult.swift โ shared models
SpeakFlowMobile (iOS host)
โโโ Minimal app for keyboard extension distribution
SpeakFlowKeyboard (Wave 3)
โโโ TBD โ iOS custom keyboard extensionWave 0 Pipeline
โฅSpace held โ HotKeyManager.onHotKeyDown
โ SpeechService.startRecording()
โโโ AVAudioEngine โ SFSpeechAudioBufferRecognitionRequest
โโโ SFSpeechRecognizer (on-device, en-US)
โโโ AsyncStream<TranscriptionResult>
โโโ console (Wave 1: will inject into focused app)
โฅSpace released โ SpeechService.stopRecording() โ finalTextExisting Code Merged
- `Desktop/Speak/SpeakReader/` โ HotKeyManager, PermissionsManager ported
- `Desktop/Speak/services/macos_speech_service.py` โ API design reference
- `Desktop/cross-script-bridge/ios-keyboard/` โ KeyboardViewController (Wave 3 source)
- `Desktop/voice-corpus/` โ 382 voice notes (Wave 2 training data)
Discord Ops Channel
Track progress: [#speakflow-ops](https://discord.com/channels/1470100197941051623/1473426349849972739)
Promotion Decision
Attach run IDs, datasets, metrics, and reproduction commands.
Source Anchor
NKo/macos/SpeakFlow/README.md
Detected Structure
Method ยท Evaluation ยท Code Anchors ยท Architecture