Back to corpus
research noteexperiment writeup candidatescore 28
Gemini Live Voice Control - High Accuracy DJ Commands
This uses **Google's Gemini Live API** for superior speech recognition accuracy compared to standard speech recognition libraries.
Full HTML reader
Read the full artifact
Extracted abstract or opening context
This uses **Google's Gemini Live API** for superior speech recognition accuracy compared to standard speech recognition libraries.
- **Much Better Accuracy**: Uses Google's advanced AI models - **Real-time Streaming**: Low-latency voice recognition - **Context Understanding**: Understands natural speech patterns - **Voice Activity Detection**: Built-in VAD to filter out background noise
1. **Gemini API Key**: Get one from [https://ai.google.dev/](https://ai.google.dev/) 2. **Python Dependencies**: Install with `pip install -r requirements.txt`
**Left Deck:** - "play left" / "pause left" / "stop left" - "cue 1 left" / "cue 2 left" / ... / "cue 8 left" - "censor left" / "filter left" / "echo left" - "tempo up left" / "faster left"
**Right Deck:** - "play right" / "pause right" / "stop right" - "cue 1 right" / "cue 2 right" / ... / "cue 4 right" - "censor right" / "filter right" / "echo right" - "tempo up right" / "faster right"
Promotion decision
What has to happen next
Attach run IDs, datasets, metrics, and reproduction commands.
Why this is not always a full paper yet
Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.