Back to corpus
research noteexperiment writeup candidatescore 28

Gemini Live Voice Control - High Accuracy DJ Commands

This uses **Google's Gemini Live API** for superior speech recognition accuracy compared to standard speech recognition libraries.

Full HTML reader

Read the full artifact

Open in new tab

Extracted abstract or opening context

This uses **Google's Gemini Live API** for superior speech recognition accuracy compared to standard speech recognition libraries. - **Much Better Accuracy**: Uses Google's advanced AI models - **Real-time Streaming**: Low-latency voice recognition - **Context Understanding**: Understands natural speech patterns - **Voice Activity Detection**: Built-in VAD to filter out background noise 1. **Gemini API Key**: Get one from [https://ai.google.dev/](https://ai.google.dev/) 2. **Python Dependencies**: Install with `pip install -r requirements.txt` **Left Deck:** - "play left" / "pause left" / "stop left" - "cue 1 left" / "cue 2 left" / ... / "cue 8 left" - "censor left" / "filter left" / "echo left" - "tempo up left" / "faster left" **Right Deck:** - "play right" / "pause right" / "stop right" - "cue 1 right" / "cue 2 right" / ... / "cue 4 right" - "censor right" / "filter right" / "echo right" - "tempo up right" / "faster right"

Promotion decision

What has to happen next

Attach run IDs, datasets, metrics, and reproduction commands.

Why this is not always a full paper yet

Corpus pages are public-safe readers for discovered workspace artifacts. They are not automatically final papers. A corpus item becomes a polished paper only after the editable source, evidence checkpoints, references, figures, render path, and release status are attached through the paper schema.