ghost pepper

Table of content

what it does

ghost pepper is a macOS hold-to-talk speech-to-text app running entirely on local models. press a key, talk, release — text appears. no cloud, no API calls, no data leaving your machine.

why it matters

voice input was the last gap in the local-first AI stack. LLMs run locally, embeddings run locally, but voice still bounced through someone else’s servers. ghost pepper closes that gap.

the creator is already using it as a voice interface for their other agents — not just dictation, but “speech to agent.” the input layer going local before the inference layer.

key features

self.md relevance

once your voice never leaves your machine, the entire personal AI pipeline can run air-gapped. ghost pepper is the front door to offline agents. pair with local TTS (Voxtral) for full duplex voice without touching a server.