Skip to main content

Recognize a single sentence from the microphone.

POST 

/stt/recognize

Starts a cpal capture -> VAD -> Whisper pipeline, returns the first recognized sentence, then destroys the pipeline. Long-polls until speech is detected or timeout (60s).

Request

Responses

Recognized text