Recognize a single sentence from the microphone.

POST /stt/recognize

Starts a cpal capture -> VAD -> Whisper pipeline, returns the first recognized sentence, then destroys the pipeline. Long-polls until speech is detected or timeout (60s).

Request

Responses

Recognized text

Recognize a single sentence from the microphone.

/stt/recognize

Request​

Responses​

Request

Responses