kral Documentation
Sign in
Chat

Voice and read-aloud

You can talk to kral and it can talk back. Both directions run through the same gateway as everything else, billed against the same credit.

Dictating a prompt

Click the microphone in the prompt box, speak, click again to stop. The recording is transcribed by a speech-to-text model (Whisper-grade) and lands in the input as text, so you can correct it before sending. The browser will ask for microphone permission the first time.

Dictation handles natural speech well: punctuation, multiple languages, technical vocabulary. Transcription is billed per second of audio; a short dictation costs a fraction of a cent.

Read-aloud

Every answer has a read-aloud control. The text is converted by a text-to-speech model with natural voices and billed per character. Useful for long answers while you do something else, or for proofreading your own drafts by ear.

Voice settings

In settings, the speech tab lets you pick the voice for read-aloud and tune dictation behavior, like automatically sending after you stop talking.

API access

The same speech models are available programmatically via /v1/audio/transcriptions and /v1/audio/speech. See Endpoints.