Speech-To-Text (STT)
Manages the transcription of audio recordings into text using local AI.
Data Model
Table transcript { id string [pk] sessionId string [note: 'Links to conversation/session'] recordingId string [note: 'Links to audio/audio-recording'] createdAt datetime note: 'Always shared between participants'}
Table transcript_segment { id string [pk] transcriptId string [ref: > transcript.id] participantId string [null, note: 'Speaker; links to conversation/participant'] text string startMs integer endMs integer}API
request-transcription Queues a recording for transcription.
| Parameter | Type | Required | Description |
|---|---|---|---|
recording-id | string | — |
transcription-completed | Parameter | Type | Required | Description |
|---|---|---|---|
session-id | string | — | |
transcript-id | string | — |
transcribe-audio Converts an audio buffer into a structured Transcript with timestamps.
| Parameter | Type | Required | Description |
|---|---|---|---|
audio-data | object | — | Buffer |
| Parameter | Type | Required | Description |
|---|---|---|---|
id | string | ✓ | |
length | number | ✓ |
Declared Errors
transcription-failed
| Error ID | Code | Category | Description |
|---|---|---|---|
transcription-failed | TODO | | Thrown when audio cannot be transcribed locally. |
| Source | Condition | Reaction | Rule |
|---|---|---|---|
session-ended | — | request-transcription | When a session ends, transcription is automatically requested. |
request-transcription | — | transcription-completed | Transcriptions are announced |
request-transcription | — | transcription-failed | Transcriptions are announced |