Behaviour for speech-to-text services. Processes audio frames and produces transcription frames.
@callback run_stt(audio :: binary(), state :: term()) :: {:ok, output_frames :: [struct()], state :: term()} | {:continue, state :: term()}