View Source GoogleApi.ContentWarehouse.V1.Model.ImageRepositorySpeechRecognitionResult (google_api_content_warehouse v0.3.0)

A speech recognition result corresponding to a portion of the audio. This field is copied from cloud/speech/v1p1beta1/cloud_speech.proto. Amarna needs to have a standalone version as v1p1beta1/cloud_speech.proto is in the for of versioned proto and it breaks other prod code depending on Amarna's video schema.


  • alternatives (type: list(GoogleApi.ContentWarehouse.V1.Model.ImageRepositorySpeechRecognitionAlternative.t), default: nil) - May contain one or more recognition hypotheses (up to the maximum specified in max_alternatives). These alternatives are ordered in terms of accuracy, with the top (first) alternative being the most probable, as ranked by the recognizer.
  • channelTag (type: integer(), default: nil) - For multi-channel audio, this is the channel number corresponding to the recognized result for the audio from that channel. For audio_channel_count = N, its output values can range from '1' to 'N'.
  • languageCode (type: String.t, default: nil) - The BCP-47 language tag of the language in this result. This language code was detected to have the most likelihood of being spoken in the audio.
  • resultEndTime (type: String.t, default: nil) - Time offset of the end of this result relative to the beginning of the audio. This field is internal-only and is used to order results based on their timestamps.



Unwrap a decoded JSON object into its complex fields.


@type t() ::
      | nil,
    channelTag: integer() | nil,
    languageCode: String.t() | nil,
    resultEndTime: String.t() | nil


@spec decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.