View Source GoogleApi.Dialogflow.V2.Model.GoogleCloudDialogflowV2SpeechToTextConfig (google_api_dialogflow v0.80.0)

Configures speech transcription for ConversationProfile.

Attributes

  • alternativeLanguageCodes (type: list(String.t), default: nil) - Defines the list of other language codes in addition to the one provided by the conversation profile that may be detected as the language code for the utterances over the conversation. See Language Support for a list of the currently supported language codes.
  • audioEncoding (type: String.t, default: nil) - Audio encoding of the audio content to process.
  • enableWordInfo (type: boolean(), default: nil) - If true, Dialogflow returns SpeechWordInfo in StreamingRecognitionResult with information about the recognized speech words, e.g. start and end time offsets. If false or unspecified, Speech doesn't return any word-level information.
  • languageCode (type: String.t, default: nil) - The language of the supplied audio. Dialogflow does not do translations. See Language Support for a list of the currently supported language codes. Note that queries in the same session do not necessarily need to specify the same language.
  • model (type: String.t, default: nil) - Which Speech model to select. Select the model best suited to your domain to get best results. If a model is not explicitly specified, then Dialogflow auto-selects a model based on other parameters in the SpeechToTextConfig and Agent settings. If enhanced speech model is enabled for the agent and an enhanced version of the specified model for the language does not exist, then the speech is recognized using the standard version of the specified model. Refer to Cloud Speech API documentation for more details. If you specify a model, the following models typically have the best performance: - phone_call (best for Agent Assist and telephony) - latest_short (best for Dialogflow non-telephony) - command_and_search Leave this field unspecified to use Agent Speech settings for model selection.
  • sampleRateHertz (type: integer(), default: nil) - Sample rate (in Hertz) of the audio content sent in the query. Refer to Cloud Speech API documentation for more details.
  • speechModelVariant (type: String.t, default: nil) - The speech model used in speech to text. SPEECH_MODEL_VARIANT_UNSPECIFIED, USE_BEST_AVAILABLE will be treated as USE_ENHANCED. It can be overridden in AnalyzeContentRequest and StreamingAnalyzeContentRequest request. If enhanced model variant is specified and an enhanced version of the specified model for the language does not exist, then it would emit an error.
  • useTimeoutBasedEndpointing (type: boolean(), default: nil) - Use timeout based endpointing, interpreting endpointer sensitivy as seconds of timeout value.

Summary

Functions

Unwrap a decoded JSON object into its complex fields.

Types

@type t() :: %GoogleApi.Dialogflow.V2.Model.GoogleCloudDialogflowV2SpeechToTextConfig{
  alternativeLanguageCodes: [String.t()] | nil,
  audioEncoding: String.t() | nil,
  enableWordInfo: boolean() | nil,
  languageCode: String.t() | nil,
  model: String.t() | nil,
  sampleRateHertz: integer() | nil,
  speechModelVariant: String.t() | nil,
  useTimeoutBasedEndpointing: boolean() | nil
}

Functions

@spec decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.