GoogleApi.Dialogflow.V2.Model.GoogleCloudDialogflowV2InputAudioConfig (google_api_dialogflow v0.66.2) View Source

Instructs the speech recognizer how to process the audio content.


  • audioEncoding (type: String.t, default: nil) - Required. Audio encoding of the audio content to process.
  • disableNoSpeechRecognizedEvent (type: boolean(), default: nil) - Only used in Participants.AnalyzeContent and Participants.StreamingAnalyzeContent. If false and recognition doesn't return any result, trigger NO_SPEECH_RECOGNIZED event to Dialogflow agent.
  • enableWordInfo (type: boolean(), default: nil) - If true, Dialogflow returns SpeechWordInfo in StreamingRecognitionResult with information about the recognized speech words, e.g. start and end time offsets. If false or unspecified, Speech doesn't return any word-level information.
  • languageCode (type: String.t, default: nil) - Required. The language of the supplied audio. Dialogflow does not do translations. See Language Support for a list of the currently supported language codes. Note that queries in the same session do not necessarily need to specify the same language.
  • model (type: String.t, default: nil) - Which Speech model to select for the given request. Select the model best suited to your domain to get best results. If a model is not explicitly specified, then we auto-select a model based on the parameters in the InputAudioConfig. If enhanced speech model is enabled for the agent and an enhanced version of the specified model for the language does not exist, then the speech is recognized using the standard version of the specified model. Refer to Cloud Speech API documentation for more details.
  • modelVariant (type: String.t, default: nil) - Which variant of the Speech model to use.
  • phraseHints (type: list(String.t), default: nil) - A list of strings containing words and phrases that the speech recognizer should recognize with higher likelihood. See the Cloud Speech documentation for more details. This field is deprecated. Please use speech_contexts instead. If you specify both phrase_hints and speech_contexts, Dialogflow will treat the phrase_hints as a single additional SpeechContext.
  • sampleRateHertz (type: integer(), default: nil) - Required. Sample rate (in Hertz) of the audio content sent in the query. Refer to Cloud Speech API documentation for more details.
  • singleUtterance (type: boolean(), default: nil) - If false (default), recognition does not cease until the client closes the stream. If true, the recognizer will detect a single spoken utterance in input audio. Recognition ceases when it detects the audio's voice has stopped or paused. In this case, once a detected intent is received, the client should close the stream and start a new request with a new stream as needed. Note: This setting is relevant only for streaming methods. Note: When specified, InputAudioConfig.single_utterance takes precedence over StreamingDetectIntentRequest.single_utterance.
  • speechContexts (type: list(GoogleApi.Dialogflow.V2.Model.GoogleCloudDialogflowV2SpeechContext.t), default: nil) - Context information to assist speech recognition. See the Cloud Speech documentation for more details.

Link to this section Summary


Unwrap a decoded JSON object into its complex fields.

Link to this section Types


t() :: %GoogleApi.Dialogflow.V2.Model.GoogleCloudDialogflowV2InputAudioConfig{
  audioEncoding: String.t() | nil,
  disableNoSpeechRecognizedEvent: boolean() | nil,
  enableWordInfo: boolean() | nil,
  languageCode: String.t() | nil,
  model: String.t() | nil,
  modelVariant: String.t() | nil,
  phraseHints: [String.t()] | nil,
  sampleRateHertz: integer() | nil,
  singleUtterance: boolean() | nil,
    | nil

Link to this section Functions


decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.