View Source GoogleApi.ContentWarehouse.V1.Model.ImageRepositoryWordInfo (google_api_content_warehouse v0.4.0)
Word-specific information for recognized words.
Attributes
-
confidence
(type:number()
, default:nil
) - The confidence estimate between 0.0 and 1.0. A higher number indicates an estimated greater likelihood that the recognized words are correct. This field is set only for the top alternative of a non-streaming result or, of a streaming result whereis_final=true
. This field is not guaranteed to be accurate and users should not rely on it to be always provided. The default of 0.0 is a sentinel value indicatingconfidence
was not set. -
endTime
(type:String.t
, default:nil
) - Time offset relative to the beginning of the audio, and corresponding to the end of the spoken word. This field is only set ifenable_word_time_offsets=true
and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary. -
speakerTag
(type:integer()
, default:nil
) - A distinct integer value is assigned for every speaker within the audio. This field specifies which one of those speakers was detected to have spoken this word. Value ranges from '1' to diarization_speaker_count. speaker_tag is set if enable_speaker_diarization = 'true' and only in the top alternative. -
startTime
(type:String.t
, default:nil
) - Time offset relative to the beginning of the audio, and corresponding to the start of the spoken word. This field is only set ifenable_word_time_offsets=true
and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary. -
word
(type:String.t
, default:nil
) - The word corresponding to this set of information.
Summary
Functions
Unwrap a decoded JSON object into its complex fields.
Types
Functions
Unwrap a decoded JSON object into its complex fields.