View Source GoogleApi.ContentWarehouse.V1.Model.SnapshotTextNode (google_api_content_warehouse v0.3.0)
The SnapshotDocument contains a list of TextNode's. Each node contains a string of text of the webpage, its bounding box in the agove snapshot image, and its font size (in number of pixels in the snapshot, which could be a fraction number since the snapshot image is typically shrinked). This list of text nodes are extracted from the output from the rendering service: htmlrender_webkit_headless_proto.Document The extraction is done by TrimDocument defined in ./shared/doctrimmer.cc
Attributes
-
boundingBox
(type:GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t
, default:nil
) - -
fontSize
(type:number()
, default:nil
) - -
inLink
(type:integer()
, default:nil
) - One if the current text node is within a link; otherwise zero/not present. -
maxSplit
(type:integer()
, default:nil
) - A value in the range [0,7] (zero if not present) indicating the most "powerful" splitting tag since the last text node. See "enum Category" in mustang/snippets/taginfo.h. -
text
(type:String.t
, default:nil
) -
Summary
Functions
Unwrap a decoded JSON object into its complex fields.
Types
@type t() :: %GoogleApi.ContentWarehouse.V1.Model.SnapshotTextNode{ boundingBox: GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t() | nil, fontSize: number() | nil, inLink: integer() | nil, maxSplit: integer() | nil, text: String.t() | nil }
Functions
Unwrap a decoded JSON object into its complex fields.