View Source GoogleApi.ContentWarehouse.V1.Model.SnapshotTextNode (google_api_content_warehouse v0.2.0)
The SnapshotDocument contains a list of TextNode's. Each node contains a string of text of the webpage, its bounding box in the agove snapshot image, and its font size (in number of pixels in the snapshot, which could be a fraction number since the snapshot image is typically shrinked). This list of text nodes are extracted from the output from the rendering service: htmlrender_webkit_headless_proto.Document The extraction is done by TrimDocument defined in ./shared/doctrimmer.cc
Attributes
-
boundingBox(type:GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t, default:nil) - -
fontSize(type:number(), default:nil) - -
inLink(type:integer(), default:nil) - One if the current text node is within a link; otherwise zero/not present. -
maxSplit(type:integer(), default:nil) - A value in the range [0,7] (zero if not present) indicating the most "powerful" splitting tag since the last text node. See "enum Category" in mustang/snippets/taginfo.h. -
text(type:String.t, default:nil) -
Summary
Functions
Unwrap a decoded JSON object into its complex fields.
Types
@type t() :: %GoogleApi.ContentWarehouse.V1.Model.SnapshotTextNode{ boundingBox: GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t() | nil, fontSize: number() | nil, inLink: integer() | nil, maxSplit: integer() | nil, text: String.t() | nil }
Functions
Unwrap a decoded JSON object into its complex fields.