View Source GoogleApi.ContentWarehouse.V1.Model.SnapshotTextNode (google_api_content_warehouse v0.3.0)

The SnapshotDocument contains a list of TextNode's. Each node contains a string of text of the webpage, its bounding box in the agove snapshot image, and its font size (in number of pixels in the snapshot, which could be a fraction number since the snapshot image is typically shrinked). This list of text nodes are extracted from the output from the rendering service: htmlrender_webkit_headless_proto.Document The extraction is done by TrimDocument defined in ./shared/doctrimmer.cc

Attributes

  • boundingBox (type: GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t, default: nil) -
  • fontSize (type: number(), default: nil) -
  • inLink (type: integer(), default: nil) - One if the current text node is within a link; otherwise zero/not present.
  • maxSplit (type: integer(), default: nil) - A value in the range [0,7] (zero if not present) indicating the most "powerful" splitting tag since the last text node. See "enum Category" in mustang/snippets/taginfo.h.
  • text (type: String.t, default: nil) -

Summary

Functions

Unwrap a decoded JSON object into its complex fields.

Types

@type t() :: %GoogleApi.ContentWarehouse.V1.Model.SnapshotTextNode{
  boundingBox: GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t() | nil,
  fontSize: number() | nil,
  inLink: integer() | nil,
  maxSplit: integer() | nil,
  text: String.t() | nil
}

Functions

@spec decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.