google_api_document_ai v0.2.0 API Reference
Modules
API client metadata for GoogleApi.DocumentAI.V1beta2.
API calls for all endpoints tagged Projects.
Handle Tesla connections for GoogleApi.DocumentAI.V1beta2.
Response to an batch document processing request. This is returned in the LRO Operation after the operation is complete.
A bounding polygon for the detected image annotation.
Document represents the canonical document resource in Document Understanding AI. It is an interchange format that provides insights into documents and allows for collaboration between users and Document Understanding AI to iterate and optimize for quality.
A phrase in the text that is a known entity type, such as a person, an organization, or location.
Relationship between Entities.
A page in a Document.
A block has a set of lines (collected into paragraphs) that have a common line-spacing and orientation.
Detected language for a structural component.
Dimension for the page.
A form field detected on the page.
Visual element describing a layout unit on a page.
A collection of tokens that a human would perceive as a line. Does not cross column boundaries, can be horizontal, vertical, etc.
A collection of lines that a human would perceive as a paragraph.
A table representation similar to HTML table structure.
A cell representation inside the table.
A row of table cells.
Detected break at the end of a Token.
Detected non-text visual elements e.g. checkbox, signature etc. on the page.
For a large document, sharding may be performed to produce several document shards. Each document shard contains this field to detail which shard it is.
Annotation for common text style attributes. This adheres to CSS conventions as much as possible.
Font size with unit.
Text reference indexing into the Document.text.
A text segment in the Document.text. The indices may be out of bounds which indicate that the text extends into another document shard for large sharded documents. See ShardInfo.text_offset
A translation of the text segment.
The Google Cloud Storage location where the output file will be written to.
The Google Cloud Storage location where the input file will be read from.
The desired input location and metadata.
A vertex represents a 2D point in the image. NOTE: the normalized vertex coordinates are relative to the original image and range from 0 to 1.
Contains metadata for the BatchProcessDocuments operation.
The desired output location and metadata.
Response to a single document processing request.
A vertex represents a 2D point in the image. NOTE: the vertex coordinates are in the same scale as the original image.
Parameters to control AutoML model prediction behavior.
Request to batch process documents as an asynchronous operation. The output is written to Cloud Storage as JSON in the [Document] format.
Response to an batch document processing request. This is returned in the LRO Operation after the operation is complete.
A bounding polygon for the detected image annotation.
Document represents the canonical document resource in Document Understanding AI. It is an interchange format that provides insights into documents and allows for collaboration between users and Document Understanding AI to iterate and optimize for quality.
A phrase in the text that is a known entity type, such as a person, an organization, or location.
Relationship between Entities.
Label attaches schema information and/or other metadata to segments within a Document. Multiple Labels on a single field can denote either different labels, different instances of the same label created at different times, or some combination of both.
A page in a Document.
A block has a set of lines (collected into paragraphs) that have a common line-spacing and orientation.
Detected language for a structural component.
Dimension for the page.
A form field detected on the page.
Visual element describing a layout unit on a page.
A collection of tokens that a human would perceive as a line. Does not cross column boundaries, can be horizontal, vertical, etc.
A collection of lines that a human would perceive as a paragraph.
A table representation similar to HTML table structure.
A cell representation inside the table.
A row of table cells.
Detected break at the end of a Token.
Detected non-text visual elements e.g. checkbox, signature etc. on the page.
For a large document, sharding may be performed to produce several document shards. Each document shard contains this field to detail which shard it is.
Annotation for common text style attributes. This adheres to CSS conventions as much as possible.
Font size with unit.
Text reference indexing into the Document.text.
A text segment in the Document.text. The indices may be out of bounds which indicate that the text extends into another document shard for large sharded documents. See ShardInfo.text_offset
A translation of the text segment.
Parameters to control entity extraction behavior.
Parameters to control form extraction behavior.
The Google Cloud Storage location where the output file will be written to.
The Google Cloud Storage location where the input file will be read from.
The desired input location and metadata.
Reserved for future use.
A vertex represents a 2D point in the image. NOTE: the normalized vertex coordinates are relative to the original image and range from 0 to 1.
Parameters to control Optical Character Recognition (OCR) behavior.
Contains metadata for the BatchProcessDocuments operation.
The desired output location and metadata.
Request to process one document.
Response to a single document processing request.
A hint for a table bounding box on the page for table parsing.
Parameters to control table extraction behavior.
A vertex represents a 2D point in the image. NOTE: the vertex coordinates are in the same scale as the original image.
This resource represents a long-running operation that is the result of a network API call.
The Status type defines a logical error model that is suitable for
different programming environments, including REST APIs and RPC APIs. It is
used by gRPC. Each Status message contains
three pieces of data: error code, error message, and error details.
Represents a color in the RGBA color space. This representation is designed for simplicity of conversion to/from color representations in various languages over compactness; for example, the fields of this representation can be trivially provided to the constructor of "java.awt.Color" in Java; it can also be trivially provided to UIColor's "+colorWithRed:green:blue:alpha" method in iOS; and, with just a little work, it can be easily formatted into a CSS "rgba()" string in JavaScript, as well.