Expand description

Data structures used by operation inputs/outputs.

Modules

Structs

Used to contain the information detected by an AnalyzeID operation.

A Block represents items that are recognized in a document within a group of pixels close to each other. The information returned in a Block object depends on the type of operation. In text detection for documents (for example DetectDocumentText), you get information about the detected words and lines of text. In text analysis (for example AnalyzeDocument), you can also get information about the fields, tables, and selection elements that are detected in the document.

The bounding box around the detected page, text, key-value pair, table, table cell, or selection element on a document page. The left (x-coordinate) and top (y-coordinate) are coordinates that represent the top and left sides of the bounding box. Note that the upper-left corner of the image is the origin (0,0).

The input document, either as bytes or as an S3 object.

The Amazon S3 bucket that contains the document to be processed. It's used by asynchronous operations such as StartDocumentTextDetection.

Information about the input document.

An object used to store information about the Value or Label detected by Amazon Textract.

The structure holding all the information returned by AnalyzeExpense

Breakdown of detected information, seperated into the catagories Type, LabelDetection, and ValueDetection

An object used to store information about the Type detected by Amazon Textract.

Information about where the following items are located on a document page: detected page, text, key-value pairs, tables, table cells, and selection elements.

Shows the results of the human in the loop evaluation. If there is no HumanLoopArn, the input did not trigger human review.

Sets up the human review workflow the document will be sent to if one of the conditions is met. You can also set certain attributes of the image before review.

Allows you to set attributes of the image. Currently, you can declare an image as free of personally identifiable information and adult content.

The structure that lists each document processed in an AnalyzeID operation.

Structure containing both the normalized type of the extracted information and the text associated with it. These are extracted as Type and Value respectively.

A structure that holds information about the different lines found in a document's tables.

A grouping of tables which contain LineItems, with each table identified by the table's LineItemGroupIndex.

Contains information relating to dates in a document, including the type of value, and the value.

The Amazon Simple Notification Service (Amazon SNS) topic to which Amazon Textract publishes the completion status of an asynchronous document operation, such as StartDocumentTextDetection.

Sets whether or not your output will go to a user created bucket. Used to set the name of the bucket, and the prefix on the output file.

The X and Y coordinates of a point on a document page. The X and Y values that are returned are ratios of the overall document page size. For example, if the input document is 700 x 200 and the operation returns X=0.5 and Y=0.25, then the point is at the (350,50) pixel coordinate on the document page.

Information about how blocks are related to each other. A Block object contains 0 or more Relation objects in a list, Relationships. For more information, see Block.

The S3 bucket name and file name that identifies the document.

A warning about an issue that occurred during asynchronous text analysis (StartDocumentAnalysis) or asynchronous document text detection (StartDocumentTextDetection).

Enums