Expand description

Document represents the canonical document resource in Document Understanding AI. It is an interchange format that provides insights into documents and allows for collaboration between users and Document Understanding AI to iterate and optimize for quality.

Activities

This type is used in activities, which are methods you may call on this type or where this type is involved in. The list links the activity name, along with information about where it is used (one of request and response).

Fields

content: Option<String>

Optional. Inline document content, represented as a stream of bytes. Note: As with all bytes fields, protobuffers use a pure binary representation, whereas JSON representations use base64.

entities: Option<Vec<GoogleCloudDocumentaiV1beta2DocumentEntity>>

A list of entities detected on Document.text. For document shards, entities in this list may cross shard boundaries.

entity_relations: Option<Vec<GoogleCloudDocumentaiV1beta2DocumentEntityRelation>>

Placeholder. Relationship among Document.entities.

error: Option<GoogleRpcStatus>

Any error that occurred while processing this document.

labels: Option<Vec<GoogleCloudDocumentaiV1beta2DocumentLabel>>

Labels for this document.

mime_type: Option<String>

An IANA published MIME type (also referred to as media type). For more information, see https://www.iana.org/assignments/media-types/media-types.xhtml.

pages: Option<Vec<GoogleCloudDocumentaiV1beta2DocumentPage>>

Visual page layout for the Document.

revisions: Option<Vec<GoogleCloudDocumentaiV1beta2DocumentRevision>>

Placeholder. Revision history of this document.

shard_info: Option<GoogleCloudDocumentaiV1beta2DocumentShardInfo>

Information about the sharding if this document is sharded part of a larger document. If the document is not sharded, this message is not specified.

text: Option<String>

Optional. UTF-8 encoded text in reading order from the document.

text_changes: Option<Vec<GoogleCloudDocumentaiV1beta2DocumentTextChange>>

Placeholder. A list of text corrections made to [Document.text]. This is usually used for annotating corrections to OCR mistakes. Text changes for a given revision may not overlap with each other.

text_styles: Option<Vec<GoogleCloudDocumentaiV1beta2DocumentStyle>>

Placeholder. Styles for the Document.text.

uri: Option<String>

Optional. Currently supports Google Cloud Storage URI of the form gs://bucket_name/object_name. Object versioning is not supported. See Google Cloud Storage Request URIs for more info.

Trait Implementations

Returns a copy of the value. Read more

Performs copy-assignment from source. Read more

Formats the value using the given formatter. Read more

Returns the “default value” for a type. Read more

Deserialize this value from the given Serde deserializer. Read more

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations

Blanket Implementations

Gets the TypeId of self. Read more

Immutably borrows from an owned value. Read more

Mutably borrows from an owned value. Read more

Returns the argument unchanged.

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

The resulting type after obtaining ownership.

Creates owned data from borrowed data, usually by cloning. Read more

🔬 This is a nightly-only experimental API. (toowned_clone_into)

Uses borrowed data to replace owned data, usually by cloning. Read more

The type returned in the event of a conversion error.

Performs the conversion.

The type returned in the event of a conversion error.

Performs the conversion.

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more