gcp-bigquery-client 0.4.0

An ergonomic async client library for GCP BigQuery.
Documentation

GCP BigQuery Client (Rust)

An ergonomic async client library for GCP BigQuery.

  • Support for dataset, table, streaming API and query (see status section for an exhaustive list of supported API endpoints)
  • Support Service Account Key authentication (other OAuth flows will be added later)
  • Create tables and rows via builder patterns
  • Persist complex Rust structs in structured BigQuery tables
  • Async API

Other OAuth flows will be added later.


NOTE This is my first crate and it's still a work-in-progress. So please post your suggestions and ideas on this GitHub discussion section.

Most of the Rust structures defined in the directory 'model' are derived from this Google API Explorer document.


Example

The following example performs the following operations:

  • Load a set of environment variables to set $PROJECT_ID, $DATASET_ID, $TABLE_ID and $GOOGLE_APPLICATION_CREDENTIALS
  • Init the BigQuery client
  • Create a dataset in the GCP project $PROJECT_ID
  • Create a table in the previously created dataset (table schema)
  • Insert a set of rows in the previously created table via the BigQuery Streaming API. The inserted rows are based on a regular Rust struct implementing the trait Serialize.
  • Perform a select query on the previously created table
  • Drop the table previously created
  • Drop the dataset previously created
// Read configuration parameters from environment variables
let (ref project_id, ref dataset_id, ref table_id, ref gcp_sa_key) = env_vars();

// Init BigQuery client
let client = gcp_bigquery_client::Client::from_service_account_key_file(gcp_sa_key).await;

// Create dataset
let created_dataset = client.dataset().create(project_id, Dataset::new(dataset_id)).await?;
println!(
    "Dataset '{}.{}' created",
    created_dataset.project_id(),
    created_dataset.dataset_id()
);

// Create table schema
let table = Table::new(
    project_id,
    dataset_id,
    table_id,
    TableSchema::new(vec![
        TableFieldSchema::integer("int_value"),
        TableFieldSchema::float("float_value"),
        TableFieldSchema::bool("bool_value"),
        TableFieldSchema::string("string_value"),
        TableFieldSchema::record(
            "record_value",
            vec![
                TableFieldSchema::integer("int_value"),
                TableFieldSchema::string("string_value"),
                TableFieldSchema::record(
                    "record_value",
                    vec![
                        TableFieldSchema::integer("int_value"),
                        TableFieldSchema::string("string_value"),
                    ],
                ),
            ],
        ),
    ]),
);

let created_table = client.table().create(project_id, dataset_id, table).await?;
println!(
    "Table '{}.{}.{}' created",
    created_table.project_id(),
    created_table.dataset_id(),
    created_table.table_id()
);

// Insert data via BigQuery Streaming API
let mut insert_request = TableDataInsertAllRequest::new();
insert_request.add_row(
    None,
    MyRow {
        int_value: 1,
        float_value: 1.0,
        bool_value: false,
        string_value: "first".into(),
        record_value: FirstRecordLevel {
            int_value: 10,
            string_value: "sub_level_1.1".into(),
            record_value: SecondRecordLevel {
                int_value: 20,
                string_value: "leaf".to_string(),
            },
        },
    },
)?;
insert_request.add_row(
    None,
    MyRow {
        int_value: 2,
        float_value: 2.0,
        bool_value: true,
        string_value: "second".into(),
        record_value: FirstRecordLevel {
            int_value: 11,
            string_value: "sub_level_1.2".into(),
            record_value: SecondRecordLevel {
                int_value: 21,
                string_value: "leaf".to_string(),
            },
        },
    },
)?;

client
    .tabledata()
    .insert_all(project_id, dataset_id, table_id, insert_request)
    .await?;

// Query
let mut rs = client
    .job()
    .query(
        project_id,
        QueryRequest::new(format!(
            "SELECT COUNT(*) AS c FROM `{}.{}.{}`",
            project_id, dataset_id, table_id
        )),
    )
    .await?;
while rs.next_row() {
    println!("Number of rows inserted: {}", rs.get_i64_by_name("c")?.unwrap());
}

// Delete the table previously created
client.table().delete(project_id, dataset_id, table_id).await?;

// Delete the dataset previously created
client.dataset().delete(project_id, dataset_id, true).await?;

Main dependencies

  • Yup-OAuth2 5.x
  • Hyper 0.14 (+ RusTLS)
  • Tokio 1.x
  • Reqwest 0.11
  • Serde JSON 1.x
  • ThisError 1.x

Status

List of endpoints implemented:

  • Dataset
    • Delete
    • Get
    • Insert (create)
    • List
    • Patch
    • Update
  • Table
    • Delete
    • Get
    • GetIamPolicy
    • Insert
    • List
    • Patch
    • SetIamPolicy
    • TestIamPermissions
    • Update
  • Tabledata
    • InsertAll
    • List
  • Job
    • Cancel
    • Get
    • GetQueryResult
    • Insert
    • List
    • Query
  • Model
    • Delete
    • Get
    • List
    • Patch
  • Project
    • GetServiceAccount
    • List
  • Routine
    • Delete
    • Get
    • Insert
    • List
    • Update

License