Module sqlx_exasol::etl 
source · Expand description
This module provides the building blocks for creating IMPORT and EXPORT jobs. These are represented by a query that gets executed concurrently with some ETL workers, both of which are obtained by building the ETL job. The data format is always CSV, but there are some customizations that can be done on the builders such as row or column separator, etc.
The query execution is driven by a future obtained from building the job, and will rely
on using the workers (also obtained from building the job) to complete and return. The
results is of type ExaQueryResult which can give you the number of affected rows.
IMPORT jobs are constructed through the ImportBuilder type and will generate workers of type
ExaImport. The workers can be used to write data to the database and the query execution
ends when all the workers have been closed (by explicitly calling close().await).
EXPORT jobs are constructed through the ExportBuilder type and will generate workers of type
ExaExport. The workers can be used to read data from the database and the query execution
ends when all the workers receive EOF. They can be dropped afterwards.
ETL jobs can use TLS, compression, or both and will do so in a
consistent manner with the ExaConnection they are executed on.
That means that if the connection uses TLS / compression, so will the ETL job.
NOTE: Trying to run ETL jobs with TLS without an ETL TLS feature flag results in a runtime error. Furthermore, enabling more than one ETL TLS feature results in a compile time error.
Atomicity
IMPORT jobs are not atomic by themselves. If an error occurs during the data ingestion,
some of the data might be already sent and written in the database. However, since
IMPORT is fundamentally just a query, it can be transactional. Therefore,
beginning a transaction and passing that to the ImportBuilder::build method will result in
the import job needing to be explicitly committed:
use std::env;
use sqlx_exasol::{etl::*, *};
let pool = ExaPool::connect(&env::var("DATABASE_URL").unwrap()).await?;
let mut con = pool.acquire().await?;
let mut tx = con.begin().await?;
let (query_fut, writers) = ImportBuilder::new("SOME_TABLE").build(&mut *tx).await?;
// concurrently use the writers and await the query future
tx.commit().await?;IMPORTANT
Exasol doesn’t really like it when ExaImport workers are closed without ever sending any
data. The underlying socket connection to Exasol will be closed, and Exasol will just try to
open a new one. However, workers only listen on the designated sockets once, so the connection
will be refused (even if it weren’t, the cycle might just repeat since we’d still be sending no
data).
Therefore, it is wise not to build IMPORT jobs with more workers than required, depending on the amount of data to be imported and especially if certain workers won’t be written to at all.
Additionally, Exasol expects that all ExaExport are read in their entirety (until EOF is
reached). Failing to do so will result in the query execution returning an error. If, for some
reason, you do not want to exhaust the readers, be prepared to handle the error returned by the
EXPORT query.
Structs
- A builder for an ETL EXPORT job. Calling [build().await] will ouput a future that drives the EXPORT query execution and aVec<ExaReader>which must be concurrently used to read data from Exasol.
- A builder for an ETL IMPORT job. Calling [build().await] will ouput a future that drives the IMPORT query execution and aVec<ExaWriter>which must be concurrently used to ingest data into Exasol.
Enums
- An ETL EXPORT worker.
- An ETL IMPORT worker.
- The EXPORT source type, which can either directly be a table or an entire query.
- Trim options for IMPORT