encryptable-tokio-fs
A drop-in, API-identical replacement for tokio::fs with transparent opt-in, non-framing stream cipher encryption.
This crate is aims to be a full API mirror of tokio::fs. When a cryptographic key is provided, data is automatically encrypted/decrypted using the XChaCha20 stream cipher during file reads and writes, requiring zero application-side code changes.
To use it:
- Search and replace all
tokio::fsforencryptable-tokio-fs::fs - To enable encryption, call
encryptable-tokio-fs::fs::set_key(). All file operations, from that point on, will be encrypted. - By not setting a key, file operations will be exactly the same as
tokio::fs-- a.k.a., "plain-text".
Example
async
Use Case and Security Context
Use Case
This crate is designed to obfuscate data at rest for programs deployed to external premises (e.g., client environments). Its primary function is to deter casual analysis and make it harder for an adversary to immediately locate and view sensitive data (configuration, proprietary content, etc.) within the program's files.
Security Note
It is critical to understand that this crate provides security through obscurity, not robust, long-term cryptographic protection -- a weakness of using client-side data protection without a secure hardware module.
If an attacker is able to execute the program in a debugging environment (e.g., using GDB or WinDbg), the encryption key will be, eventually, loaded into memory. Once the program is under an analyst's control, they can easily:
- Inspect memory to extract the plaintext key.
- Set breakpoints on the decryption function to view the data immediately after decryption.
Therefore, the protection offered is ephemeral once active reverse engineering begins.
For a more comprehensive defense, this encryption strategy must be paired with strong anti-debugging and code obfuscation techniques.
Please see the Security Model and Mitigation sections at the end of this document for details on a layered approach.
Pre-release API
We are in the pre-release API, where only a small -- but very useful -- portion of the whole tokio::fs API is implemented.
Specifically, the CryptorAsyncReader & CryptorAsyncWriter are fully implemented, tested, and optimized,
effectively allowing using the replacement File object for encryption / decryption.
On the other side, we are still lacking:
- File Name & Path encryption
- Traversing the filesystem (encrypted portion of it)
- Seek support
- Append support (this would require seek, as the cypher needs also to read the headers and seek the cypher to the end position)
Implementation Status
Although the API is still incomplete, the implementation is efficient, secure, decoupled, and has been fully tested.
Your inputs are welcome to guide further developments. Please create a Github Issue with requests or suggestions.
Global context vs Instantiated
On the above usage example, a single key would be used for all file operations -- since the easiest integration path is
to keep using the global context, exactly as tokio does when using file APIs under tokio::fs.
Nonetheless, there will be APIs to instantiate the cryptor FS layer -- allowing multiple keys to be used simultaneously.
Using the global context is easier, but has the downside to require the key to be stored in RAM until the process ends, which may be of concern when executing it in adverse environments.
Security model
This crate, currently, provides confidentiality only: it encrypts bytes so they are unreadable without the key. It does not provide integrity or authenticity.
What you get: secrecy of file contents (assuming key secrecy).
What you do not get: detection of modifications, truncation, re-ordering, or header tampering. Any bit flips or edits to the encrypted file will decrypt to some bytes without error.
Corruption/tampering: disk glitches or malicious edits are not detected. We keep an integrity parity with plaintext: Using this crate for encrypted files provides no more and no less modification detection than storing plain-text. If your application requires modification detection, we recommend you to add your own checks for it to work for both encrypted or non-encrypted contents -- If you only care about accidental corruption (not adversaries), add a filesystem-level checksum (e.g., CRC32/64) or hash stored elsewhere and verify before use.
Threat model: equivalent to a plain-text file with OS-level permissions for integrity. If an attacker can modify the file, they can change the decrypted output without this crate noticing.
We do not defend against malicious changes.
However, there is a caveat: if any of the first 192 bits of any encrypted file are changed (the nonce header), the full contents of the file will become garbage.
Why this design: files remain seekable with raw stream encryption; adding per-frame authentication would change that trade-off. This crate intentionally keeps the plain-text-like ergonomics for reading/seek.
Mitigations
No seek, possibly support append, but solving the integrity & authentication issues Trade-off
By incorporating compression, we may solve all the issues raised in the security model above -- at the expense of losing seek support. Maybe we can work with both variants:
encryptable-tokio-fs::fs::set_key()-- enables encryption (but no integrity nor authentication) and support for the fulltokio::fsAPI.encryptable-tokio-fs::fs::set_compressor()-- enables compression (on top of encryption): provides integrity & authentication (immune to "casual attacks") but disablesseekand, possibly,append.
To further improve a little bit on the security -- provided an attacker is not able to conduct a debugging session:
- No compressor error message will leak. It will just fail with "tempered data";
- Before shouting "tempered data", a sleep of 1 second will be enforced;
- Users are strongly advised to hide -- as much as possible -- the contents of the encrypted files.
If unavoidable, the further delaying presenting this information the better. E.g.: if
--verboseis enabled, sleep for 1 second before starting the program.
With these additions, we estimate the cost for a determined attacker to effectively change a "license expiry date" on a yaml file to be at around ~40k. Please do not use this crate to secure higher valuable assets.
The above cost holds true provided you obfuscate the binary enough to make debugging sessions fruitless:
- Binary building options: strip debug info, use aggressive linking optimizations (fat), use codegen-units = 1, panic = abort, statically link as most as possible.
and bail out if a debugger has been detected:
- Linux: Check the
P_TRACEDflag in the/proc/self/statusfile or use theptracesystem call (the well-known "ptrace trick") - Windows: Call the
IsDebuggerPresent()function from the WinAPI, or check theBeingDebuggedflag in the Process Environment Block (PEB) - You can also search among process names for known debuggers: gdb, lldb, x64dbg, ollydbg, windbg, ...