Struct EmbrFS

Source

pub struct EmbrFS {
    pub manifest: Manifest,
    pub engram: Engram,
    pub resonator: Option<Resonator>,
    /* private fields */
}

Expand description

EmbrFS - Holographic Filesystem with Guaranteed Reconstruction

§100% Reconstruction Guarantee

EmbrFS guarantees bit-perfect file reconstruction through a layered approach:

Encode: Data chunks → SparseVec via reversible encoding
Verify: Immediately decode and compare to original
Correct: Store minimal correction if any difference exists
Extract: Decode + apply correction = exact original bytes

This guarantee holds regardless of:

Data content (binary, text, compressed, encrypted)
File size (single byte to gigabytes)
Number of files in the engram
Superposition crosstalk in bundles

§Holographic Mode

When created with new_holographic(), uses ReversibleVSAEncoder which achieves ~94% uncorrected accuracy through position-aware VSA binding. This results in <10% correction overhead instead of the ~200%+ overhead of legacy encoding.

§Examples

use embeddenator_fs::EmbrFS;
use std::path::Path;

// Legacy mode (not recommended)
let mut fs_legacy = EmbrFS::new();

// Holographic mode (recommended - minimal storage overhead)
let mut fs = EmbrFS::new_holographic();
assert_eq!(fs.manifest.total_chunks, 0);
assert_eq!(fs.manifest.files.len(), 0);

Fields§

§manifest: Manifest§engram: Engram§resonator: Option<Resonator>

Implementations§

Source §

impl EmbrFS

Source

pub fn new() -> Self

👎Deprecated since 0.25.0: Use new_holographic() instead for ~94% encoding accuracy and <10% storage overhead

Create a new empty EmbrFS instance (legacy mode - NOT RECOMMENDED)

This constructor creates an EmbrFS with legacy encoding that has only ~10% accuracy, resulting in ~200%+ storage overhead due to verbatim corrections.

Use new_holographic() instead for production use.

§Examples

use embeddenator_fs::EmbrFS;

// Legacy mode - high storage overhead
let fs = EmbrFS::new();
assert_eq!(fs.manifest.files.len(), 0);

Source

pub fn new_holographic() -> Self

Create a new EmbrFS with holographic encoding (RECOMMENDED)

Uses ReversibleVSAEncoder which achieves ~94% uncorrected accuracy through position-aware VSA binding. This results in <10% correction overhead instead of the ~200%+ overhead of legacy encoding.

§Examples

use embeddenator_fs::EmbrFS;

let fs = EmbrFS::new_holographic();
assert_eq!(fs.manifest.files.len(), 0);
assert_eq!(fs.manifest.total_chunks, 0);
assert!(fs.is_holographic());

Source

pub fn is_holographic(&self) -> bool

Check if holographic mode is enabled

Source

pub fn chunk_size(&self) -> usize

Get the chunk size being used

Source

pub fn set_resonator(&mut self, resonator: Resonator)

Set the resonator for enhanced pattern recovery during extraction

Configures a resonator network that can perform pattern completion to recover missing or corrupted data chunks during filesystem extraction. The resonator acts as a content-addressable memory that can reconstruct lost information by finding the best matching patterns in its trained codebook.

§How it works

The resonator maintains a codebook of known vector patterns
During extraction, missing chunks are projected onto the closest known pattern
This enables robust recovery from partial data loss or corruption

§Why this matters

Provides fault tolerance for holographic storage systems
Enables reconstruction even when some chunks are unavailable
Supports graceful degradation rather than complete failure

§Arguments

resonator - A trained resonator network for pattern completion

§Examples

use embeddenator_fs::{EmbrFS, Resonator};

let mut fs = EmbrFS::new();
let resonator = Resonator::new();
fs.set_resonator(resonator);
// Now extraction will use resonator-enhanced recovery

Source

pub fn correction_stats(&self) -> CorrectionStats

Get correction statistics for this engram

Returns statistics about how many chunks needed correction and the overhead incurred by storing corrections.

§Examples

use embeddenator_fs::EmbrFS;

let fs = EmbrFS::new();
let stats = fs.correction_stats();
assert_eq!(stats.total_chunks, 0);

Source

pub fn ingest_directory<P: AsRef<Path>>( &mut self, dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Ingest an entire directory into engram format

Source

pub fn ingest_directory_with_prefix<P: AsRef<Path>>( &mut self, dir: P, logical_prefix: Option<&str>, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Ingest a directory into the engram, optionally prefixing all logical paths.

When logical_prefix is provided, all ingested file paths become: {logical_prefix}/{relative_path_from_dir}.

Source

pub fn ingest_file<P: AsRef<Path>>( &mut self, file_path: P, logical_path: String, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Ingest a single file into the engram with guaranteed reconstruction

This method encodes file data into sparse vectors and stores any necessary corrections to guarantee 100% bit-perfect reconstruction.

§Correction Process

For each chunk:

Encode: chunk_data → SparseVec
Decode: SparseVec → decoded_data
Compare: chunk_data == decoded_data?
If different: store correction in CorrectionStore

§Arguments

file_path - Path to the file on disk
logical_path - Path to use in the engram manifest
verbose - Print progress information
config - VSA encoding configuration

§Returns

io::Result<()> indicating success or failure

Source

pub fn add_file<P: AsRef<Path>>( &mut self, file_path: P, logical_path: String, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Add a new file to an existing engram (incremental update)

This method enables efficient incremental updates by adding a single file to an existing engram without requiring full re-ingestion. The new file’s chunks are bundled with the existing root vector using VSA’s associative bundle operation.

§Algorithm

Encode new file into chunks (same as ingest_file)
Bundle each chunk with existing root: root_new = root_old ⊕ chunk
Add chunks to codebook with new chunk IDs
Update manifest with new file entry

§Performance

Time complexity: O(n) where n = number of chunks in new file
Does not require reading or re-encoding existing files
Suitable for production workflows with frequent additions

§Arguments

file_path - Path to the file on disk
logical_path - Path to use in the engram manifest
verbose - Print progress information
config - VSA encoding configuration

§Returns

io::Result<()> indicating success or failure

§Examples

use embeddenator_fs::{EmbrFS, ReversibleVSAConfig};
use std::path::Path;

let mut fs = EmbrFS::new();
let config = ReversibleVSAConfig::default();

// Ingest initial dataset
fs.ingest_directory("./data", false, &config).unwrap();

// Later, add a new file without full re-ingestion
fs.add_file("./new_file.txt", "new_file.txt".to_string(), true, &config).unwrap();

Source

pub fn remove_file(&mut self, logical_path: &str, verbose: bool) -> Result<()>

Remove a file from the engram (mark as deleted for incremental update)

This method marks a file as deleted in the manifest without modifying the root vector. This is because VSA bundling is a lossy operation and there’s no clean inverse. The chunks remain in the codebook but won’t be extracted.

§Algorithm

Find file in manifest by logical path
Mark file entry as deleted
Chunks remain in codebook (for potential recovery or compaction)
File won’t appear in future extractions

§Note on VSA Limitations

Bundle operation is associative but not invertible:

(A ⊕ B) ⊕ C = A ⊕ (B ⊕ C) ✓ (can add)
(A ⊕ B) ⊖ B ≠ A ✗ (can’t cleanly remove)

To truly remove chunks from the root, use compact() which rebuilds the engram without deleted files.

§Arguments

logical_path - Path of the file to remove
verbose - Print progress information

§Returns

io::Result<()> indicating success or failure

§Examples

use embeddenator_fs::{EmbrFS, ReversibleVSAConfig};

let mut fs = EmbrFS::new();
let config = ReversibleVSAConfig::default();

fs.ingest_directory("./data", false, &config).unwrap();
fs.remove_file("old_file.txt", true).unwrap();
// File marked as deleted, won't be extracted

Source

pub fn modify_file<P: AsRef<Path>>( &mut self, file_path: P, logical_path: String, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Modify an existing file in the engram (incremental update)

This method updates a file’s content by removing the old version and adding the new version. It’s equivalent to remove_file + add_file.

§Algorithm

Mark old file as deleted
Re-encode new file content
Bundle new chunks with root
Add new file entry to manifest

§Trade-offs

Old chunks remain in codebook (use compact() to clean up)
Root contains both old and new chunk contributions (slight noise)
Fast operation, doesn’t require rebuilding entire engram

§Arguments

file_path - Path to the file on disk (new content)
logical_path - Path of the file in the engram
verbose - Print progress information
config - VSA encoding configuration

§Returns

io::Result<()> indicating success or failure

§Examples

use embeddenator_fs::{EmbrFS, ReversibleVSAConfig};
use std::path::Path;

let mut fs = EmbrFS::new();
let config = ReversibleVSAConfig::default();

fs.ingest_directory("./data", false, &config).unwrap();

// Later, modify a file
fs.modify_file("./data/updated.txt", "data/updated.txt".to_string(), true, &config).unwrap();

Source

pub fn compact( &mut self, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Compact the engram by rebuilding without deleted files

This operation rebuilds the engram from scratch, excluding all files marked as deleted. It’s the only way to truly remove old chunks from the root vector and codebook.

§Algorithm

Create new empty engram
Re-bundle all non-deleted files
Reassign chunk IDs sequentially
Replace old engram with compacted version

§Performance

Time complexity: O(N) where N = total bytes of non-deleted files
Expensive operation, run periodically (not after every deletion)
Recommended: compact when deleted files exceed 20-30% of total

§Benefits

Reclaims space from deleted chunks
Reduces root vector noise from obsolete data
Resets chunk IDs to sequential order
Maintains bit-perfect reconstruction of kept files

§Arguments

verbose - Print progress information
config - VSA encoding configuration

§Returns

io::Result<()> indicating success or failure

§Examples

use embeddenator_fs::{EmbrFS, ReversibleVSAConfig};

let mut fs = EmbrFS::new();
let config = ReversibleVSAConfig::default();

fs.ingest_directory("./data", false, &config).unwrap();
fs.remove_file("old1.txt", false).unwrap();
fs.remove_file("old2.txt", false).unwrap();

// After many deletions, compact to reclaim space
fs.compact(true, &config).unwrap();

Source

pub fn save_engram<P: AsRef<Path>>(&self, path: P) -> Result<()>

Save engram to file

Source

pub fn load_engram<P: AsRef<Path>>(path: P) -> Result<Engram>

Load engram from file

Source

pub fn save_manifest<P: AsRef<Path>>(&self, path: P) -> Result<()>

Save manifest to JSON file

Source

pub fn load_manifest<P: AsRef<Path>>(path: P) -> Result<Manifest>

Load manifest from JSON file

Source

pub fn load<P: AsRef<Path>, Q: AsRef<Path>>( engram_path: P, manifest_path: Q, ) -> Result<Self>

Load an EmbrFS from engram and manifest files

Automatically detects if the engram was created with holographic mode and sets up the appropriate encoder for extraction.

§Arguments

engram_path - Path to the engram file
manifest_path - Path to the manifest JSON file

§Returns

io::Result<EmbrFS> with the loaded engram and manifest

Source

pub fn extract<P: AsRef<Path>>( engram: &Engram, manifest: &Manifest, output_dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Extract files from engram to directory with guaranteed reconstruction

This method guarantees 100% bit-perfect reconstruction by applying stored corrections after decoding each chunk.

§Reconstruction Process

For each chunk:

Decode: SparseVec → decoded_data
Apply correction: decoded_data + correction → original_data
Verify: Hash matches stored hash (guaranteed by construction)

§Arguments

engram - The engram containing encoded data and corrections
manifest - File metadata and chunk mappings
output_dir - Directory to write extracted files
verbose - Print progress information
config - VSA decoding configuration

§Returns

io::Result<()> indicating success or failure

Source

pub fn extract_with_resonator<P: AsRef<Path>>( &self, output_dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Extract files using resonator-enhanced pattern completion with guaranteed reconstruction

Performs filesystem extraction with intelligent recovery capabilities powered by resonator networks. When chunks are missing from the codebook, the resonator attempts pattern completion to reconstruct the lost data, enabling extraction even from partially corrupted or incomplete engrams.

§Reconstruction Guarantee

Even with resonator-assisted recovery, corrections are applied to guarantee bit-perfect reconstruction. The process is:

Try to get chunk from codebook
If missing, use resonator to recover approximate chunk
Apply correction from CorrectionStore
Result is guaranteed bit-perfect (if correction exists)

§How it works

For each file chunk, check if it exists in the engram codebook
If missing, use the resonator to project a query vector onto known patterns
Apply stored corrections for guaranteed accuracy
Reconstruct the file from available and recovered chunks
If no resonator is configured, falls back to standard extraction

§Why this matters

Enables 100% reconstruction even with missing chunks
Provides fault tolerance for distributed storage scenarios
Supports hierarchical recovery at multiple levels of the storage stack
Maintains data integrity through pattern-based completion

§Arguments

output_dir - Directory path where extracted files will be written
verbose - Whether to print progress information during extraction
config - VSA configuration for encoding/decoding

§Returns

io::Result<()> indicating success or failure of the extraction operation

§Examples

use embeddenator_fs::{EmbrFS, Resonator, ReversibleVSAConfig};
use std::path::Path;

let mut fs = EmbrFS::new();
let resonator = Resonator::new();
let config = ReversibleVSAConfig::default();
fs.set_resonator(resonator);

// Assuming fs has been populated with data...
let result = fs.extract_with_resonator("/tmp/output", true, &config);
assert!(result.is_ok());

Source

pub fn bundle_hierarchically( &self, max_level_sparsity: usize, verbose: bool, _config: &ReversibleVSAConfig, ) -> Result<HierarchicalManifest>

Perform hierarchical bundling with path role binding and permutation tagging

Creates multi-level engram structures where path components are encoded using permutation operations to create distinct representations at each level. This enables efficient hierarchical retrieval and reconstruction.

§How it works

Split file paths into components (e.g., “a/b/c.txt” → [“a”, “b”, “c.txt”])
For each level, apply permutation based on path component hash
Bundle representations level-by-level with sparsity control
Create sub-engrams for intermediate nodes

§Why this matters

Enables scalable hierarchical storage beyond flat bundling limits
Path-based retrieval without full engram traversal
Maintains semantic relationships through permutation encoding
Supports efficient partial reconstruction

§Arguments

max_level_sparsity - Maximum non-zero elements per level bundle
verbose - Whether to print progress information

§Returns

HierarchicalManifest describing the multi-level structure

§Examples

use embeddenator_fs::{EmbrFS, ReversibleVSAConfig};

let fs = EmbrFS::new();
let config = ReversibleVSAConfig::default();
// Assuming files have been ingested...

let hierarchical = fs.bundle_hierarchically(500, false, &config);
assert!(hierarchical.is_ok());

Source

pub fn bundle_hierarchically_with_options( &self, max_level_sparsity: usize, max_chunks_per_node: Option<usize>, verbose: bool, _config: &ReversibleVSAConfig, ) -> Result<HierarchicalManifest>

Like bundle_hierarchically, but supports an optional deterministic cap on chunk_ids per node.

If max_chunks_per_node is set and a node would exceed that many chunk_ids, the node becomes a router with empty chunk_ids, and deterministic shard children are created each containing a bounded subset of chunk_ids.

Source

pub fn extract_hierarchically<P: AsRef<Path>>( &self, hierarchical: &HierarchicalManifest, output_dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

Extract files from hierarchical manifest with manifest-guided traversal

Performs hierarchical extraction by traversing the manifest levels and reconstructing files from sub-engrams. This enables efficient extraction from complex hierarchical structures without loading the entire engram.

§How it works

Traverse manifest levels from root to leaves
For each level, locate relevant sub-engrams
Reconstruct file chunks using inverse permutation operations
Assemble complete files from hierarchical components

§Why this matters

Enables partial extraction from large hierarchical datasets
Maintains bit-perfect reconstruction accuracy
Supports efficient path-based queries and retrieval
Scales to complex directory structures

§Arguments

hierarchical - The hierarchical manifest to extract from
output_dir - Directory path where extracted files will be written
verbose - Whether to print progress information during extraction

§Returns

io::Result<()> indicating success or failure of the hierarchical extraction

§Examples

use embeddenator_fs::{EmbrFS, ReversibleVSAConfig};

let fs = EmbrFS::new();
let config = ReversibleVSAConfig::default();
// Assuming hierarchical manifest was created...
// let hierarchical = fs.bundle_hierarchically(500, true).unwrap();

// fs.extract_hierarchically(&hierarchical, "/tmp/output", true, &config)?;

Trait Implementations§

Source §

impl Default for EmbrFS

Source §

fn default() -> Self

Returns the “default value” for a type. Read more

Auto Trait Implementations§

§

impl UnwindSafe for EmbrFS

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

impl<T> Pointable for T

Source §

const ALIGN: usize

The alignment of pointer.

Source §

type Init = T

The type for initializers.

Source §

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more

Source §

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more

Source §

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more

Source §

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more

Source §

impl<T> Same for T

Source §

type Output = T

Should always be Self

Source §

impl<T, U> TryFrom for T
where U: Into<T>,

Source §

type Error = Infallible

The type returned in the event of a conversion error.

Source §

fn try_from(value: U) -> Result<T, <T as TryFrom>::Error>

Performs the conversion.

Source §

impl<T, U> TryInto for T
where U: TryFrom<T>,

Source §

type Error = >::Error

The type returned in the event of a conversion error.

Source §

fn try_into(self) -> Result<U, >::Error>

Performs the conversion.

Source §

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source §

Struct EmbrFS Copy item path

§100% Reconstruction Guarantee

§Holographic Mode

§Examples

Fields§

Implementations§

impl EmbrFS

pub fn new() -> Self

§Examples

pub fn new_holographic() -> Self

§Examples

pub fn is_holographic(&self) -> bool

pub fn chunk_size(&self) -> usize

pub fn set_resonator(&mut self, resonator: Resonator)

§How it works

§Why this matters

§Arguments

§Examples

pub fn correction_stats(&self) -> CorrectionStats

§Examples

pub fn ingest_directory<P: AsRef<Path>>( &mut self, dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

pub fn ingest_directory_with_prefix<P: AsRef<Path>>( &mut self, dir: P, logical_prefix: Option<&str>, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

pub fn ingest_file<P: AsRef<Path>>( &mut self, file_path: P, logical_path: String, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

§Correction Process

§Arguments

§Returns

pub fn add_file<P: AsRef<Path>>( &mut self, file_path: P, logical_path: String, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

§Algorithm

§Performance

§Arguments

§Returns

§Examples

pub fn remove_file(&mut self, logical_path: &str, verbose: bool) -> Result<()>

§Algorithm

§Note on VSA Limitations

§Arguments

§Returns

§Examples

pub fn modify_file<P: AsRef<Path>>( &mut self, file_path: P, logical_path: String, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

§Algorithm

§Trade-offs

§Arguments

§Returns

§Examples

pub fn compact( &mut self, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

§Algorithm

§Performance

§Benefits

§Arguments

§Returns

§Examples

pub fn save_engram<P: AsRef<Path>>(&self, path: P) -> Result<()>

pub fn load_engram<P: AsRef<Path>>(path: P) -> Result<Engram>

pub fn save_manifest<P: AsRef<Path>>(&self, path: P) -> Result<()>

pub fn load_manifest<P: AsRef<Path>>(path: P) -> Result<Manifest>

pub fn load<P: AsRef<Path>, Q: AsRef<Path>>( engram_path: P, manifest_path: Q, ) -> Result<Self>

§Arguments

§Returns

pub fn extract<P: AsRef<Path>>( engram: &Engram, manifest: &Manifest, output_dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

§Reconstruction Process

§Arguments

§Returns

pub fn extract_with_resonator<P: AsRef<Path>>( &self, output_dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

§Reconstruction Guarantee

§How it works

§Why this matters

§Arguments

§Returns

§Examples

pub fn bundle_hierarchically( &self, max_level_sparsity: usize, verbose: bool, _config: &ReversibleVSAConfig, ) -> Result<HierarchicalManifest>

§How it works

§Why this matters

§Arguments

§Returns

§Examples

pub fn bundle_hierarchically_with_options( &self, max_level_sparsity: usize, max_chunks_per_node: Option<usize>, verbose: bool, _config: &ReversibleVSAConfig, ) -> Result<HierarchicalManifest>

pub fn extract_hierarchically<P: AsRef<Path>>( &self, hierarchical: &HierarchicalManifest, output_dir: P, verbose: bool, config: &ReversibleVSAConfig, ) -> Result<()>

§How it works

§Why this matters

§Arguments

Struct EmbrFS

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,