Skip to main content

Crate datasynth_core

Crate datasynth_core 

Source
Expand description

§synth-core

Core domain models, traits, and distributions for synthetic accounting data generation.

This crate provides the foundational types used throughout the synthetic data factory:

  • Journal Entry models (header and line items)
  • Chart of Accounts structures
  • SAP HANA ACDOCA/BSEG compatible event log formats
  • Generator and Sink traits for extensibility
  • Statistical distribution samplers based on empirical research
  • Templates for realistic data generation (names, descriptions, references)
  • Resource management (memory, disk, CPU) with graceful degradation
  • Streaming infrastructure for real-time data generation

Re-exports§

pub use cpu_monitor::*;
pub use degradation::*;
pub use disk_guard::*;
pub use distributions::*;
pub use error::*;
pub use memory_guard::*;
pub use models::*;
pub use rate_limit::*;
pub use resource_guard::*;
pub use streaming::*;
pub use templates::*;
pub use traits::*;
pub use uuid_factory::*;

Modules§

accounts
Centralized GL account constants for consistent account mapping.
causal
Causal and counterfactual data generation.
compliance
Regulatory compliance module for EU AI Act, GDPR, and related frameworks.
cpu_monitor
CPU load monitoring for preventing system overload.
degradation
Graceful degradation system for handling resource pressure.
diffusion
Diffusion model abstraction for statistical data generation.
disk_guard
Disk space management and guardrails for preventing disk exhaustion.
distributions
Statistical distribution samplers for realistic data generation.
error
Error types for the synthetic data generation system.
llm
LLM provider abstraction for AI-augmented data generation.
memory_guard
Memory management and guardrails for preventing OOM conditions.
models
Domain models for synthetic accounting data generation.
plugins
Built-in plugin examples demonstrating the Plugin SDK.
rate_limit
Rate limiting for controlling generation throughput.
resource_guard
Unified resource guard combining memory, disk, and CPU monitoring.
streaming
Streaming infrastructure for real-time data generation.
templates
Template system for realistic data generation.
traits
Core traits for generators, output sinks, post-processors, and plugins.
uuid_factory
Deterministic UUID generation factory for reproducible synthetic data.