Struct ThresholdedForest

Source

pub struct ThresholdedForest<const D: usize> { /* private fields */ }

Expand description

Adaptive-threshold detector composed of a RandomCutForest plus a running EMA of the anomaly-score stream.

Instantiate via crate::ThresholdedForestBuilder. The type parameter D is the per-point dimensionality, pinned at compile time exactly like the bare RandomCutForest.

§Examples

use anomstream_core::ThresholdedForestBuilder;

let mut detector = ThresholdedForestBuilder::<2>::new()
    .num_trees(50)
    .sample_size(64)
    .min_observations(4)
    .seed(42)
    .build()
    .unwrap();
for i in 0..64 {
    let v = f64::from(i) * 0.01;
    let _ = detector.process([v, v + 0.5]).unwrap();
}
let verdict = detector.process([10.0, 10.0]).unwrap();
assert!(verdict.ready());

Implementations§

Source §

impl<const D: usize> ThresholdedForest<D>

Source

pub fn attribution_stability( &self, point: &[f64; D], ) -> RcfResult<AttributionStability>

Inter-tree dispersion of the attribution on point. Delegates to the underlying forest — the threshold layer does not influence attribution.

§Errors

Same as RandomCutForest::attribution_stability.

Source §

impl<const D: usize> ThresholdedForest<D>

Source

pub fn bootstrap(&mut self, points: I) -> RcfResult<BootstrapReport>
where I: IntoIterator<Item = [f64; D]>,

Replay historical points through the thresholded detector, folding each one into the forest and the score-stream EMA so the adaptive threshold is hot before the first live point.

Graded verdicts produced during the replay are discarded — they would be misleading for historical data. The detector is ready for live traffic as soon as BootstrapReport::final_observations passes the configured min_observations threshold.

Non-finite points are skipped and tallied in the report.

§Errors

Propagates Self::process failures other than RcfError::NaNValue (absorbed and counted as a skip).

Source §

impl<const D: usize> ThresholdedForest<D>

Source

pub fn group_scores( &self, point: &[f64; D], groups: &FeatureGroups, ) -> RcfResult<GroupScores>

Decompose the anomaly attribution of point over groups. Delegates to the underlying forest’s attribution — the adaptive threshold layer does not influence the decomposition.

§Errors

Same as crate::RandomCutForest::group_scores.

Source §

impl<const D: usize> ThresholdedForest<D>

Source

pub fn to_bytes(&self) -> RcfResult<Vec<u8>>

Serialise the thresholded detector into a versioned binary blob.

The payload carries the underlying forest, the threshold configuration, and the EMA statistics — enough for a receiver to resume scoring and emitting graded verdicts without a warmup gap.

§Errors

Returns RcfError::SerializationFailed when the underlying postcard encoder rejects the payload.

Source

pub fn from_bytes(bytes: &[u8]) -> RcfResult<Self>

Reload a thresholded detector previously produced by to_bytes.

§Errors

RcfError::DeserializationFailed when the byte slice is too short to hold the version prefix, longer than MAX_DESERIALIZE_BYTES, or the postcard payload is malformed.
RcfError::IncompatibleVersion when the embedded version does not match THRESHOLDED_PERSISTENCE_VERSION.

§Security

Designed for trusted checkpoints — see the module-level # Security section. Use Self::from_bytes_with_max_size when the deployment’s expected payload exceeds MAX_DESERIALIZE_BYTES.

Source

pub fn from_bytes_with_max_size(bytes: &[u8], max: usize) -> RcfResult<Self>

Variant of Self::from_bytes with a caller-supplied byte-length cap.

§Errors

Same as Self::from_bytes with max replacing MAX_DESERIALIZE_BYTES.

§Security

See module-level # Security notes.

Source

pub fn to_path(&self, path: impl AsRef<Path>) -> RcfResult<()>

Atomically serialise the thresholded detector to path. Same atomic write discipline as RandomCutForest::to_path.

§Errors

RcfError::SerializationFailed for any filesystem or encoder failure.

Source

pub fn from_path(path: impl AsRef<Path>) -> RcfResult<Self>

Reload a thresholded detector from path.

§Errors

RcfError::DeserializationFailed when the file cannot be read, exceeds MAX_DESERIALIZE_BYTES, or the payload is malformed.
RcfError::IncompatibleVersion when the embedded version does not match THRESHOLDED_PERSISTENCE_VERSION.

§Security

Inherits the trust model of Self::from_bytes.

Source §

impl<const D: usize> ThresholdedForest<D>

Source

pub fn from_parts( forest: RandomCutForest<D>, thresholded: ThresholdedConfig, ) -> RcfResult<Self>

Low-level constructor used by crate::ThresholdedForestBuilder::build.

Both forest and thresholded are expected to have been validated upstream; this function only wires them together and constructs the EMA.

§Errors

Propagates EmaStats::new failures (non-finite decay etc.).

Source

pub fn with_metrics_sink(self, sink: Arc<dyn MetricsSink>) -> Self

Install a crate::MetricsSink — every subsequent process / score_only call emits counters and histograms into it. Does not propagate to the underlying forest; install on the forest separately if you also want low-level rcf_updates_total / rcf_score / rcf_deletes_total events.

Source

pub fn metrics_sink(&self) -> &Arc<dyn MetricsSink>

Read-only handle to the installed threshold-layer sink.

Source

pub fn forest(&self) -> &RandomCutForest<D>

Read-only access to the underlying forest.

Source

pub fn forest_config(&self) -> &RcfConfig

Read-only access to the forest configuration.

Source

pub fn thresholded_config(&self) -> &ThresholdedConfig

Threshold-layer configuration.

Source

pub fn stats(&self) -> &EmaStats

Running statistics of the anomaly-score stream.

Source

pub fn current_threshold(&self) -> f64

Current adaptive threshold. Clamped to the configured floor whenever the detector has not yet accumulated enough observations to trust the running statistic (stddev under ZSigma, quantile under Quantile).

Source

pub fn process(&mut self, point: [f64; D]) -> RcfResult<AnomalyGrade>

Score point against the current forest, grade it against the adaptive threshold, insert it into the forest, then fold the score into the running statistics.

The first call returns a warming-up verdict (ready = false, is_anomaly = false) because the forest holds no leaves yet. Subsequent calls within the min_observations warmup window also return ready = false.

§Errors

RcfError::NaNValue when the point contains a non-finite component.
Any error bubbled up from RandomCutForest::update or RandomCutForest::score.

Source

pub fn score_only(&self, point: &[f64; D]) -> RcfResult<AnomalyGrade>

Score point and grade it without touching the forest or the running statistics. Useful for re-evaluating a point against a snapshot of the model without contaminating the training stream.

On an empty forest (no points have been inserted yet), returns a warming-up verdict (ready = false, is_anomaly = false) rather than an error — mirrors Self::process’s cold-start handling so callers can query the detector during the warmup window without special-casing the empty case.

§Errors

RcfError::NaNValue when the point contains a non-finite component.
Any other error bubbled up from RandomCutForest::score.

Source

pub fn attribution(&self, point: &[f64; D]) -> RcfResult<DiVector>

Compute the per-feature attribution of point’s anomaly score against the underlying forest. Forwarded to RandomCutForest::attribution; the threshold layer has no bearing on attribution.

§Errors

Same as RandomCutForest::attribution.

Source

pub fn score_only_many( &self, points: &[[f64; D]], ) -> RcfResult<Vec<AnomalyGrade>>

Bulk-score a batch of points without touching the threshold layer’s stats. Returns AnomalyGrades graded against the current adaptive threshold — identical to what Self::score_only would emit per point, but parallelised across the batch via rayon when the parallel feature is enabled.

§Errors

Propagates any Self::score_only error hit while processing the batch.

Source

pub fn attribution_many(&self, points: &[[f64; D]]) -> RcfResult<Vec<DiVector>>

Bulk per-feature attribution. Delegates to RandomCutForest::attribution_many.

§Errors

Same as RandomCutForest::attribution_many.

Source

pub fn forensic_baseline( &self, point: &[f64; D], ) -> RcfResult<ForensicBaseline<D>>

Imputation-like forensic baseline. Delegates to RandomCutForest::forensic_baseline.

§Errors

Same as RandomCutForest::forensic_baseline.

Source

pub fn score_many_early_term( &self, points: &[[f64; D]], config: EarlyTermConfig, ) -> RcfResult<Vec<EarlyTermScore>>

Bulk early-termination scoring. Delegates to RandomCutForest::score_many_early_term — the threshold layer does not alter the scoring path.

§Errors

Same as RandomCutForest::score_many_early_term.

Source

pub fn score_early_term( &self, point: &[f64; D], config: EarlyTermConfig, ) -> RcfResult<EarlyTermScore>

Early-termination variant of the scoring path — delegates to RandomCutForest::score_early_term. Does not update the thresholded layer’s stats (this is a read path, not a training path).

§Errors

Same as RandomCutForest::score_early_term.

Source

pub fn reset_stats(&mut self)

Drop every statistic and warm-up sample. The underlying forest is left untouched — callers who want a full reset should rebuild via the builder. Used by tests and by callers that want to re-enter a warmup phase after a major regime change.

Source

pub fn delete(&mut self, point_idx: usize) -> RcfResult<bool>

Retract a previously-observed point from the underlying forest by its point_idx. Delegates to RandomCutForest::delete — the threshold layer’s stats are left untouched (they already reflect the score that was emitted when the point was processed).

§Errors

Same as RandomCutForest::delete.

Source

pub fn delete_by_value(&mut self, point: &[f64; D]) -> RcfResult<usize>

Retract every point whose stored value bit-matches point. Delegates to RandomCutForest::delete_by_value.

§Errors

Same as RandomCutForest::delete_by_value.

Source

pub fn process_indexed( &mut self, point: [f64; D], ) -> RcfResult<(usize, AnomalyGrade)>

Same as Self::process but returns the point_idx the underlying forest assigned to the fresh observation, paired with the usual graded verdict. Callers that want to later retract the observation via Self::delete should store the index from this call.

§Errors

Same as Self::process.

Source

pub fn process_at( &mut self, point: [f64; D], timestamp: u64, ) -> RcfResult<AnomalyGrade>

Timestamped variant of Self::process — tags the freshly inserted point with timestamp so callers can later prune history via RandomCutForest::delete_before. Returns the same graded verdict as Self::process.

§Errors

Same as Self::process.

Source

pub fn process_indexed_at( &mut self, point: [f64; D], timestamp: u64, ) -> RcfResult<(usize, AnomalyGrade)>

Timestamped variant of Self::process_indexed — records the caller-supplied timestamp against the fresh point_idx.

§Errors

Same as Self::process_indexed.

Source

pub fn delete_before(&mut self, cutoff: u64) -> RcfResult<usize>

Retract every point whose timestamp is strictly less than cutoff. Forwards to RandomCutForest::delete_before.

§Errors

Propagates RandomCutForest::delete_before failures.

Trait Implementations§

Source §

impl<const D: usize> Debug for ThresholdedForest<D>

Source §

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Source §

impl<'de, const D: usize> Deserialize<'de> for ThresholdedForest<D>

Source §

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

Source §

impl<const D: usize> ForestSnapshot for ThresholdedForest<D>

Source §

fn snapshot_num_trees(&self) -> usize

Number of trees in the forest.

Source §

fn snapshot_sample_size(&self) -> usize

Per-tree reservoir capacity.

Source §

fn snapshot_dimension(&self) -> usize

Per-point compile-time dimensionality.

Source §

fn snapshot_live_points(&self) -> usize

Live points currently referenced by at least one tree.

Source §

fn snapshot_updates_seen(&self) -> u64

Total update calls observed since construction.

Source §

fn snapshot_memory_estimate(&self) -> usize

Pessimistic upper bound on the forest’s memory footprint in bytes (point store + tree arenas + samplers + RNGs).

Source §

impl<const D: usize> Serialize for ThresholdedForest<D>

Source §

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations§

§

impl<const D: usize> !UnwindSafe for ThresholdedForest<D>

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

impl<T> Pointable for T

Source §

const ALIGN: usize

The alignment of pointer.

Source §

type Init = T

The type for initializers.

Source §

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more

Source §

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more

Source §

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more

Source §

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more

Source §

impl<T, U> TryFrom for T
where U: Into<T>,

Source §

type Error = Infallible

The type returned in the event of a conversion error.

Source §

fn try_from(value: U) -> Result<T, <T as TryFrom>::Error>

Performs the conversion.

Source §

impl<T, U> TryInto for T
where U: TryFrom<T>,

Source §

type Error = >::Error

The type returned in the event of a conversion error.

Source §

fn try_into(self) -> Result<U, >::Error>

Performs the conversion.

Source §

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source §

fn vzip(self) -> V

Source §

Struct ThresholdedForest Copy item path

§Examples

Implementations§

impl<const D: usize> ThresholdedForest<D>

pub fn attribution_stability( &self, point: &[f64; D], ) -> RcfResult<AttributionStability>

§Errors

impl<const D: usize> ThresholdedForest<D>

pub fn bootstrap<I>(&mut self, points: I) -> RcfResult<BootstrapReport>where I: IntoIterator<Item = [f64; D]>,

§Errors

impl<const D: usize> ThresholdedForest<D>

pub fn group_scores( &self, point: &[f64; D], groups: &FeatureGroups, ) -> RcfResult<GroupScores>

§Errors

impl<const D: usize> ThresholdedForest<D>

pub fn to_bytes(&self) -> RcfResult<Vec<u8>>

§Errors

pub fn from_bytes(bytes: &[u8]) -> RcfResult<Self>

§Errors

§Security

pub fn from_bytes_with_max_size(bytes: &[u8], max: usize) -> RcfResult<Self>

§Errors

§Security

pub fn to_path(&self, path: impl AsRef<Path>) -> RcfResult<()>

§Errors

pub fn from_path(path: impl AsRef<Path>) -> RcfResult<Self>

§Errors

§Security

impl<const D: usize> ThresholdedForest<D>

pub fn from_parts( forest: RandomCutForest<D>, thresholded: ThresholdedConfig, ) -> RcfResult<Self>

§Errors

pub fn with_metrics_sink(self, sink: Arc<dyn MetricsSink>) -> Self

pub fn metrics_sink(&self) -> &Arc<dyn MetricsSink>

pub fn forest(&self) -> &RandomCutForest<D>

pub fn forest_config(&self) -> &RcfConfig

pub fn thresholded_config(&self) -> &ThresholdedConfig

pub fn stats(&self) -> &EmaStats

pub fn current_threshold(&self) -> f64

pub fn process(&mut self, point: [f64; D]) -> RcfResult<AnomalyGrade>

§Errors

pub fn score_only(&self, point: &[f64; D]) -> RcfResult<AnomalyGrade>

§Errors

pub fn attribution(&self, point: &[f64; D]) -> RcfResult<DiVector>

§Errors

pub fn score_only_many( &self, points: &[[f64; D]], ) -> RcfResult<Vec<AnomalyGrade>>

§Errors

pub fn attribution_many(&self, points: &[[f64; D]]) -> RcfResult<Vec<DiVector>>

§Errors

pub fn forensic_baseline( &self, point: &[f64; D], ) -> RcfResult<ForensicBaseline<D>>

§Errors

pub fn score_many_early_term( &self, points: &[[f64; D]], config: EarlyTermConfig, ) -> RcfResult<Vec<EarlyTermScore>>

§Errors

pub fn score_early_term( &self, point: &[f64; D], config: EarlyTermConfig, ) -> RcfResult<EarlyTermScore>

§Errors

pub fn reset_stats(&mut self)

pub fn delete(&mut self, point_idx: usize) -> RcfResult<bool>

§Errors

pub fn delete_by_value(&mut self, point: &[f64; D]) -> RcfResult<usize>

§Errors

pub fn process_indexed( &mut self, point: [f64; D], ) -> RcfResult<(usize, AnomalyGrade)>

§Errors

pub fn process_at( &mut self, point: [f64; D], timestamp: u64, ) -> RcfResult<AnomalyGrade>

§Errors

pub fn process_indexed_at( &mut self, point: [f64; D], timestamp: u64, ) -> RcfResult<(usize, AnomalyGrade)>

§Errors

pub fn delete_before(&mut self, cutoff: u64) -> RcfResult<usize>

§Errors

Trait Implementations§

impl<const D: usize> Debug for ThresholdedForest<D>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl<'de, const D: usize> Deserialize<'de> for ThresholdedForest<D>

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl<const D: usize> ForestSnapshot for ThresholdedForest<D>

fn snapshot_num_trees(&self) -> usize

fn snapshot_sample_size(&self) -> usize

fn snapshot_dimension(&self) -> usize

fn snapshot_live_points(&self) -> usize

fn snapshot_updates_seen(&self) -> u64

fn snapshot_memory_estimate(&self) -> usize

impl<const D: usize> Serialize for ThresholdedForest<D>

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

Auto Trait Implementations§

Struct ThresholdedForest

pub fn bootstrap<I>(&mut self, points: I) -> RcfResult<BootstrapReport>
where I: IntoIterator<Item = [f64; D]>,

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,