split_by_discriminant 0.6.1

A small utility for partitioning a sequence of items by enum discriminant
Documentation

split_by_discriminant

split_by_discriminant is a lightweight Rust utility for partitioning a sequence of items by the discriminant of an enum.

Table of contents

Core API

  • split_by_discriminant — the simple grouping operation.
  • map_by_discriminant — a more flexible variant that applies separate mapping closures to matched and unmatched items, allowing you to change the output types on the fly.

Both are useful when you need to gather all values of a particular variant, operate on them, and then return them to the original collection.

Feature flags

  • indexmap: use IndexMap/IndexSet for deterministic key iteration order
  • proc_macro: enables macros re-exported from the library to the proc-macro crate

Complete method families

Methods are organized into four families by access pattern and ownership:

Family Removes group? Self borrow Lifetime of returned refs
Immutable ref No &self tied to &self borrow
Mutable ref No &mut self tied to &mut self borrow
Take Yes &mut self full 'items lifetime
Remove Yes &mut self full 'items lifetime

Single-item methods

Family DiscriminantMap SplitWithExtractor
Immutable ref as_ref_simple(id) · as_ref<U>(id) · as_ref_with<S>(id) · map_as_ref(id, f)
Mutable ref map_as_mut(id, f) as_mut_simple(id) · as_mut<U>(id) · as_mut_with<S>(id) · map_as_mut(id, f)
Take take_simple(id) · take_extracted<S>(id)
Remove remove(id) · remove_mapped(id, f) · remove_with(id, f) · remove_others() (same, forwarded)

Batch methods (multiple discriminants at once)

Family DiscriminantMap SplitWithExtractor
Immutable ref map_as_ref_multiple(ids, f) as_ref_multiple_simple(ids) · as_ref_multiple<U>(ids) · as_ref_multiple_with<S>(ids) · map_as_ref_multiple(ids, f)
Mutable ref map_as_mut_multiple(ids, f) as_mut_multiple_simple(ids) · as_mut_multiple<U>(ids) · as_mut_multiple_with<S>(ids) · map_as_mut_multiple(ids, f)
Take take_multiple_simple(ids) · take_multiple_extracted<S>(ids)
Remove remove_multiple(ids) · remove_multiple_mapped(ids, f) · remove_multiple_with(ids, f) (same, forwarded)

extract_with(id, f) / extract_multiple_with(ids, f) (on both types): borrow mutably and return owned U values without removing any groups. They bridge the mutable-ref and take families.

Choose immutable ref (map_as_ref*) when a &self borrow is required or you only need to read. Choose mutable ref (as_mut*, map_as_mut*) when you want to mutate or inspect without removing. Choose take or remove when the extracted values need to outlive the map.

Primary API

split_by_discriminant

Generic function that takes:

  1. An iterable of items (items) whose element type R implements Borrow<T> (e.g. &T, &mut T, or T).
  2. An iterable of discriminants (kinds) to match against; duplicates are ignored.

Returns a DiscriminantMap<T, R> containing:

  • entries: a map from discriminant to a Vec<R> of matching items.
  • others: a Vec<R> of items whose discriminant was not requested.

Type inference normally deduces the return type; you rarely need to annotate it explicitly.

map_by_discriminant

A more flexible variant of split_by_discriminant that accepts two mapping closures. The first closure is applied to items whose discriminant is requested, and the second handles all others. This allows the types of grouped elements and the "others" bucket to differ, and lets you perform on-the-fly transformations during partitioning.

DiscriminantMap<T, G, O> struct

The result of a split operation. Every parameter has a clear responsibility:

Parameter Role
T The underlying enum (or any type with a Discriminant). Used to compute the map keys (Discriminant<T>) and for Borrow<T> bounds on input items.
G Type stored inside each matching group. Defaults to the iterator's item type, but may be transformed by map_by_discriminant (e.g. String, &mut i32, etc.).
O Type stored in the “others” bucket. Defaults to G to make the common case ergonomic, but you can choose a different type to handle unmatched items specially (e.g. map them to () or a count).

The generic trio lets you express use cases where the group and others types differ without resorting to enum or Box<dyn>.

Methods:

Inspection

  • others(&self) — borrow the unmatched items as &[O]. Takes &self; safe to call without a mutable borrow.
  • others_mut(&mut self) — mutably borrow the unmatched items as &mut [O].
  • get(&self, id) — borrow a particular group by discriminant as &[G].
  • get_mut(&mut self, id) — mutably borrow a particular group as GroupMut<'_, G>. The GroupMut wrapper exposes iteration, sorting, and Index<usize> read access while intentionally omitting IndexMut to prevent accidentally writing the wrong variant back into a slot.
  • for_each_group_mut(&mut self, ids, f) — call f(discriminant, GroupMut) once for each discriminant in ids that is present. Use this to mutate several groups in a single pass without tying borrow scopes together.
  • extract_with(&mut self, id, f) — borrow a group mutably and map each &mut T through f: FnMut(&mut T) -> Option<U>, collecting owned U values without removing the group.
  • extract_multiple_with(&mut self, ids, f) — batch variant of extract_with; returns a map of Vec<U> per discriminant.

Move (remove) — remove a group and take ownership of its elements

  • remove(&mut self, id) — remove and return the group as Vec<G>, preserving the full original lifetime when G is a reference.
  • remove_mapped<U>(&mut self, id, f: FnMut(G) -> U) — remove a group and map every element through f by value; returns Option<Vec<U>>.
  • remove_with<U>(&mut self, id, f: FnMut(G) -> Option<U>) — remove a group and filter-map every element through f by value; returns Option<Vec<U>>. Full lifetime preservation.
  • remove_others(&mut self) — remove and return the others vector as Vec<O>. Unlike into_parts, self remains usable for further remove* calls afterward. A second call returns an empty Vec.

Consuming

  • into_parts(self) — consume and return (Map<Discriminant<T>, Vec<G>>, Vec<O>). The concrete map type is HashMap by default; enable the indexmap feature for IndexMap/IndexSet instead.
  • map_all(self, f) — transform every group at once, consuming self.
  • map_others(self, f) — transform the others vector, consuming self.

GroupMut<'a, G>

A newtype wrapper over &'a mut [G] returned by get_mut and yielded by for_each_group_mut. It exposes len, is_empty, as_slice, iter, iter_mut, sort_by, sort_unstable_by, reverse, and Index<usize> for position-based read access. IndexMut is deliberately omitted: writing group[i] = wrong_variant through IndexMut would silently corrupt the discriminant map's invariants. Use iter_mut to mutate field values in place.

Extraction Traits

Three traits handle mutable-extraction scenarios:

  • SimpleExtractFrom<T> — single-variant extractors with zero-annotation call site
  • VariantExtractFrom<T, U> — multi-variant extractors with binding-inferred U
  • ExtractFrom<T, Selector> — multi-field or complex outputs with explicit selector

Three read-only counterpart traits mirror the above but take &T instead of &mut T, enabling the as_ref_* family with only a &self borrow on SplitWithExtractor:

  • SimpleReadFrom<T> — single-variant read-only; enables as_ref_simple (annotation-free) and via blankets as_ref<U> and as_ref_with::<()>.
  • VariantReadFrom<T, U> — multi-variant read-only; enables as_ref<U> for each U.
  • ReadFrom<T, Selector> — GAT-based read-only; enables as_ref_with<S> for any selector.

No automatic blanket from SimpleExtractFromSimpleReadFrom: because extract_from takes &mut T, it is impossible to soundly derive read_from(&T) from it. Both impls must be written separately — the bodies differ only in removing mut from the match arm. #[derive(ExtractFrom)] generates both sets automatically.

See Four-Crate Pattern Guide for trait selection, implementation guidance, and decision trees. The guide covers all traits, blanket impls, and patterns for factory-crate authors.

SplitWithExtractor<T, G, O, E> struct

A thin wrapper around DiscriminantMap that pairs it with an extractor value E. The four type parameters serve these roles:

  • T – the enum/Discriminant target, carried through from the inner split.
  • G – group element type; forwarded from DiscriminantMap.
  • O – others element type; also forwarded and defaults to G when the split is originally constructed.
  • E – the extractor type that implements ExtractFrom<T, S> for one or more selector types S. The extractor is usually a zero-sized local struct; its purpose is to give you a constraint that allows extract::<S> to disambiguate between multiple output types without a closure. Because the impl lives on your local type, the orphan rule is satisfied even when T and the output are foreign.

With this design every parameter can vary independently and has a real use case in the docs and tests.

Methods available directly on SplitWithExtractor:

Inspection

  • others — forwarded from the inner split.
  • others_mut — forwarded from the inner split.
  • get — forwarded from the inner split.
  • get_mut — forwarded from the inner split; returns GroupMut<'_, G>.
  • for_each_group_mut — forwarded from the inner split.

Move (remove) — remove a group and take ownership of its elements

  • remove — forwarded from the inner split; full lifetime preservation.
  • remove_mapped — forwarded from the inner split.
  • remove_with — forwarded from the inner split.
  • remove_others — forwarded from the inner split.
  • take_simple(&mut self, id) — consuming counterpart of as_mut_simple; requires E: SimpleExtractFrom<T>. No turbofish, no annotation — the return type is fully determined by E and T. Returned elements carry the full 'items lifetime.
  • take_extracted<S>(&mut self, id) — like remove_with but uses the bound extractor instead of a closure. Requires E: TakeFrom<G, S>, which is satisfied automatically for any E: ExtractFrom<T, S> when G = &mut T.

Immutable borrow — borrow from a group by shared reference

  • as_ref_simple(&self, id) — fully annotation-free read-only access; requires E: SimpleReadFrom<T>. Takes &self and works with maps built from immutable slices (G = &T).
  • as_ref<U>(&self, id) — read-only access with U inferred from the binding; requires E: VariantReadFrom<T, U>. Every SimpleReadFrom<T> blankets VariantReadFrom<T, Output> automatically.
  • as_ref_with<S>(&self, id) — read-only access with explicit selector; requires E: ReadFrom<T, S>. Supports GAT outputs such as multi-field tuple references.
  • map_as_ref(&self, id, f) — read-only access via inline closure; no extractor trait required.

Mutable reborrow — borrow into a group without removing it

  • as_mut_simple(&mut self, id) — fully annotation-free mutable extraction; requires E: SimpleExtractFrom<T>. The return type is determined entirely by E and T.
  • as_mut<U>(&mut self, id) — mutable extraction with U inferred from binding; requires E: VariantExtractFrom<T, U>. Call once per variant in a separate scope so borrows do not overlap.
  • as_mut_with<S>(&mut self, id) — mutable extraction with explicit selector; requires E: ExtractFrom<T, S>. Use for multi-field outputs or when VariantExtractFrom is not sufficient.

Consuming

  • into_inner(self) -> DiscriminantMap<T, G, O> — unwrap to reach consuming methods (into_parts, map_all, map_others).

Construct with SplitWithExtractor::new(split, extractor).

Four-crate Pattern

The factory crate pattern solves the Rust orphan rule for extractors on foreign enums. A factory crate defines an extractor type and implements extraction traits; downstream callers then use it without needing to implement the traits themselves.

See Four-Crate Pattern Guide for detailed guidance, decision trees, and implementation examples.

Quick example:

# use split_by_discriminant::split_by_discriminant;
# use std::mem::discriminant;
#[derive(Debug)]
enum E { A(i32), B }

let mut data = vec![E::A(1), E::B, E::A(2)];
let a_disc = discriminant(&E::A(0));

// move — returned refs carry full 'items lifetime
let ints: Vec<&mut i32> = {
    let mut split = split_by_discriminant(&mut data, &[a_disc]);
    split.remove_with(a_disc, |e| if let E::A(v) = e { Some(v) } else { None })
        .unwrap()
};
assert_eq!(ints.len(), 2);

Examples

use split_by_discriminant::{split_by_discriminant, SplitWithExtractor, VariantExtractFrom};
use std::mem::discriminant;

#[derive(Debug)]
enum E { A(i32), B(String), C }

struct EExtractor;

impl VariantExtractFrom<E, i32> for EExtractor {
    fn extract_from<'a>(&self, t: &'a mut E) -> Option<&'a mut i32> {
        if let E::A(v) = t { Some(v) } else { None }
    }
}
impl VariantExtractFrom<E, String> for EExtractor {
    fn extract_from<'a>(&self, t: &'a mut E) -> Option<&'a mut String> {
        if let E::B(s) = t { Some(s) } else { None }
    }
}

let mut data = vec![E::A(1), E::B("hello".into()), E::A(2), E::C];
let a_disc = discriminant(&E::A(0));
let b_disc = discriminant(&E::B(String::new()));

let split = split_by_discriminant(&mut data, &[a_disc, b_disc]);
let mut extractor = SplitWithExtractor::new(split, EExtractor);

// U inferred from binding — each call lives in its own scope so &mut borrows
// do not overlap.
{ let ints: Vec<&mut i32>    = extractor.as_mut(a_disc).unwrap(); assert_eq!(ints.len(), 2); }
{ let strs: Vec<&mut String> = extractor.as_mut(b_disc).unwrap(); assert_eq!(strs.len(), 1); }

// Consuming methods are reached via into_inner().
let (_, others) = extractor.into_inner().into_parts();
assert_eq!(others.len(), 1); // E::C

Move-style extraction with full lifetime preservation

When you need the extracted references to outlive the SplitWithExtractor, use take_extracted:

use split_by_discriminant::{split_by_discriminant, SplitWithExtractor, SimpleExtractFrom};
use std::mem::discriminant;

#[derive(Debug, PartialEq)]
enum E { A(i32), B }
struct EExtractor;
impl SimpleExtractFrom<E> for EExtractor {
    type Output = i32;
    fn extract_from<'a>(&self, t: &'a mut E) -> Option<&'a mut i32> {
        if let E::A(v) = t { Some(v) } else { None }
    }
}

let mut data = [E::A(1), E::A(2), E::B];
let a_disc = discriminant(&E::A(0));

// ints outlives the SplitWithExtractor — full 'items lifetime preserved
let mut ints: Vec<&mut i32> = {
    let split = split_by_discriminant(&mut data[..], &[a_disc]);
    let mut ex = SplitWithExtractor::new(split, EExtractor);
    ex.take_extracted::<()>(a_disc).unwrap()
};

*ints[0] = 99;
drop(ints);
assert_eq!(data[0], E::A(99));

remove_mapped — transform every element by value

use split_by_discriminant::split_by_discriminant;
use std::mem::discriminant;

#[derive(Debug)] enum E { A(i32), B }
let mut data = [E::A(1), E::A(2), E::B];
let a_disc = discriminant(&E::A(0));

let mut split = split_by_discriminant(&mut data[..], &[a_disc]);
let labels: Vec<String> = split
    .remove_mapped(a_disc, |e| format!("{:?}", e))
    .unwrap();
assert_eq!(labels, ["A(1)", "A(2)"]);

remove_others — retrieve unmatched items without consuming self

use split_by_discriminant::split_by_discriminant;
use std::mem::discriminant;

#[derive(Debug)] enum E { A(i32), B, C }
let mut data = [E::A(1), E::A(2), E::B, E::C];
let a_disc = discriminant(&E::A(0));

let mut split = split_by_discriminant(&mut data[..], &[a_disc]);

// Remove the unmatched items — split remains usable.
let others: Vec<&mut E> = split.remove_others();
assert_eq!(others.len(), 2); // B and C

// Groups are still intact.
let group: Vec<&mut E> = split.remove(a_disc).unwrap();
assert_eq!(group.len(), 2); // A(1) and A(2)

Other supported input types

You can also pass an owned iterator:

use split_by_discriminant::split_by_discriminant;
use std::mem::discriminant;

#[derive(Debug)] enum E { A(i32), B(String) }

let owned = vec![E::A(4), E::B(String::new())];
let a_disc = discriminant(&E::A(0));
let split = split_by_discriminant(owned.into_iter(), &[a_disc]);
let (groups, _) = split.into_parts();
assert_eq!(groups[&a_disc].len(), 1);

Or use immutable references — as_ref_* and map_as_ref methods are available; as_mut_* and take_* require G = &mut T:

use split_by_discriminant::{split_by_discriminant, DiscriminantMap};
use std::mem::discriminant;

#[derive(Debug)] enum E { A(i32), B(String) }

let data = [E::A(2), E::B(String::new())];
let a_disc = discriminant(&E::A(0));
let split: DiscriminantMap<_, &E> = split_by_discriminant(&data[..], &[a_disc]);
assert_eq!(split.get(a_disc).unwrap().len(), 1);

Use map_by_discriminant when you need to transform matched and unmatched items during partitioning:

use split_by_discriminant::map_by_discriminant;
use std::mem::discriminant;

#[derive(Debug)]
enum E { A(i32), B }

let data = [E::A(1), E::B];
let a_disc = discriminant(&E::A(0));
let b_disc = discriminant(&E::B);

let mut split = map_by_discriminant(&data[..], &[a_disc, b_disc],
    |e| format!("match:{:?}", e),
    |e| format!("other:{:?}", e),
);
assert_eq!(split.get(a_disc).unwrap(), &["match:A(1)".to_string()][..]);

Proc Macros

split_by_discriminant_macros provides a derive macro and helpers for extractor generation. For full API details, configuration options, and examples, see split_by_discriminant_macros/README.md.

Quickstart: #[derive(ExtractFrom)]

The derive macro generates a zero-sized extractor type named <EnumName>Extractor. Use it with SplitWithExtractor to perform extraction without manually writing an extractor type.

use split_by_discriminant_macros::ExtractFrom;
use split_by_discriminant::{split_by_discriminant, SplitWithExtractor};
use std::mem::discriminant;

#[derive(Debug, ExtractFrom)]
enum E { A(i32), B }

let mut data = vec![E::A(1), E::B];
let a_disc = discriminant(&E::A(0));

let split = split_by_discriminant(&mut data, &[a_disc]);
let mut extractor = SplitWithExtractor::new(split, EExtractor);
// EExtractor implements SimpleExtractFrom<E> — no turbofish needed
let ints: Vec<&mut i32> = extractor.as_mut_simple(a_disc).unwrap();

Customizing #[derive(ExtractFrom)] names

The derive macro supports a #[extract_from(...)] attribute to override the generated helper names.

Custom extractor name

By default #[derive(ExtractFrom)] generates a zero-sized extractor named <EnumName>Extractor. Use:

use split_by_discriminant_macros::ExtractFrom;

#[derive(ExtractFrom)]
#[extract_from(extractor = "MyExtractor")]
enum E { A(i32) }

Custom selector name

When the derive must generate selector types (multi-field variants or duplicate field types), the default is Select{Enum}{Variant}. You can override it on a per-variant basis or globally via a format string.

Per-variant override:

use split_by_discriminant_macros::ExtractFrom;

#[derive(ExtractFrom)]
enum E {
    #[extract_from(selector = "MySelector")]
    A(i32, String),
}

Global override (format string, supports {} or {enum}/{variant}):

use split_by_discriminant_macros::ExtractFrom;

#[derive(ExtractFrom)]
#[extract_from(selector = "Custom{enum}{variant}")]
enum E { A(i32, String) }

(The default format is Select{}{}, with the first {} substituted by the enum name and the second by the variant name.)

Empty enum support

By default #[derive(ExtractFrom)] on an empty enum is an error, because no extraction behavior can be generated. You can override this with skip_empty to allow empty enums to compile as a no-op derive:

#[derive(ExtractFrom)]
#[extract_from(skip_empty)]
enum Empty {}

FAQ

When should I use as_ref_* vs as_mut_*?

Use the as_ref_* / map_as_ref family when you only need to read inner fields:

  • Takes &self — no exclusive borrow, so multiple reads can coexist without conflicting borrows.
  • Works with maps built from immutable slices (G = &T), where BorrowMut<T> would not be satisfiable.
  • Requires the extractor to implement SimpleReadFrom<T>, VariantReadFrom<T, U>, or ReadFrom<T, S>. #[derive(ExtractFrom)] generates these automatically.

Use the as_mut_* / map_as_mut family when you need to mutate inner fields in-place. These require G: BorrowMut<T> and take &mut self.

Why does map_as_* require a closure?

The as_mut_simple, as_mut<U>, and as_mut_with<S> methods dispatch extraction through extractor traits compiled into E, which lets the compiler infer the output type from the binding at the call site — zero annotations needed.

The map_as_* variants accept a closure instead. Use them for one-off transformations, foreign enums, or any situation where a dedicated extractor type would be overkill.

Should I implement SimpleExtractFrom or ExtractFrom directly?

  • SimpleExtractFrom<T> — the right choice when your extractor covers exactly one variant with exactly one field. The associated Output type lets as_mut_simple and take_simple work with no annotations.
  • ExtractFrom<T, S> — use this for multi-field outputs (tuples), multiple variants with the same field type, or when you need GAT lifetime parameters in the return type. Use a distinct selector ZST per logical extraction target.

#[derive(ExtractFrom)] from split_by_discriminant_macros automatically picks the right strategy. See the Four-Crate Pattern Guide for full decision trees.

When do I need take_* instead of as_mut_*?

When the extracted references need to outlive the SplitWithExtractor. The as_mut_* family reborrows elements through the map's &mut self borrow; the result cannot escape that scope. The take_* family removes the group from the map and moves each element by value, preserving the full 'items lifetime.

See docs/lifetime-model.md for an annotated walkthrough.

Should I implement SimpleReadFrom alongside SimpleExtractFrom?

Implement SimpleReadFrom<T> whenever as_ref_* access is useful — either as a complement to SimpleExtractFrom<T> (giving both mutable and read-only access) or by itself (making as_mut_* unavailable, enforcing read-only access at the type level).

There is no automatic blanket from SimpleExtractFromSimpleReadFrom because extract_from takes &mut T. When both are wanted, write both impls explicitly — the bodies differ only in removing mut from the match arm. #[derive(ExtractFrom)] generates both impls automatically.

Supported inputs

  • &mut [T] or &mut Vec<T>DiscriminantMap<T, &mut T>
  • &[T] or &Vec<T>DiscriminantMap<T, &T>
  • Any owning iterator, e.g. Vec<T>::into_iter()R = T

Features

  • indexmap — use IndexMap/IndexSet instead of HashMap/HashSet. Enables deterministic iteration order over groups.

Documentation

  • Four-Crate Pattern Guide — Complete implementation guide for factory-crate authors. Covers all mutable and read-only extraction traits, blanket impls, decision trees, and selector patterns.
  • API Matrix Dimensions — Reference guide explaining the five dimensions that organize the full method matrix.
  • Lifetime Model — Annotated walkthrough of reborrow vs. move/take lifetime semantics.
  • v0.5 to v0.6 Migration Guide — Upgrading from v0.5. New idiomatic method names for reference access.
  • v0.4 to v0.5 Migration Guide — Upgrading from v0.4. Method renames and trait changes.

Notes

  • Discriminants can be precomputed with std::mem::discriminant and stored in consts for reuse.
  • Items not matching any requested discriminant are preserved in others in original order.
  • The remove_* methods work on any group element type, including owned values and immutable references.
  • remove_others returns Vec<O> directly (not Option); a second call returns an empty Vec.
  • Source code is human written and carefully reviewed - documentation and tests AI generated to keep them up to date.

Testing

Integration tests and unit tests live in the tests/ directory alongside src/