pub enum DataCommands {
Audit {
file: PathBuf,
num_classes: usize,
input_column: String,
label_column: String,
preamble_prefix: Option<String>,
},
Split {
file: PathBuf,
train: f64,
val: f64,
test: f64,
label_column: String,
seed: u64,
output: PathBuf,
},
Decontaminate {
file: PathBuf,
reference: Vec<PathBuf>,
ngram: usize,
threshold: f64,
json: bool,
},
Balance {
file: PathBuf,
strategy: String,
label_column: String,
num_classes: Option<usize>,
seed: u64,
output: Option<PathBuf>,
},
}Expand description
Data quality pipeline subcommands (powered by alimentar).
Thin CLI wrappers around alimentar’s data utilities.
Variants§
Audit
Audit a JSONL classification dataset for quality issues
Fields
Split
Stratified train/val/test split preserving class proportions
Fields
Decontaminate
Check training data for benchmark contamination via n-gram overlap
Fields
Balance
Resample dataset to address class imbalance
Fields
Trait Implementations§
Source§impl Debug for DataCommands
impl Debug for DataCommands
Source§impl FromArgMatches for DataCommands
impl FromArgMatches for DataCommands
Source§fn from_arg_matches(__clap_arg_matches: &ArgMatches) -> Result<Self, Error>
fn from_arg_matches(__clap_arg_matches: &ArgMatches) -> Result<Self, Error>
Source§fn from_arg_matches_mut(
__clap_arg_matches: &mut ArgMatches,
) -> Result<Self, Error>
fn from_arg_matches_mut( __clap_arg_matches: &mut ArgMatches, ) -> Result<Self, Error>
Source§fn update_from_arg_matches(
&mut self,
__clap_arg_matches: &ArgMatches,
) -> Result<(), Error>
fn update_from_arg_matches( &mut self, __clap_arg_matches: &ArgMatches, ) -> Result<(), Error>
Assign values from
ArgMatches to self.Source§fn update_from_arg_matches_mut<'b>(
&mut self,
__clap_arg_matches: &mut ArgMatches,
) -> Result<(), Error>
fn update_from_arg_matches_mut<'b>( &mut self, __clap_arg_matches: &mut ArgMatches, ) -> Result<(), Error>
Assign values from
ArgMatches to self.Source§impl Subcommand for DataCommands
impl Subcommand for DataCommands
Source§fn augment_subcommands<'b>(__clap_app: Command) -> Command
fn augment_subcommands<'b>(__clap_app: Command) -> Command
Source§fn augment_subcommands_for_update<'b>(__clap_app: Command) -> Command
fn augment_subcommands_for_update<'b>(__clap_app: Command) -> Command
Append to
Command so it can instantiate self via
FromArgMatches::update_from_arg_matches_mut Read moreSource§fn has_subcommand(__clap_name: &str) -> bool
fn has_subcommand(__clap_name: &str) -> bool
Test whether
Self can parse a specific subcommandAuto Trait Implementations§
impl Freeze for DataCommands
impl RefUnwindSafe for DataCommands
impl Send for DataCommands
impl Sync for DataCommands
impl Unpin for DataCommands
impl UnsafeUnpin for DataCommands
impl UnwindSafe for DataCommands
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more