Expand description
Audio processing for AxonML.
Transforms (MelSpectrogram via rustfft O(n log n), MFCC, Resample, NormalizeAudio, AddNoise, TimeStretch, PitchShift, TrimSilence) and synthetic datasets (SyntheticCommandDataset, SyntheticMusicDataset, SyntheticSpeakerDataset) for command recognition, music genre classification, and speaker identification tasks.
§File
crates/axonml-audio/src/lib.rs
§Author
Andrew Jewell Sr. — AutomataNexus LLC ORCID: 0009-0005-2158-7060
§Updated
April 14, 2026 11:15 PM EST
§Disclaimer
Use at own risk. This software is provided “as is”, without warranty of any kind, express or implied. The author and AutomataNexus shall not be held liable for any damages arising from the use of this software.
Re-exports§
pub use transforms::AddNoise;pub use transforms::MFCC;pub use transforms::MelSpectrogram;pub use transforms::NormalizeAudio;pub use transforms::PitchShift;pub use transforms::Resample;pub use transforms::TimeStretch;pub use transforms::TrimSilence;pub use datasets::AudioClassificationDataset;pub use datasets::AudioSeq2SeqDataset;pub use datasets::SyntheticCommandDataset;pub use datasets::SyntheticMusicDataset;pub use datasets::SyntheticSpeakerDataset;
Modules§
- datasets
- Audio Datasets - Dataset implementations for audio processing tasks
- prelude
- Common imports for audio processing.
- transforms
- Audio Transforms - Signal Processing and Augmentation