Skip to main content

Crate axonml_audio

Crate axonml_audio 

Source
Expand description

Axonml Audio - Audio Processing Utilities

This crate provides audio processing functionality for the Axonml ML framework:

  • Transforms: Resampling, spectrograms, MFCC, pitch shifting, noise addition
  • Datasets: Audio classification, command recognition, music genre, speaker ID

§Example

use axonml_audio::prelude::*;

// Create a synthetic command dataset
let dataset = SyntheticCommandDataset::small();

// Apply mel spectrogram transform
let mel = MelSpectrogram::new(16000);
let (waveform, label) = dataset.get(0).unwrap();
let spectrogram = mel.apply(&waveform);

@version 0.1.0 @author AutomataNexus Development Team

Re-exports§

pub use transforms::AddNoise;
pub use transforms::MelSpectrogram;
pub use transforms::NormalizeAudio;
pub use transforms::PitchShift;
pub use transforms::Resample;
pub use transforms::TimeStretch;
pub use transforms::TrimSilence;
pub use transforms::MFCC;
pub use datasets::AudioClassificationDataset;
pub use datasets::AudioSeq2SeqDataset;
pub use datasets::SyntheticCommandDataset;
pub use datasets::SyntheticMusicDataset;
pub use datasets::SyntheticSpeakerDataset;

Modules§

datasets
Audio Datasets - Dataset implementations for audio processing tasks
prelude
Common imports for audio processing.
transforms
Audio Transforms - Signal Processing and Augmentation