Skip to main content

AudioFormatConverter

speech_prep::converter

Struct AudioFormatConverter

pub struct AudioFormatConverter;

Expand description

High-level audio format converter.

Detects common audio container formats and converts WAV/PCM input to the standard format: mono, 16kHz, normalized f32 samples.

§Pipeline Architecture

Input Bytes
    ↓
[Format Detection] ← 6 formats: WAV, MP3, FLAC, Opus, WebM, AAC
    ↓
[WAV Decoding] ← 16/24-bit PCM normalization
    ↓
[Resampling] ← Arbitrary rate → 16kHz (linear interpolation)
    ↓
[Channel Mixing] ← Multi-channel → Mono (simple averaging)
    ↓
StandardAudio (mono, 16kHz, f32)

§Current Scope Limitations

Formats: Only WAV decoding implemented; other formats detected but not decoded
Channel Counts: 1, 2, 4, 6 channels supported
Bit Depths: 16-bit and 24-bit PCM only
Resampling: Linear interpolation (sinc reserved for future)

§Example

use speech_prep::converter::AudioFormatConverter;

let wav_bytes = std::fs::read("audio.wav")?;
let standard = AudioFormatConverter::convert_to_standard(&wav_bytes)?;

assert!(standard.metadata.original_sample_rate > 0);
assert!(standard.metadata.original_channels > 0);

Implementations§

impl AudioFormatConverter

pub const fn new() -> Self

Create a new audio format converter instance.

pub fn convert_to_standard(audio_bytes: &[u8]) -> Result<StandardAudio>

Convert WAV audio bytes to standard format: mono, 16kHz, f32.

This is the primary entry point for the audio normalization pipeline. It composes all 4 stages: format detection, decoding, resampling, and channel mixing.

§Arguments

audio_bytes - Raw audio file bytes

§Returns

StandardAudio with mono 16kHz samples and complete conversion metadata.

§Errors

Returns Error::InvalidInput if:

Format detection fails (not a recognized audio format)
Format is detected but not WAV (only WAV decoding supported)
WAV decoding fails (malformed file, unsupported codec)
Resampling fails (invalid sample rates)
Channel mixing fails (unsupported channel count)

§Performance

Target: <10ms for 3-second audio clip on reference hardware. Actual timing captured in ConversionMetadata.conversion_time_ms.

§Example

use speech_prep::converter::AudioFormatConverter;

let audio_bytes = std::fs::read("recording.wav")?;
let standard = AudioFormatConverter::convert_to_standard(&audio_bytes)?;

assert_eq!(standard.metadata.original_format.as_str(), "wav");
assert!(standard.samples.len() > 0);

Trait Implementations§

impl Clone for AudioFormatConverter

fn clone(&self) -> AudioFormatConverter

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for AudioFormatConverter

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for AudioFormatConverter

fn default() -> AudioFormatConverter

Returns the “default value” for a type. Read more

impl Copy for AudioFormatConverter

Auto Trait Implementations§

impl Freeze for AudioFormatConverter

impl RefUnwindSafe for AudioFormatConverter

impl Send for AudioFormatConverter

impl Sync for AudioFormatConverter

impl Unpin for AudioFormatConverter

impl UnsafeUnpin for AudioFormatConverter

impl UnwindSafe for AudioFormatConverter

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<F, T> IntoSample<T> for F
where T: FromSample<F>,

fn into_sample(self) -> T

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more