Skip to main content

AudioFormatConverter

Struct AudioFormatConverter 

Source
pub struct AudioFormatConverter;
Expand description

High-level audio format converter.

Detects common audio container formats and converts WAV/PCM input to the standard format: mono, 16kHz, normalized f32 samples.

§Pipeline Architecture

Input Bytes
    ↓
[Format Detection] ← 6 formats: WAV, MP3, FLAC, Opus, WebM, AAC
    ↓
[WAV Decoding] ← 16/24-bit PCM normalization
    ↓
[Resampling] ← Arbitrary rate → 16kHz (linear interpolation)
    ↓
[Channel Mixing] ← Multi-channel → Mono (simple averaging)
    ↓
StandardAudio (mono, 16kHz, f32)

§Current Scope Limitations

  • Formats: Only WAV decoding implemented; other formats detected but not decoded
  • Channel Counts: 1, 2, 4, 6 channels supported
  • Bit Depths: 16-bit and 24-bit PCM only
  • Resampling: Linear interpolation (sinc reserved for future)

§Example

use speech_prep::converter::AudioFormatConverter;

let wav_bytes = std::fs::read("audio.wav")?;
let standard = AudioFormatConverter::convert_to_standard(&wav_bytes)?;

assert!(standard.metadata.original_sample_rate > 0);
assert!(standard.metadata.original_channels > 0);

Implementations§

Source§

impl AudioFormatConverter

Source

pub const fn new() -> Self

Create a new audio format converter instance.

Source

pub fn convert_to_standard(audio_bytes: &[u8]) -> Result<StandardAudio>

Convert WAV audio bytes to standard format: mono, 16kHz, f32.

This is the primary entry point for the audio normalization pipeline. It composes all 4 stages: format detection, decoding, resampling, and channel mixing.

§Arguments
  • audio_bytes - Raw audio file bytes
§Returns

StandardAudio with mono 16kHz samples and complete conversion metadata.

§Errors

Returns Error::InvalidInput if:

  • Format detection fails (not a recognized audio format)
  • Format is detected but not WAV (only WAV decoding supported)
  • WAV decoding fails (malformed file, unsupported codec)
  • Resampling fails (invalid sample rates)
  • Channel mixing fails (unsupported channel count)
§Performance

Target: <10ms for 3-second audio clip on reference hardware. Actual timing captured in ConversionMetadata.conversion_time_ms.

§Example
use speech_prep::converter::AudioFormatConverter;

let audio_bytes = std::fs::read("recording.wav")?;
let standard = AudioFormatConverter::convert_to_standard(&audio_bytes)?;

assert_eq!(standard.metadata.original_format.as_str(), "wav");
assert!(standard.samples.len() > 0);

Trait Implementations§

Source§

impl Clone for AudioFormatConverter

Source§

fn clone(&self) -> AudioFormatConverter

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for AudioFormatConverter

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for AudioFormatConverter

Source§

fn default() -> AudioFormatConverter

Returns the “default value” for a type. Read more
Source§

impl Copy for AudioFormatConverter

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<F, T> IntoSample<T> for F
where T: FromSample<F>,

Source§

fn into_sample(self) -> T

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more