RustWAV
I was dissatisfied with the hound library - its API was poor, functionality limited, and iterator implementation subpar. Thus, I decided to reinvent the WAV wheel myself.
Language 语言
English | 简体中文
Features
Audio Reader:
- Cross-platform. You may have noticed that this thing has some dependencies about Windows. No, this thing works on Linux or other systems.
- Supports reading WAV audio files over 4GB in size.
- Supports embedded formats including PCM, PCM-aLaw, PCM-muLaw, ADPCM-MS, ADPCM-IMA, ADPCM-YAMAHA, MP3, Opus, Ogg Vorbis etc.
- Resampler support assists in modifying sample rates.
- Downmixer support assists in downmixing multi-channel audio to stereo or mono audio.
- Generates corresponding iterators via generic parameters to retrieve audio frames, with sample formats in each frame strictly converted to specified generic types according to their numerical ranges.
- Supported generic types:
i8
, i16
, i24
, i32
, i64
, u8
, u16
, u24
, u32
, u64
, f32
, f64
- Regardless of original audio storage format, iterators can convert to the above generic formats.
- No conversion occurs when original format matches the specified generic type.
- Reads music metadata information:
- Special handling for system-specific string encodings (e.g., Code Page 936/GB2312 on Windows):
- On Windows builds, calls
GetACP()
to detect code page, retrieves corresponding encoding, and converts to UTF-8 using the encoding crate.
- Supports ID3 metadata.
- Allows creating audio readers using any
Read + Seek
trait implementer as input. In this mode, a temporary file stores the audio's data
section.
- The temporary file will be deleted when the
WaveReader
drops or the executable stops.
- No temporary files created when using file paths to initialize readers.
- Supports reading WAV files with randomly distributed Chunk storage.
- No
panic!
except for explicit parameter errors.
Audio Writer
- Supports writing WAV audio files over 4GB in size.
- Supports embedded formats including PCM, PCM-aLaw, PCM-muLaw, ADPCM-MS, ADPCM-IMA, ADPCM-YAMAHA, MP3, Opus, etc.
write_frame()
function accepts generic parameters, encoding input samples for storage.
- Writes music metadata and can copy all metadata from other audio readers.
- No
panic!
except for explicit parameter errors.
Other Features
Usage Example
use std::{env::args, error::Error, process::ExitCode};
use format_specs::*;
use options::*;
use resampler::Resampler;
pub const FORMATS: [(&str, DataFormat); 16] = [
("pcm", DataFormat::Pcm),
("pcm-alaw", DataFormat::PcmALaw),
("pcm-ulaw", DataFormat::PcmMuLaw),
("adpcm-ms", DataFormat::Adpcm(AdpcmSubFormat::Ms)),
("adpcm-ima", DataFormat::Adpcm(AdpcmSubFormat::Ima)),
("adpcm-yamaha", DataFormat::Adpcm(AdpcmSubFormat::Yamaha)),
(
"mp3",
DataFormat::Mp3(Mp3EncoderOptions {
channels: Mp3Channels::NotSet,
quality: Mp3Quality::Best,
bitrate: Mp3Bitrate::Kbps320,
vbr_mode: Mp3VbrMode::Off,
id3tag: None,
}),
),
(
"opus",
DataFormat::Opus(OpusEncoderOptions {
bitrate: OpusBitrate::Max,
encode_vbr: false,
samples_cache_duration: OpusEncoderSampleDuration::MilliSec60,
}),
),
(
"flac",
DataFormat::Flac(FlacEncoderParams {
verify_decoded: false,
compression: FlacCompression::Level8,
channels: 2,
sample_rate: 44100,
bits_per_sample: 32,
total_samples_estimate: 0,
}),
),
(
"vorbis",
DataFormat::OggVorbis(OggVorbisEncoderParams {
mode: OggVorbisMode::NakedVorbis,
channels: 2,
sample_rate: 44100,
stream_serial: None,
bitrate: Some(OggVorbisBitrateStrategy::Vbr(320_000)),
minimum_page_data_size: None,
}),
),
(
"oggvorbis1",
DataFormat::OggVorbis(OggVorbisEncoderParams {
mode: OggVorbisMode::OriginalStreamCompatible,
channels: 2,
sample_rate: 44100,
stream_serial: None,
bitrate: Some(OggVorbisBitrateStrategy::Vbr(320_000)),
minimum_page_data_size: None,
}),
),
(
"oggvorbis2",
DataFormat::OggVorbis(OggVorbisEncoderParams {
mode: OggVorbisMode::HaveIndependentHeader,
channels: 2,
sample_rate: 44100,
stream_serial: None,
bitrate: Some(OggVorbisBitrateStrategy::Vbr(320_000)),
minimum_page_data_size: None,
}),
),
(
"oggvorbis3",
DataFormat::OggVorbis(OggVorbisEncoderParams {
mode: OggVorbisMode::HaveNoCodebookHeader,
channels: 2,
sample_rate: 44100,
stream_serial: None,
bitrate: Some(OggVorbisBitrateStrategy::Vbr(320_000)),
minimum_page_data_size: None,
}),
),
(
"oggvorbis1p",
DataFormat::OggVorbis(OggVorbisEncoderParams {
mode: OggVorbisMode::OriginalStreamCompatible,
channels: 2,
sample_rate: 44100,
stream_serial: None,
bitrate: Some(OggVorbisBitrateStrategy::Abr(320_000)),
minimum_page_data_size: None,
}),
),
(
"oggvorbis2p",
DataFormat::OggVorbis(OggVorbisEncoderParams {
mode: OggVorbisMode::HaveIndependentHeader,
channels: 2,
sample_rate: 44100,
stream_serial: None,
bitrate: Some(OggVorbisBitrateStrategy::Abr(320_000)),
minimum_page_data_size: None,
}),
),
(
"oggvorbis3p",
DataFormat::OggVorbis(OggVorbisEncoderParams {
mode: OggVorbisMode::HaveNoCodebookHeader,
channels: 2,
sample_rate: 44100,
stream_serial: None,
bitrate: Some(OggVorbisBitrateStrategy::Abr(320_000)),
minimum_page_data_size: None,
}),
),
];
pub fn get_rounded_up_fft_size(sample_rate: u32) -> usize {
for i in 0..31 {
let fft_size = 1usize << i;
if fft_size >= sample_rate as usize {
return fft_size;
}
}
0x1_00000000_usize
}
pub fn transfer_audio_from_decoder_to_encoder(decoder: &mut WaveReader, encoder: &mut WaveWriter) {
let decode_spec = decoder.spec();
let encode_spec = encoder.spec();
let decode_channels = decode_spec.channels;
let encode_channels = encode_spec.channels;
let decode_sample_rate = decode_spec.sample_rate;
let encode_sample_rate = encode_spec.sample_rate;
let fft_size = get_rounded_up_fft_size(std::cmp::max(encode_sample_rate, decode_sample_rate));
let resampler = Resampler::new(fft_size);
assert_eq!(encode_channels, decode_channels);
let process_size = resampler.get_process_size(fft_size, decode_sample_rate, encode_sample_rate);
match encode_channels {
1 => {
let mut iter = decoder.mono_iter::<f32>().unwrap();
loop {
let block: Vec<f32> = iter.by_ref().take(process_size).collect();
if block.is_empty() {
break;
}
let block = audioutils::do_resample_mono(
&resampler,
&block,
decode_sample_rate,
encode_sample_rate,
);
encoder.write_mono_channel(&block).unwrap();
}
}
2 => {
let mut iter = decoder.stereo_iter::<f32>().unwrap();
loop {
let block: Vec<(f32, f32)> = iter.by_ref().take(process_size).collect();
if block.is_empty() {
break;
}
let block = audioutils::do_resample_stereo(
&resampler,
&block,
decode_sample_rate,
encode_sample_rate,
);
encoder.write_stereos(&block).unwrap();
}
}
_ => {
let mut iter = decoder.frame_iter::<f32>().unwrap();
loop {
let block: Vec<Vec<f32>> = iter.by_ref().take(process_size).collect();
if block.is_empty() {
break;
}
let block = audioutils::do_resample_frames(
&resampler,
&block,
decode_sample_rate,
encode_sample_rate,
);
encoder.write_frames(&block).unwrap();
}
}
}
}
pub fn test(arg1: &str, arg2: &str, arg3: &str, arg4: &str) -> Result<(), Box<dyn Error>> {
let mut data_format = DataFormat::Unspecified;
for format in FORMATS {
if arg1 == format.0 {
data_format = format.1;
break;
}
}
if data_format == DataFormat::Unspecified {
return Err(std::io::Error::new(
std::io::ErrorKind::InvalidInput,
format!(
"Unknown format `{arg1}`. Please input one of these:\n{}",
FORMATS
.iter()
.map(|(s, _v)| { s.to_string() })
.collect::<Vec<String>>()
.join(", ")
),
)
.into());
}
println!("======== TEST 1 ========");
println!("{:?}", data_format);
let mut wavereader = WaveReader::open(arg2).unwrap();
let orig_spec = wavereader.spec();
let mut spec = Spec {
channels: orig_spec.channels,
channel_mask: 0,
sample_rate: orig_spec.sample_rate,
bits_per_sample: 16,
sample_format: SampleFormat::Int,
};
match data_format {
DataFormat::Mp3(ref mut options) => match spec.channels {
1 => options.channels = Mp3Channels::Mono,
2 => options.channels = Mp3Channels::JointStereo,
o => panic!("MP3 format can't encode {o} channels audio."),
},
DataFormat::Opus(ref options) => {
spec.sample_rate = options.get_rounded_up_sample_rate(spec.sample_rate);
}
DataFormat::Flac(ref mut options) => {
options.channels = spec.channels;
options.sample_rate = spec.sample_rate;
options.bits_per_sample = spec.bits_per_sample as u32;
}
DataFormat::OggVorbis(ref mut options) => {
options.channels = spec.channels;
options.sample_rate = spec.sample_rate;
}
_ => (),
}
#[allow(unused_imports)]
use options::FileSizeOption::{AllowLargerThan4GB, ForceUse4GBFormat, NeverLargerThan4GB};
let mut wavewriter = WaveWriter::create(arg3, spec, data_format, NeverLargerThan4GB).unwrap();
transfer_audio_from_decoder_to_encoder(&mut wavereader, &mut wavewriter);
wavewriter.inherit_metadata_from_reader(&wavereader, true);
dbg!(&wavereader);
dbg!(&wavewriter);
drop(wavereader);
drop(wavewriter);
println!("======== TEST 2 ========");
let spec2 = Spec {
channels: spec.channels,
channel_mask: 0,
sample_rate: orig_spec.sample_rate,
bits_per_sample: 16,
sample_format: SampleFormat::Int,
};
let mut wavereader_2 = WaveReader::open(arg3).unwrap();
let mut wavewriter_2 = WaveWriter::create(arg4, spec2, DataFormat::Pcm, NeverLargerThan4GB).unwrap();
transfer_audio_from_decoder_to_encoder(&mut wavereader_2, &mut wavewriter_2);
wavewriter_2.inherit_metadata_from_reader(&wavereader_2, true);
dbg!(&wavereader_2);
dbg!(&wavewriter_2);
drop(wavereader_2);
drop(wavewriter_2);
Ok(())
}
#[allow(dead_code)]
pub fn test_wav() -> ExitCode {
let args: Vec<String> = args().collect();
if args.len() < 5 {
return ExitCode::from(1);
}
let input_wav = &args[1];
let output_wav = &args[2];
let reinput_wav = &args[3];
let reoutput_wav = &args[4];
match test(input_wav, output_wav, reinput_wav, reoutput_wav) {
Ok(_) => ExitCode::from(0),
Err(e) => {
eprintln!("{:?}", e);
ExitCode::from(2)
}
}
}