aud2txt 0.5.0

Audio to text tool using ggerganov's whisper.cpp

Coverage
0%
0 out of 2 items documented0 out of 1 items with examples
Size
Source code size: 73.72 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 322.16 kB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 58s Average build duration of successful builds.
all releases: 58s Average build duration of successful builds in releases after 2024-10-23.
Links
xandkar/aud2txt
0 0 0
crates.io
Dependencies
Versions
- 0.5.0 (2025-05-10)
Owners

aud2txt

Audio to text tool, using ggerganov's whisper.cpp via whisper-rs and FFmpeg.

install

install FFmpeg (via your package manager or directly)
ensure ffmpeg command is available
cargo install aud2txt

usage

TL;DR

aud2txt <INPUT_FILE>

where <INPUT_FILE> is any media file readable by ffmpeg.

Also see the demo script.

options

Usage: aud2txt [OPTIONS] <INPUT_FILE>

Arguments:
  <INPUT_FILE>  Input audio file

Options:
  -l, --log <LOG_LEVEL>            [default: error]
  -m, --model-file <MODEL_FILE>
  -N, --no-normalize               Disable audio normalization before conversion to text
  -o, --output-file <OUTPUT_FILE>  Output text file
  -h, --help                       Print help

If --model-file argument is omitted, aud2txt will try to download and use the default model from: https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.en.bin

If --no-normalize flag is passed, the normalization step will be skiped, removing the runtime dependency on ffmpeg.