native-whisperx 0.1.1

WhisperX-style transcription workflows composed from moritzbrantner Rust building-block crates.
Documentation

native-whisperx

Reusable workflow library for composing moritzbrantner-* transcription, alignment, diarization, and transcript crates into a WhisperX-style pipeline.

This crate owns workflow configuration and output writing. The reusable audio, text, model-runtime, and speaker primitives remain in rust-packages.

Release Role

native-whisperx is the library crate for Workflow Composition APIs. It is published before native-whisperx-cli. After that manual release order, Cargo users install the CLI package with:

cargo install native-whisperx-cli

The installed terminal command is native-whisperx.

Workflow Surface

The crate composes native transcription, default wav2vec2 alignment, optional speaker diarization, optional segment-level post-ASR translation, and output writing. The user-facing parity target is the Python WhisperX CLI. Delegated features remain delegated where the repository has not yet replaced them with a Rust-Native Parity path.

Default json output is WhisperX JSON. Use native-json through the CLI when you need the Rust transcript contract shape.

Installing the CLI package does not make transcription resources available. Model bundles, cache entries, CUDA, Python WhisperX compatibility resources, and gated Hugging Face assets are resolved by the invoked workflow. Delegated Feature paths remain delegated until separate Rust-Native Parity work replaces them.

Feature Flags

Feature Purpose
native Native Candle Whisper and wav2vec2 alignment composition. Enabled by default.
translation Helsinki-NLP OPUS-MT/Marian post-ASR segment translation. Enabled by native.
cuda CUDA-backed Candle execution for hosts with a local CUDA toolchain.
media-decode Non-WAV media/container decode through the audio I/O crate.
diarization Heuristic speaker diarization composition.
onnx-diarization Explicit ONNX speaker embedding diarization path.
pyannote-diarization Explicit native pyannote community diarization bundle path.
silero-vad Explicit Silero ONNX VAD path.
pyannote-vad Explicit local pyannote ONNX VAD path.
whisperx-compat External Python WhisperX command compatibility and parity checks.

Documentation

The canonical public API documentation target is https://docs.rs/native-whisperx. Repository release and model-resource setup details are in https://github.com/moritzbrantner/native-whisperx.