voxudio 0.1.0

A real-time audio processing library with ONNX runtime support

Coverage
56.67%
17 out of 30 items documented7 out of 7 items with examples
Size
Source code size: 1.64 MB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 4.07 MB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 1m 3s Average build duration of successful builds.
all releases: 40s Average build duration of successful builds in releases after 2024-10-23.
Links
mzdk100/voxudio
10 1 0
crates.io
Dependencies
Versions
Owners

Voxudio

English Version

Voxudio 是一个用 Rust 编写的高性能音频处理库，专注于语音处理和音色转换功能。它提供了一套完整的工具，用于音频采集、播放、语音活动检测、说话人特征提取和音色转换。

特性

🎤 音频设备管理
- 音频采集：支持从各种音频输入设备采集数据
- 音频播放：支持实时音频播放
🔍 语音活动检测 (VAD)
- 实时检测音频中的语音活动
- 精确区分语音和非语音部分
👤 说话人特征提取 (SEE)
- 提取 256 维说话人特征向量
- 支持说话人识别和验证
🎭 音色转换 (TCC)
- 实时音色转换
- 保持原始语音内容和情感
📱 跨平台支持
- 支持 Windows、Linux、macOS
- Android 平台支持
- iOS 平台尚未全面测试

安装

将以下依赖添加到你的 Cargo.toml 文件中：

cargo add voxudio

使用示例

音频采集
音频播放
语音活动检测
音色转换
Android 使用示例运行 Android 示例：
1. 确保已安装 Android SDK 和 NDK
2. 进入 examples/android 目录
3. 在 Windows 上运行：
```
run.bat

``

在 Linux/macOS 上运行：

```bash

./run.sh
```

性能优化

使用 ONNX Runtime 进行高效的模型推理
基于 Tokio 的异步处理
优化的音频数据处理流程

许可证

本项目采用 Apache-2.0 许可证。详见 LICENSE 文件。

贡献

欢迎提交 Issue 和 Pull Request！

Fork 本项目
创建你的特性分支 (git checkout -b feature/amazing-feature)
提交你的修改 (git commit -m 'Add some amazing feature')
推送到分支 (git push origin feature/amazing-feature)
开启一个 Pull Request

联系我们

如果你有任何问题或建议，欢迎提交 Issue 或通过以下方式联系我们：

Issue Tracker: github.com/mzdk100/voxudio/issues
Email: mzdk100@foxmail.com