panlabel 0.6.0

The universal annotation converter
Documentation
# Contributing to Panlabel

Thanks for your interest in contributing! Whether you're fixing a typo, adding
a new format adapter, improving error messages, or writing tests — every
contribution helps make annotation conversion less painful for everyone.

## Getting started

```sh
git clone https://github.com/strickvl/panlabel.git
cd panlabel
cargo build
cargo test
```

If all tests pass, you're ready to go.

## Where to change what

| What you're changing | Where to look |
|---|---|
| CLI behavior, command args, auto-detection | `src/lib.rs` |
| COCO format adapter | `src/ir/io_coco_json.rs` |
| CVAT format adapter | `src/ir/io_cvat_xml.rs` |
| Label Studio format adapter | `src/ir/io_label_studio_json.rs` |
| TFOD format adapter | `src/ir/io_tfod_csv.rs` |
| YOLO format adapter | `src/ir/io_yolo.rs` |
| Pascal VOC format adapter | `src/ir/io_voc_xml.rs` |
| Lossiness and conversion policy | `src/conversion/mod.rs` |
| Stable conversion issue codes | `src/conversion/report.rs` |
| CLI integration tests | `tests/cli.rs` |
| Format roundtrip tests | `tests/*_roundtrip.rs` |

## Testing

```sh
cargo test                             # unit + integration + proptests
cargo test --test cli                  # CLI integration tests only
cargo test --test proptest_ir_json
cargo test --test proptest_coco
cargo test --test proptest_tfod
cargo test --test proptest_label_studio
cargo test --test proptest_voc
cargo test --test proptest_yolo
cargo test --test proptest_cvat
cargo test --test proptest_cross_format

PROPTEST_CASES=1000 cargo test --test proptest_ir_json   # deeper local exploration
```

## Fuzzing

Requires nightly and `cargo-fuzz`:

> Note: `fuzz/Cargo.toml` enables panlabel's `fuzzing` feature so the YOLO line-parser fuzz entrypoint is available to the fuzz crate.

```sh
cargo +nightly fuzz run coco_json_parse
cargo +nightly fuzz run voc_xml_parse
cargo +nightly fuzz run cvat_xml_parse
cargo +nightly fuzz run tfod_csv_parse
cargo +nightly fuzz run label_studio_json_parse
cargo +nightly fuzz run ir_json_parse
cargo +nightly fuzz run yolo_label_line_parse
```

Seed corpora are tracked under `fuzz/corpus/<target>/`.

## Benchmarks

```sh
cargo bench    # Run Criterion benchmarks (COCO parsing, TFOD writing)
```

## Generating synthetic test data

The included generator creates realistic COCO and TFOD datasets:

```sh
pip install numpy  # or: uv pip install numpy
python scripts/dataset_generator.py --num_images 1000 --annotations_per_image 10 --output_dir ./assets
```

## What kinds of contributions are most useful?

Here are some areas where help is especially welcome:

- **New format adapters** — Label Studio or other common formats
- **Better error messages** — if a panlabel error confused you, that's a bug worth fixing
- **Test coverage** — especially edge cases and roundtrip tests
- **Documentation improvements** — clearer explanations, better examples, typo fixes

For larger changes (new formats, IR schema changes), please
[open an issue](https://github.com/strickvl/panlabel/issues) first so we can
discuss the approach before you invest time writing code.

## Keeping docs in sync

When your change affects user-visible behavior, please update the relevant docs
in the same PR:

| Behavior change | Update |
|---|---|
| CLI flags or commands | `docs/cli.md` and `README.md` examples |
| Format read/write behavior | `docs/formats.md` |
| Task or use-case support | `docs/tasks.md` |
| Lossiness, report schema, or issue codes | `docs/conversion.md` |

Also update `CLAUDE.md` when repository structure or workflow guidance changes,
and `ROADMAP.md` when priorities shift.

## Guidelines

- **Don't claim unsupported capabilities.** Current scope is detection bounding
  boxes only (not segmentation, pose/keypoints, OBB, or classification-only).
- **Prefer small, concrete examples** that match tested behavior.
- **Write tests** for new behavior — if it's not tested, it can silently break.