Crate nom_exif

source ·
Expand description

nom-exif is an Exif/metadata parsing library written in pure Rust with nom.

Both image (jpeg/heif/heic/jpg/tiff etc.) and video/audio (mov/mp4/3gp/webm/mkv/mka, etc.) files are supported.

In addition to the high performance brought by Rust and nom, the library is also specifically designed and optimized for batch file processing scenarios. At the same time, the API design should be as simple and easy to use as possible.

§Key Features

  • Ergonomic Design

    • Unified multimedia file processing process.

      No need to check the file extensions! nom-exif can automatically detect supported file formats and parse them correctly.

      The API has been carefully designed so that various types of multimedia files can be easily processed using the same processing process.

      Compared with the way the user judges the file name and then decides which parsing function to call (such as parse_jpg, parse_mp4, etc.), this work flow is simpler, more reliable, and more versatile (can be applied to non-file scenarios, such as TcpStream).

      The usage is demonstrated in the following examples. [examples/rexiftool] is also a good example.

    • Two style APIs for Exif: iterator style (ExifIter) and get style (Exif). The former is parse-on-demand, and therefore, more detailed error information can be captured; the latter is simpler and easier to use.

  • Performance

    • Zero-copy when appropriate: Use borrowing and slicing instead of copying whenever possible.

    • Minimize I/O operations: When metadata is stored at the end/middle of a large file (such as a QuickTime file does), Seek rather than Read to quickly locate the location of the metadata (if the reader supports Seek).

    • Pay as you go: When working with ExifIter, all entries are lazy-parsed. That is, only when you iterate over ExifIter will the IFD entries be parsed one by one.

    • Share I/O and parsing buffer between multiple parse calls: This can improve performance and avoid the overhead and memory fragmentation caused by frequent memory allocation. This feature is very useful when you need to perform batch parsing.

  • Robustness and stability: Through long-term Fuzz testing, and tons of crash issues discovered during testing have been fixed. Thanks to @sigaloid for pointing this out!

  • Supports both sync and async interfaces.

§Supported File Types

  • Image
    • *.heic, *.heif, etc.
    • *.jpg, *.jpeg
    • *.tiff, *.tif
  • Video/Audio
    • ISO base media file format (ISOBMFF): *.mp4, *.mov, *.3gp, etc.
    • Matroska based file format: *.webm, *.mkv, *.mka, etc.

§Unified multimedia file processing process

By using MediaSource + MediaParser, we can easily unify the parsing process of multiple different types of multimedia files.

use nom_exif::*;

fn main() -> Result<()> {
    let mut parser = MediaParser::new();

    let files = [
        "./testdata/exif.heic",
        "./testdata/exif.jpg",
        "./testdata/tif.tif",
        "./testdata/meta.mov",
        "./testdata/meta.mp4",
        "./testdata/webm_480.webm",
        "./testdata/mkv_640x360.mkv",
        "./testdata/mka.mka",
        "./testdata/3gp_640x360.3gp"
    ];

    for f in files {
        let ms = MediaSource::file_path("./testdata/exif.heic")?;

        if ms.has_exif() {
            // Parse the file as an Exif-compatible file
            let mut iter: ExifIter = parser.parse(ms)?;
            // ...
        } else if ms.has_track() {
            // Parse the file as a track
            let info: TrackInfo = parser.parse(ms)?;
            // ...
        }
    }

    Ok(())
}

§MediaSource + MediaParser, AsyncMediaSource + AsyncMediaParser

  • MediaSource is an abstraction of multimedia data sources, which can be created from any object that implements the Read trait, and can be parsed by MediaParser.

    See MediaSource & MediaParser for more information.

  • Likewise, AsyncMediaParser is an abstraction for asynchronous multimedia data sources, which can be created from any object that implements the AsyncRead trait, and can be parsed by AsyncMediaParser.

    See [AsyncMediaSource] & [AsyncMediaParser] for more information.

use nom_exif::*;

fn main() -> Result<()> {
    let mut parser = MediaParser::new();
     
    let ms = MediaSource::file_path("./testdata/exif.heic")?;
    assert!(ms.has_exif());
     
    let mut iter: ExifIter = parser.parse(ms)?;
    let exif: Exif = iter.into();
    assert_eq!(exif.get(ExifTag::Make).unwrap().as_str().unwrap(), "Apple");

    let ms = MediaSource::file_path("./testdata/meta.mov")?;
    assert!(ms.has_track());
     
    let info: TrackInfo = parser.parse(ms)?;
    assert_eq!(info.get(TrackInfoTag::Make), Some(&"Apple".into()));
    assert_eq!(info.get(TrackInfoTag::Model), Some(&"iPhone X".into()));
    assert_eq!(info.get(TrackInfoTag::GpsIso6709), Some(&"+27.1281+100.2508+000.000/".into()));
    assert_eq!(info.get_gps_info().unwrap().latitude_ref, 'N');
    assert_eq!(
        info.get_gps_info().unwrap().latitude,
        [(27, 1), (7, 1), (68, 100)].into(),
    );

    // `MediaSource` can also be created from a `TcpStream`:
    // let ms = MediaSource::tcp_stream(stream)?;

    // Or from any `Read + Seek`:
    // let ms = MediaSource::seekable(stream)?;
     
    // From any `Read`:
    // let ms = MediaSource::unseekable(stream)?;
     
    Ok(())
}

§Async API Usage

Enable async feature flag for nom-exif in your Cargo.toml:

[dependencies]
nom-exif = { version = "1", features = ["async"] }

For detailed usage, please refer to: [AsyncMediaParser::parse].

§GPS Info

ExifIter provides a convenience method for parsing gps information. (Exif also provides a get_gps_info mthod).

use nom_exif::*;

fn main() -> Result<()> {
    let mut parser = MediaParser::new();
     
    let ms = MediaSource::file_path("./testdata/exif.heic")?;
    let iter: ExifIter = parser.parse(ms)?;

    let gps_info = iter.parse_gps_info()?.unwrap();
    assert_eq!(gps_info.format_iso6709(), "+43.29013+084.22713+1595.950CRSWGS_84/");
    assert_eq!(gps_info.latitude_ref, 'N');
    assert_eq!(gps_info.longitude_ref, 'E');
    assert_eq!(
        gps_info.latitude,
        [(43, 1), (17, 1), (2446, 100)].into(),
    );
    Ok(())
}

§Video

Please refer to: MediaParser / [AsyncMediaParser].

For more usage details, please refer to the API documentation.

Structs§

Enums§

  • Represent a parsed entry value.
  • Defines recognized Exif tags. All tags can be parsed, no matter if it is defined here. This enum definition is just for ease of use.
  • FileFormatDeprecated
    Deprecated: Please use [MediaType] instead.
  • Try to keep the tag name consistent with crate::ExifTag, and add some unique to video/audio, such as DurationMs.

Functions§

Type Aliases§