acetylene_parser 0.1.0

A parser for chemical formulas.
Documentation

acetylene-parser

A string parser for different chemical nomenclature.

functions

  • tokenize(string, type="formula") -> Substance

    Tokenizes a string describing a chemical, yielding a Substance with (optional) functional groups corresponding to (more) fundamental components.

    • "formula" type expects a simple "secondary school" element-symbol naming string.

    • "smiles" expects a chemical name utilizing the SMILES system.

    • TODO: "iupac" expects a chemical name utilizing the IUPAC system.

roadmap

  • implement SMILES parsing
  • improve struct based on SMILES findings
  • decide whether InChI implementation is worth it

references

SMILES

IUPAC

InChI

data