## Textract
Rust library for extracting text from various file types.
supported file extension.
txt
odf
ods
odt
pptx
xlsx
pdf
## Installation and usage;
Use cargo to install textract.
```
// there is a pdf file at ./tmp.pdf
let content = textract::extract("tmp.pdf","pdf").unwrap;
// content contains raw text in pdf. do whatever you want.
```
main.rs contains usage of textract library.
### commandline
The command line as simple.
```
textract tmp.pdf pdf
```
## Roadmap.
This lib is in beta stage with few file types support. but texract supports will keep increasing the file types support. since this project is part of ![achoz](https://github.com/kcubeterm/achoz)
* supports of compressed file and tar archives
* use lib magic to guess file types.
* All types of documents files.