ragit 0.4.5

git-like rag pipeline
Documentation
rag-ls-terms(1)
==========

NAME
----
rag-ls-terms - Show information about search terms

SYNOPSIS
--------
[verse]
'rag ls-terms' [--term-only | --stat-only] [--json | -j] [<uid>..]

DESCRIPTION
-----------
Run `rag help uid-query` to read how uid-query works. If the uid is of an
image, it dies. Later versions will allow `rag ls-terms` on images.

Ragit uses tfidf algorithm to retrieve texts, and tfidf uses a tokenizer.
This command shows how your documents are tokenized. It shows how many times
each term in a chunk appears. It's also known as TF (term-frequency). If you
provide a uid of a file, it merges TFs of chunks in the file. It's NOT a TF
of merged chunks, but a merged TF of chunks. It's different because chunks have
sliding windows. If a term appears once in a file, but is included in a sliding
window. `rag ls-terms` will tell you that the term appears twice.

If no uid is given, it shows a merged TF of all the chunks.

If `--json` is set, it dumps a valid json to stdout. Its schema is,

- no flags: `{ term: term_count }`
- `--term-only`: `[term: string]`
- `--stat-only`: `{ "terms": integer, "unique terms": integer }`