Expand description
Python bindings for scirs2-text
This module provides Python bindings for text processing operations, including tokenization, vectorization, sentiment analysis, stemming, string similarity metrics, and text cleaning.
Structs§
- PyCharacter
Tokenizer - Character tokenizer
- PyCount
Vectorizer - Count vectorizer (bag-of-words)
- PyLancaster
Stemmer - Lancaster stemmer
- PyLexicon
Sentiment Analyzer - Lexicon-based sentiment analyzer
- PyNgram
Tokenizer - N-gram tokenizer
- PyPorter
Stemmer - Porter stemmer
- PyRegex
Tokenizer - Regex tokenizer
- PySentence
Tokenizer - Sentence tokenizer
- PySnowball
Stemmer - Snowball stemmer
- PyTfidf
Vectorizer - TF-IDF vectorizer
- PyWhitespace
Tokenizer - Whitespace tokenizer
- PyWord
Tokenizer - Word tokenizer
Functions§
- register_
module - Python module registration