Module text

Module text 

Source
Expand description

Text processing transformers Text processing transformers for feature extraction

This module provides utilities for converting text data into numerical features suitable for machine learning algorithms.

Structsยง

CountVectorizer
Count vectorizer for converting text documents to term frequency vectors
HashingVectorizer
Hashing vectorizer for memory-efficient text vectorization
StreamingCountVectorizer
Streaming count vectorizer that can learn vocabulary incrementally
TfidfVectorizer
TF-IDF vectorizer for converting text to TF-IDF features