Expand description
Various transformers for chunking, embedding and transforming data
These transformers are generic over their implementation and many require a swiftide integration to be configured.
Transformers that prompt have a default prompt configured. Prompts can be customized and tailored, supporting Jinja style templating based on tera.
See swiftide_core::prompt::Prompt
and [swiftide_core::template::Template
]
Re-exports§
pub use chunk_markdown::ChunkMarkdown;
pub use chunk_text::ChunkText;
pub use embed::Embed;
pub use metadata_keywords::MetadataKeywords;
pub use metadata_qa_text::MetadataQAText;
pub use metadata_summary::MetadataSummary;
pub use metadata_title::MetadataTitle;
pub use sparse_embed::SparseEmbed;
Modules§
- chunk_
markdown - Chunk markdown content into smaller pieces
- chunk_
text - Chunk text content into smaller pieces
- embed
- Generic embedding transformer
- metadata_
keywords - Extract keywords from a node and add them as metadata
This module defines the
MetadataKeywords
struct and its associated methods, which are used for generating metadata in the form of keywords for a given text. It interacts with a client (e.g.,OpenAI
) to generate the keywords based on the text chunk in aNode
. - metadata_
qa_ text - Generates questions and answers from a given text chunk and adds them as metadata.
This module defines the
MetadataQAText
struct and its associated methods, which are used for generating metadata in the form of questions and answers from a given text. It interacts with a client (e.g.,OpenAI
) to generate these questions and answers based on the text chunk in anNode
. - metadata_
summary - Generate a summary and adds it as metadata
This module defines the
MetadataSummary
struct and its associated methods, which are used for generating metadata in the form of a summary for a given text. It interacts with a client (e.g.,OpenAI
) to generate the summary based on the text chunk in anNode
. - metadata_
title - Generate a title and adds it as metadata
This module defines the
MetadataTitle
struct and its associated methods, which are used for generating metadata in the form of a title for a given text. It interacts with a client (e.g.,OpenAI
) to generate these questions and answers based on the text chunk in anNode
. - sparse_
embed - Generic embedding transformer