Crate callm

Source
Expand description

callm allows you to easily run Generative AI models (like Large Language Models) directly on your hardware, offline.

Modulesยง

device
This module provides computation device configuration.
error
This module provides custom Error type.
loaders
This module provides loaders for different model formats.
models
This module provides various model implementations for different architectures.
pipelines
This module provides pipelines.
templates
This module provides template implementations for different templating engines.
utils
This module provides utility functions.