Struct Chat

Source

pub struct Chat<M: CreateChatSession> { /* private fields */ }

Expand description

Chat is a chat interface that builds on top of crate::ChatModel and crate::StructuredChatModel. It makes it easy to create a chat session with streaming responses, and constraints. Let’s start with a simple chat application:

// Before you create a chat session, you need a model. Llama::new_chat will create a good default chat model.
let model = Llama::new_chat().await.unwrap();
// Then you can build a chat session that uses that model
let mut chat = model.chat()
    // The builder exposes methods for settings like the system prompt and constraints the bot response must follow
    .with_system_prompt("The assistant will act like a pirate");

loop {
    // To use the chat session, you need to add messages to it
    let mut response_stream = chat(&prompt_input("\n> ").unwrap());
    // And then display the response stream to the user
    response_stream.to_std_out().await.unwrap();
}

If you run the application, you may notice that it takes more time for the assistant to start responding to long prompts. The LLM needs to read and transform the prompt into a format it understands before it can start generating a response. Kalosm stores that state in a chat session, which can be saved and loaded from the filesystem to make loading existing chat sessions faster.

You can save and load chat sessions from the filesystem using the ChatSession::to_bytes and [ChatBuilder::from_bytes] methods:

// First, create a model to chat with
let model = Llama::new_chat().await.unwrap();
// Then try to load the chat session from the filesystem
let save_path = std::path::PathBuf::from("./chat.llama");
let mut chat = model.chat();
if let Some(old_session) = std::fs::read(&save_path)
    .ok()
    .and_then(|bytes| LlamaChatSession::from_bytes(&bytes).ok())
{
    chat = chat.with_session(old_session);
}

// Then you can add messages to the chat session as usual
let mut response_stream = chat(&prompt_input("\n> ").unwrap());
// And then display the response stream to the user
response_stream.to_std_out().await.unwrap();

// After you are done, you can save the chat session to the filesystem
let session = chat.session().unwrap();
let bytes = session.to_bytes().unwrap();
std::fs::write(&save_path, bytes).unwrap();

LLMs are powerful because of their generality, but sometimes you need more control over the output. For example, you might want the assistant to start with a certain phrase, or to follow a certain format.

In kalosm, you can use constraints to guide the model’s response. Constraints are a way to specify the format of the output. When generating with constraints, the model will always respond with the specified format.

Let’s create a chat application that uses constraints to guide the assistant’s response to always start with “Yes!”:

let model = Llama::new_chat().await.unwrap();
// Create constraints that parses Yes! and then stops on the end of the assistant's response
let constraints = LiteralParser::new("Yes!")
    .then(model.default_assistant_constraints());
// Create a chat session with the model and the constraints
let mut chat = model.chat();

// Chat with the user
loop {
    let mut output_stream = chat(&prompt_input("\n> ").unwrap()).with_constraints(constraints.clone());
    output_stream.to_std_out().await.unwrap();
}

Struct ChatCopy item path

Implementations§

impl<M: CreateChatSession> Chat<M>

pub fn new(model: M) -> Chat<M>

§Example

pub fn with_system_prompt(self, system_prompt: impl ToString) -> Self

§Example

pub fn with_session(self, session: M::ChatSession) -> Self

§Example

pub fn add_message( &mut self, message: impl IntoChatMessage, ) -> ChatResponseBuilder<'_, M>

§Example

pub fn into_add_message( self, message: impl IntoChatMessage, ) -> ChatResponseBuilder<'static, M>

§Example

pub fn session( &self, ) -> Result<impl Deref<Target = M::ChatSession> + use<'_, M>, &M::Error>

Trait Implementations§

impl<M: CreateChatSession> Clone for Chat<M>

fn clone(&self) -> Self

fn clone_from(&mut self, source: &Self)

impl<M: CreateChatSession + Debug> Debug for Chat<M>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl<M: CreateChatSession + Clone + 'static> Deref for Chat<M>

type Target = dyn FnMut(&str) -> ChatResponseBuilder<'static, M>

fn deref(&self) -> &Self::Target

impl<M: CreateChatSession + Clone + 'static> DerefMut for Chat<M>

fn deref_mut(&mut self) -> &mut Self::Target

Auto Trait Implementations§

impl<M> !Freeze for Chat<M>

impl<M> !RefUnwindSafe for Chat<M>

impl<M> Send for Chat<M>where M: Sync + Send, <M as CreateChatSession>::ChatSession: Send,

impl<M> Sync for Chat<M>where M: Sync + Send, <M as CreateChatSession>::ChatSession: Send,

impl<M> Unpin for Chat<M>where <M as CreateChatSession>::Error: Unpin,

impl<M> !UnwindSafe for Chat<M>

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dst: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<P, T> Receiver for Pwhere P: Deref<Target = T> + ?Sized, T: ?Sized,

type Target = T

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>where S: Into<Dispatch>,

fn with_current_subscriber(self) -> WithDispatch<Self>

impl<T> ErasedDestructor for Twhere T: 'static,

impl<T> MaybeSendSync for T

Struct Chat

impl<M> Send for Chat<M>
where M: Sync + Send, <M as CreateChatSession>::ChatSession: Send,

impl<M> Sync for Chat<M>
where M: Sync + Send, <M as CreateChatSession>::ChatSession: Send,

impl<M> Unpin for Chat<M>
where <M as CreateChatSession>::Error: Unpin,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<P, T> Receiver for P
where P: Deref<Target = T> + ?Sized, T: ?Sized,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

impl<T> ErasedDestructor for T
where T: 'static,