Blob

Struct Blob 

Source
pub struct Blob {
    pub id: Option<String>,
    pub metadata: HashMap<String, Value>,
    pub data: Option<BlobData>,
    pub mimetype: Option<String>,
    pub encoding: String,
    pub path: Option<PathBuf>,
}
Expand description

Raw data abstraction for document loading and file processing.

Represents raw bytes or text, either in-memory or by file reference. Used primarily by document loaders to decouple data loading from parsing.

Inspired by Mozilla’s Blob

§Examples

Initialize a blob from in-memory data:

use agent_chain_core::documents::Blob;

let blob = Blob::from_data("Hello, world!");

// Read the blob as a string
assert_eq!(blob.as_string().unwrap(), "Hello, world!");

// Read the blob as bytes
assert_eq!(blob.as_bytes().unwrap(), b"Hello, world!");

Load from memory and specify MIME type and metadata:

use agent_chain_core::documents::Blob;
use std::collections::HashMap;

let blob = Blob::builder()
    .data("Hello, world!")
    .mime_type("text/plain")
    .metadata(HashMap::from([("source".to_string(), serde_json::json!("https://example.com"))]))
    .build()
    .unwrap();

Fields§

§id: Option<String>

An optional identifier for the blob.

§metadata: HashMap<String, Value>

Arbitrary metadata associated with the blob.

§data: Option<BlobData>

Raw data associated with the blob (bytes or string).

§mimetype: Option<String>

MIME type, not to be confused with a file extension.

§encoding: String

Encoding to use if decoding the bytes into a string. Uses utf-8 as default encoding if decoding to string.

§path: Option<PathBuf>

Location where the original content was found.

Implementations§

Source§

impl Blob

Source

pub fn builder() -> BlobBuilder

Create a new Blob builder.

Source

pub fn from_data(data: impl Into<String>) -> Self

Create a Blob from in-memory data (string).

Source

pub fn from_bytes(data: Vec<u8>) -> Self

Create a Blob from in-memory bytes.

Source

pub fn from_path( path: impl AsRef<Path>, mime_type: Option<String>, encoding: Option<String>, metadata: Option<HashMap<String, Value>>, ) -> Self

Load the blob from a path.

The data is not loaded immediately - the blob treats the path as a reference to the underlying data.

§Arguments
  • path - Path to the file
  • mime_type - Optional MIME type (will be guessed from extension if not provided)
  • encoding - Encoding to use (defaults to “utf-8”)
  • metadata - Optional metadata
Source

pub fn source(&self) -> Option<String>

The source location of the blob as string if known otherwise none.

If a path is associated with the Blob, it will default to the path location. Unless explicitly set via a metadata field called 'source', in which case that value will be used instead.

Source

pub fn as_string(&self) -> Result<String>

Read data as a string.

§Errors

Returns an error if the blob cannot be represented as a string.

Source

pub fn as_bytes(&self) -> Result<Vec<u8>>

Read data as bytes.

§Errors

Returns an error if the blob cannot be represented as bytes.

Source

pub fn as_bytes_io(&self) -> Result<Box<dyn Read>>

Read data as a byte stream (returns a reader).

§Errors

Returns an error if the blob cannot be represented as a byte stream.

Trait Implementations§

Source§

impl Clone for Blob

Source§

fn clone(&self) -> Blob

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Blob

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<'de> Deserialize<'de> for Blob

Source§

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more
Source§

impl Display for Blob

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl PartialEq for Blob

Source§

fn eq(&self, other: &Blob) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Serialize for Blob

Source§

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>
where __S: Serializer,

Serialize this value into the given Serde serializer. Read more
Source§

impl StructuralPartialEq for Blob

Auto Trait Implementations§

§

impl Freeze for Blob

§

impl RefUnwindSafe for Blob

§

impl Send for Blob

§

impl Sync for Blob

§

impl Unpin for Blob

§

impl UnwindSafe for Blob

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> PolicyExt for T
where T: ?Sized,

Source§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more
Source§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T> ToStringFallible for T
where T: Display,

Source§

fn try_to_string(&self) -> Result<String, TryReserveError>

ToString::to_string, but without panic on OOM.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,