DeviceConfig

Struct DeviceConfig 

Source
pub struct DeviceConfig {
    pub cpu_config: CpuConfig,
    pub ram_config: RamConfig,
    pub use_gpu: bool,
    pub cuda_config: Option<CudaConfig>,
    pub error_on_config_issue: bool,
    pub layer_count: Option<u64>,
    pub average_layer_size_bytes: Option<u64>,
    pub local_model_path: String,
}
Expand description

Configuration for device-specific settings in LLM inference.

Fields§

§cpu_config: CpuConfig

CPU configuration for thread count.

§ram_config: RamConfig

RAM configuration for non-GPU inference on Windows and Unix.

This setting is used when GPU acceleration is not available or not enabled.

§use_gpu: bool

Indicates whether to use any available GPUs for inference.

If true, the system will attempt to use GPU acceleration. If false, inference will be performed on CPU only.

§cuda_config: Option<CudaConfig>

CUDA configuration for GPU inference on non-macOS platforms.

This field is only available on platforms other than macOS. If None, default CUDA settings will be used when GPU is enabled.

§error_on_config_issue: bool

Determines error handling behavior for configuration issues.

If true, the system will return an error when encountering configuration issues. If false (default), issues will be logged and execution will continue if possible.

This flag is useful for debugging purposes.

§layer_count: Option<u64>

The number of layers in the model.

This is set at runtime.

§average_layer_size_bytes: Option<u64>

The average size of a layer in bytes.

This is set at runtime.

§local_model_path: String

The file system path to the local model.

This is set at runtime.

Implementations§

Source§

impl DeviceConfig

Source

pub fn initialize(&mut self) -> Result<()>

Source

pub fn available_memory_bytes(&self) -> Result<u64>

Source

pub fn average_layer_size_bytes(&self) -> Result<u64>

Source

pub fn layer_count(&self) -> Result<u64>

Source

pub fn main_gpu(&self) -> Result<u32>

Source

pub fn gpu_count(&self) -> usize

Source

pub fn allocate_layers_to_gpus( &self, buffer_layer_per_gpu: u64, buffer_layer_main_gpu: u64, ) -> Result<Vec<GpuDevice>>

Trait Implementations§

Source§

impl Clone for DeviceConfig

Source§

fn clone(&self) -> DeviceConfig

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for DeviceConfig

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for DeviceConfig

Source§

fn default() -> Self

Returns the “default value” for a type. Read more
Source§

impl Display for DeviceConfig

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more