Skip to main content

OutputSafetyConfig

Struct OutputSafetyConfig

pub struct OutputSafetyConfig {
    pub enabled: bool,
    pub toxicity_enabled: bool,
    pub toxicity_threshold: f32,
    pub block_on_critical: bool,
    pub hallucination_enabled: bool,
    pub hallucination_model: String,
    pub hallucination_threshold: f32,
    pub hallucination_min_response_length: usize,
}

Expand description

Output safety configuration for response content analysis.

When enabled, the proxy analyses LLM response content for toxicity, PII leakage, secret exposure, and hallucination detection. This is a post-processing step that runs after the upstream response is received.

§Example (YAML)

output_safety:
  enabled: true
  toxicity_enabled: true
  toxicity_threshold: 0.7
  block_on_critical: false
  hallucination_enabled: false
  hallucination_model: "vectara/hallucination_evaluation_model"
  hallucination_threshold: 0.5
  hallucination_min_response_length: 50

Fields§

§enabled: bool

Enable output safety analysis on LLM responses.

§toxicity_enabled: bool

Enable toxicity detection on response content.

§toxicity_threshold: f32

Confidence threshold for toxicity detection (0.0–1.0).

§block_on_critical: bool

Block (replace) the response if critical toxicity is detected.

§hallucination_enabled: bool

Enable hallucination detection on response content.

When enabled, response sentences are scored against the user’s prompt for factual consistency using a cross-encoder model.

§hallucination_model: String

HuggingFace model ID for hallucination detection.

§hallucination_threshold: f32

Threshold below which a sentence is considered potentially hallucinated (0.0–1.0). Sentences scoring below this are flagged.

§hallucination_min_response_length: usize

Minimum response length (in characters) to run hallucination detection. Responses shorter than this are skipped to save compute.

Trait Implementations§

impl Clone for OutputSafetyConfig

fn clone(&self) -> OutputSafetyConfig

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for OutputSafetyConfig

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for OutputSafetyConfig

fn default() -> Self

Returns the “default value” for a type. Read more

impl<'de> Deserialize<'de> for OutputSafetyConfig

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl Serialize for OutputSafetyConfig

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations§

impl Freeze for OutputSafetyConfig

impl RefUnwindSafe for OutputSafetyConfig

impl Send for OutputSafetyConfig

impl Sync for OutputSafetyConfig

impl Unpin for OutputSafetyConfig

impl UnsafeUnpin for OutputSafetyConfig

impl UnwindSafe for OutputSafetyConfig

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,