Skip to main content

EvalRecord

mur_common::eval

Struct EvalRecord

pub struct EvalRecord {Show 15 fields
    pub schema_version: u32,
    pub test_suite: EvalSuite,
    pub test_id: String,
    pub attack_category: String,
    pub agent_decision: EvalDecision,
    pub expected: EvalDecision,
    pub passed: bool,
    pub hook_decisions: Vec<EvalHookDecision>,
    pub tokens_input: Option<u64>,
    pub tokens_output: Option<u64>,
    pub wall_clock_ms: u64,
    pub llm_backend: EvalLlmBackend,
    pub llm_model: String,
    pub run_id: String,
    pub timestamp: String,
}

Expand description

One test case’s result — written as a single JSONL line by the Python harness per case, parsed by mur agent eval report to build the markdown summary.

Fields§

§schema_version: u32

Wire-version of this struct; mismatches abort report generation rather than producing silently-wrong aggregates.

§test_suite: EvalSuite§test_id: String

Stable identifier — <suite>:<env>:<id> for AgentDojo, <suite>:<behavior_id> for HarmBench.

§attack_category: String

Free-form upstream tag, e.g. "data_exfil", "prompt_injection", "agentic_misuse". Used to bucket the markdown report by category.

§agent_decision: EvalDecision§expected: EvalDecision§passed: bool§hook_decisions: Vec<EvalHookDecision>

B0 hook chain trace for this test, in order. Empty if the Python harness ran in fast-only mode (no hook capture).

§tokens_input: Option<u64>

LLM token usage. None for the stub backend (no real tokens).

§tokens_output: Option<u64>§wall_clock_ms: u64§llm_backend: EvalLlmBackend§llm_model: String

Free-form model identifier — "claude-sonnet-4-6", "stub", "llama3.2:3b", etc.

§run_id: String

Run-id this record belongs to; aggregator groups by this. Format: ULID so records sort by time.

§timestamp: String

RFC3339 timestamp of when this case finished.

Trait Implementations§

impl Clone for EvalRecord

fn clone(&self) -> EvalRecord

Returns a duplicate of the value. Read more

1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for EvalRecord

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl<'de> Deserialize<'de> for EvalRecord

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl PartialEq for EvalRecord

fn eq(&self, other: &EvalRecord) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 (const: unstable) · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

impl Serialize for EvalRecord

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

impl Eq for EvalRecord

impl StructuralPartialEq for EvalRecord

Auto Trait Implementations§

impl Freeze for EvalRecord

impl RefUnwindSafe for EvalRecord

impl Send for EvalRecord

impl Sync for EvalRecord

impl Unpin for EvalRecord

impl UnsafeUnpin for EvalRecord

impl UnwindSafe for EvalRecord

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> AnyEq for T
where T: Any + PartialEq,

fn equals(&self, other: &(dyn Any + 'static)) -> bool

fn as_any(&self) -> &(dyn Any + 'static)

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

Checks if this value is equivalent to the given key. Read more

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> Same for T

type Output = T

Should always be Self

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn vzip(self) -> V

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,