Module task_definition

Expand description

Task definition types for benchmark harness.

Defines the JSON schema for benchmark tasks, including setup, verification, and metrics collection. Tasks run in isolated workspaces with their own .git directories to avoid polluting the main repository.

§Example

use ralph_core::task_definition::{TaskDefinition, TaskSuite, Verification};

let task = TaskDefinition::builder("hello-world", "tasks/hello-world/PROMPT.md", "TASK_COMPLETE")
    .verification_command("python hello.py | grep -q 'Hello, World!'")
    .max_iterations(5)
    .expected_iterations(1)
    .complexity("simple")
    .build();

assert_eq!(task.name, "hello-world");
assert!(task.verification.command.contains("Hello, World!"));

Structs§

SuiteMetadata: Suite-level metadata.
TaskDefinition: A single benchmark task definition.
TaskDefinitionBuilder: Builder for constructing task definitions.
TaskSetup: Setup configuration for task workspace.
TaskSuite: A suite of benchmark tasks loaded from a JSON file.
Verification: Verification configuration for a task.

Enums§

TaskDefinitionError: Errors that can occur when working with task definitions.

Module task_definition

Module task_definition Copy item path

§Example

Structs§

Enums§

Module task_definition