Module evals

Source

Re-exports§

pub use crate::types::responses::EasyInputMessage;
pub use crate::types::responses::InputTextContent;
pub use crate::types::responses::ReasoningEffort;

Structs§

CreateEvalCompletionsRunDataSource: Completions run data source.
CreateEvalCustomDataSourceConfig
CreateEvalCustomDataSourceConfigArgs: Builder for CreateEvalCustomDataSourceConfig.
CreateEvalJsonlRunDataSource: JSONL run data source.
CreateEvalLabelModelGrader: Label model grader for creating an eval.
CreateEvalLabelModelGraderArgs: Builder for CreateEvalLabelModelGrader.
CreateEvalLogsDataSourceConfig: Logs data source config for creating an eval.
CreateEvalLogsDataSourceConfigArgs: Builder for CreateEvalLogsDataSourceConfig.
CreateEvalRequest
CreateEvalRequestArgs: Builder for CreateEvalRequest.
CreateEvalResponsesRunDataSource: Responses run data source.
CreateEvalRunRequest: Request to create an eval run.
CreateEvalRunRequestArgs: Builder for CreateEvalRunRequest.
DeleteEvalResponse: Response from deleting an eval.
DeleteEvalRunResponse: Response from deleting an eval run.
Eval: An Eval object with a data source config and testing criteria. An Eval represents a task to be done for your LLM integration. Like:
EvalApiError: An object representing an error response from the Eval API.
EvalCustomDataSourceConfig: Custom data source config.
EvalGraderLabelModel: Label model grader.
EvalGraderPython: Python grader. also in openapi spec: GraderPython
EvalGraderScoreModel: Score model grader. also in openapi spec: GraderScoreModel
EvalGraderStringCheck: String check grader.
EvalGraderTextSimilarity: Text similarity grader.
EvalItem
EvalJsonlContentItem: JSONL content item.
EvalJsonlFileContentSource: JSONL file content source.
EvalJsonlFileIdSource: JSONL file ID source.
EvalList: List of evals.
EvalLogsDataSourceConfig: Logs data source config.
EvalResponsesSamplingParams
EvalResponsesSource: A EvalResponsesSource object describing a run data source configuration.
EvalRun: A schema representing an evaluation run.
EvalRunList: List of eval runs.
EvalRunModelUsage: Usage statistics for each model during the evaluation run.
EvalRunOutputItem: A schema representing an evaluation run output item.
EvalRunOutputItemList: List of eval run output items.
EvalRunOutputItemResult: A single grader result for an evaluation run output item.
EvalRunOutputItemSample: A sample containing the input and output of the evaluation run.
EvalRunOutputItemUsage: Token usage details for the sample.
EvalRunResultCounts: Counters summarizing the outcomes of the evaluation run.
EvalRunTestingCriteriaResult: Results per testing criteria applied during the evaluation run.
EvalSamplingParams: Sampling parameters for the model.
EvalStoredCompletionsDataSourceConfig: Stored completions data source config (deprecated).
EvalStoredCompletionsSource: Stored completions source.
InputImage
ItemReference
OutputText
SamplingParams
SimpleInputMessage
SimpleOutputMessage
TemplateInputMessages
UpdateEvalRequest: Request to update an eval.
UpdateEvalRequestArgs: Builder for UpdateEvalRequest.

Enums§

CreateEvalDataSourceConfig
CreateEvalItem: A chat message that makes up the prompt or context.
CreateEvalRunDataSource: Details about the run’s data source.
CreateEvalTestingCriterion
EvalCompletionsSource: Completions source.
EvalDataSourceConfig: Configuration of data sources used in runs of the evaluation.
EvalInputMessages: Input messages for completions.
EvalItemContent: Inputs to the model - can contain template strings.
EvalItemRole: The role of the message input.
EvalJsonlSource: JSONL source.
EvalResponsesRunSource: Responses source.
EvalRunDataSource: Information about the run’s data source.
EvalRunStatus: Status of an evaluation run.
EvalTestingCriterion: A list of testing criteria.
TextSimilarityMetric: Text similarity metric.

Module evals

Module evals Copy item path

Re-exports§

Structs§

Enums§

Module evals