Module evals

Module evals 

Source

Re-exports§

pub use crate::types::responses::EasyInputMessage;
pub use crate::types::responses::InputTextContent;
pub use crate::types::responses::ReasoningEffort;

Structs§

CreateEvalCompletionsRunDataSource
Completions run data source.
CreateEvalCustomDataSourceConfig
CreateEvalCustomDataSourceConfigArgs
Builder for CreateEvalCustomDataSourceConfig.
CreateEvalJsonlRunDataSource
JSONL run data source.
CreateEvalLabelModelGrader
Label model grader for creating an eval.
CreateEvalLabelModelGraderArgs
Builder for CreateEvalLabelModelGrader.
CreateEvalLogsDataSourceConfig
Logs data source config for creating an eval.
CreateEvalLogsDataSourceConfigArgs
Builder for CreateEvalLogsDataSourceConfig.
CreateEvalRequest
CreateEvalRequestArgs
Builder for CreateEvalRequest.
CreateEvalResponsesRunDataSource
Responses run data source.
CreateEvalRunRequest
Request to create an eval run.
CreateEvalRunRequestArgs
Builder for CreateEvalRunRequest.
DeleteEvalResponse
Response from deleting an eval.
DeleteEvalRunResponse
Response from deleting an eval run.
Eval
An Eval object with a data source config and testing criteria. An Eval represents a task to be done for your LLM integration. Like:
EvalApiError
An object representing an error response from the Eval API.
EvalCustomDataSourceConfig
Custom data source config.
EvalGraderLabelModel
Label model grader.
EvalGraderPython
Python grader. also in openapi spec: GraderPython
EvalGraderScoreModel
Score model grader. also in openapi spec: GraderScoreModel
EvalGraderStringCheck
String check grader.
EvalGraderTextSimilarity
Text similarity grader.
EvalItem
EvalJsonlContentItem
JSONL content item.
EvalJsonlFileContentSource
JSONL file content source.
EvalJsonlFileIdSource
JSONL file ID source.
EvalList
List of evals.
EvalLogsDataSourceConfig
Logs data source config.
EvalResponsesSamplingParams
EvalResponsesSource
A EvalResponsesSource object describing a run data source configuration.
EvalRun
A schema representing an evaluation run.
EvalRunList
List of eval runs.
EvalRunModelUsage
Usage statistics for each model during the evaluation run.
EvalRunOutputItem
A schema representing an evaluation run output item.
EvalRunOutputItemList
List of eval run output items.
EvalRunOutputItemResult
A single grader result for an evaluation run output item.
EvalRunOutputItemSample
A sample containing the input and output of the evaluation run.
EvalRunOutputItemUsage
Token usage details for the sample.
EvalRunResultCounts
Counters summarizing the outcomes of the evaluation run.
EvalRunTestingCriteriaResult
Results per testing criteria applied during the evaluation run.
EvalSamplingParams
Sampling parameters for the model.
EvalStoredCompletionsDataSourceConfig
Stored completions data source config (deprecated).
EvalStoredCompletionsSource
Stored completions source.
InputImage
ItemReference
OutputText
SamplingParams
SimpleInputMessage
SimpleOutputMessage
TemplateInputMessages
UpdateEvalRequest
Request to update an eval.
UpdateEvalRequestArgs
Builder for UpdateEvalRequest.

Enums§

CreateEvalDataSourceConfig
CreateEvalItem
A chat message that makes up the prompt or context.
CreateEvalRunDataSource
Details about the run’s data source.
CreateEvalTestingCriterion
EvalCompletionsSource
Completions source.
EvalDataSourceConfig
Configuration of data sources used in runs of the evaluation.
EvalInputMessages
Input messages for completions.
EvalItemContent
Inputs to the model - can contain template strings.
EvalItemRole
The role of the message input.
EvalJsonlSource
JSONL source.
EvalResponsesRunSource
Responses source.
EvalRunDataSource
Information about the run’s data source.
EvalRunStatus
Status of an evaluation run.
EvalTestingCriterion
A list of testing criteria.
TextSimilarityMetric
Text similarity metric.