pub fn evaluate( config: &Config, seeds: &[usize], ) -> Result<Vec<EvaluationRecord>>
Executes the submission code and the local tester for each seed and collect the score and the execution time.