EvoEval: Evolving Coding Benchmarks via LLM

EvoEval1 is a holistic benchmark suite created by evolving HumanEval problems:

1 coincidentally similar pronunciation with 😈 EvilEval