MBPP EvalPlus Benchmark

MBPP EvalPlus Benchmark | Phaseo

Organisation	Model	Reported	Top Score	Info	Self Reported	Source
Meta	Llama 2 70B Chat	20 Jun 2023	0.88	inferred family alias from llama-3.3-70b-instruct (score=0.3129; benches=9)	Yes	Source

MBPP EvalPlus