How do you balance the different dimensions of an evaluation?
Is a new school improvement program a success if it does a better job of teaching mathematics but a worse job of language? Is it a success if it works better for most students but leads to a higher rate of school drop out? What if the drop out rate has increased for the most disadvantaged? And what about the costs of the program? Is it a success if the program gets better results but costs more?