diff - scaled evaluation vs. binary pass or fail evaluation