Measuring and Mitigating Overoptimization