Transformer on MSNOpinion

GPT-5.6 cheats so much METR couldn't measure it

OpenAI’s new model broke rules and exploited loopholes more than any model METR has tested to date ...