Summary of METR's predeployment evaluation of GPT-5.6 Sol

(metr.org)

10 points | by pongogogo  2 days ago

6 comments