Research: AI's ability to complete long and complex software engineering tasks doubles every 6&7 months, but there is a "messiness tax" for real&world tasks (Boaz Barak/Windows On Theory)

Boaz Barak / Windows On Theory: Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks  —  METR has had a very influential work by Kwa and West et al on measuring AI's ability to complete long tasks.

Research: AI's ability to complete long and complex software engineering tasks doubles every 6&7 months, but there is a "messiness tax" for real&world tasks (Boaz Barak/Windows On Theory)
Boaz Barak / Windows On Theory: Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks  —  METR has had a very influential work by Kwa and West et al on measuring AI's ability to complete long tasks.