How the AI safety technical landscape has changed in the last year, according to some practitioners

By tlevin @ 2024-07-26T19:06 (+83)

This is a crosspost, probably from LessWrong. Try viewing it there.

null
Chris Leong @ 2024-07-27T02:48 (+7)

I don’t know the exact dates, but: a)proof-based methods seem to be receiving a lot of attention b) def/acc is becoming more of a thing c) more focus on concentration of power risk (tbh, while there are real risks here, I suspect most work here is net-negative)