“Clean” vs. “messy” goal-directedness (Section 2.2.3 of “Scheming AIs”)

By Joe_Carlsmith @ 2023-11-29T16:32 (+7)

This is a crosspost, probably from LessWrong. Try viewing it there.

null