“Clean” vs. “messy” goal-directedness (Section 2.2.3 of “Scheming AIs”)
By Joe_Carlsmith @ 2023-11-29T16:32 (+7)
This is a crosspost, probably from LessWrong. Try viewing it there.
nullBy Joe_Carlsmith @ 2023-11-29T16:32 (+7)
This is a crosspost, probably from LessWrong. Try viewing it there.
null