Varieties of fake alignment (Section 1.1 of “Scheming AIs”)

By Joe_Carlsmith @ 2023-11-21T15:00 (+6)

This is a crosspost, probably from LessWrong. Try viewing it there.

null