Varieties of fake alignment (Section 1.1 of “Scheming AIs”)
By Joe_Carlsmith @ 2023-11-21T15:00 (+6)
This is a crosspost, probably from LessWrong. Try viewing it there.
nullBy Joe_Carlsmith @ 2023-11-21T15:00 (+6)
This is a crosspost, probably from LessWrong. Try viewing it there.
null