Can we ever ensure AI alignment if we can only test AI personas?

By Karl von Wendt @ 2025-03-16T08:06 (+8)

This is a crosspost, probably from LessWrong. Try viewing it there.

null