Teun van der Weij

Posts

How to mitigate sandbagging
by Teun van der Weij @ 2025-03-23 | +3 | 0 comments
The Elicitation Game: Evaluating capability elicitation techniques
by Teun van der Weij @ 2025-02-27 | +3 | 0 comments
[Paper] AI Sandbagging: Language Models can Strategically Underperform on...
by Teun van der Weij @ 2024-06-13 | +22 | 0 comments
Beyond Humans: Why All Sentient Beings Matter in Existential Risk
by Teun van der Weij @ 2023-05-31 | +12 | 0 comments
Teun_Van_Der_Weij's Quick takes
by Teun van der Weij @ 2022-06-23 | +1 | 0 comments