Neel Nanda

A Pragmatic Vision for Interpretability
by Neel Nanda @ 2025-12-03 | +9 | 0 comments

How To Become A Mechanistic Interpretability Researcher
by Neel Nanda @ 2025-09-02 | +31 | 0 comments

Neel Nanda MATS Applications Open (Due Aug 29)
by Neel Nanda @ 2025-07-30 | +20 | 0 comments

Advice for Sending Cold Messages to Busy People at EAG
by Neel Nanda, Jemima @ 2025-06-02 | +122 | 0 comments

Socratic Persuasion: Giving Opinionated Yet Truth-Seeking Advice
by Neel Nanda @ 2025-05-26 | +66 | 0 comments

Highly Opinionated Advice on How to Write ML Papers
by Neel Nanda @ 2025-05-12 | +22 | 0 comments

Interpretability Will Not Reliably Find Deceptive AI
by Neel Nanda @ 2025-05-04 | +74 | 0 comments

My Research Process: Understanding and Cultivating Research Taste
by Neel Nanda @ 2025-05-01 | +9 | 0 comments

My Research Process: Key Mindsets - Truth-Seeking, Prioritisation, Moving Fast
by Neel Nanda @ 2025-04-27 | +36 | 0 comments

How I Think About My Research Process: Explore, Understand, Distill
by Neel Nanda @ 2025-04-26 | +45 | 0 comments

Neel Nanda's Quick takes
by Neel Nanda @ 2025-04-06 | +8 | 0 comments

Good Research Takes are Not Sufficient for Good Strategic Takes
by Neel Nanda @ 2025-03-22 | +121 | 0 comments

MATS Applications + Research Directions I'm Currently Excited About
by Neel Nanda @ 2025-02-06 | +31 | 0 comments

Concrete open problems in mechanistic interpretability: a technical overview
by Neel Nanda @ 2023-07-06 | +27 | 0 comments

Concrete Steps to Get Started in Transformer Mechanistic Interpretability
by Neel Nanda @ 2022-12-26 | +18 | 0 comments

A Barebones Guide to Mechanistic Interpretability Prerequisites
by Neel Nanda @ 2022-11-29 | +54 | 0 comments

An Extremely Opinionated Annotated List of My Favourite Mechanistic...
by Neel Nanda @ 2022-10-18 | +19 | 0 comments

Concrete Advice for Forming Inside Views on AI Safety
by Neel Nanda @ 2022-08-17 | +58 | 0 comments

Things That Make Me Enjoy Giving Career Advice
by Neel Nanda @ 2022-06-17 | +33 | 0 comments

How I Formed My Own Views About AI Safety
by Neel Nanda @ 2022-02-27 | +134 | 0 comments

Simplify EA Pitches to "Holy Shit, X-Risk"
by Neel Nanda @ 2022-02-11 | +189 | 0 comments

My Overview of the AI Alignment Landscape: A Bird’s Eye View
by Neel Nanda @ 2021-12-15 | +45 | 0 comments

Optimisation-focused introduction to EA podcast episode
by Neel Nanda @ 2021-01-15 | +8 | 0 comments

Retrospective on Teaching Rationality Workshops
by Neel Nanda @ 2021-01-03 | +43 | 0 comments

Local Group Event Idea: EA Community Talks
by Neel Nanda @ 2020-12-20 | +26 | 0 comments

Make a Public Commitment to Writing EA Forum Posts
by Neel Nanda @ 2020-11-18 | +21 | 0 comments

Helping each other become more effective
by Neel Nanda @ 2020-10-30 | +10 | 0 comments

What altruism means to me
by Neel Nanda @ 2020-08-15 | +14 | 0 comments

The world is full of wasted motion
by Neel Nanda @ 2020-08-05 | +21 | 0 comments