What Is Mechanistic Interpretability Neel Nanda Explains

Media Summary: Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

What Is Mechanistic Interpretability Neel Nanda Explains - Detailed Analysis & Overview

Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to This is a talk I gave to my MATS scholars, with a stylised history of the field of Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ... How good are we at understanding the internal computation of advanced machine learning models, and do we have a hope at ... A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... A talk I gave to my MATS 9.0 training program about reasoning model See part 2 here: Implementing GPT-2 from Scratch Template notebook: ...

Photo Gallery

What is mechanistic interpretability? Neel Nanda explains.

What Matters Right Now In Mechanistic Interpretability?

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

The Story of Mech Interp

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

The Dark Matter of AI [Mechanistic Interpretability]

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Interpretability: Understanding how AI models think

19 - Mechanistic Interpretability with Neel Nanda

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

What is interpretability?

View Detailed Profile

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

The Story of Mech Interp

The Story of Mech Interp

This is a talk I gave to my MATS scholars, with a stylised history of the field of

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

...

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

Interpretability: Understanding how AI models think

Interpretability: Understanding how AI models think

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ...

19 - Mechanistic Interpretability with Neel Nanda

19 - Mechanistic Interpretability with Neel Nanda

How good are we at understanding the internal computation of advanced machine learning models, and do we have a hope at ...

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda

What is interpretability?

What is interpretability?

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0 training program about reasoning model

What is a Transformer? (Transformer Walkthrough Part 1/2)

What is a Transformer? (Transformer Walkthrough Part 1/2)

See part 2 here: Implementing GPT-2 from Scratch https://neelnanda.io/transformer-tutorial-2 Template notebook: ...

Web Analytics