upcarta
  • Sign In
  • Sign Up
  • Explore
  • Search

Neel Nanda (at ICLR)

neelnanda.io
1 Follower
community-curated profile

Mechanistic Interpretability research @DeepMind. Formerly @AnthropicAI, independent In this to reduce AI X-risk. Neural networks can be understood, let's do it!

Overview Posts Content Recommendations
Most Recommended Recent
  • Paper
Paper May 2, 2023
Progress measures for grokking via mechanistic interpretability
by Neel Nanda (at ICLR)
Post Add to Collection Mark as Completed
Recommended by 1 person
1 mention
Paper
Finding Neurons in a Haystack: Case Studies with Sparse Probing
by Wes Gurnee and Neel Nanda (at ICLR)
Post Add to Collection Mark as Completed
Recommended by 1 person
1 mention
Paper Feb 6, 2023
A Toy Model of Universality: Reverse Engineering How Networks Learn Group Operations
by Bilal Chughtai and Neel Nanda (at ICLR)
Post Add to Collection Mark as Completed
  • upcarta ©2025
  • Home
  • About
  • Terms
  • Privacy
  • Cookies
  • @upcarta