Tweet by OpenAI on Twitter. AI to study AI

b4F1eVpJ_normal.jpg spacer.png
OpenAI
⁦‪@OpenAI‬⁩
logo_twitter-1497383721365.png
spacer_464x1-1582829598167.png
We applied GPT-4 to interpretability — automatically proposing explanations for GPT-2’s 300k neurons — and found neurons responding to concepts like similes, “things done correctly,” or expressions of certainty. We aim to use Al to help us understand Al: openai.com/research/langu… pic.twitter.com/knCUxnL5CY
5/9/23, 1:05 PM

Joseph Thornton

Leave a comment