Post by Ethan Mollick on X. Got benchmarks reasoning

3AAOlqrX_normal.jpg spacer.png
Ethan Mollick
⁦‪@emollick‬⁩
logo_twitter-1497383721365.png
spacer_464x1-1582829598167.png
One contentious debate in AI research is about the ability of LLMs to reason like a human does. While there is still a lot of uncertainty, this paper from Google has a handy chart on the kinds of reasoning tasks AIs are tested on & how they do (GPT-4 wins) arxiv.org/pdf/2312.17661… pic.twitter.com/ZGC9NFVPS8
1/1/24, 3:17 PM

Joseph Thornton

Leave a comment