22 Comments
Mar 21, 2023Liked by Arvind Narayanan

You guys are amazing! Crush the hype! Expose the parrot! :)

Expand full comment

Re "Melanie Mitchell gives an example of an MBA test question where changing some details in a way that wouldn’t fool a person is enough to fool ChatGPT (running GPT-3.5). A more elaborate experiment along these lines would be valuable." — A quick experiment with GPT-4 suggests that, unlike ChatGPT, GPT-4 is far less sensible to changes in wording. In our (brief) experiments with the same prompt Mitchell used, GPT-4 output the correct answer every time.

Expand full comment

The first job that will disappear with AI is that of scientists who use quantitative measures to predict which jobs will disappear with AI.

Expand full comment

So is it then fair to say standardized tests are a completely bogus way of estimating how someone will perform in the real word (based on the above thesis). Or could even be directionally damaging. Imaging a doctor who aces their standardized test but does poorly in the real world. Or is the thesis that when it comes to humans, standardized tests are a proxy for real world knowledge but not so for AI? as in humans build upon and adapt whereas AI learning stops at memorized training data? Thanks and will take your answer in the comments

Expand full comment

Isn't the biggest issue that Chat GPT is accessing a database, period?

This is exactly like a prospective lawyer being able to bring in cheat notes of infinite pages into a bar exam. Whether it is an RDBMS database or the equivalent to a pointer-based database is irrelevant; 175B parameters performs like a gigantic database.

And no 175B parameters is not comparable to human memorization - it is far greater. The entire Encyclopedia Brittanica is about 23 gigabytes, compressed - so the ChatGPT parameters are at least the equivalent of having memorized the entire Encyclopedia Brittanica and likely is far, far greater.

Expand full comment

h you guys are a share a amazing information share with us

Expand full comment

Whoa, this is a great take on GPT-4! If you gave me internet access during all of the AP tests, I could probably do about as well as GPT-4.

Expand full comment

In mid-2022, Bill Gates suggested using AP Bio (Advanced Placement biology exam) to judge GPT.

He told OpenAI to make it "capable of answering questions that it hasn’t been specifically trained for" and that AP Bio was *THE* test to prove their work is a revolutionary "breakthrough". They came back after a few months and GPT aced the test. See

https://www.gatesnotes.com/The-Age-of-AI-Has-Begun?WT.mc_id=20230321100000_Artificial-Intelligence_BG-TW_&WT.tsrc=BGTW

Was Gates fooled by OpenAI or might unintended "contamination" (like described above) have fooled them all?

Expand full comment

while i responded to the. thread I felt this added to the discussion and merits its own comment thread if the authors are so inclined. This from Microsoft Research in the last 20 hours https://arxiv.org/pdf/2303.12712.pdf

Expand full comment

Very interesting read. Thank you.

Expand full comment