Mystery AI Hype Theater 3000, Episode 44: OpenAI's Ridiculous 'Reasoning'
The company behind ChatGPT is back with bombastic claim that their new o1 model is capable of so-called "complex reasoning." Ever-faithful, Alex and Emily tear it apart. Plus the flaws in a tech publication's new 'AI hype index,' and some palette-cleansing new regulation against data-scraping worker surveillance.
References:
OpenAI: Learning to reason with LLMs
- How reasoning works
- GPQA, a 'graduate-level' Q&A benchmark system
Fresh AI Hell:
MIT Technology Review's AI 'AI hype index'
CFPB Takes Action to Curb Unchecked Worker Surveillance
You can check out future streams at http://twitch.tv/dair_institute.
Follow Emily at https://twitter.com/EmilyMBender and https://dair-community.social/@EmilyMBender
Follow Alex at https://twitter.com/alexhanna and https://dair-community.social/@alex
Music: Toby Menon.
Production: Christie Taylor.
Graphic Design: Naomi Pleasure-Park.
Introducing: The AI Hype Index
Everything you need to know about the state of AI.The Editors (MIT Technology Review)