A discrepancy between first- and third-party benchmark results for OpenAI’s o3 AI model is raising questions about the company’s transparency and model testing practices. When OpenAI unveiled o3 in December, the company claimed the model could answer just over a fourth of questions on FrontierMath, a challenging set of math problems. That score blew the […]
Sources
Recent Posts
- A letter from the M&S hackers landed in my inbox – this is what happened next
- Microsoft’s Satya Nadella is choosing chatbots over podcasts
- MIT disavows doctoral student paper on AI’s productivity benefits
- Laser-powered fusion experiment more than doubles its power output
- TechCrunch Week in Review: Coinbase gets hacked
Archives
- May 2025
- April 2025
- March 2025
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022