The list of informal, weird AI benchmarks keeps growing. Over the past few days, some in the AI community on X have become obsessed with a test of how different AI models, particularly so-called reasoning models, handle prompts like this: “Write a Python script for a bouncing yellow ball within a shape. Make the shape […]
© 2024 TechCrunch. All rights reserved. For personal use only.
Sources
Recent Posts
- Waymo obtains permit to test robotaxis at San Francisco International Airport
- Google rolls out new Windows desktop app with Spotlight-like search tool
- Gemini overtakes ChatGPT on App Store, as its Nano Banana AI model drives downloads up 45%
- Cybersecurity provider Netskope boosts IPO range as it tests tech hotstreak
- Andrew Yang took inspiration from Mark Cuban for his budget cell carrier Noble Mobile
Archives
- September 2025
- August 2025
- July 2025
- June 2025
- May 2025
- April 2025
- March 2025
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022