OpenAI One Step Closer to SELF IMPROVING AI | AI Agents doing AI Research | MLE-bench

Published: 10 October 2024
on channel: Wes Roth
42,196
1.4k

The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anthropic, NVIDIA and Open Source AI.

My Links 🔗
➡️ Subscribe:    / @wesroth  
➡️ Twitter: https://x.com/WesRothMoney
➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe

#ai #openai #llm

00:00 MLE-bench
02:57 Kaggle ML Competitions
04:59 Evaluating Machine Learning Agents on Machine Learning Engineering

LINKS:

The Blog Post:
https://openai.com/index/mle-bench/

The Paper:
https://arxiv.org/abs/2410.07095

The Code:
https://github.com/openai/mle-bench/