We created a benchmark for open-ended computer science problems including algorithmic problems handpicked by former IOI gold medalists and research problems in various areas by PhD students. Read more in the blog and leaderboard for how frontier models and popular evolve frameworks perform.
1 comments