News
An AI researcher put leading AI models to the test in a game of Diplomacy. Here's how the models fared.
I've been subjecting chatbots to a set of real-world programming tests for over two years. There are now five I recommend if you're looking for AI coding help, and several to avoid.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results