A new test of AI capabilities consists of puzzles that humans are able to solve without too much trouble, but which all leading AI models struggle with. To improve and pass the test, AI companies will need to balance problem-solving abilities with cost.
So they actually tested LLMs for being AGIs?
Should have asked me… I would have told them the best ants in the world fail the test for being snails.