Source: https://news.ycombinator.com/item?id=40179232 https://www.strangeloopcanon.com/p/what-can-llms-never-do Wordle
https://www.wolfram.com/llm-benchmarking-project/
https://help.kagi.com/kagi/ai/llm-benchmark.html
https://arxiv.org/html/2405.19616
https://arxiv.org/html/2406.02061v1