Simple tasks showing reasoning breakdown in state-of-the-art LLMs
349 by tosh | 370 comments on Hacker News.