Claude has no problem with this: https://imgur.com/a/ifSNOVU Maybe older models?

andix · 2025-11-15T14:30:10 1763217010

Try to twist around words and phrases, at some point it might start to fail.

I tried it again yesterday with GPT. GPT-5 manages quite well too in thinking mode, but starts crackling in instant mode. 4o completely failed.

It's not that LLMs are unable to solve things like that at all, but it's really easy to find some variations that make them struggle really hard.