I gave this "riddle" to various models: > The farmer and the goat are going to t...

jampa · 2025-11-15T01:32:18 1763170338

There are few examples of this as well:

https://www.reddit.com/r/singularity/comments/1fqjaxy/contex...

andix · 2025-11-15T01:38:26 1763170706

It really shows how LLMs work. It's all about probabilities, and not about understanding. If something looks very similar to a well known problem, the llm is having a hard time to "see" contradictions. Even if it's really easy to notice for humans.

Recursing · 2025-11-15T14:10:23 1763215823

Claude has no problem with this: https://imgur.com/a/ifSNOVU

Maybe older models?

andix · 2025-11-15T14:30:10 1763217010

Try to twist around words and phrases, at some point it might start to fail.

I tried it again yesterday with GPT. GPT-5 manages quite well too in thinking mode, but starts crackling in instant mode. 4o completely failed.

It's not that LLMs are unable to solve things like that at all, but it's really easy to find some variations that make them struggle really hard.

userbinator · 2025-11-15T01:29:58 1763170198

Basically a variation of https://en.wikipedia.org/wiki/Age_of_the_captain