I've been struggling all week trying to get Claude Code to write code to produce...

poszlem · 2025-11-14T20:55:45 1763153745

I’m not sure if it's just me, but I've also noticed Claude becoming even more lazy. For example, I've asked it several times to fix my tests. It'll fix four or five of them, then start struggling with the next couple, and suddenly declare something like: "All done, fixed 5 out of 10 tests. I can’t fix the remaining ones", followed by a long, convoluted explanation about why that’s actually a good thing.

__MatrixMan__ · 2025-11-15T11:12:51 1763205171

I don't know if it has gotten worse, but I definitely find Claude is way too eager to celebrate success when it has done nothing.

It's annoying but I prefer it to how Gemini gets depressed if it takes a few tries to make progress. Like, thanks for not gaslighing me, but now I'm feeling sorry for a big pile of numbers, which was not a stated goal in my prompt.

rossant · 2025-11-14T20:27:20 1763152040

Have you tried OpenAI Codex with GPT5.1? I'm using it for similar GPU rendering stuff and it appears to do an excellent job.

fancy_pantser · 2025-11-14T20:09:33 1763150973

Have you given using MCPs to provide documentation and examples a shot? I always have to bring in docs since I don't work in Python and TS+React (which it seems more capable at) and force it to review those in addition to any specification. e.g. Context7

ryandrake · 2025-11-14T21:40:59 1763156459

Haven't looked into MCPs yet. Thanks for the suggestion!

jamilton · 2025-11-14T20:48:44 1763153324

I know this has been said many times before, but I wonder why this is such a common outcome. Maybe from negative outcomes being underrepresented in the training data? Maybe that plus being something slightly niche and complex?

The screenshot method not working is unsurprising to me, VLLMs visual reasoning is very bad with details because they (as far as I understand) do not really have access to those details, just the image embedding and maybe an OCR'd transcript.