Is it a bubble?

f154hfds · 2025-12-10T19:28:28 1765394908

The post script was pretty sobering. It's kind of the first time in my life that I've been actively hoping for a technology to out right not deliver on its promise. This is a pretty depressing place to be, because most emerging technologies provide us with exciting new possibilities whereas this technology seems only exciting for management stressed about payroll.

It's true that the technology currently works as an excellent information gathering tool (which I am happy to be excited about) but that doesn't seem to be the promise at this point, the promise is about replacing human creativity with artificial creativity which.. is certainly new and unwelcome.

stack_framer · 2025-12-10T19:56:02 1765396562

> It's kind of the first time in my life that I've been actively hoping for a technology to out right not deliver on its promise.

Same here, and I think it's because I feel like a craftsman. I thoroughly enjoy the process of thinking deeply about what I will build, breaking down the work into related chunks, and of course writing the code itself. It's like magic when it all comes together. Sometimes I can't even believe I get to do it!

I've spent over a decade learning an elegant language that allows me to instruct a computer—and the computer does exactly what I tell it. It's a miracle! I don't want to abandon this language. I don't want to describe things to the computer in English, then stare at a spinner for three minutes while the computer tries to churn out code.

I never knew there was an entire subclass of people in my field who don't want to write code.

I want to write code.

zparky · 2025-12-10T20:22:54 1765398174

It's been blowing my mind reading HN the past year or so and seeing so many comments from programmers that are excited to not have to write code. It's depressing.

IanCal · 2025-12-10T23:47:30 1765410450

There are three takes that I think are not depressing:

* Being excited to be able to write the pieces of code they want, and not others. When you sit down to write code, you do not do everything from scratch, you lean on libraries, compilers, etc. Take the most annoying boilerplate bit of code you have to write now - would you be happy if a new language/framework popped up that eliminated it?

* Being excited to be able to solve more problems because the code is at times a means to an end. I don't find writing CSS particularly fun but I threw together a tool for making checklists for my kids in very little time using llms and it handled all of the css for printing vs on the screen. I'm interested in solving an optimisation issue with testing right now, but not that interested in writing code to analyse test case perf changes so the latter I got written for me in very little time and it's great. It wasn't really a choice of me or machine, I do not really have the time to focus on those tasks.

* Being excited that others can get the outcomes I've been able to get for at least some problems, without having to learn how to code.

As is tradition, to torture a car analogy, I could be excited for a car that autonomously drives me to the shops despite loving racing rally cars.

wakawaka28 · 2025-12-11T01:46:15 1765417575

Those are all good outcomes, up to a point. But if this stuff works TOO well, most or maybe all of us will have to start looking at other career options. Whatever autonomy you think you have in deciding what the AI does, that can ultimately be trained as well, and it will be the more people use it.

I personally don't like it when others who don't know how to code are able to get results using AI. I spent many years of my life and a small fortune learning scarce skills that everyone swore would be the last to ever be automated. Now, in a cruel twist of fate, those skills are being automated and there is seemingly no worthwhile job that can't be automated given enough investment. I am hopeful because the AI still has a long way to go, but even with the improvements it currently has, it might ultimately destroy the tech industry. I'm hoping that Say's Law proves true in this case, but even before the AI I was skeptical that we would find work for all the people trying to get into the software industry.

badsectoracula · 2025-12-11T10:27:02 1765448822

> I personally don't like it when others who don't know how to code are able to get results using AI.

Sounds like for many programmers AI is the new Visual Basic 6 :-P

wakawaka28 · 2025-12-11T15:02:26 1765465346

It's worse than that lol. At least with VB 6 and similar scripting languages, there is still code getting written. Now we have complete morons who think they're software developers because they got some AI to shit out an app for them. This is going to affect how people view the profession of software engineering all around.

ares623 · 2025-12-11T03:39:20 1765424360

Except in this case you won't be able to afford going to the shops anymore. Or even if the shops will still be around. What use is an autonomous car if you can't use it.

zahlman · 2025-12-11T00:33:05 1765413185

I suspect, rather strongly, that what really specifically wears programmers down is boilerplate.

AI is addressing that problem extremely well, but by putting up with it rather than actually solving it.

I don't want the boilerplate to be necessary in the first place.

projektfu · 2025-12-11T01:58:11 1765418291

Or, for me, yak shaving. I start a project with enthusiasm and then 8 hours later I'm debugging an nginx config file or something rather than working on the core project. AI gets a lot of that out of the way if you let it, and you can at least let it grind on that stuff while you think about other things.

zahlman · 2025-12-11T02:02:40 1765418560

For me, the yak shaving is the part where I get the next project idea...

seanmcdirmid · 2025-12-10T23:44:51 1765410291

It is fun. It takes some skill to organize a pipeline to generate code that would be tedious to write and maintain otherwise. You are still writing stuff to instruct the computer, but now you have something taking natural language instructions and generating code and code test assets.

There might have been people who were happy to write assembly that got bummed about compilers. This AI stuff judt feels like a new way to write code.

youoy · 2025-12-11T08:00:26 1765440026

I think that the main missunderstanding is that we used to think programming=coding, but this is not the case. LLMs allow people to use natural language as a programming language, but you still need to program. As with every programing language, it requires you to learn how to use it.

Not everyone needs to be excited about LLMs, in the same way that C++ developers dont need to be excited about python.

xyzwave · 2025-12-11T16:02:39 1765468959

I hate writing code, but love debugging. LLMs have been a godsend for banging out boilerplate and getting things 95% of the way there. Now I spend most of my time on the hard stuff (debugging, refactoring), while building things that would have taken weeks in days. It’s honestly made the act of building software more enjoyable and rewarding.

xnx · 2025-12-11T00:06:32 1765411592

Some carpenters like to make cabinets. Some just like to hammer nails.

DevDesmond · 2025-12-11T00:11:32 1765411892

Perhaps consider that I still think coding by prompting is just another layer of abstraction on top of coding.

I'm my mind, writing the prompt that generates the code is somewhat analogous to writing the code that generates the assembly. (Albeit, more stochastically, the way psychology research might be analogous to biochemistry research).

Different experts are still required at different layers of abstraction, though. I don't find it depressing when people show preference for working at different levels of complexity / tooling, nor excitement about the emergence of new tools that can enable your creativity to build, automate, and research. I think scorn in any direction is vapid.

layer8 · 2025-12-11T01:05:13 1765415113

One important reason people like to write code is that it has well-defined semantics, allowing to reason about it and predict its outcome with high precision. Likewise for changes that one makes to code. LLM prompting is the diametrical opposite of that.

youoy · 2025-12-11T08:10:33 1765440633

It completely depends on the way you prompt the model. Nothing prevents you from telling it exactly what you want, to the level of specifying the files and lines to focus on. In my experience anything other than that is a recepy for failure in sufficiently complex projects.

layer8 · 2025-12-11T14:06:23 1765461983

Several comments can be made here: (1) You only control what the LMM generates to the extent that you specify precisely what it should generate. You cannot reasons about what it will generate for what you don't specify. (2) Even for what you specify precisely, you don't actually have full control, because the LLM is not reliable in a way you can reason about. (3) The more you (have to) specify precisely what it should generate, the less benefit using the LLM has. After all, regular coding is just specifying everything precisely.

The upshot is, you have to review everything the LLM generates, because you can't predict the qualities or failures of its output. (You cannot reason in advance about what qualities and failures it definitely will or will not exhibit.) This is different from, say, using a compiler, whose output you generally don't have to review, and whose input-to-output relation you can reason about with precision.

Note: I'm not saying that using an LLM for coding is not workable. I'm saying that it lacks what people generally like about regular coding, namely the ability to reason with absolute precision about the relation between the input and the behavior of the output.

yunwal · 2025-12-11T04:43:02 1765428182

You’re still allowed to reason about the generated output. If it’s not what you want you can even reject it and write it yourself!

palmotea · 2025-12-11T07:17:21 1765437441

>> One important reason people like to write code is that it has well-defined semantics, allowing to reason about it and predict its outcome with high precision. Likewise for changes that one makes to code. LLM prompting is the diametrical opposite of that.

> You’re still allowed to reason about the generated output. If it’s not what you want you can even reject it and write it yourself!

You missed the key point. You can't predict and LLM's "outcome with high precision."

Looking at the output and evaluating it after the fact (like you describe) is an entirely different thing.

yunwal · 2025-12-11T11:59:26 1765454366

For many things you can though. If I ask an LLM to create an alert in terraform that triggers when 10% of requests fail over a 5 minute period and sends an email to some address, with the html on the email looking a certain way, it will do exactly the same as if I looked at the documentation, and figured out all of the fields 1 by 1. It’s just how it works when there’s one obvious way to do things. I know software devs love to romanticize about our jobs but I don’t know a single dev who writes 90% meaningful code. There’s always boilerplate. There’s always fussing with syntax you’re not quite familiar with. And I’m happy to have an AI do it

palmotea · 2025-12-11T15:05:44 1765465544

I think you're still missing the point. This cousin comment does a decent job of explaining it: https://news.ycombinator.com/item?id=46231510

rester324 · 2025-12-11T02:16:08 1765419368

I love to write code too. But what usually happens is that I go through running the gauntlet of proving how brilliant code I can write in a job interview, and then later conversely being paid for listening to really dumb conversations of our stakeholders and sitting in project planning, etc meetings just so that finally everybody can harass me to implement something that a million programmer implemented before me a million times, at which point the only metric that matters to either my fellow developers or my managers or the stakeholders is the speed of churning the code out, quality or design be damned. So for this reason in most cases in my work I use LLMs.

How any of that comes down to an investment portfolio manager as writing "world class code" by LLMs is a mistery to me.

citrin_ru · 2025-12-11T15:08:12 1765465692

> I never knew there was an entire subclass of people in my field who don't want to write code.

Some people don't enjoy writing code and went into software development only because it's a well paid and a stable job. Now this trade is under the thread and they are happy to switch to prompting LLMs. I do like to code so use LLMs less then many my colleagues.

Though I don't expect to see many from this crowd in HM, instead I expect here to see entrepreneurs who need a product to sell and don't care if it is written by humans or by LLMs.

doug_durham · 2025-12-10T21:19:16 1765401556

Writing code is my passion, and like you I'm amazed I get paid to do it. That said in any new project there is a large swath of code that needs to be written that I've written many times before. I'm happy to let the LLM write the low value code so I can work on the interesting parts. Examples of this type of code are argument parsers and interfacing with REST interfaces. I add no value there.

averageRoyalty · 2025-12-10T20:52:51 1765399971

So write code.

Maybe post renaissance many artists no longer had patrons, but nothing was stopping them from painting.

If your industry truely is going in the direction where there's no paid work for you to code (which is unlikely in my opinion), nobody is stopping you. It's easier than ever, you have decades of personal computing at your fingertips.

Most people with a thing they love do it as a hobby, not a job. Maybe you've had it good for a long time?

tjr · 2025-12-10T21:27:00 1765402020

From the GNU Manifesto:

I could answer that nobody is forced to be a programmer. Most of us cannot manage to get any money for standing on the street and making faces. But we are not, as a result, condemned to spend our lives standing on the street making faces, and starving. We do something else.

https://www.gnu.org/gnu/manifesto.en.html

harimau777 · 2025-12-11T03:07:35 1765422455

That's tough to do without time and money. Which is something we certainly won't have if the decent jobs get automated out of existence.

marcosdumay · 2025-12-10T20:27:28 1765398448

I'm quite ok with only writing code in my personal time. In fact, if I could solve the problems there faster, I'd be delighted.

Instead, I've reacted to the article from the opposite direction. All those grand claims about stuff this tech doesn't do and can't do. All that trying to validate the investment as rational when it's absolutely obvious it's at least 2 orders of magnitude larger than any arguably rational value.

kace91 · 2025-12-11T06:56:30 1765436190

>I never knew there was an entire subclass of people in my field who don't want to write code.

Regardless of AI this has been years in the making. “Learn to code” has been the standard grinder cryptobro advice for “follow the money” for a while, there’s a whole generation of people getting into the industry for financial reasons (which is not wrong, just a big cultural shift).

georgeecollins · 2025-12-10T20:49:13 1765399753

I also love to code, though it's not what people pay to do anymore.

You should never hope for a technology to not deliver on its promise. Sooner or later it usually does. The question is, does it happen in two years or a hundred years? My motto: don't predict, prepare.

djeastm · 2025-12-11T10:42:47 1765449767

>You should never hope for a technology to not deliver on its promise. Sooner or later it usually does.

Lots of wiggle room between "never" or "usually". We're not all riding Segways or wearing VR goggles. Seems wiser to work on case-by-case basis here.

gspr · 2025-12-10T23:13:11 1765408391

> You should never hope for a technology to not deliver on its promise. Sooner or later it usually does.

Really? Are you sure there isn't a lot of confirmation bias in this? Do you really have a good handle on 100-year-old tech hypes that didn't deliver? All I can think of is "flying everything".

stego-tech · 2025-12-11T02:28:51 1765420131

I'm right there with you, and it's been my core gripe since ChatGPT burst onto the stage. Believe it or not, my environmental concerns came about a year later, once we had data on how datacenters were being built and their resource consumption rates; I had no idea how big things had very suddenly and violently exploded into, and that alone gave me serious pause about where things are going.

In my heart, I firmly believe in the ability of technology to uplift and improve humanity - and have spent much of my career grappling with the distressing reality that it also enables a handful of wealthy people to have near-total control of society in the process. AI promises a very hostile, very depressing, very polarized world for everyone but those pulling the levers, and I wish more people evaluated technology beyond the mere realm of Computer Science or armchair economics. I want more people to sit down, to understand its present harms, its potential future harms, and the billions of people whose lives it will profoundly and negatively impact under current economic systems.

It's equal parts sobering and depressing once you shelve personal excitement or optimism and approach it objectively. Regardless of its potential as a tool, regardless of the benefit it might bring to you, your work day, your productivity, your output, your ROI, I desperately wish more people would ask one simple question:

Is all of that worth the harm I'm inflicting on others?

simianwords · 2025-12-11T03:32:23 1765423943

Some person asked this same question about computers back in the day.

stego-tech · 2025-12-11T04:53:13 1765428793

The fact the question has been asked before does not make it any less valuable or worthwhile to ask now, and history is full of the sort of pithy replies like yours masquerading as profound philosophical insights. I’d like to think the question is asked at every invention, every revolution, because we must doubt our own creations lest we blind ourselves to the consequences of our actions.

Nothing is inevitable. Systems can be changed if we decide to do so, and AI is no different. To believe in inevitability is to embrace fatalism.

oytis · 2025-12-11T16:58:58 1765472338

I dunno, I might be getting old, but I think the idea that people absolutely need a job to stay sane betrays lack of imagination. Of course getting paid just enough for survival is pretty depressing, but if I can have healthy food, a spacious place to live, ability to travel and all the free time I can have, I'd be absolutely happy without a job. Maybe I'd be even writing code, just not commercially useful one.

some-guy · 2025-12-11T03:24:22 1765423462

There are a few areas where I have found LLMs to be useful (anything related to writing code, as a search engine) and then just downright evil and upsetting in every other instance of using it, especially as a replacement for human creativity and personal expression.

Night_Thastus · 2025-12-10T23:40:40 1765410040

Don't worry that much about 'AI' specifically. LLMs are an impressive piece of technology, but at the end of the day they're just language predictors - and bad ones a lot of the time. They can reassemble and remix what's already been written but with no understanding of it.

It can be an accelerator - it gets extremely common boiler-plate text work out of the way. But it can't replace any job that requires a functioning brain, since LLMs do not have one - nor ever will.

But in the end it doesn't matter. Companies do whatever they can to slash their labor requirements, pay people less, dodge regulations, etc. If not 'AI' it'll just be something else.

DevDesmond · 2025-12-11T00:51:31 1765414291

Text is an LLMs input and output, but, under the hood, the transformer network is capable of far more than mere re-assembly and remix of text. Transformers can approximate turing completeness as their size scales, and they can encode entire algorithms in their weights. Therefore, I'd argue they can do far more than reassemble and remix. These aren't just Markov models anymore.

(I'd also argue that "understanding" and "functional brain" are unfalsifiable comparisons. What exactly distinguishes a functional brain from a turing machine? Chess once required a functional brain to play, but has now been surpassed by computation. Saying "jobs that require a human brain" is tautological without any further distinction).

Of course, LLMs are definitely missing plenty of brain skills like working in continuous time, with persistent state, with agency, in physical space, etc. But to say that an LLM "never will" is either semantic, (you might call it something other than an LLM when next generation capabilities are integrated), tautological (once it can do a human job, it's no longer a job that requires a human), or anthropocentric hubris.

That said, who knows what the time scale looks like for realizing such improvements – (decades, centuries, millennia).

mrdependable · 2025-12-11T01:54:05 1765418045

What I don't understand is, will every company really want to be beholden to some AI provider? If they get rid of the workers, all of a sudden they are on the losing end of the bargaining table. They have incredible leverage as things stand.

asdff · 2025-12-10T23:22:31 1765408951

I think it just reflects on the sort of businesses that these companies are vs others. Of course we worry about this in the context of companies that dehumanize us, reduce us to line item costs and seek to eliminate us.

Now imagine a different sort of company. A little shop where the owner's first priority is actually to create good jobs for their employees that afford a high quality life. A shop like that needn't worry about AI.

It is too bad that we put so much stock as a society in businesses operating in this dehumanizing capacity instead of ones that are much more like a family unit trying to provide for each other.

0manrho · 2025-12-10T23:45:02 1765410302

Regarding that PS:

> This strikes me as paradoxical given my sense that one of AI’s main impacts will be to increase productivity and thus eliminate jobs.

The allegation that an "Increase of productivity will reduce jobs" has been proven false by history over and over again it's so well known it has a name, "Jevons Paradox" or "Jevons Effect"[0].

> In economics, the Jevons paradox (sometimes Jevons effect) occurs when technological advancements make a resource more efficient to use [...] results in overall demand increasing, causing total resource consumption to rise.

The "increase in productivity" does not inherently result in less jobs, that's a false equivalence. It's likely just as false as it was in 1915 with the the assembly line and the Model T as it is in 2025 with AI and ChatGPT. This notion persists because as we go through inflection points due to something new changing up market dynamics, there is often a GROSS loss (as in economics) of jobs that often precipitates a NET gain overall as the market adapts, but that's not much comfort to people that lost or are worried about losing their jobs due to that inflection point changing the market.

The two important questions in that context for individuals in the job market during those inflections points (like today) are: "how difficult is it to adapt (to either not lose a job, or to benefit from or be a part of that net gain)?" and "Should you adapt?" Afterall, the skillsets that the market demands and the skillsets it supplies are not objectively quantifiable things; the presence of speculative markets is proof that this is subjective, not objective. Anyone who's ever been involved in the hiring process knows just how subjective this is. Which leads me to:

> the promise is about replacing human creativity with artificial creativity which.. is certainly new and unwelcome.

Disagree that that's what the promise about. That IS happening, I don't disagree there, but that's not the promise that corporate is so hyped about. If we're being honest and not trying to blow smoke up people's ass to artificially inflate "value," AI is fundamentally about being more OBJECTIVE than SUBJECTIVE with regard to costs and resources of labor, and it's outputs. Anyone who knows what OKR's are and has been subject to a "performance review" in a self professed "Data driven company" knows how much modern corporate America, especially the tech market, loves it's "quantifiables." It's less about how much better it can allegedly do something, but the promise of how much "better" it can be quantified vs human labor. As long as AI has at least SOME proven utility (which it does), this promise of quantifiables combined with it's other inherent potential benefits (Doesn't need time off, doesn't sleep, doesn't need retirement/health benefits, no overtime pay, no regulatory limitations on hours worked, no "minimum wage") means that so long as the monied interests perceive it as continuing to improve, then they can dismiss it's inefficiencies/ineffectiveness in X or Y by the promise of it's potential to overcome that eventually.

It's the fundamental reason why people are so concerned about AI replacing Humans. Especially when you consider one of the things that AI excels at is quickly delivering an answer with confidence (people are impressed with speed and a sucker for confidence), and another big strength is it's ability to deal with repetitive minutia in known and solved problem spaces(a mainstay of many office jobs). It can also bullshit with best of them, fluff your ego as much as you want (and even when you don't), and almost never says "No" or "You're wrong" unless you ask it to.

In other words, it excels at the performative and repetitive bullshit and blowing smoke up your boss' ass and empowers them to do the same for their boss further up the chain, all while never once ruffling HR's feathers.

Again, it has other, much more practical and pragmatic utility too, it's not JUST a bullshit oracle, but it IS a good bullshit oracle if you want it to be.

0: https://en.wikipedia.org/wiki/Jevons_paradox

harimau777 · 2025-12-11T03:09:29 1765422569

If that's the case, then why do we live in this late capitalist hell hole? Any technology that gets developed will be used for its worst, most dehumanizing purpose possible. That's just the reality of the shity society we live in.

0manrho · 2025-12-11T07:53:54 1765439634

You're a cheerful one, aren't you?

All it takes for evil to persevere is good people to sit by and do nothing. Don't like the situation you're in, do something about it. Preferably other than doomscrolling, but hey, you do you.

Joel_Mckay · 2025-12-10T20:27:48 1765398468

LLM slop doesn't have aspirations at all, its just click bait nonsense.

https://www.youtube.com/watch?v=_zfN9wnPvU0

Drives people insane:

https://www.youtube.com/watch?v=yftBiNu0ZNU

And LLM are economically and technologically unsustainable:

https://www.youtube.com/watch?v=t-8TDOFqkQA

These have already proven it will be unconstrained if AGI ever emerges.

https://www.youtube.com/watch?v=Xx4Tpsk_fnM

The LLM bubble will pass, as it is already losing money with every new user. =3

supongo · 2025-12-11T22:23:40 1765491820

I've had some success in using Claude Code, with caveats.

To give some context - I started developing a tactical RPG. I had an MVP prior to using Claude Code. I continued to work on the project, but lost motivation due to work burnout and prioritizing other hobbies.

I gave Claude Code a try to see whether it's any use. It helped more than I expected it to - it helped me produce something while dealing with burnout by building on the MVP I developed prior to AI assisted development.

The main issues I ran into were:

1) A lot of effort into reviewing the output. Main difference from peer review is that there's quicker feedback.

2)It throws out some absolutely wild solutions sometimes. It build on my existing architecture, so it was easier to catch issues. If I hadn't developed the architecture without AI assistance, things could have gone badly.

3)I only pay for the $20 Claude plan. Anything useful Claude produces for me requires it to consume a lot of tokens due to back-and-forth questions and asking Claude to dig into source file.

The most significant issue I ran into with Claude is when it suggested solutions I don't have the background to review. I don't know much about optimization, so I ran into issues with both rendering and the ECS (entity component system) library. Claude gave me recommendations, but I didn't know how to evaluate the code due to lacking that experience.

Claude was good for things I know how to do but don't want to do. It's been helpful when I want to work on something without being motivated enough to put 100% (or even 70%) into it.

If it's things I don't know how to do (like game optimization) it's harmful.

artur44 · 2025-12-10T18:52:32 1765392752

A lot of the debate here swings between extremes. Claims like “AI writes most of the code now” are obviously exaggerated especially coming from a nontechnical author but acting like any use of AI is a red flag is just as unrealistic. Early stage teams do lean on LLMs for scaffolding, tests and boilerplate, but the hard engineering work is still human. Is there a bubble? Sure, valuations look frothy. But like the dotcom era, a correction doesn’t invalidate the underlying shift it just clears out the noise. The hype is inflated, the technology is real.

artur44 · 2025-12-11T20:07:35 1765483655

I think some wires got crossed. My point wasn’t that LLMs can’t produce useful infra or complex code clearly they can, as many examples here show. It’s just that neither extreme narrative AI writes everything now vs. you can’t trust it for anything serious reflects how teams actually work. LLMs are great accelerators for boilerplate, declarative configs, and repetitive logic, but they don’t replace engineering judgement they shift where that judgement is applied. That’s why I see AI as real, transformative tech inside an overhyped investment cycle, not as magic that removes humans from the loop.

Daishiman · 2025-12-11T00:38:15 1765413495

> Early stage teams do lean on LLMs for scaffolding, tests and boilerplate, but the hard engineering work is still human.

I no longer believe this. A friend of mine just did a stint a startup doing fairly sophisticated finance-related coding and LLMs allowed them to bootstrap a lot of new code, get it up and running in scalable infra with terraform, and onboard new clients extremely quickly and write docs for them based on specs and plans elaborated by the LLMs.

This last week I extended my company's development tooling by adding a new service in a k8s cluster with a bunch of extra services, shared variables and configmaps, and new helm charts that did exactly what I needed after asking nicely a couple of times. I have zero knowledge of k8s, helm or configmaps.

xdc0 · 2025-12-11T01:59:57 1765418397

If you are in charge of that tooling, how do you ensure the correctness of the work? Or is it that at this point the responsibility goes one level higher now where implementation details are not important or relevant at all and all it matters is it behaves as described?

yunnpp · 2025-12-11T03:33:46 1765424026

Just look at what they are stating:

> that did exactly what I needed

> I have zero knowledge of k8s, helm or configmaps.

Obviously this is not anything resembling engineering, or anything a respectful programmer would do. An elevator that is cut lose when you press 0 also works very well until you press 0. The claims of AI writing significant chunks of code come from these sort of people with little experience in programming or engineering in general, SPA vibe coders and what not. You should tremble at the thought of using any of the resulting systems in production, and certainly not try to replicate that workflow yourself. Which gives you a sense of how overblown these claims are.

Daishiman · 2025-12-11T04:40:45 1765428045

> The claims of AI writing significant chunks of code come from these sort of people with little experience in programming or engineering in general, SPA vibe coders and what not.

I'm sorry man but I've been doing this for 25 years and I've worked and studied with some extremely bright and productive engineers. I vouch for the code that I write or that I delegate to an LLM, and believe it or not it doesn't take a magician to write a k8s spec file, just patience to write 10 levels of nested YAMLs to describe the most boring, normal and predictable code to tell your cluster what volume mounts and env variables to load.

noodletheworld · 2025-12-11T07:49:35 1765439375

> I have zero knowledge of k8s, helm or configmaps

…

> I vouch for the code that I write or that I delegate to an LLM, and believe it or not it doesn't take a magician to write a k8s spec file…

I have been writing code since 1995.

That has zero relevance to my skill at rolling out deployments in a technology I know nothing about.

One of the two things you’ve said is false:

Either a) you do know what you’re talking about, or b) you are not confident in the results.

It can’t be both.

It sounds to me like you’re subscribed heavily into a hype train; that’s fine, but your position, as described, leaves a lot to desired, if you’re trying to describe some wide trend.

Here my anecdote: major cloudflare outages.

Hard things are hard. AI doesn’t solve that. Scaffolding is easy; ai can solve that.

Scaffolding is a reliable thing to rely on with ai.

Doing it for K8s configuration, if you don’t know k8s is stupid. I know what I’m talking about when I say that. Having it help you if you do know what you’re doing is perfectly legit.

Claiming it did help when claiming you have, and I quote, “zero knowledge” (but you actually do) is hype. Leave it on LinkedIn dude. :(

Daishiman · 2025-12-11T15:07:02 1765465622

> Either a) you do know what you’re talking about, or b) you are not confident in the results. It can’t be both.

You've been coding for a lifetime yet you don't seem to get that certainty in software is a spectrum? I have sufficient confidence in the output of LLMs to sign my name under the code it writes when putting up a PR for a specialist to read. That's good enough for 90% of the work that we do day-to-day. You think that's not hype-worthy?

> Doing it for K8s configuration, if you don’t know k8s is stupid. I know what I’m talking about when I say that. Having it help you if you do know what you’re doing is perfectly legit.

"Knowing" k8s is an oxymoron. K8s is a profoundly complicated piece of tech that can don insanely complicated things while also serving as a replacement for docker-compose or basic services that could have been hosted on ECR. The concepts behind basic k8s functionality are not difficult, but I saved myself two weeks of reading how to write helm spec files, a piece of knowledge I have no interest in learning because it doesn't add any appreciable value to the software I produce, and was instead able to focus on getting what I needed out of my cluster automation scripts.

This really isn't that complicated to understand. I don't care for being a k8s expert and I don't care for syntactical minutiae behind it. It isn't hype that I now I only need to understand the essential conceptual basics behind the software to get it working for what I need instead of doing a deep dive like I had to do years ago in when reading similar docs for similar IaC producs to get lesser functionality going.

Daishiman · 2025-12-11T04:39:38 1765427978

Because after 25 years of coding and a dozen infrastructure description languages I know that you test your code and you get someone expert in the field to look at your PRs.

LLMs are _really_ good at writing infra code if you know how infra works, believe it or not. And the ultimate responsibility still lies in human beings for code ownership.

biophysboy · 2025-12-11T01:56:15 1765418175

It depends on the task though, right? I promise I'm not in denial; I use these things all the time. Sometimes it works immediately; sometimes it doesn't. I have no way of predicting when it will or won't.

Daishiman · 2025-12-11T04:52:47 1765428767

* Infra code description languages like Terraform and K8s/helm spec files are like magic; they get 90% of the code right 90% of the time. In my experience that's about half of the work; the other half is spent debugging and correcting details that matter, but still applies to the code that I write myself.

* SQL works almost as good. It's especially useful when you need to generate queries with long lists of fields and complex query criteria. Give it a schema and let it rip.

* Python code works reasonably well. If your description is terse and clear it will generally do the right thing. It has a knack for being excessive in comments and will sometimes do things in ways that feel unnatural, but business code will be as good as the context that surrounds it. For boring, repetitive tasks like setting up program args, annotating types, and writing generic request/response cycles with common frameworks it will do boring old vanilla code. You'll likely want to touch it up and adapt it to your personal preference.

* Debugging is very much or miss. It has been absolutely fantastic at troubleshooting failed and stuck k8s jobs and service configuration issues, having no qualms about creating its own shell or python scripts to investigate ports or logs, and writing JSON parsing scripts that are snoozefest for a human to write. The regexes that I'd barely be arsed to write to parse enormous logs it writes trivially. For business logic, the more convoluted your logic the harder the time it will have, and for most debugging issues I prefer to let it run and list some hypotheses and potential issues and my intent is to learn and understand the problem myself deeply before committing to a fix.

biophysboy · 2025-12-11T15:55:46 1765468546

It sounds like it works better for declarative schema than imperative scripting/debugging (speaking loosely here). Do you agree? Seems like a good heuristic for me to keep in mind

jillesvangurp · 2025-12-11T07:52:52 1765439572

The thing to remember about the dotcom era was that while there were a lot of bad companies at the time with a lot of clueless investors behind them, quite a few companies made it through the implosion of that bubble and then prospered. Amazon, Google, eBay, etc. are still around.

More importantly, the web is now dominant for enterprise SaaS applications, which is a category of software that did not really exist before the web. And the web post–dot-com bubble spawned a lot of unicorns.

In short, there was an investment bubble. But the core tech was fine.

AI feels like one of those things where the tech is similarly transformational (even more so, actually). It’s another investment bubble predicated on the price of GPUs, which is mostly making Nvidia very rich right now.

Right now the model makers are getting most of the funding and then funneling non-trivial amounts to Nvidia (and their competitors). But actually the value creation is in applications using the models these companies create. And the innovation for that isn’t coming from the likes of Anthropic, OpenAI, Mistral, X.ai, etc. They are providing core technology, but they seem to be struggling to do productive things in terms of UX and use cases. Most of the interesting things in this space are coming from smaller companies figuring out how to use the models these companies produce. Models and GPUs are infrastructure, not end-user products.

And with the rise of open-source models, open algorithms, and exponentially dropping inference costs, the core infrastructure technology is not as much of a moat as it may seem to investors. OpenAI might be well funded, but their main UI (ChatGPT) is surprisingly limited and riddled with bugs. That doesn’t look like the polished work of a company that knows what they are doing. It’s all a bit hesitant and copycat. It’s never going to be a magic solution to everyone’s problems.

From where I’m sitting, there is clear untapped value in the enterprise space for AI to be used. And it’s going to take more than a half-assed chat UI to unlock that. It’s actually going to be a lot of work to build all of that. Coding tools are, so far, the most promising application of reasoning models. It’s easy to see how that could be useful in the context of ERP/manufacturing, CRM, traditional office applications, and the financial world.

Those each represent verticals with many established players trying to figure out how to use all this new stuff — and loads more startups eager to displace them. That’s where the money is going to be post-bubble. We’ve seen nothing yet. Just like after the dot-com bubble burst, all the money is going to be in new applications on top of the new infrastructure. It’s untapped revenue. And it’s not going to be about buying GPUs or offering benchmark-beating models. That’s where all the money is going currently. That’s why it is a bubble.

liampulles · 2025-12-11T21:40:44 1765489244

> Coding, which we called “computer programming” 60 years ago, is the canary in the coal mine in terms of the impact of AI. In many advanced software teams, developers no longer write the code; they type in what they want, and AI systems generate the code for them. Coding performed by AI is at a world-class level, something that wasn’t so just a year ago. According to my guide here, “There is no speculation about whether or not human replacement will take place in that vertical.”

I'm starting to believe that AI coding optimism/pessimism maps to how much one actually cares about system longevity.

If a given developer just takes on board the demands for speed from the business and/or does not care about long-term maintainability (and I mean hey, some businesses foster that, and scaling quickly is important in many cases), then I can totally understand why they would embrace AI agents.

If you care about theory building, and domain driven design, and making a system comprehensive enough to extend in a year or two's time, then I can understand the resistance for the AI to let-it-rip. I admit to falling in this camp.

Am I off the mark here? I'd really like to hear from people who care about the long term who also let agents run relatively wild.

sp4cec0wb0y · 2025-12-10T18:26:13 1765391173

> In many advanced software teams, developers no longer write the code; they type in what they want, and AI systems generate the code for them.

What a wild and speculative claim. Is there any source for this information?

sethammons · 2025-12-10T19:05:35 1765393535

At $WORK, we have a bot that integrates with Slack that sets up minor PRs. Adjusting tf, updating endpoints, adding simple handlers. It does pretty well.

Also in a case of just prose to code, Claude wrote up a concurrent data migration utility in Go. When I reviewed it, it wasn't managing goroutines or waitgroups well, and the whole thing was a buggy mess and could not be gracefully killed. I would have written it faster by hand, no doubt. I think I know more now and the calculus may be shifting on my AI usage. However, the following day, my colleague needed a nearly identical temporary tool. A 45 minute session with Claude of "copy this thing but do this other stuff" easily saved them 6-8 hours of work. And again, that was just talking with Claude.

I am doing a hybrid approach really. I write much of my scaffolding, I write example code, I modify quick things the ai made to be more like I want, I set up guard rails and some tests then have the ai go to town. Results are mixed but trending up still.

FWIW, our CEO has declared us to be AI-first, so we are to leverage AI in everything we do which I think is misguided. But you can bet they will be reviewing AI usage metrics and lower wont be better at $WORK.

yellow_lead · 2025-12-10T23:44:04 1765410244

You should periodically ask Claude to review random parts of code to pump your metrics.

giancarlostoro · 2025-12-11T01:57:30 1765418250

Has the net benefit that it points out things that are actually wrong and overlooked.

strken · 2025-12-11T07:12:17 1765437137

AI reviews have the benefit of making me feel like an idiot in one bullet point and then a genius in the next.

rasz · 2025-12-11T02:54:04 1765421644

But also points out tons of your deliberate design choices as bugs, and will recommend removing things it doesnt understand.

rgbrgb · 2025-12-11T05:58:31 1765432711

just like any junior dev

rozap · 2025-12-11T06:27:51 1765434471

consider rewriting in rust

s1mplicissimus · 2025-12-11T09:18:39 1765444719

that's gonna be painful, as the borrow checker really trips up LLMs

jmalicki · 2025-12-11T16:37:21 1765471041

I do a lot of LLM work in rust, I find the type system is a huge defense against errors and hallucinations vs JavaScript or even Typescript.

giancarlostoro · 2025-12-11T14:39:49 1765463989

Great time to research if those choices are still valid or if there's a better way. In any regard, its just an overview, not a total rewrite from the AI's perspective.

lovich · 2025-12-11T05:03:33 1765429413

why periodically? Just set it up in an agentic workflow and have it work until your token limit is hit.

If companies want to value something as dumb as LoC then they get what they incentivized

oneeyedpigeon · 2025-12-11T10:35:03 1765449303

> we are to leverage AI in everything we do

Sounds like the extremely well-repeated mistake of treating everything like a nail because hammers are being hyped up this month.

roncesvalles · 2025-12-11T20:22:30 1765484550

The risk is that lay people read comments like this and conclude "ergo, we need fewer programmers."

Nothing that the LLM is outputting is useful in the hands of somebody who couldn't have done it themselves (at least, given a reasonable amount of time).

The most apt analogy is that of pilot and autopilot. Autopilot makes the job of the pilot more pleasant, but it doesn't even slightly obviate the need for the pilot, nor does it lower the bar for the people that you can train as pilots.

The benefits of LLM programming are mostly going to be subsumed by the operator, to make their lives easier. Very little is gonna go to their employer (despite all the pressure), and this is not due to some principal-agent breakdown; it's just intrinsic to the nature of this work.

nomel · 2025-12-11T21:12:00 1765487520

> ergo, we need fewer programmers.

How so? And in what context?

Where I am, headcount is based on "can we finish and sustain these planned and present required projects". If these automations allow a developer to burn less time, it reduces the need for headcount. As a direct result of this approach to hiring based on need, the concept of a "layoff" doesn't exist where I am.

roncesvalles · 2025-12-11T21:43:19 1765489399

>If these automations allow a developer to burn less time, it reduces the need for headcount.

This is exactly the fallacy, and it's very hard to see why it's a fallacy if you've never professionally written code (and even then).

Software development work fills to occupy the time allotted to it. That's because there is always a tradeoff between time and quality. If you have time available, you will fundamentally alter your approach to writing that piece of software. A rough analogy: air travel doesn't mean we take fewer vacations -- it just means we take vacations to farther away places.

Because of this effect, a dev can really finish a project in as little time as you want (up to a reasonable minimum). It just comes down to how much quality loss and risk can be tolerated. I can make a restaurant website in 1 hour (on Wix/Squarespace) or in 3 months (something hand-crafted and sophisticated). The latter is not "wasted time", it just depends on where you move the lever.

However, sometimes this is a false tradeoff. It isn't always necessary that the place you flew 3 hours will give you a better vacation than some place you could've driven to in 3 hours. You only hope it's better.

>As a direct result of this approach to hiring based on need, the concept of a "layoff" doesn't exist where I am.

LLMs or not, you could've just hired fewer people and made it work anyway. It's not like if you hired 3 people instead of 6 before the LLM era, it was impossible to do.

The gist of it is that LLMs are mostly just devs having fun and tinkering about, or making their quality of life better, or implementing some script, tooling, or different approach that they might've avoided before LLMs. There's no powertrain from that stuff to business efficiency.

nomel · 2025-12-11T23:12:16 1765494736

> This is exactly the fallacy, and it's very hard to see why it's a fallacy if you've never professionally written code (and even then).

This was not necessary or appropriate, and completely discredits your reply.

engineer_22 · 2025-12-11T21:34:23 1765488863

> The benefits of LLM programming are mostly going to be subsumed by the operator, to make their lives easier. Very little is gonna go to their employer

your boss is going to let you go home if you get all your work done early?

shuckles · 2025-12-11T01:55:46 1765418146

It took me a while to realize you were using "$WORK" as a shell variable, not as a reference to Slack's stock ticker prior to its acquisition by $CRM.

Terr_ · 2025-12-11T22:09:54 1765490994

Now I'm imagining a world where all publicly traded stocks are identified by reverse-order domain names.

re-thc · 2025-12-11T06:34:00 1765434840

You never know. Could be both.

palmotea · 2025-12-11T06:43:35 1765435415

> FWIW, our CEO has declared us to be AI-first, so we are to leverage AI in everything we do which I think is misguided. But you can bet they will be reviewing AI usage metrics and lower wont be better at $WORK.

I've taken some pleasure in having GitHub copilot review whitespace normalization PRs. It says it can't do it, but I hope I get my points anyway.

chickensong · 2025-12-10T19:52:40 1765396360

> it wasn't managing goroutines or waitgroups well, and the whole thing was a buggy mess and could not be gracefully killed

First pass on a greenfield project is often like that, for humans too I suppose. Once the MVP is up, refactor with Opus ultrathink to look for areas of weakness and improvement usually tightens things up.

Then as you pointed out, once you have solid scaffolding, examples, etc, things keep improving. I feel like Claude has a pretty strong bias for following existing patterns in the project.

ProllyInfamous · 2025-12-11T14:30:25 1765463425

This is a great response, even for a blue collar worker understanding none of its complexities (I have no code creation abilities, whatsoever — I can adjust parameters, and that's about it... I am a hardware guy).

My layperson anecdote about LLM coding is that using Perplexity is the first time I've ever had the confidence (artificial, or not) to actually try to accomplish something novel with software/coding. Without judgments, the LLM patiently attempts to turn my meat-speak into code. It helps explain [very simple stuff I can assure you!] what its language requires for a hardware result to occur, without chastising you. [Raspberry Pi / Arduino e.g.]

LLMs have encouraged me to explore the inner workings of more technologies, software and not. I finally have the knowledgeable apprentice to help me with microcontroller implementations, albeit slowly and perhaps somewhat dangerously [1].

----

Having spent the majority of my professional life troubleshooting hardware problems, I often benefit from rubber ducky troubleshooting [0], going back to the basics when something complicated isn't working. LLMs have been very helpful in this roleplay (e.g. garage door openers, thermostat advanced configurations, pin-outs, washing machine not working, etc.).

[0] <https://en.wikipedia.org/wiki/Rubber_duck_debugging>

[1] "He knows just enough to be dangerous" —proverbial electricians

¢¢

giardini · 2025-12-11T19:23:28 1765481008

As a software guy going way back, this post may be the death knell of software development as I've known it. I have never seen a good hardware guy who could code his way out of a paper bag. If hardware guys succeed in developing software with LLM coding, then it's time to abandon ship (reaches for life preserver pension).

ProllyInfamous · 2025-12-11T19:37:36 1765481856

I'm'bout'ta flash your PLC Ladder Logic firmwares, friend.

j/k don't worry I'm an idiot — but somebody else WILL.

mrwrong · 2025-12-11T14:55:29 1765464929

what really comes through in this description is a fear of judgement from other people, which I think is extremely relatable for anyone who's ever posted a question on stack overflow. I don't think it's a coincidence that the popularity of these tools is coinciding with a general atmosphere of low trust and social cohesion in the US and other societies this last decade

ProllyInfamous · 2025-12-11T19:35:13 1765481713

On her deathbed, years ago, my beloved mother lamented that she often felt mentally bullied by her three brilliant sons [0], even decades into our adulthoods; embarassed, she would censor her own knowledge-seeking from the people she trusted most [2].

She didn't live long enough to use ChatGPT [1] (she would have been flabbergasted at its ability to understand people/situations), but even with her "normal" intelligence she would have been a master to its perceptions/trainings.

[0] "Beyond just teasing."

[1] We did briefly wordplay with GPT-2 right before she died via thisworddoesnotexist.com exchanges, but nothing conversive.

[2] Relavent example, to the best of my understanding of hers: I would never ask my brilliant engineer programmer hardwarebro for coding help on any personal project, never. Just as I don't ask lawyerbro for personal legal advice.

----

About a year later (~2023), my dentist friend experienced a sudden life change (wife sick @35); in his grieving/soul-seeking, I recommended that he share some of his mental chaos with an LLM, even just if to role-play as his sick family member. Dr. Friend later thanked me for recommending the resource — particularly "the entire lack of any judgments" — and shared his own brilliant discoveries using creative prompt structuring.

----

Particularly as a big dude, it's nice to not always have to be the tough guy, to even admit weakness. Unfortunately I think the overall societal benefits of generative AI are going to increase anti-social behaviour, but it's nice to have a friendly apprentice that knows something about almost everything... any time... any reason.

sbuttgereit · 2025-12-11T01:07:39 1765415259

I think your experience matches well with mine. There are certain workloads and use cases where these tools really do well and legitimately save time; these tend to be more concise tasks and well defined with good context from which to draw from. The wrong tasking and the results can be pretty bad and a time sink.

I think the difficulty is exercising the judgement to know where that productive boundary sits. That's more difficult than it sounds because we're not use to adjudicating machine reasoning which can appear human-like ... So we tend to treat it like a human which is, of course, an error.

TheOtherHobbes · 2025-12-11T12:04:49 1765454689

I find ChatGPT excellent for writing scripts in obscure scripting languages - AppleScript, Adobe Cloud products, IntelliJ plugin development, LibreOffice, and others.

All of these have a non-trivial learning curve and/or poor and patchy docs.

I could master all of these the hard way, but it would be a huge and not very productive time sink. It's much easier to tell a machine what I want and iterate with error reports if it doesn't solve my problem immediately.

So is this AGI? It's not self-training. But it is smart enough to search docs and examples and pull them together into code that solves a problem. It clearly "knows" far more than I do in this particular domain, and works much faster.

So I am very clearly getting real value from it. And there's a multiplier effect, because it's now possible to imagine automating processes that weren't possible before, and glue together custom franken-workflows that link supposedly incompatible systems and save huge amounts of time.

returnInfinity · 2025-12-11T05:15:41 1765430141

My thoughts as well, good at somethings and terrible for somethings and you will lose time.

Somethings are best written by yourself.

And this is with the mighty claude opus 4.5

blitzar · 2025-12-11T10:27:36 1765448856

The CEO obviously wants one of those trophies that chatgpt gives out.

kscarlet · 2025-12-10T19:20:54 1765394454

The line right after this is much worse:

> Coding performed by AI is at a world-class level, something that wasn’t so just a year ago.

Wow, finance people certainly don't understand programming.

mcv · 2025-12-10T19:55:27 1765396527

World class? Then what am I? I frequently work with Copilot and Claude Sonnet, and it can be useful, but trusting it to write code for anything moderately complicated is a bad idea. I am impressed by its ability to generate and analyse code, but its code almost never works the first time, unless it's trivial boilerplate stuff, and its analysis is wrong half the time.

It's very useful if you have the knowledge and experience to tell when it's wrong. That is the absolutely vital skill to work with these systems. In the right circumstances, they can work miracles in a very short time. But if they're wrong, they can easily waste hours or more following the wrong track.

It's fast, it's very well-read, and it's sometimes correct. That's my analysis of it.

malfist · 2025-12-10T20:31:14 1765398674

Is this why AI is telling us our every idea is brilliant and great? Because their code doesn't stand up to what we can do?

AmericanOP · 2025-12-11T01:08:09 1765415289

Whichever PM sold glazing as a core feature should be ejected into space.

RHSman2 · 2025-12-11T15:18:27 1765466307

Because people who can’t code but now can have zero understanding of the ‘path to production quality code’

Of course it is mind blowing for them.

formerly_proven · 2025-12-10T20:31:03 1765398663

Copilot is easily the worst (and probably slowest) coding agent. SOTA and Copilot don't even inhabit similar planes of existence.

RobinL · 2025-12-11T07:22:25 1765437745

I've found Opus 4.5 in copilot to be very impressive. Better than codex CLI in my experience. I agree Copilot definitely used to be absolutely awful.

whimsicalism · 2025-12-11T16:06:51 1765469211

cursor is better than both, i wish this weren’t the case tbph

skydhash · 2025-12-11T01:09:04 1765415344

> I frequently work with Copilot and Claude Sonnet, and it can be useful, but trusting it to write code for anything moderately complicated is a bad idea

This sentence and the rest of the post reads like an horoscope advice. Like "It can be good if you use it well, it may be bad if you don't". It's pretty much the same as saying a coin may land on head or on tail.

hatthew · 2025-12-11T02:39:14 1765420754

saying "a coin may land on head or on tail" is useful when other people are saying "we will soon have coins that always land on heads"

bdangubic · 2025-12-11T12:25:15 1765455915

this is doable, you just have to rig the coin

selectodude · 2025-12-10T20:10:45 1765397445

They don’t. I’ve gone from rickety and slow excel sheets and maybe some python functions to automate small things that I can figure out to building out entire data pipelines. It’s incredible how much more efficient we’ve gotten.

sshadmand · 2025-12-11T18:01:22 1765476082

Finance people are funny. They are so wrong when you hear their logic and references, but I also realized it doesn't matter. It is trends they try to predict, fuzzy directional signals, not facts of the moment.

2025-12-10T20:01:36 1765396896

[deleted]

n8cpdx · 2025-12-10T20:23:44 1765398224

> Including how it looks at the surrounding code and patterns.

Citation needed. Even with specific examples, “follow the patterns from the existing tests”, etc copilot (gpt 5) still insists on generating tests using the wrong methods (“describe” and “it” in a codebase that uses “suite” and “test”).

An intern, even an intern with a severe cognitive disability, would not be so bad at pattern following.

formerly_proven · 2025-12-10T20:45:01 1765399501

Do you think smart companies seeking to leverage AI effectively in their engineering orgs are using the 20$ slopify subscription from Microsoft?

You get what you pay for.

n8cpdx · 2025-12-10T23:06:25 1765407985

Every time a new model or tool comes out, the AI boosters love to say that n-1 was garbage and finally AI vibecoding is the real deal and it will make you 10x more productive.

Except six months ago n-1 was n and the boosters were busy ruining their credibility saying that their garbage tier AI was world class and making them 10x more productive.

Today’s leading world-class agentic model is tomorrow’s horrible garbage tier slop generator that was patently never good enough to be taken seriously.

This has been going on for years, the pattern is obvious and undeniable.

clickety_clack · 2025-12-10T20:38:59 1765399139

Ask ChatGPT “is AI programming world class?”

venturecruelty · 2025-12-11T02:43:09 1765420989

Of course not, why would they? They understand making money, and what makes money right now? What would be antithetical to making money? Why might we be doing one thing and not another? The lines are bright and red and flashing.

throwaway2037 · 2025-12-11T01:56:09 1765418169

I completely agree. This guy is way outside his area of expertise. For those unaware, Howard Marks is a legendary investment manager with a decades-long impressive track record. Additionally, these "insights" letters are also legendary in the money management business. Personally, I would say his wisdom is one notch below Warren Buffett. I am sure he is regularly asked (badgered?) by investors what he thinks about the current state and future of AI (LLMs) and how it will impact his investment portfolio. The audience of this letter is investors (real and potential), as well as other investment managers.

throwaway2037 · 2025-12-11T01:59:57 1765418397

Follow-up: This letter feels like a "jump the shark" moment.

Ref: https://blog.codinghorror.com/has-joel-spolsky-jumped-the-sh...

dmurvihill · 2025-12-11T18:37:58 1765478278

It's funny, because this decision by Joel in 2006 prefigures TypeScript six years later. VBA was a terrible bet for a target language and Joel was crazy to think his little company could sustain a language ecosystem, but Microsoft had the same idea and nailed it.

urxvtcd · 2025-12-11T08:43:42 1765442622

First time reading this. It's actually funny how disliking exceptions seemed crazy then but it's pretty normal now. And writing a new programming language for a certain product, well, it could turn out to be pretty cool, right? It's how we get all those Elms and so on.

alterom · 2025-12-11T13:19:26 1765459166

That's how we got Rust.

whoknowsidont · 2025-12-10T19:02:42 1765393362

It's not. And if your team is doing this you're not "advanced."

Lots of people are outing themselves these days about the complexity of their jobs, or lack thereof.

Which is great! But it's not a +1 for AI, it's a -1 for them.

NewsaHackO · 2025-12-10T19:38:11 1765395491

Part of the issue is that I think you are underestimating the number of people not doing "advanced" programming. If it's around ~80-90%, then that's a lot of +1s for AI

friendzis · 2025-12-11T07:54:00 1765439640

Wrong. 80% of code not being advanced is quite strictly not the same as 80% people not doing advanced programming.

NewsaHackO · 2025-12-11T15:03:27 1765465407

I completely understand the difference, and I am standing by my statement that 80-90% of programmers are not doing advanced programming at all.

whoknowsidont · 2025-12-10T20:56:28 1765400188

Why do you feel like I'm underestimating the # of people not doing advanced programming?

NewsaHackO · 2025-12-10T21:07:47 1765400867

Theoretically, if AI can do 80-90% of programming jobs (the ones not in the "advanced" group), that would be an unequivocal +1 for AI.

whoknowsidont · 2025-12-10T22:27:00 1765405620

I think you're crossing some threads here.

NewsaHackO · 2025-12-10T22:33:33 1765406013

"It's not. And if your team is doing this you're not "advanced." Lots of people are outing themselves these days about the complexity of their jobs, or lack thereof.

Which is great! But it's not a +1 for AI, it's a -1 for them.

" Is you, right?

whoknowsidont · 2025-12-10T22:54:09 1765407249

Yes. You can see my name on the post.

NewsaHackO · 2025-12-10T23:14:48 1765408488

OK, just making sure. Have a blessed day :)

9rx · 2025-12-10T23:50:33 1765410633

It's true for me. I type in what I want and then the AI system (compiler) generates the code.

Doesn't everyone work that way?

zahlman · 2025-12-11T00:07:19 1765411639

Describing a compiler as "AI" is certainly a take.

conradev · 2025-12-11T06:53:17 1765435997

I used to hand roll the assembly, but now I delegate that work to my agent, clang. I occasionally override clang or give it hints, but it usually gets it right most of the time.

clang doesn't "understand" the hints because it doesn't "understand" anything, but it knows what to do with them! Just like codex.

lm28469 · 2025-12-11T08:10:37 1765440637

Given an input clang will always give the same output, not quite the same for llms. Also nobody ever claimed compilers were intelligent or that they "understood" things

conradev · 2025-12-11T19:25:45 1765481145

The determinism depends on the architecture of the model!

Symbolica is working on more deterministic/quicker models: https://www.symbolica.ai

I also wish it was that easy, but compiler determinism is hard, too: https://reproducible-builds.org

9rx · 2025-12-11T13:14:01 1765458841

An LLM will also give the same output for the same input when the temperature is zero[1]. It only becomes non-deterministic if you choose for it to be. Which is the same for a C compiler. You can choose to add as many random conditionals as you so please.

But there is nothing about a compiler that implies determinism. A compiler is defined by function (taking input on how you want something to work and outputting code), not design. Implementation details are irrelevant. If you use a neural network to compile C source into machine code instead of more traditional approaches, it most definitely remains a compiler. The function is unchanged.

[1] "Faulty" hardware found in the real world can sometimes break this assumption. But a C compiler running on faulty hardware can change the assumption too.

whimsicalism · 2025-12-11T16:57:58 1765472278

currently LLMs from majorvproviders are not deterministic with temp=0, there are startups focusing on this issue (among others) https://thinkingmachines.ai/blog/defeating-nondeterminism-in...

lm28469 · 2025-12-11T13:22:02 1765459322

You can test that yourself in 5 seconds and see that even at a temp of 0 you never get the same output

9rx · 2025-12-11T14:06:00 1765461960

Works perfectly fine for me.

Did you do that stupid HN thing where you failed to read the entire comment and then went off to try it on faulty hardware?

lm28469 · 2025-12-11T14:45:03 1765464303

No I did that HN thing where I went to an LLM, set temp to 0, pasted your comments in and got widely different outputs every single time I did so

9rx · 2025-12-11T14:53:30 1765464810

"Went" is a curious turn of phrase, but I take it to mean that you used an LLM on someone else's hardware of unknown origin? How are you ensuring that said hardware isn't faulty? It is a known condition. After all, I already warned you of it.

Now try it on deterministic hardware.

NewsaHackO · 2025-12-11T15:07:26 1765465646

Was the seed set to the same value everytime?

whimsicalism · 2025-12-11T16:59:02 1765472342

https://thinkingmachines.ai/blog/defeating-nondeterminism-in...

bewo001 · 2025-12-11T10:56:23 1765450583

Hm, some things compilers do during optimization would have been labelled AI during the last AI bubble.

agumonkey · 2025-12-11T00:19:36 1765412376

it's something that crossed my mind too honestly. natural-language-to-code translation.

skydhash · 2025-12-11T01:11:23 1765415483

You can also do search query to code translation by using GitHub or StackOverflow.

parliament32 · 2025-12-11T01:15:25 1765415725

Compilers are probably closer to "intelligence" than LLMs.

rfrey · 2025-12-11T02:47:52 1765421272

I understand what you're getting at, but compilers are deterministic. AI isn't just another tool, or just a higher level of program specification.

7952 · 2025-12-11T09:05:04 1765443904

This is all a bit above my head. But the effects a compiler has on the computer are certainly not deterministic. It might do what you want or it might hit a weird driver bug or set off a false positive in some security software. And the more complex stacks get the he more this happens.

dust42 · 2025-12-11T08:39:07 1765442347

And so is "AI". Unless you add randomness AKA raise the temperature.

rfrey · 2025-12-11T16:29:39 1765470579

If you and I put the same input into GCC, we will get the same output (counting flags and config as input). The same is not true for an LLM.

9rx · 2025-12-11T18:15:52 1765476952

> The same is not true for an LLM.

Incorrect. LLMs are designed to be deterministic (when temperature=0). Only if you choose for them to be non-deterministic are they so. Which is no different in the case of GCC. You can add all kinds of random conditionals if you had some reason to want to make it non-deterministic. You never would, but you could.

There are some known flaws in GPUs that can break that assumption in the real world, but in theory (and where you have working, deterministic hardware) LLMs are absolutely deterministic. GCC also stops being deterministic when the hardware breaks down. A cosmic bit flip is all it takes to completely defy your assertion.

9rx · 2025-12-11T03:13:13 1765422793

> but compilers are deterministic.

Are they, though? Obviously they are in some cases, but it has always been held that a natural language compiler is theoretically possible. But a natural language compiler cannot be deterministic, fundamentally. It is quite apparent that determinism is not what makes a compiler.

In fact, the dictionary defines compiler as: "a program that converts instructions into a machine-code or lower-level form so that they can be read and executed by a computer." Most everyone agrees that it is about function, not design.

> AI isn't just another tool

AI is not a tool, that is true. I don't know, maybe you stopped reading too soon, but it said "AI systems". Nobody was ever talking about AI. If you want to participate in the discussions actually taking place, not just the one you imagined in your head, what kind of system isn't just another tool?

rfrey · 2025-12-11T16:25:44 1765470344

> Nobody was ever talking about AI. If you want to participate in the discussions actually taking place, not just the one you imagined in your head

Wow. No, I actually don't want to participate in a discussion where the default is random hostility and immediate personal attack. Sheesh.

9rx · 2025-12-11T17:14:20 1765473260

You don't want to participate, so you continue to participate? Uhh... Thanks for clearing up that you are not coming here from a place of logic, just bad faith emotionalism. We almost were starting to think you had something of value to add.

XenophileJKO · 2025-12-11T01:42:23 1765417343

I beginning to think most "advanced" programmers are just poor communicators.

It really comes mostly down to being able to concisely and eloquently define what you want done. It also is important to understand what the default tendencies and biases of the model are so you know where to lean in a little. Occasionally you need to provide reference material.

The capabilities have grown dramatically in the last 6 months.

I have an advantage because I have been building LLM powered products so I know mechanically what they are and are not good with. For example.. want it to wire up an API with 250+ endpoints with a harness? You better create (or have it create) a way to cluster and audit coverage.

Generally the failures I hear often with "advanced" programmers are things like algorithmic complexity, concurrency, etc.. and these models can do this stuff given the right motivation/context. You just need to understand what "assumptions" the model it making and know when you need to be explicit.

Actually one thing most people don't understand is they try to say "Do (A), Don't do (B)", etc. Defining granular behavior which is fundamentally a brittle way to interact with the models.

Far more effective is defining the persona and motivation for the agent. This creates the baseline behavior profile for the model in that context.

Not "don't make race conditions", more like "You value and appreciate elegant concurrent code."

tjr · 2025-12-11T05:39:51 1765431591

Some of the best programmers I know are very good at writing and/or speaking and teaching. I struggle to believe that “advanced programmers” are poor communicators.

XenophileJKO · 2025-12-11T05:59:10 1765432750

Genuine reflection question, are these excellent communicators good at using llms to write code?

My supposition was: Many programmers that say their programming domain was too advanced and llms didn't work for their kind of code are simply bad at describing concisely what is required.

tjr · 2025-12-11T06:12:35 1765433555

Most good programmers that I know personally work, as do I, in aerospace, where LLMs have not been adopted as quickly as some other fields, so I honestly couldn’t say.

interstice · 2025-12-11T04:09:17 1765426157

> I beginning to think most "advanced" programmers are just poor communicators.

This is a interesting take take considering that programmers are experts in communicating what someone has asked for (however vaguely) into code.

I think you're referring to is the transition from 'write code that does X' which is very concrete to 'trick an AI into writing the code I would have written, only faster', which feels like work that's somewhere between an art form and asking a magic box to fix things over and over again until it stops being broken (in obvious ways, at least).

Understandably people that prefer engineered solutions do not like the idea of working this way very much.

XenophileJKO · 2025-12-11T05:56:28 1765432588

When you oversee a team technically as a tech lead or an architect, you need communication skills.

1. Basing on how the engineer just responded to my comment, what is the understanding gap?

2. How do I describe what I want in a concise and intuitive way?

3. How do I tell an engineer what is important in this system and what are the constraints?

4. What assumptions will an engineer likely make that are will cause me to have to make a lot of corrections?

Etc.. this is all human to human.

These skills are all transferrable to working with an LLM.

So I guess if you are not used to technical leadership, you may not have used those skills as much.

interstice · 2025-12-11T18:38:23 1765478303

The issue here is that LLM’s are not human and so having a human mental model of how to communicate doesn’t really work. If I communicate to my engineer to do X I know all kinds of things about them, like their coding style, strengths and weaknesses, and that they have some familiarity with the code they are working with and won’t bring the entirety of stack overflow answers to the context we are working in. LLM’s are nothing like this even when working with large amounts of context, they fail in extremely unpredictable ways from one prompt to the next. If you disagree I’d be interested in what stack or prompting you are using that avoids this.

mjr00 · 2025-12-11T02:56:58 1765421818

> It really comes mostly down to being able to concisely and eloquently define what you want done.

We had a method for this before LLMs; it was called "Haskell".

XenophileJKO · 2025-12-11T01:48:26 1765417706

One added note. This rigidness of instruction is a real problem that the models themselves will magnify and you need to be aware of. For example if you ask a Claude family of models to write a sub-agent for you in Claude Code. 99% of the time it will define a rigid process with steps and conditions instead of creating a persona with motivations (and if you need it suggested courses of action).

projektfu · 2025-12-11T02:02:46 1765418566

I have heard many software developers confidently tell me "pilots don't really fly the planes anymore" and, well, that's patently false but also the jetliners autopilots do handle much of the busy work during cruise, and sometimes during climb-out and approach. And they can sometimes land themselves, but not efficiently enough for a busy airport.

coffeebeqn · 2025-12-11T10:41:09 1765449669

Autopilot based on a LLM would guarantee I’d never fly again

its_ethan · 2025-12-10T23:47:19 1765410439

Is it not sort of implied by the stats later: "Revenues from Claude Code, a program for coding that Anthropic introduced earlier this year, already are said to be running at an annual rate of $1 billion. Revenues for the other leader, Cursor, were $1 million in 2023 and $100 million in 2024, and they, too, are expected to reach $1 billion this year."

Surely that revenue is coming from people using the services to generate code? Right?

Windchaser · 2025-12-11T00:12:45 1765411965

A back-of-the-napkin estimate of software developer salaries:

There are some ~1.5 million software developers in the US per BLS data, or ~4 million if using a broader definition Median salary is $120-140k. Let's say $120k to be conservative.

This puts total software developer salaries at $180 billion.

So, that puts $1 billion in Claude revenue in perspective; only about 0.5% of software developer salaries. Even if it only improved productivity 5%, it'd be paying for itself handily - which means we can't take the $1 billion in revenues to indicate that it's providing a big boost in productivity.

dmurvihill · 2025-12-11T07:43:29 1765439009

If it makes a 5% improvement, that would make it a $9 billion dollar per year industry. What’s our projected capex for AI projects next five years again?

lovich · 2025-12-11T05:05:49 1765429549

You are ignoring costs

The AI companies are currently lighting dollars on fire if you pay them a few pennies to do so.

The AI models are actually accomplishing something, but the unit economics aren't there to support it being profitable

browningstreet · 2025-12-11T00:00:31 1765411231

Generating code isn’t the same as running it, running it on production, and living with it over time.

In time I’m sure it will, but it’s still early days, land grab time.

halfcat · 2025-12-11T00:20:13 1765412413

> Surely that revenue is coming from people using the services to generate code? Right?

Yes. And all code is tech debt. Now generated faster than ever.

jv22222 · 2025-12-11T07:07:06 1765436826

Hmm maybe that’s a bit reductive? I’ve used claud to help with some really great refactoring sessions tbh.

brulard · 2025-12-10T19:27:02 1765394822

I'm on a team like that and I see it happening in more and more companies around. Maybe "many" does a heavy lifting in the quoted text, but it is definitely happening.

loloquwowndueo · 2025-12-10T18:27:05 1765391225

Probably their googly-eyed vibe coder friend told them this and they just parroted it.

RajT88 · 2025-12-10T18:49:21 1765392561

Right. The author is non-technical and said so up front.

interstice · 2025-12-10T18:43:43 1765392223

If true I’d like to know who is doing this so I can have exactly nothing to do with them.

20after4 · 2025-12-10T20:57:05 1765400225

I've had claude code compose complex AWS infrastructure (using pulumi IAC) that mostly works from a one-shot prompt.

no_wizard · 2025-12-11T01:25:32 1765416332

Here's the lede they buried:

>The key is to not be one of the investors whose wealth is destroyed in the process of bringing on progress.

They are a VC group. Financial folks. They are working largely with other people's money. They simply need not hold the bag to be successful.

Of course they don't care if its a bubble or not, at the end of the day, they only have to make sure they aren't holding the bag when it all implodes.

venturecruelty · 2025-12-11T02:44:34 1765421074

They have "capital" in their domain name. Of course they're going to be, well... on the side of capital. This shouldn't be hotly debated... "Mining company says mine they own is full of ore and totally not out of ore."

PurpleRamen · 2025-12-11T10:10:40 1765447840

Yes and no. There is the infamous quote of Microsoft, about 30%(?) of their code being written by AI now. And technically, it's probably not that such a wild claim in certain areas. AI is very good at barfing up common popular patterns, and companies have a huge amount of patternized software, like UIs, tests, documentation or marketing-fluff. So it's quite easy to "outsource" such grunt-work if AI has the necessary level.

But to say that they don't write any code at all, it's really stretched. Maybe I'm not good enough at AI-assisted and vibe coding, but code-quality always seems to drop down really hard the moment one steps a bit outside the common patterns.

grumbelbart2 · 2025-12-11T10:47:01 1765450021

I found LLLMs to be very good of writing (unit) tests for my code, for example. They just don't get tired iterating over all corner cases. Those tests easily, in LoC, dwarf the actual implementation. Not sure if that would count towards the 30%, for example.

whimsicalism · 2025-12-11T06:15:52 1765433752

Wow, reading these comments and I feel like I've entered a parallel reality. My job involves implementing research ML and I use it literally all the time, very fascinating to see how many have such strong negative reactions. As long as you are good at reviewing code, spec-ing carefully, and make atomic changes - why would you not be using this basically all the time?

kkapelon · 2025-12-11T18:54:33 1765479273

> As long as you are good at reviewing code, spec-ing carefully, and make atomic changes - why would you not be using this basically all the time?

This implies that you are an expert/seasoned programmer. And not everybody is an expert on this industry (especially the reviewing code part).

whimsicalism · 2025-12-11T19:12:44 1765480364

I thought this was a forum for seasoned engineers? But yes, I agree that this widens the skill gap and makes the on-ramp steeper.

kkapelon · 2025-12-11T19:56:09 1765482969

What happens if you work in a team?

If a team has one senior/seasoned person and 3 juniors will adopting ai be a total positive move? Or the senior person will just become the bottleneck for the junior devs?

qsort · 2025-12-11T09:29:00 1765445340

It's one of the failure modes of online forums. Everyone piles on and you get an unrealistic opinion sample. I'm not exactly trying to shove AI into everything, I'm weary of over hyping and mostly conservative in my technology choices. Still, I get a lot out of LLMs and agents for coding tasks.

whimsicalism · 2025-12-11T15:26:15 1765466775

i have trouble understanding how a forum of supposedly serious coders can be so detached from reality, but I do know that this is one of HN’s pathologies

qsort · 2025-12-11T15:32:46 1765467166

I think it's more of a thread-bound dynamic rather than HN as a whole. If the thread starts positive you get "AGI tomorrow", if the thread starts negative you get "stochastic parrot".

But I see what you mean, there have been at least a few insane comment sections for sure.

LtWorf · 2025-12-11T07:06:12 1765436772

Because carefully spec-ing to the level an llm needs, and ultra carefully checking the output is easily slower and more tiring than just doing it yourself.

Kinda like having a child "help" you cook basically.

But for the child you do it because they actually learn. llms do not learn in that sense.

whimsicalism · 2025-12-11T15:23:28 1765466608

not at all true for the latest generation of models in my experience. they are overly verbose but except for the simplest simplest changes it is faster to ask first

LtWorf · 2025-12-11T15:46:16 1765467976

For the simplest changes you have to first review the code fully, ask for the change, do a new full review and so on.

whimsicalism · 2025-12-11T15:54:47 1765468487

no, you just have to ask for the change - wait ~minute, review. and if it’s a small change, review goes fast. typically i’ll have a zellij/tmux with lazygit one pane, a cli agent (cursor-agent or codex) in the other, and a pop up vim pane. i can see the changes in lazygit as they’re made and review immediately and commit

agumonkey · 2025-12-11T00:17:59 1765412279

Seen it first hand. scan your codebase, plan extension or rewrite or both, iterate with some hand holding and off you go. And it was not even an advanced developer driving the feature (which is concerning).

Illniyar · 2025-12-11T02:44:12 1765421052

I think he might be misrepresenting it a bit, but from what I've seen every software company I know of heavily uses agentic AI to create code (except some highly regulated industries).

It has become a standard tool, in the same way that most developers code with an IDE, most developers use agentic AI to start a task (if not to finish it).

stretchwithme · 2025-12-11T02:58:00 1765421880

It's often true. But not when it's easier to code than to explain.

qsort · 2025-12-10T20:23:19 1765398199

Everyone is doing this extreme pearl clutching around the specific wording. Yeah, it's not 100% accurate for many reasons, but the broader point was about employment effects, it doesn't need to completely replace every single developer to have a sizable impact. Sure, it's not there yet and it's not particularly close, but can you be certain that it will never be there?

Error bars, folks, use them.

thenaturalist · 2025-12-11T08:39:09 1765442349

No, but there are huuuuuge incentives by people publishing such statements.

johnfn · 2025-12-10T19:14:41 1765394081

I only write around 5% of the code I ship, maybe less. For some reason when I make this statement a lot of people sweep in to tell me I am an idiot or lying, but I really have no reason to lie (and I don't think I'm an idiot!). I have 10+ years of experience as an SWE, I work at a Series C startup in SF, and we do XXMM ARR. I do thoroughly audit all the code that AI writes, and often go through multiple iterations, so it's a bit of a more complex picture, but if you were to simply say "a developer is not writing the code", it would be an accurate statement.

Though I do think "advanced software team" is kind of an absurd phrase, and I don't think there is any correlation with how "advanced" the software you build is and how much you need AI. In fact, there's probably an anti-correlation: I think that I get such great use out of AI primarily because we don't need to write particularly difficult code, but we do need to write a lot of it. I spend a lot of time in React, which AI is very well-suited to.

EDIT: I'd love to hear from people who disagree with me or think I am off-base somehow about which particular part of my comment (or follow-up comment https://news.ycombinator.com/item?id=46222640) seems wrong. I'm particularly curious why when I say I use Rust and code faster everyone is fine with that, but saying that I use AI and code faster is an extremely contentious statement.

MontyCarloHall · 2025-12-10T19:25:30 1765394730

>I only write around 5% of the code I ship, maybe less.

>I do thoroughly audit all the code that AI writes, and often go through multiple iterations

Does this actually save you time versus writing most of the code yourself? In general, it's a lot harder to read and grok code than to write it [0, 1, 2, 3]. For me, one of the biggest skills for using AI to efficiently write code is a) chunking the task into increments that are both small enough for me to easily grok the AI-generated code and also aligned enough to the AI's training data for its output to be ~100% correct, b) correctly predicting ahead of time whether reviewing/correcting the output for each increment will take longer than just doing it myself, and c) ensuring that the overhead of a) and b) doesn't exceed just doing it myself.

[0] https://mattrickard.com/its-hard-to-read-code-than-write-it

[1] https://www.joelonsoftware.com/2000/04/06/things-you-should-...

[2] https://trishagee.com/presentations/reading_code/

[3] https://idiallo.com/blog/writing-code-is-easy-reading-is-har...

johnfn · 2025-12-10T19:44:26 1765395866

Yes, I save an incredible amount of time. I suspect I’m likely 5-10x more productive, though it depends exactly what I’m working on. Most of the issues that you cite can be solved, though it requires you to rewire the programming part of your brain to work with this new paradigm.

To be honest, I don’t really have a problem with chunking my tasks. The reason I don’t is because I don’t really think about it that way. I care a lot more about chunks and AI could reasonably validate. Instead of thinking “what’s the biggest chunk I could reasonably ask AI to solve” I think “what’s the biggest piece I could ask an AI to do that I can write a script to easily validate once it’s done?” Allowing the AI to validate its own work means you never have to worry about chunking again. (OK, that's a slight hyperbole, but the validation is most of my concern, and a secondary concern is that I try not to let it go for more than 1000 lines.)

For instance, take the example of an AI rewriting an API call to support a new db library you are migrating to. In this case, it’s easy to write a test case for the AI. Just run a bunch of cURLs on the existing endpoint that exercise the existing behavior (surely you already have these because you’re working in a code base that’s well tested, right? right?!?), and then make a script that verifies that the result of those cURLs has not changed. Now, instruct the AI to ensure it runs that script and doesn’t stop until the results are character for character identical. That will almost always get you something working.

Obviously the tactics change based on what you are working on. In frontend code, for example, I use a lot of Playwright. You get the idea.

As for code legibility, I tend to solve that by telling the AI to focus particularly on clean interfaces, and being OK with the internals of those interfaces be vibecoded and a little messy, so long as the external interface is crisp and well-tested. This is another very long discussion, and for the non-vibe-code-pilled (sorry), it probably sounds insane, and I feel it's easy to lose one's audience on such a polarizing topic, so I'll keep it brief. In short, one real key thing to understand about AI is that it makes the cost of writing unit tests and e2e tests drop significantly, and I find this (along with remaining disciplined and having crisp interfaces) to be an excellent tool in the fight against the increased code complexity that AI tools bring. So, in short, I deal with legibility by having a few really really clean interfaces/APIs that are extremely readable, and then testing them like crazy.

EDIT

There is a dead comment that I can't respond to that claims that I am not a reliable narrator because I have no A/B test. Behold, though: I am the AI-hater's nightmare, because I do have a good A/B test! I have a website that sees a decent amount of traffic (https://chipscompo.com/). Over the last few years, I have tried a few times to modernize and redesign the website, but these attempts have always failed because the website is pretty big (~50k loc) and I haven't been able to fit it in a single week of PTO.

This Thanksgiving, I took another crack at it with Claude Code, and not only did I finish an entire redesign (basically touched every line of frontend code), but I also got in a bunch of other new features, too, like a forgot password feature, and a suite of moderation tools. I then IaC'd the whole thing with Terraform, something I only dreamed about doing before AI! Then I bumped React a few majors versions, bumped TS about 10 years, etc, all with the help of AI. The new site is live and everyone seems to like it (well, they haven't left yet...).

If anything, this is actually an unfair comparison, because it was more work for the AI than it was for me when I tried a few years ago, because because my dependencies became more and more out of date as the years went on! This was actually a pain for AI, but I eventually managed to solve it.

no_wizard · 2025-12-11T01:33:09 1765416789

Use case mapping matters. I use AI tools at work (have for a few years now, first Copilot from GitHub, now I use Gemini and Claude tools primarily). When the use case maps well, it is great. You can typically assume anything with a large corpus of fairly standard problems will map well in a popular language. JavaScript, HTML, CSS, these have huge training datasets from open source alone.

The combination of which, deep training dataset + maps well to how AI "understands" code, it can be a real enabler. I've done it myself. All I've done with some projects is write tests, point Claude at the tests and ask it to write code till those tests pass, then audit said code, make adjustments as required, and ship.

That has worked well and sped up development of straightforward (sometimes I'd argue trivial) situations.

Where it falls down is complex problem sets, major refactors that cross cut multiple interdependent pieces of code, its less robust with less popular languages (we have a particular set of business logic in Rust due to its sensitive nature and need for speed, it does a not great job with that) and a host of other areas I have hit limitations with it.

Granted, I work in a fairly specialized way and deal with alot of business logic / rules rather than boiler plate CRUD, but I have hit walls on things like massive refactors in large codebases (50K is small to me, for reference)