I use agents at work and yeah they read api keys and the like. If my employer doesn’t care, I care even less. At home I don’t use agents yet, but if I have to use them I’ll probably do what I do to run untrusted code at home: use a VM
I think in the future (in 10 years?) we are going to see a lot of disposable/throwaway software. I don’t know, imagine this: I need to buy tickets for a concert. I ask my AI agent that I want tickets. The agent creates code on the fly and uses it to purchase my tickets. The code could be simple curl command, or a full app with nice ui/ux. As a user I don’t need to see the code.
If I want to buy more tickets the same day, the ai agent will likely reuse the same code. But if i buy tickets again in one year, the agent will likely rebuild the code to adjust to the new API version the ticket company now offers.
Seems wasteful but it’s more dynamic. Vendors only need to provide raw APIs and your agent can create the ui experience you want. In that regard nobody but the company that owns your agent can inject malware into the software you use. Some software will last more than others (e.g., the music player your agent provided won’t probably be rebuilt unless you want a new look and feel or extra functionality). I think we’ll adopt the “cattle, not pets” approach to software too.
I know people have done truly amazing things with AI lately, but I feel this in my bones. Almost every demo I see is like, uh, I don't need these extremely simple things in my life automated. I can just go to Delta and buy a plane ticket. I actually want to write my own email to my mom or wife. Of course a demo is just a demo, but also come on
It's easy to buy one plane ticket when a person has a specific plan -- to attend a meeting or a conference, or to match up with an airbnb timeslot or something.
It's harder to buy one plane ticket for the lowest cost amongst all the different ways that plane tickets can be bought, and harder yet to do so with a lack of specificity.
So, for instance: Maybe I don't have a firm plan. Maybe I'm very flexible.
Maybe all I want to do is say "Hey, bot. I want to go visit my friend in Florida sometime in the next couple of weeks and spend a few days there as inexpensively as I can. He's in Orlando. I can fly out of Detroit or Cleveland; all the same to me. If I drive to the airport myself, I'll need a place to keep my car at or near the airport. I also want to explore renting a car in Orlando. I pack light; personal bag only. Cattle class is OK, but I prefer a window seat. Present to me a list of the cheapest options, with itinerary."
That's all stuff that a human can sort out, but it takes time to manually fudge around dates and locations and deal with different systems and tabulate the results. And there's nuances that need covered, like parking at DTW is weird: It's all off-site, and it can be cheaper and better to rent a room for one night in a nearby hotel that includes long-term parking than to pay for parking by itself.
So the hypothetical bot does a bunch of API hits, applies its general knowledge of how things flow, and comes back with a list of verified-good options for me to review. And then I get to pick around that list, and ask questions, and mold it to best fit my ideal vision of an inexpensive trip to go spend time with a friend.
In English, and without ever dealing with any travel websites myself.
"Right. So I go to Detroit on Tuesday and check in at the hotel any time after noon, and take the free shuttle to the airport the next morning at around 0400 to the Evans terminal. Also, thanks for pointing out that this airport is like a ghost town until 0600 and I might want to bring a snack. Anyway, I get on the flight, land at Orlando, and they'll have a cheap car waiting for me at Avis. This will all cost me a total of $343, which sounds great. If that's all I need to know right now, then make it so. Pay for it and put it on my calendar."
(And yeah, this is a problem that I actually have from time to time. I'd love to have a bot that could just sort this stuff out with a few paragraphs.)
But who is really going to put together the infrastructure and harness to make all that work? My dad certainly isn't. My mother in law won't
What you describe will just end up a feature on Expedia. The highly technical builders of stuff that love to tinker vastly overestimate how much BS the general public will put up with
Indeed. I have zero desire to put such a thing together just for my own use.
I didn't address that concept at all above, but I think the notion of a million people each independently using the bot to write a million bespoke programs that each do the same things is...kind of a non-starter. It's something that can only happen in some weird reality where software isn't essentially free to copy, and where people are motivated neither by laziness, nor the size of their pocketbook.
If/when someone does put the work into getting it to happen, then I expect to find it on Github for people to lazily copy and use, or for them to make it available as a website or app for anybody to use (with even more laziness) -- and for them to monetize it.
I think it's a fallacy that if you make creating anything easier, more useful things will be created. In reality, you just end up with more useless things being created. Like with art, when it gets easier to create you don't end up with more good art. And with software - it's not like the quality of software has gone up as it's gotten easier to build, it's gotten much worse.
A related fallacy is that great things are easier to build when you can rapidly create stuff. That isn't really how great ideas are generated, it's not a slot machine where if you pull the lever 1000 times you generate a good idea and thus a successful piece of software can be made. This seems like a distinctly Silicon Valley, SFBA type mentality. Steve Jobs didn't invent the iPhone by creating 1000 different throwaway products to test the market. Etc etc.
> I think it's a fallacy that if you make creating anything easier, more useful things will be created. In reality, you just end up with more useless things being created.
Well, if you lower the competence bar required to do something, then more people of lower competence will do that thing.
Why would I do that if the gateway to the internet becomes these LLM interfaces? How is it not easier to ask or type 'buy me tickets for Les Mis'? In the ideal world it will just figure it out, or I frustratingly have to interact with a slightly different website to purchase tickets for each separate event I want to see.
One of the benefits that I see is as much as I love tech and writing software, I really really do not want to interface with a vast majority of the internet that has been designed to show the maximum amount of ads in the given ad space.
The internet sucks now, anything that gets me away from having ads shoved in my face constantly and surrounded by uncertainty that you could always be talking to a bot.
I'm sympathetic to this view too, but I don't think the solution is to have LLM's generate bespoke code to do it. We absolutely should be using them for more natural language interfaces tho.
Yeah, that can also work. But I don’t see the future of software is to keep building multimillion line of code systems in a semi manual way (with or without llms). I think we will reach a phase in which we’ll have to treat code as disposable. I don’t think we are there yet, though.
We probably need higher levels of abstraction, built upon more composable building blocks and more interplay between various systems. To me that requires less disposable code though.
Personally the experience getting tickets at the moment is horrible.
Endless queues, scalpers grabbing tickets within a second. Having to wait days/weeks periodically checking to see if a ticket is available.
The only platform I’m aware of that does guarantee a ticket can be purchased if available is Dice once you join a wait list. You get given a reasonable time to purchase it in too.
So I can see why people would prefer to defer this to an agent and not care about the implementation, I personally would. In the past I’ve been able to script notifications for it for myself and can see more people benefiting from it.
seriously. I don't even wanna compile code when binaries are available in a repository. the thought of everybody preferring vibe-coding something on their own over using something that's battle-tested and available to the collective is just crazy to me.
My point is: such apps wouldn’t need to exist if agents can provide in the future the same functionality for a fraction of the cost. Sure if ticketmaster is here to stay forever and keep their app up to date, we can keep using it. But what about new products? Would companies decide to build a single fixed app that all the users have to use, instead of, well, not building it? Sure the functionality would still need to be provided by the company (e.g., like offered in the form of an api), so they keep getting profit.
It’s like we usually say: companies should focus on their core value. And typically the ui/ux is not the core value of companies.
So we burn the planet up to deploy individually craft UIs on demand? I mean, I've read your comment three times, and I just don't see it. If we end up in that future, we're doomed.
> And typically the ui/ux is not the core value of companies
Huh? The user experience is basically ALL of the core product of a company.
If it's so easy for an AI to create ticket purchasing software that people can generate it themselves, then it's also true that the company can also use AI to generate that software for users who then don't need to generate it themselves. Obviously I think neither of these things are true or likely to happen.
> Huh? The user experience is basically ALL of the core product of a company.
Thats the case now, but I think it’s because there’s no other way around it nowadays. But if agents in the future provide a better or more natural ui/ux for many use cases, then companies core value will shift more into their inner core (which in software translates typically to the domain model)
> If it's so easy for an AI to create ticket purchasing software that people can generate it themselves, then it's also true that the company can also use AI to generate that software for users who then don't need to generate it themselves.
I think the generation of software per se will be transparent to the user. Users won’t think in terms of software created but wishes their agents make true.
Aren’t we kinda realising that disposable/throwaway stuff is, like, bad? Why do we have to go down this wasteful and hyper-consumptive route AGAIN. Can we try and see the patterns here and move forwards?
Agree in general. I don’t see how making an agent create software is more wasteful than making dozens of engineers create the same thing. The latter seems more wasteful.
We have compilers creating binaries every single day. We don’t say thats wasteful.
Well ticketmaster (for example) is used by millions of people. It seems to me like spinning up millions of LLMs to produce a million different apps is way more wasteful than having a dozen developers produce one efficient app that everyone can use?
What to use? A website where you can quickly buy the stuff you want? Or an LLM where you specify how to buy the the thing you want, wait a while, then actually do the buying, and praying in the meantime, it's not throwing your money away?
I don't know if this is the future or not, but it seems to serve no real purpose other than to enrich LLM company profits. There is real value in well designed code that has been battle tested and hardened over years of bugfixes and iteration. It's reliable, it's reusable, it's efficient and it's secure. The opposite of hastily written and poorly understood vibe code that may or may not even do what you want it to do, even while you think it's doing what you want it to do.
there is software and software. lots of enterprise software gets re-written every 2-5 years, some projects are in rubbish bin as soon as finished (if finished)
This is also where I think we end up. If the behavior of the system is specified well enough, then the code itself is cheap and throwaway. Why have a static system that is brittle to external changes when you can just reconstruct the system on the fly?
Might be quite awhile before you can do this with large systems but we already see this on smaller contextual scales such as Claude Code itself
The specification for most systems _is the code_. English cannot describe business rules as succinctly as code, and most business rules end up being implied from a spec rather than directly specified, at least in my experience.
The thought of converting an app back into a spec document or list of feature requests seems crazy to me.
Why would it be? If you can describe an approximation of a system and regenerate it to be, let’s say, 98% accurate in 1% of the time that it would take to generate it by hand (and that’s being generous, it’s probably more like 0.1% in today’s day and age and that decimal is only moving left) aren’t there a giant set of use cases where the approximation of the system is totally fine? People will always bring up “but what about planes and cars and medicine and critical life or death systems”. Yeah sure, but a vast majority of the systems an end user interacts with every day do not have that level of risk tolerance
You are just validating the point that code is spec.
For your proposed system to work one must have a deterministic way of sending said spec to a system(Compiler?) and getting the same output everytime.
Input/Output is just one thing, software does a lot of 'side effect' kind of work, and has security implications. You don't leave such things to luck. Things either work or don't.
Absolutely let’s not do away with the determinism entirely. But we can decouple generation of the code from its deterministic behavior. If you are adequately able to identify the boundaries of the system and run deterministic tests to validate those boundaries that should be sufficient enough. It’s not like human written code was often treated with even that much scrutiny in the before times. I would validate human written code in the exact same way.
>>If the behavior of the system is specified well enough, then the code itself is cheap and throwaway. Why have a static system that is brittle to external changes when you can just reconstruct the system on the fly?
You mean to say if the unit and functional tests cases are given the system must generate code for you? You might want to look at Prolog in that case.
>>Might be quite awhile before you can do this with large systems but we already see this on smaller contextual scales such as Claude Code itself
We have been able to do something like this reliably for like 50 years now.
> If the behavior of the system is specified well enough
Then it becomes code: a precise symbolic representation of a process that can be unambiguously interpreted by a computer. If there is ambiguity, then that will be unsuitable for many systems.
The word “many” is carrying a lot of weight here. Given the probabilistic nature of AI I suspect that systems that are 98% correct will be just fine for all but the “this plane will crash” or “this person will get cancer” use cases. If the recreation of the system failed in that 2% by slightly annoying some end user, who gives a shit? If the stakes are low, and indeed they are for a large majority of software use cases, probabilistic approximation of everyone’s open source will do just fine.
If you’re worried about them achieving the 98%, worry no more, due to the probabilistic nature it will eventually converge on 9’s. Just keep sending the system through the probabilistic machine until it reaches your desired level of nines
I love the aosa book. I learned a lot about systems design from it. Ironically, I usually fail the Systems Design interviews at fancy companies because they only ask about LBs, sharding, obscure data structures like CRDTs, and what not.
They are so wildly used, and the companies usually tell you if they use your prompts for training. But again, at the current use case your code gets lost in the noise even if they do use it. Anthropic says they don’t. Microsoft allows you to disable, same with ChatGPT. Unclear on Goole.
But if you are worried, you can use an inference only solution like Groq.
Please provide a PDF as well. I cannot read books in HTML format because I need to keep track of where i left. That means I either have to leave the browser tab open (but this is prone to accidentally closing it) or I need to bookmark every progress, which creates noise in my bookmarks. With a PDF I simply leave it, the reader remembers my last page. I also have a sense of progress with pdf.
Depends. If it’s with ICs, sure. But as soon as some manager or someone with a leadership position joins, then it’s just plain work. So I dislike chitchatting with such people.
Same! If I see the first people joining are managers or above, I just wait until I see other engineers join. I hate managers talk, I couldn’t care less about them.
HN is so depressing, but at the same time so Im addicted to it. It’s like tiktok but for people who enjoy plain text and hacking related stuff. When I first visited HN more than 10 years ago (without account) like, 90% of the content was exciting and you got to learn something. Nowadays it’s about 40-50%, and the rest is crap (including comments). I have been trying to leave HN, let’s see if I can do it in 2026.
particularly ironic comment from an HN/lobsters celebrity account lol
this website isn't turning into Reddit, this website has been a pretentious orange subreddit for well over a decade if not right from the get go and a link to this site's Reddiquette page (just as ignored as on any subreddit!) is evidence TO that effect, and not against it!
the fact that the link petuously denies reality notwithstanding!
I mean, I'm not saying I think it's some sort of bastion of intellectual superiority, just that "have people been saying this place has been going downhill for a long time" is true.
It's still really early 2000's! We have over 900 years left :)
---
On topic: discussions like these are as old as human discussion forums and communities. I think that the participants each grow and change on an individual level just as much as the community and platform does. I think humans have a hard time identifying how much of their feelings of nostalgia are based in reality.
Maybe the platform has not actually changed in the ways people fear, and instead, peoples' opinions on what is interesting, important, or valuable has changed?
Since this thread has been discussing politics-adjacent things, let's consider Senator John Fetterman from the United States. Mr. Fetterman is notably different today from when he first started his campaign, regarding what he believes is important and valuable. (Mr. Fetterman suffered a stroke, which is suspected to have brought about personality changes and shifts in political ideology.)
---
I think we, as individuals, should always be focusing our first line of questioning on how _we're_ changing, rather than trying to figure out how the world, or the zeitgeist, or Hacker News, etc. is changing.
Sometimes we outgrow things that we hold dear, and instead of accepting that it's not really the place for us anymore and moving on to a different environment, we try to shape our current environment around our new personality by instituting new rules or adding new features.
I don't get people who use "you say [thing] is getting worse but someone X years ago said the same!" as an argument that somehow proves [thing] isn't getting worse. Things can become progressively worse over long periods of time, it's not an instant change that can only happen once.
Another context where I often see this "argument" is major Windows versions. People rightfully say they want to stay on Windows 10 because 11 is objectively worse in many ways, and someone jumps in to say "you said the same about 7 to 10" as if it's some sort of gotcha. Both complaints can be right, each new version can be worse than the last.
Right now, we have at least one aspect in which HN has become objectively worse in the past years: AI-generated content. It didn't exist a decade ago, so good luck using that "argument" there. Thankfully, its prevalence is still nowhere near as bad as on Reddit (it's impossible to browse that site for 10 minutes without noticing bots posting blatant ChatGPT responses everywhere and getting hundreds of upvotes), but still.
I do feel like 40-50% signal ratio is still good compared to 90%
HN did give me some leads in the start of just cool things to follow and I have been able to make an understanding of what things interest me and what don't due to it. And this has also been the reason I read a lot of comments etc. and content here, maybe more than I should.
I don't know to me, building my own website and forum etc. are possible but they feel complicated and I still can't seem to get eye balls. On Hackernews Comments its easier personally to write something, get feedback on it, (improve?/learn?)
Of course if one wants to optimize for eyeballs, they can probably go for reddit or twitter maxxing or similar because cmon this is exactly the stuff the article is talking about from what I see.
Hackernews does indeed sit on the perfect spot. I feel like if you want more informationally dense topics, perhaps lobsters can be good for ya.
Their UX is not steamlined. They seem to also opt you in by default to every conceivable category of notifications. It feels like a clown website. If they fixed some of this it could genuinely be enjoyable though of course I get the point that it's employment networking as opposed to a social media 'connect with friends' site
Its alright, were not all like that. I found the site cute, at least there are people standing up to the bullshit. I have been blogging about it on my site to https://www.scottrlarson.com/publications/
If I build a web app i still need to pay for a domain, for a server for egress.
We are just renting. Wouldn’t be surprised if in the future this gets even more depressing
reply