If you watch how agents attempt a task, fail, try to figure out what went wrong,...

dingnuts · 2025-10-17T23:52:53 1760745173

no I see something resembling gradient descent which is fine but it's hardly a child

haskellshill · 2025-10-18T13:44:28 1760795068

> try to figure out what went wrong

LLMs don't do this. They can't think. If you just one for like five minutes it's obvious that just because the text on the screen says "Sorry, I made I mistake, there are actually 5 r's in strawberry", doesn't mean there's any thought behind it.

crazygringo · 2025-10-18T19:00:40 1760814040

I mean, you can literally watch their thought process. They try to figure out reasons why something went wrong, and then identify solutions. Often in ways that require real deduction and creativity. And have quite a high success rate.

If that's not thinking, then I don't know what is.

haskellshill · 2025-10-22T22:50:02 1761173402

You're arguing in circles. They don't "try to figure out reasons" because there is no concept of "trying", "figuring out" or "reasons".

> If that's not thinking, then I don't know what is.

How about actual thinking, you know, what humans and to a lesser extent animals do?

balder1991 · 2025-10-18T02:08:15 1760753295

No, because an agent doesn’t learn, it’s just continuing a story. A kid will learn from the experience and at the end will be a different person.

CaptainOfCoit · 2025-10-18T10:51:25 1760784685

You just haven't added the right tool together with the right system/developer prompt. Add a `add_memory` and `list_memory` (or automatically inject the right memories for the right prompts/LLM responses) and you have something that can learn.

You can also take it a step further and add automatic fine-tuning once you start gathering a ton of data, which will rewire the model somewhat.

haskellshill · 2025-10-18T13:46:23 1760795183

Perhaps it can improve but it can't learn because that requires thought. Would you say that a PID regulator can "learn"?

CaptainOfCoit · 2025-10-18T14:47:12 1760798832

I guess it depends on what you understand "learn" to mean.

But in my mind, if I tell the LLM to do something, and it did it wrong, then I ask it to fix it, and if in the future I ask the same thing and it avoids the mistake it did first, then I'd say it had learned to avoid that same pitfall, although I know very well it hasn't "learned" like a human would, I just added it to the right place, but for all intents and purposes, it "learned" how to avoid the same mistake.

haskellshill · 2025-10-22T22:48:12 1761173292

This is a silly definition of learning, and any way, LLMs can't even do what you describe.