Making LeCun report to Wang was the most boneheaded move imaginable. But… I supp...

xuancanh · 2025-11-12T08:55:07 1762937707

In industry research, someone in a chief position like LeCun should know how to balance long-term research with short-term projects. However, for whatever reason, he consistently shows hostility toward LLMs and engineering projects, even though Llama and PyTorch are two of the most influential projects from Meta AI. His attitude doesn’t really match what is expected from a Chief position at a product company like Facebook. When Llama 4 got criticized, he distanced himself from the project, stating that he only leads FAIR and that the project falls under a different organization. That kind of attitude doesn’t seem suitable for the face of AI at the company. It's not a surprise that Zuck tried to demote him.

blutoot · 2025-11-12T10:55:59 1762944959

These are the types that want academic freedom in a cut-throat industry setup and conversely never fit into academia because their profiles and growth ambitions far exceed what an academic research lab can afford (barring some marquee names). It's an unfortunate paradox.

sigbottle · 2025-11-12T11:41:46 1762947706

Maybe it's time for Bell Labs 2?

I guess everyone is racing towards AGI in a few years or whatever so it's kind of impossible to cultivate that environment.

ryukoposting · 2025-11-12T12:12:25 1762949545

The Bell Labs we look back on was only the result of government intervention in the telecom monopoly. The 1956 consent decree forced Bell to license thousands of its patents, royalty free, to anyone who wanted to use them. Any patent not listed in the consent decree was to be licensed at "reasonable and nondiscriminatory rates."

The US government basically forced AT&T to use revenue from its monopoly to do fundamental research for the public good. Could the government do the same thing to our modern megacorps? Absolutely! Will it? I doubt it.

https://www.nytimes.com/1956/01/25/archives/att-settles-anti...

aatd86 · 2025-11-12T13:01:31 1762952491

Used to be a Google X. Not sure at what scale it was. But if any state/central bank was clever they would subsidize this. That's a better trickle down strategy. Until we get to agi and all new discoveries are autonomously led by AI that is :p

williamDafoe · 2025-11-12T19:02:49 1762974169

Google X is a complete failure. Maybe they had fei-fei on staff for a short while but most of her work was done elsewhere.

godelski · 2025-11-12T21:13:18 1762981998

  > Google X is a complete failure

  - Google Brain
  - Google Watch/Wear OS
  - Gcam/Pixel Camera
  - Insight (indoor GMaps)
  - Waymo
  - Verily

It is a moonshot factory after all, not a "we're only going to do things that are likely to succeed" factory. It's an internal startup space, which comes with high failure rates. But these successes seem pretty successful. Even the failed Google Glass seems to have led to learning, though they probably should have kept the team going considering the success of Meta Raybands and with things like Snap's glasses.

https://x.company/projects/#graduate

https://en.wikipedia.org/wiki/X_Development#Graduated_projec...

aatd86 · 2025-11-12T20:01:54 1762977714

Didn't the current LLMs stem from this...? Or it might be Google Brain instead. For Google X, there is Waymo? I know a lot of stuff didn't pan out. This is expected. These were 'moonshots'.

But the principle is there. I think that when a company sits on a load of cash, that's what they should do. Either that or become a kind of alternative investments allocator. These are risky bets. But they should be incentivized to take those risks. From a fiscal policy standpoint for instance. Well it probably is the case already via lower taxation of capital gains and so on. But there should probably exist a more streamlined framework to make sure incentives are aligned.

And/or assigned government projects? Besides implementing their Cloud infrastructure that is...

HarHarVeryFunny · 2025-11-12T12:33:04 1762950784

It seems DeepMind is the closest thing to a well funded blue-sky AI research group, even despite the merger with Google Brain and now more of a product focus.

musebox35 · 2025-11-12T14:55:48 1762959348

Google Deepmind is the closest lab to that idea because Google is the only entity that is big enough to get close to the scale of AT&T. I was skeptical that the Deepmind and Google Brain merge would be successful but it seems to have worked surprisingly well. They are killing it with LLMs and image editing models. They are also backing the fastest growing cloud business in the world and collecting Nobel prizes along the way.

ximeng · 2025-11-12T15:00:28 1762959628

https://www.startuphub.ai/ai-news/ai-research/2025/sam-altma...

Like the new spin out Episteme from OpenAI?

stocksinsmocks · 2025-11-12T19:34:09 1762976049

I thought that was Google. Regulators pretend not to notice their monopoly, they probably get large government contracts for social engineering and surveillance laundered through advertising, and the “don’t be evil” part is they make some open source contributions

diego_sandoval · 2025-11-12T13:28:17 1762954097

The fact that people invest on the architecture that keeps getting increasingly better results is a feature, not a bug.

If LLMs actually hit a plateau, then investment will flow towards other architectures.

esafak · 2025-11-12T13:55:56 1762955756

At which point companies that had the foresight to investigate those architectures earlier on will have the lead.

red2awn · 2025-11-12T17:13:27 1762967607

I'd argue SSI and Thinking Machines Lab seem to that environment you are thinking about. Industry labs that focuses on research without immediate product requirement.

inkysigma · 2025-11-12T23:05:01 1762988701

I don't think that quite matches because those labs have very clear directions of research in LLMs. The theming is a bit more constrained and I don't know if a line of research as vague as what LeCun is pursuing would be funded by those labs.

meekaaku · 2025-11-12T16:33:21 1762965201

I am of the opinion that splitting AT&T and hence Bell Labs was a net negative for America and rest of the world.

We are yet to create lab as foundational as Bell Labs.

ambicapter · 2025-11-12T14:51:06 1762959066

Why would Bell Labs be a good fit? It was famous for embedding engineers with the scientists to direct research in a more results-oriented fashion.

blueboo · 2025-11-12T13:50:01 1762955401

We call it “legacy DeepMind”

belter · 2025-11-12T12:02:28 1762948948

> I guess everyone is racing towards AGI in a few years

A pipe dream sustaining the biggest stock market bubble in history. Smart investors are jumping to the next bubble already...Quantum...

re-thc · 2025-11-12T12:08:46 1762949326

> A pipe dream sustaining the biggest stock market bubble in history

This is why we're losing innovation.

Look at electric cars, batteries, solar panels, rare earths and many more. Bubble or struggle for survival? Right, because if US has no AI the world will have no AI? That's the real bubble - being stuck in an ancient world view.

Meta's stock has already tanked for "over" investing in AI. Bubble, where?

belter · 2025-11-13T05:19:40 1763011180

https://news.ycombinator.com/item?id=45902246

belter · 2025-11-12T12:10:49 1762949449

2 Trillion dollars in Capex to get code generators with hallucinations, that run at a loss, and you ask where is the Bubble?

re-thc · 2025-11-12T12:19:30 1762949970

> 2 Trillion dollars in Capex to get code generators with hallucinations

You assume that's the only use of it.

And are people not using these code generators?

Is this an issue with a lost generation that forgot what Capex is? We've moved from Capex to Opex and now the notion is lost, is it? You can hire an army of software developers but can't build hardware.

Is it better when everyone buys DeepSeek or a non-US version? Well then you don't need to spend Capex but you won't have revenue either.

littlestymaar · 2025-11-12T12:29:38 1762950578

Deepseek somehow didn't need $2T to happen.

re-thc · 2025-11-12T12:33:31 1762950811

Because you know how much they spent.

And that $2T you're referring to includes infrastructure like energy, data centers, servers and many things. DeepSeek rents from others. Someone is paying.

anotherd1p · 2025-11-12T13:41:34 1762954894

all that led up to Deepseek needed more. don't forget where it all comes from.

matt3D · 2025-11-12T13:45:13 1762955113

I think the argument can be made that Deepseek is a state sponsored needle looking to pop another states bubble.

If Deepseek is free it undermines the value of LLMs, so the value of these US companies is mainly speculation/FOMO over AGI.

re-thc · 2025-11-12T14:25:31 1762957531

> the argument can be made that Deepseek is a state sponsored needle looking to pop another states bubble

Who says they don't make money? Same with open source software that offer a hosted version.

> If Deepseek is free it undermines the value of LLMs, so the value of these US companies is mainly speculation/FOMO over AGI

Freemium, open source and other models all exist. Does it undermine the value of e.g. Salesforce?

belter · 2025-11-13T05:29:58 1763011798

"Big Tech Needs $2 Trillion In AI Revenue By 2030 or They Wasted Their Capex" - https://www.wheresyoured.at/big-tech-2tr/

gtech1 · 2025-11-12T13:06:41 1762952801

This sounds crazy. We don't even know/can't define what human intelligence is or how it works , but we're trying to replicate it with AGI ?

afthonos · 2025-11-12T13:32:43 1762954363

Man, why did no one tell the people who invented bronze that they weren’t allowed to do it until they had a correct definition for metals and understood how they worked? I guess the person saying something can’t be done should stay out of the way of the people doing it.

skeeter2020 · 2025-11-12T13:49:53 1762955393

>> I guess the person saying something can’t be done should stay out of the way of the people doing it.

I'll happily step out of the way once someone simply tells me what it is you're trying to accomplish. Until you can actually define it, you can't do "it".

afthonos · 2025-11-12T14:11:50 1762956710

The big tech companies are trying to make machines that replace all human labor. They call it artificial intelligence. Feel free to argue about definitions.

gtech1 · 2025-11-12T14:45:39 1762958739

No no, let's define labor (labour?) first.

CamperBob2 · 2025-11-12T20:19:55 1762978795

Whatever you're doing for money that you wouldn't do if you didn't need money.

gtech1 · 2025-11-12T14:00:34 1762956034

no bro, others have done 'it' without even knowing what they were doing!

gtech1 · 2025-11-12T13:37:55 1762954675

I'm not sure what 'inventing bronze' is supposed to be. 'Inventing' AGI is pretty much equivalent to creating new life, from scratch. And we don't have an idea on how to do that either, or how life came to be.

cantor_S_drug · 2025-11-12T13:32:31 1762954351

Intelligence and human health can't be defined neatly. They are what we call suitcase words. If there exists a physiological tradeoff between medical research about whether to live till 500 years or to be able to lift 1000kg when a person is in youth, those are different dimensions / directions across we can make progress. Same happens for intelligence. I think we are on right track.

Obscurity4340 · 2025-11-12T13:26:09 1762953969

If an LLM can pass a bar exam, isn't that at least a decent proof of concept or working model?

staticman2 · 2025-11-12T14:10:03 1762956603

I don't think the bar exam is scientifically designed to measure intelligence so that was an odd example. Citing the bar exam is like saying it passes the "Game of thrones trivia" exam so it must be intelligent.

As for IQ tests and the like, to the extent they are "scientific" they are designed based on empirical observations of humans. It is not designed to measure the intelligence of a statistical system containing a compressed version of the internet.

skeeter2020 · 2025-11-12T13:54:15 1762955655

Or does this just prove lawyers are artificially intelligent?

yes, a glib response, but think about it: we define an intelligence test for humans, which by definition is an artificial construct. If we then get a computer to do well on the test we haven't proved it's on par with human intelligence, just that both meet some of the markers that the test makers are using as rough proxies for human intelligence. Maybe this helps signal or judge if AI is a useful tool for specific problems, but it doesn't mean AGI

anotherd1p · 2025-11-12T13:40:27 1762954827

I love this application of AI the most but as many have stated elsewhere: mathematical precision in law won't work, or rather, won't be tolerated.

Obscurity4340 · 2025-11-22T04:41:02 1763786462

Do you have an example that you rely on for that kind of statement?

meindnoch · 2025-11-12T14:13:11 1762956791

Hi there! :) Just wanted to gently flag that one of the terms (beginning with the letter "r") in your comment isn't really aligned with the kind of inclusive language we try to encourage across the community. Totally understand it was likely unintentional - happens to all of us! Going forward, it'd be great to keep things phrased in a way that ensures everyone feels welcome and respected. Thanks so much for taking the time to share your thoughts here!

gtech1 · 2025-11-12T14:42:01 1762958521

My apologies, I have edited my comment.

anotherd1p · 2025-11-12T13:37:09 1762954629

stretching the infinite game is exactly that, yes, "This is the way"

blutoot · 2025-11-12T12:49:30 1762951770

[flagged]

sllabres · 2025-11-12T13:02:20 1762952540

If you are (obviously) interested in the matter you might find one of the Bell Labs articles discussed on HN:

"Why Bell Labs Worked" [1]

"The Influence of Bell Labs" [2]

"Bringing back the golden days of Bell Labs" [3]

"Remembering Bell Labs as legendary idea factory prepares to leave N.J. home" [4] or

"Innovation and the Bell Labs Miracle" [5]

interesting too.

[1] https://news.ycombinator.com/item?id=43957010 [2] https://news.ycombinator.com/item?id=42275944 [3] https://news.ycombinator.com/item?id=32352584 [4] https://news.ycombinator.com/item?id=39077867 [5] https://news.ycombinator.com/item?id=3635489

mysfi · 2025-11-12T15:42:18 1762962138

I became interested in the matter reading this thread and vaguely remember reading a couple of the articles. Saved them all in NotebookLM to get an audio overview and to read later. Thanks!

anotherd1p · 2025-11-12T13:35:55 1762954555

I always take a bird's eye kind of view on things like that, because however close I get, it always loops around to make no sense.

> is massively monopolistic and have unbounded discretionary research budget

that is the case for most megacorps. if you look at all the financial instruments.

modern monopolies are not equal to single corporation domination. modern monopolies are portfolios who do business using the same methods and strategies.

the problem is that private interests strive mostly for control, not money or progress. if they have to spend a lot of money to stay in control of (their (share of the)) segments, they will do that, which is why stuff like the current graph of investments of, by and for AI companies and the industries works.

A modern equivalent and "breadth" of a Bell Labs (et. al) kind of R&D speed could not be controlled and would 100% result in actual Artificial Intelligence vs all those white labelababbebel (sry) AI toys we get now.

Post WW I and II "business psychology" have build a culture that cannot thrive in a free world (free as in undisturbed and left to all devices available) for a variety of reasons, but mostly because of elements with a medieval/dark-age kind of aggressive tendency to come to power and maintain it that way.

In other words: not having a Bell Labs kind of setup anymore ensures that the variety of approaches taken on large scales aka industry-wide or systemic, remains narrow enough.

kamaal · 2025-11-12T12:00:09 1762948809

More importantly even if you do want it, and there are business situations that support your ambitions. You still have to do get into the managerial powerplay, which quite honestly takes a separate kind of skill set, time and effort. Which Im guessing the academia oriented people aren't willing to do.

Its pretty much dog eat dog at top management positions.

Its not exactly a space for free thinking timelines.

ptero · 2025-11-12T12:11:17 1762949477

It is not a free thinking paradise in academia either. Different groups fighting for hiring, promotions and influence exist there, too. And it tends to be more pronounced: it is much easier in industry to find a comparable job to escape a toxic environment, so a lot of problems in academia settings steam forever.

But the skill sets to avoid and survive personnel issues in academia is different from industry. My 2c.

anotherd1p · 2025-11-12T13:58:18 1762955898

> Its not exactly a space for free thinking timelines.

Same goes for academia. People's visions compete for other people's financial budgets, time and other resources. Some dogs get to eat, study, train at the frontier and with top tools in top environments while the others hope to find a good enough shelter.

whiplash451 · 2025-11-12T16:42:37 1762965757

Meta has the financial oomph to run multiple Bell Labs within its organization.

Why they decided not to do that is kind of a puzzle.

keeganpoppen · 2025-11-12T22:07:30 1762985250

because the business hierarchy clearly couldnt support it. take that for what you will.

poslathian · 2025-11-13T07:57:00 1763020620

as I understand, Bell Labs mandate was to improve the network, which had tons of great threads to pull on: plastics for handsets, transistors for amplification, information theory for capacity on fixed copper.

Google and Meta are ads businesses with a lot less surface area for such a mandate to have similar impact and, frankly, exciting projects people want to do.

Meanwhile they still have tons of cash so, why not, throw money at solving Atari or other shiny programs.

Also, for cultural reasons, there’s been a huge shift to expensive monolithic “moonshot programs” whose expenses need on-demand progress to justify and are simply slower and way less innovative.

3 passionate designers hiding deep inside Apple can side hustle up the key gestures that make multi touch baked enough to see a path to an iPhone - long before iPhone was any sort endgame direction they were being managed to.

Innovation thrives on lots of small teams mostly failing in the search for something worth doubling down on.

Googles et al have a new approach - aim for the moon, budget and staff for the moon, then burn cash while no one ever really polished up the fundamental enabling pieces in hindsight they needed to succeed

throwaw12 · 2025-11-12T09:02:59 1762938179

I would pose a question differently, under his leadership did Meta achieve good outcome?

If the answer is yes, then better to keep him, because he has already proved himself and you can win in the long-term. With Meta's pockets, you can always create a new department specifically for short-term projects.

If the answer is no, then nothing to discuss here.

xuancanh · 2025-11-12T09:32:23 1762939943

Meta did exactly that, kept him but reduced his scope. Did the broader research community benefit from his research? Absolutely. But did Meta achieve a good outcome? Probably not.

If you follow LeCun on social media, you can see that the way FAIR’s results are assessed is very narrow-minded and still follows the academic mindset. He mentioned that his research is evaluated by: "Research evaluation is a difficult task because the product impact may occur years (sometimes decades) after the work. For that reason, evaluation must often rely on the collective opinion of the research community through proxies such as publications, citations, invited talks, awards, etc."

But as an industry researcher, he should know how his research fits with the company vision and be able to assess that easily. If the company's vision is to be the leader in AI, then as of now, he seems to have failed that objective, even though he has been at Meta for more than 10 years.

nsonha · 2025-11-12T09:55:52 1762941352

Also he always sounds like "I know this will not work". Dude are you a researcher? You're supposed to experiment and follow the results. That's what separates you from oracles and freaking philosophers or whatever.

lukan · 2025-11-12T11:56:27 1762948587

Philosophers are usually more aware of their not knowing than you seem to give them credit for. (And oracles are famously vague, too).

teleforce · 2025-11-12T12:07:03 1762949223

Do you know that all formally trained researchers have Doctor of Philosophy or PhD to their name? [1]

[1] Doctor of Philosophy:

https://en.wikipedia.org/wiki/Doctor_of_Philosophy

anotherd1p · 2025-11-12T13:47:25 1762955245

If academia is in question, then so are their titles. When I see "PhD", I read "we decided that he was at least good enough for the cause" PhD, or PhD (he fulfilled the criteria).

yawnxyz · 2025-11-12T10:02:07 1762941727

he probably predicted the asymptote everyone is approaching right now

brazukadev · 2025-11-12T11:54:57 1762948497

So did I after trying llama/Meta AI

uoaei · 2025-11-12T11:16:37 1762946197

He's speaking to the entire feedforward Transformer-based paradigm. He sees little point in continuing to try to squeeze more blood out of that stone and instead move on to more appropriate ways to model ontologies per se rather than the crude-for-what-we-use-them-for embedding-based methods that are popular today.

I really resonate with his view due to my background in physics and information theory. I for one welcome his new experimentation in other realms while so many still hack away at their LLMs in pursuit of SOTA benchmarks.

fhd2 · 2025-11-12T11:36:47 1762947407

If the LLM hype doesn't cool down fast, we're probably looking at another AI winter. Appears to me like he's just trying to ensure he'll have funding for chasing the global maximum going forward.

re-thc · 2025-11-12T12:04:12 1762949052

> If the LLM hype doesn't cool down fast, we're probably looking at another AI winter.

Is the real bubble ignorance? Maybe you'll cool down but the rest of the world? There will just be more DeepSeek and more advances until the US loses its standing.

uoaei · 2025-11-12T19:15:11 1762974911

How is it a foregone conclusion that squeezing the stone will continue to produce blood?

rw2 · 2025-11-12T09:05:38 1762938338

I believe that the fact that Chinese models are beating the crap of of Llama means it's a huge no.

amelius · 2025-11-12T09:34:31 1762940071

Why? The Chinese are very capable. Most DL papers have at least one Chinese name on it. That doesn't mean they are Chinese but it's telling.

UrineSqueegee · 2025-11-12T09:39:32 1762940372

is an american model chinese because chinese people were in the team?

danielbln · 2025-11-12T10:10:00 1762942200

There is no need for that tone here.

danielbln · 2025-11-13T12:21:11 1763036471

OP edited the post.

rat9988 · 2025-11-12T10:07:53 1762942073

What are these chinese labs made of?

4ggr0 · 2025-11-12T10:14:17 1762942457

500 remote indian workers (/s)

rob_c · 2025-11-12T10:02:50 1762941770

most papers are also written in the same language, what's your point?

HarHarVeryFunny · 2025-11-12T12:35:36 1762950936

LeCun was always part of FAIR, doing research, not part of the LLM/product group, who reported to someone else.

rockinghigh · 2025-11-12T15:23:57 1762961037

Wasn't the original LLaMA developed by FAIR Paris?

HarHarVeryFunny · 2025-11-12T16:20:59 1762964459

I hadn't heard that, but he was heavily involved in a cancelled project called Galactica that was an LLM for scientific knowledge.

fooker · 2025-11-12T17:38:52 1762969132

Yeah that stuff generated embarrassingly wrong scientific 'facts' and citations.

That kind of hallucination is somewhat acceptable for something marketed as a chatbot, less so for an assistant helping you with scientific knowledge and research.

baobabKoodaa · 2025-11-12T19:40:23 1762976423

I thought it was weird at the time how much hate Galactica got for its hallucinations compared to hallucinations of competing models. I get your point and it partially explains things. But it's not a fully satisfying explanation.

fooker · 2025-11-13T03:07:59 1763003279

I guess another aspect is - being too early is not too different from being wrong.

anotherd1p · 2025-11-12T13:43:42 1762955022

then we should ask: will Meta come close enough to the fulfillment of the promises made, or will it keep achieving good enough outcomes?

HarHarVeryFunny · 2025-11-12T12:31:21 1762950681

Meta had a two prong AI approach - product-focused group working on LLMs, and blue-sky research (FAIR) working on alternate approaches, such as LeCun's JEPA.

It seems they've given up on the research and are now doubling down on LLMs.

sharmajai · 2025-11-12T11:32:58 1762947178

Product companies with deprioritized R&D wings are the first ones to die.

astrange · 2025-11-12T15:56:07 1762962967

Apple doesn't have an "R&D wing". It's a bad idea to split your company into the cool part and the boring part.

fooker · 2025-11-12T17:40:09 1762969209

Isn't that why Siri is worse today than it was thirteen years ago?

astrange · 2025-11-12T18:45:10 1762973110

It's better in ways you don't think about.

(It works offline, it works in other languages, the TTS is much better.)

fooker · 2025-11-13T04:46:26 1763009186

None of it matters if it does not understand the audio input and maps it to the correct actions on your device.

The first one is a solved problem now, and the second one, while not solved, is where a little bit of research can really make a difference.

mi_lk · 2025-11-12T17:46:11 1762969571

And apparently that doesn't stop people from buying their products

fooker · 2025-11-12T18:10:46 1762971046

It doesn't.

Apple makes the best hardware, period.

It makes sense that people are willing to overlook subpar software for top notch hardware.

skeeter2020 · 2025-11-12T13:58:01 1762955881

Hasn't happened to Google yet

anshumankmr · 2025-11-12T14:40:32 1762958432

Has Google depriortized R&D?

StilesCrisis · 2025-11-12T12:56:34 1762952194

None of Meta's revenue has anything to do with AI at all. (Other than GenAI slop in old people's feeds.) Meta is in the strange position of investing very heavily in multiple fields where they have no successful product: VR, hardware devices, and now AI. Ad revenue funds it all.

jpadkins · 2025-11-12T14:28:38 1762957718

LLMs help ads efficiency a lot. policy labels, targeting, adaptive creatives, landing page evals, etc.

nxor · 2025-11-12T14:10:28 1762956628

Underrated comment

hbarka · 2025-11-12T13:48:36 1762955316

LeCun truly believes the future is in world models. He’s not alone. Good for him to now be in the position he’s always wanted and hopefully prove out what he constantly talks about.

astrange · 2025-11-12T15:57:44 1762963064

He seems stuck in the GOFAI development philosophy where they just decide humans have something called a "world model" because they said so, and then decide that if they just develop some random thing and call it a "world model" it'll create intelligence because it has the same name as the thing they made up.

And of course it doesn't work. Humans don't have world models. There's no such thing as a world model!

Libidinalecon · 2025-11-13T03:29:55 1763004595

I do agree humans don't have a world model. It is really more than that. We exist in the world. We don't need a world model because we exist in the world.

It is like saying a fish has a water model. It makes no sense when the fish existence is intertwined with water.

That is not to say that a computer that has a model of the world would not most likely be extremely useful vs something like the LLM that has none. The world model would be the best we could do to create a machine that simulates being in the world.

HarHarVeryFunny · 2025-11-12T19:31:37 1762975897

I don't think the focus is really on world models, rather than on animal intelligence based around predicting the real world, but to predict it you need to model it in some sense.

astrange · 2025-11-12T19:53:50 1762977230

IMO the issue is that animals can't have a specific "world model" system, because if you create a model ahead of time you will mostly waste energy because most of the model is not used.

And animals' main concern is energy conservation, so they must be doing something else.

HarHarVeryFunny · 2025-11-12T20:38:04 1762979884

There are many factors playing into "survival of the fittest", and energy conservation is only one. Animals build mental models to predict the world because this superpower of seeing into the future is critical to survival - predict where the water is in a drought, where the food is, and how to catch it, etc, etc.

The animal learns as it encounters learning signals - prediction failure - which is the only way to do it. Of course you need to learn/remember something before you can use that in the future, so in that sense it's "ahead of time", but the reason it's done that way because evolution has found that learning patterns will ultimately prove beneficial.

astrange · 2025-11-12T22:27:02 1762986422

It doesn't necessarily need to model the world to learn how to perform actions though. That was the topic of this old GOFAI research:

https://aaai.org/papers/00268-aaai87-048-pengi-an-implementa...

It instead works by "doing the thing that worked last time".

As an example, you don't usually need to know what is in your garbage in order to take out the trash.

HarHarVeryFunny · 2025-11-13T12:26:24 1763036784

Right - I've no idea how LeCun thinks about it, but I don't see that an animal needs or would have any more of a "world model" than something like an LLM. I'm sure all the research into rats in mazes etc has something to say about their representations of location/etc, but given a goal of prediction it seems that all is needed is a combination of pattern recognition and sequence prediction - not an actual explicit "declarative" model.

It seems that things like place cells and grandmother cells are a part of the pattern recognition component, but recognizing landmarks and other predictive-relevant information doesn't mean we have a complete coherent model of the environments we experience - perhaps more likely a fragmented one of task-relevant memories. It seems like our subjective experience of driving is informative - we don't have a mental road map but rather familiarity with specific routes and landmarks. We know to turn right at the gas station, etc.

rapsey · 2025-11-12T09:31:54 1762939914

Yann was never a good fit for Meta.

runeblaze · 2025-11-12T12:58:19 1762952299

Agreed, I am surprised he is happy to stay this long. He would have been on paper a far better match at a place like pre-Gemini-era Google

Grimblewald · 2025-11-12T12:06:42 1762949202

LLM hostility was warrented. The overhype/downright charlartan nature of ai hype and marketing threatens another AI winter. It happened to cybernetics, it'll happen to us too. The finance folks will be fine, they'll move to the next big thing to overhype, it is the researchers who suffer the fall-out. I am considered anti LLM (transformers anyway) for this reason, i like the the architecture, it is cool amd rather capable at its problem set, which is a unique set, but, it isnt going to deliver any of what has been promised, any more than a plain DNN or a CNN will.

HDThoreaun · 2025-11-12T17:46:58 1762969618

Meta is in last place among the big tech companies making an AI push because of lecun’s llm hostility. Refusing to properly invest in the biggest product breakthrough this century was not even a little bit warranted. He had more than enough resources available to do the research he wanted and create a fantastic open source llm.

Grimblewald · 2025-11-12T19:49:24 1762976964

Meta has made some fantastic llm's publically avliable many of which continue to outperform all but the qwen series in real world applications.

LLMs cannot do any of the major claims made for them, so competing at the current frontier is a massive resource waste.

Right now a locally running 8b model with large context window (10k tokens+) beat google/openAI models easily on any task you like.

why would anyone then pay for something that is possible to run on consumer hardware with higher token/second throughput and better performance? What exactly have the billions invested given google/oai in return? Nothing more than an existensial crisis I'd say.

Companies aren't trying to force AI costs into their subscription models in dishonest ways because they've got a winning product.

HDThoreaun · 2025-11-12T21:22:05 1762982525

I dont really agree with your perception of current LLMs, but the point is it doesnt even matter. This is a pr war. Lecun lost it for meta. Meta needs to be thought of as an AI leader to gain traction in their metaverse stuff. They can live with everyone thinking theyre evil but if everyone thinks theyre lame has beens they are fucked.

Grimblewald · 2025-11-13T06:35:06 1763015706

are they thought of as lame has-beens? OR even on a trajectory for that to be thought of them? I don't think that's true, at least not in my circles. Like you said, evil, sure, but not has been.

mi_lk · 2025-11-12T15:34:56 1762961696

This is the right take. He is obviously a pioneer and much more knowledgeable than Wang in the field, but if you don't have the product mind to serve company's business interest in short term and long term capacity anymore, you may as well stay in academia and be your own research director, let alone a chief executive in one of the largest public companies

whiplash451 · 2025-11-12T16:40:28 1762965628

It's very hard (and almost irreconcilable) to lead both Applied Research -- that optimizes for product/business outcomes -- and Fundamental Research -- that optimizes for novel ideas -- especially at the scale of Meta.

LeCun had chosen to focus on the latter. He can't be blamed for not having taken the second hat.

HDThoreaun · 2025-11-12T17:43:35 1762969415

Yes he can. If he wanted to focus on fundamental research he shouldn’t have accepted a leadership position at a product company. He knew going in that releasing products was part of his job and largely blew it.

Nimitz14 · 2025-11-12T15:50:05 1762962605

Yann was in charge of FAIR which has nothing to do with llama4 or the product focussed AI orgs. In general your comment is filled with misrepresentations. Sad.

HDThoreaun · 2025-11-12T17:49:48 1762969788

FAIR having shit for products is the whole reason he is being demoted/fired. Yes, he had nothing to do with applied research, that was the problem.

nailer · 2025-11-12T13:57:34 1762955854

Lecun has also consistently tried to redefine open source away from the open source definition.

rob_c · 2025-11-12T10:02:04 1762941724

tbf, transformers from more of a developmental perspective are hugely wasteful. they're long-range stable sure, but the whole training process requires so much power/data compared to even slightly simpler model designs I can see why people are drawn to alternative complex model designs down-playing the reliance on pure attention.

_the_inflator · 2025-11-12T13:45:34 1762955134

I totally agree. He appeared to act against his employer and actively undermined Meta's effort to attract talent by his behavior visible on X.

And I stopped reading him, since he - in my opinion - trashed on autopilot everything 99% did - and these 99% were already beyond the two standard deviation of greatness.

It is even more highly problematic if you have absolutely no results eg products to back your claims.

gnaman · 2025-11-12T08:02:22 1762934542

He is also not very interested in LLMs, and that seems to be Zuck's top priority.

tinco · 2025-11-12T08:05:58 1762934758

Yeah I think LeCun is underestimating the impact that LLM's and Diffusion models are going to have, even considering the huge impact they're already having. That's no problem as I'm sure whatever LeCun is working on is going to be amazing as well, but an enterprise like Facebook can't have their top researcher work on risky things when there's surefire paths to success still available.

jll29 · 2025-11-12T08:26:27 1762935987

I politely disagree - it is exactly an industry researcher's purpose to do the risky things that may not work, simply because the rest of the corporation cannot take such risks but must walk on more well-trodden paths.

Corporate R&D teams are there to absorb risk, innovate, disrupt, create new fields, not for doing small incremental improvements. "If we know it works, it's not research." (Albert Einstein)

I also agree with LeCun that LLMs in their current form - are a dead end. Note that this does not mean that I think we have already exploited LLMs to the limit, we are still at the beginning. We also need to create an ecosystem in which they can operate well: for instance, to combine LLMs with Web agents better we need a scalable "C2B2C" (customer delegated to business to business) micropayment infrastructure, because as these systems have already begun talking to each other, in the longer run nobody would offer their APIs for free.

I work on spatial/geographic models, inter alia, which by coincident is one of the direction mentioned in the LeCun article. I do not know what his reasoning is, but mine was/is: LMs are language models, and should (only) be used as such. We need other models - in particular a knowledge model (KM/KB) to cleanly separate knowledge from text generation - it looks to me right now that only that will solve hallucination.

barrkel · 2025-11-12T08:42:30 1762936950

Knowledge models, like ontologies, always seem suspect to me; like they promise a schema for crisp binary facts, when the world is full of probabilistic and fuzzy information loosely categorized by fallible humans based on an ever slowly shifting social consensus.

Everything from the sorites paradox to leaky abstractions; everything real defies precise definition when you look closely at it, and when you try to abstract over it, to chunk up, the details have an annoying way of making themselves visible again.

You can get purity in mathematical models, and in information systems, but those imperfectly model the world and continually need to be updated, refactored, and rewritten as they decay and diverge from reality.

These things are best used as tools by something similar to LLMs, models to be used, built and discarded as needed, but never a ground source of truth.

fauigerzigerk · 2025-11-12T13:17:20 1762953440

>Knowledge models, like ontologies, always seem suspect to me; like they promise a schema for crisp binary facts, when the world is full of probabilistic and fuzzy information loosely categorized by fallible humans based on an ever slowly shifting social consensus.

I don't disagree that the world is full of fuzziness. But the problem I have with this portrayal is that formal models are often normative rather than analytical. They create reality rather than being an interpretation or abstraction of reality.

People may well have a fuzzy idea of how their credit card works, but how it really works is formally defined by financial institutions. And this is not just true for software products. It's also largely true for manufactured products. Our world is very much shaped by artifacts and man-made rules.

Our probabilistic, fuzzy concepts are often simply a misconception. That doesn't mean it's not important of course. It is important for an AI to understand how people talk about things even if their idea of how these things work is flawed.

And then there is the sort of semi-formal language used in legal or scientific contexts that often has to be translated into formal models before it can become effective. Law makers almost never write algorithms (when they do, they are often buggy). But tax authorities and accounting software vendors do have to formally model the language in the law and then potentially change those formal definitions after court decisions.

My point is that the way in which the modeled, formal world interacts with probabilistic, fuzzy language and human actions is complex. In my opinion we will always need both. AIs ultimately need to understand both and be able to combine them just like (competent) humans do. AI "tool use" is a stop-gap. It's not a sufficient level of understanding.

pton_xd · 2025-11-12T14:31:02 1762957862

> People may well have a fuzzy idea of how their credit card works, but how it really works is formally defined by financial institutions.

> Our probabilistic, fuzzy concepts are often simply a misconception.

How eg a credit card works today is defined by financial institutions. How it might work tomorrow is defined by politics, incentives, and human action. It's not clear how to model those with formal language.

I think most systems we interact with are fuzzy because they are in a continual state of change due to the aforementioned human society factors.

fauigerzigerk · 2025-11-12T15:53:49 1762962829

To some degree I think that our widely used formal languages may just be insufficient and could be improved to better describe change.

But ultimately I agree with you that this entire societal process is just categorically different. It's simply not a description or definition of something, and therefore the question of how formal it can be doesn't really make sense.

Formalisms are tools for a specific but limited purpose. I think we need those tools. Trying to replace them with something fuzzy makes no sense to me either.

barrkel · 2025-11-13T11:24:32 1763033072

I believe the formalisms can be constructed by something fuzzy. Humans are fuzzy; they create imperefect formalisms that work until they break, and then they're abandoned or adapted.

I don't see how LLMs are significantly different. I don't think the formalisms are an "other". I believe they could be tools, both leveraged and maintained by the LLM, in much the same way as most software engineers, when faced with a tricky problem that is amenable to brute force computation, will write up a quick script to answer it rather than try and work it out by hand.

fauigerzigerk · 2025-11-13T12:57:03 1763038623

I think AI could do this in principle but I haven't seen a convincing demonstration or argument that Transformer based LLMs can do it.

I believe what makes the current Transformer based systems different to humans is that they cannot reliably decide to simulate a deterministic machine while linking the individual steps and the outcomes of that application to the expectations and goals that live in the fuzzy parts of our cognitive system. They cannot think about why the outcome is undesirable and what the smallest possible change would be to make it work.

When we ask them to do things like that, they can do _something_, but it is clearly based on having learned how people talk about it rather than actually applying the formalism themselves. That's why their performance drops off a cliff as soon as the learned patterns get too sparse (I'm sure there's a better term for this that any LLM would be able to tell you :)

Before developing new formalisms you first have to be able to reason properly. Reasoning requires two things. Being able to learn a formalism without examples. And keeping track of the state of a handful of variables while deterministically applying transformation rules.

The fact that the reasoning performance of LLMs drops off a cliff after a number of steps tells me that they are not really reasoning. The 1000th rules based transformation only depending on the previous state of the system should not be more difficult or error prone than the first one, because every step _is_ the first one in a sense. There is no such cliff-edge for humans.

rob_c · 2025-11-12T10:09:20 1762942160

You're basically describing the knowledge problem vs model structure, how to even begin to design a system which self-updates/dynamically-learns vs being trained and deployed.

Cracking that is a huge step, pure multi-modal trained models will probably give us a hint, but I think we're some ways from seeing a pure multi-modal open model which can be pulled apart/modified. Even then they're still train and deploy not dynamically learning. I worry we're just going to see LSTM design bolted onto deep LLM because we don't know where else to go and it will be fragile and take eons to train.

And less said about the crap of "but inference is doing some kind of minimization within the context window" the better, it's vacuous and not where great minds should be looking for a step forwards.

balamatom · 2025-11-12T11:01:40 1762945300

I have vague notions of there being an entire hidden philosophical/political battlefield (massacre?) behind the whole "are knowledge models/ontologies a realistic goal" debate.

Starting with the sophomoric questions of the optimist who mistakes the possible for the viable: how definite of a thing is "the world", how knowable is it, what is even knowledge... and then back through the more pragmatic: by whom is it knowable, to what degree, and by what means. The mystics: is "the world" the same thing as "the sum of information about the world"? The spooks: how does one study those fields of information which are already agentic and actively resist being studied by changing themselves, such as easily emerge anywhere more than n(D) people gather?

Plenty of food for thought from why ontologies are/aren't a thing. The classical example of how this plays out in the market being search engines winning over internet directories. But that's one turn of the wheel. Look at what search engines grew into quarter century later. What their outgrowths are doing to people's attitude towards knowledge. Different timescale, different picture.

Fundamentally, I don't think human language has sufficient resolution to model large spans of reality within the limited human attention span. The physical limits of human language as information processing device have been hit at some point in the XX century. Probably that 1970s divergence between productivity and wages.

So while LLMs are "computers speak language now" and it's amazing if sad that they cracked it by more data and not by more model, what's more amazing is how many people are continually ready to mistake language for thought. Are they all P-zombies or just obedience-conditioned into emulating ones?!?!?

Practically, what we lack is not the right architecture for "big knowing machine", but better tools for ad-hoc conceptual modeling of local situations. And, just like poetry that rhymes, this is exactly what nobody has a smidgen of interest to serve to consumers, thus someone will just build it in their basement in the hope of turning the tables on everyone. Probably with the help of LLMs as search engines and code generators. Yall better hurry. They're almost done.

bwfan123 · 2025-11-12T15:55:01 1762962901

Nice commentary and I enjoyed the poetic turn of phrase. I had to respond to it with my own thoughts if only to bookmark it for myself.

> how many people are continually ready to mistake language for thought

This is a fundamental illusion - where, rote memory and names and words get mistaken for understanding. This was wonderfully illustrated here [1]. Few really grok what understanding actually is. This is an unfortunate by-product of our education system.

> Are they all P-zombies or just obedience-conditioned into emulating ones?!?!?

Brilliant way to state the fundamental human condition. ie, we are all zombies conditioned to imitate rather than understand. Social media amplifies the zombification, and now LLMs do that too.

> Starting with the sophomoric questions of the optimist who mistakes the possible for the viable

This is the fundamental tension between operationalized meaning and imagination. A grokking soul gathers mists from the cosmic chaos and creates meaning and operationalizes it for its own benefit and then continually adapts it.

> it's amazing if sad that they cracked it by more data and not by more model

I was speaking to experts in the sciences (chemistry). They were shocked that the underlying architecture is brute force. They expected a compact information-compressed theory which is able to model independent of data. The problem with brute-force approaches are that they dont scale, and dont capture the essences which are embodied in theories.

> The physical limits of human language as information processing device have been hit at some point in the XX century

2000 years back when humans realized that formalism was needed to operationalize meaning, and natural language was too vague to capture and communicate it. Because the world model that natural language captures encompasses "everything" whereas for making it "useful" requires to limit it via formalism.

[1] https://news.ycombinator.com/item?id=2483976

balamatom · 2025-11-15T08:49:52 1763196592

I disagree with most of what you said.

cheesecompiler · 2025-11-12T14:37:25 1762958245

Is it that fuzzy though? If it was would language not adequately grasp and model our realities? And what about the physical world itself: animals are modeling the world adequately enough to navigate it. There's significant gains to make from modeling _enough_ of the world, without falling into hallucinations of purely statistical associations of an LLM.

Marshferm · 2025-11-12T13:22:46 1762953766

World models are trivial. eg narratives are world models and they provide only pre frontal simulation, ie they are synthetically prey-predation. No animal uses world models for survival and doubtful they exist (maps are not models), a world model doesn't conform to optic flow, ie instantaneous use and response. Anything like a world model isn't shallow, the basic premise of oscillatory command, it's needlessly deep, nothing like brains. This is just a frontier hail-mary to the current age.

siva7 · 2025-11-12T08:34:35 1762936475

> it is exactly a researcher's purpose to do the risky things that may not work

Maybe at university, but not at a trillion dollar company. That job as chief scientist is leading risky things that will work to please the shareholders.

vintermann · 2025-11-12T09:19:27 1762939167

They knew what Yann LeCun was when they hired him. If anything, those brilliant academics who have done what they're told and loyally pursued corporate objectives the way the corporation wanted (e.g. Karpathy when he was at Tesla) haven't had great success either.

jack_tripper · 2025-11-12T09:33:54 1762940034

>They knew what Yann LeCun was when they hired him.

Yes but he was hired in the ZIRP era where all SV companies were hiring every opinionated academic and giving them free reign and unlimited money to burn in the hopes that maybe they'll create the next big thing for them eventually.

These are very different economic times right now, after the FED infinite money glitch has been patched out, so now people do need to adjust to them and start actually making some products of value for their seven figure costs to their employers, or end up being shown the door.

miohtama · 2025-11-12T10:08:18 1762942098

Some employees even need to physically present at the office

rob_c · 2025-11-12T10:10:49 1762942249

so your message is to short OpenAI before it implodes and gets absorbed into Cortana or equivalent ;)

corford · 2025-11-12T15:51:23 1762962683

Unless you're an insider, currently you'd need to express that short via something else.

Hendrikto · 2025-11-12T11:23:30 1762946610

> risky things that will work

Things known to work are not risky. Risky things can fail by definition.

tempfile · 2025-11-12T10:09:51 1762942191

What exactly does it mean for something to be a "risky thing that will work"?

rsynnott · 2025-11-12T09:52:34 1762941154

“Risky things that will work” - contradiction in terms. If companies only did things they knew would work, we probably still wouldn’t have microchips.

Also, like… it’s Facebook. It has a history of ploughing billions into complete nonsense (see metaverse). It is clearly not particularly risk averse.

igravious · 2025-11-12T10:38:23 1762943903

> I also agree with LeCun that LLMs in their current form - are a dead end.

Well then you and he are clearly dead wrong.

maleldil · 2025-11-12T18:09:34 1762970974

How do you know that for sure? How can you be absolutely certain that LLMs are what will lead to AGI?

infamouscow · 2025-11-12T19:00:29 1762974029

I’ve yet to meet a single person who claims AGI will happen without recycling the same broken reasoning the peak-oil retards were peddling a decade ago.

Talking to these people is exhausting, so I cut straight to the chase: name the exact, unavoidable conditions that would prove AGI won’t happen.

Shockingly, nobody has an answer. They’ve never even considered it.

That’s because their whole belief is unfalsifiable.

pegasus · 2025-11-12T10:51:10 1762944670

Either that, or just tautological, given that LLM tech is continually morphing and improving.

fxtentacle · 2025-11-12T08:25:51 1762935951

LLMs and Diffusion solve a completely different problem than world models.

If you want to predict future text, you use an LLM. If you want to predict future frames in a video, you go with Diffusion. But what both of them lack is object permanence. If a car isn't visible in the input frame, it won't be visible in the output. But in the real world, there are A LOT of things that are invisible (image) or not mentioned but only implied (text) that still strongly affect the future. Every kid knows that when you roll a marble behind your hand, it'll come out on the other side. But LLMs and Diffusion models routinely fail to predict that, as for them the object disappears when it stops being visible.

Based on what I heard from others, world models are considered the missing ingredient for useful robots and self-driving cars. If that's halfway accurate, it would make sense to pour A LOT of money into world models, because they will unlock high-value products.

tinco · 2025-11-12T08:30:50 1762936250

Sure, if you only consider the model they have no object permanence. However you can just put your model in a loop, and feed the previous frame into the next frame. This is what LLM agent engineers do with their context histories, and it's probably also what the diffusion engineers do with their video models.

Messing with the logic in the loop and combining models has an enormous potential, but it's more engineering than researching, and it's just not the sort of work that LeCun is interested in. I think the conflict lies there, that Facebook is an engineering company, and a possible future of AI lies in AI engineering rather than AI research.

Workaccount2 · 2025-11-12T14:48:20 1762958900

>But what both of them lack is object permanence.

This is something that was true last year, but hanging on by a thread this year. Genie shows this off really well, but it's also in the video models as well.[1]

[1]https://storage.googleapis.com/gdm-deepmind-com-prod-public/...

yogrish · 2025-11-12T08:59:43 1762937983

I think World models is way to go for Super Intelligence. One of teh patent i saw already going in this direction for Autonomous mobility is https://patents.google.com/patent/EP4379577A1 where synthetic data generation (visualization) is missing step in terms of our human intelligence.

makestuff · 2025-11-12T14:25:41 1762957541

This is the first time I have heard of world models. Based on my brief reading it does look like this is the idea model for autonomous driving. I wonder if the self driving companies are already using this architecture or something close to it.

PxldLtd · 2025-11-12T08:32:13 1762936333

I thoroughly disagree, I believe world models will be critical in some aspect for text generation too. A predictive world model you can help to validate your token prediction. Take a look at the Code World Model for example.

ml-anon · 2025-11-12T10:18:34 1762942714

lol what is this? We already have world models based on diffusion and ar algorithms.

qmr · 2025-11-12T09:14:18 1762938858

> but an enterprise like Facebook can't have their top researcher work on risky things when there's surefire paths to success still available.

Bell Labs

KaiserPro · 2025-11-12T13:19:44 1762953584

> I think LeCun is underestimating the impact that LLM's and Diffusion models

No, I think hes suggesting that "world models" are more impactful. The issue for him inside meta is that there is already a research group looking at that, and are wildly more successful (in terms of getting research to product) and way fucking cheaper to run than FAIR.

Also LeCun is stuck weirdly in product land, rather than research (RL-R) which means he's not got the protection of Abrash to isolate him from the industrial stupidity that is the product council.

ainch · 2025-11-13T22:51:17 1763074277

Which other group is that?

StopDisinfo910 · 2025-11-12T09:16:23 1762938983

Hard to tell.

The last time LeCun disagreed with the AI mainstream was when he kept working on neural net when everyone thought it was a dead end. He might be entirely right in his LLM scepticism. It's hardly a surefire path. He didn't prevent Meta from working on LLM anyway.

The issue is more than his position is not compatible with short term investors expectations and that's fatal in a company like Meta at the position LeCun occupies.

anthonybsd · 2025-11-12T13:46:26 1762955186

> Facebook can't have their top researcher work on risky things when there's surefire paths to success still available.

How did you determine that "surefire paths to success still available"? Most academics agree that LLMs (or LLMs alone) are not going to lead us to AGI. How are you so certain?

tinco · 2025-11-12T14:04:28 1762956268

I don't believe we need more academic research to achieve AGI. The sort of applications that are solving the recent AGI challenges are just severely resource constrained AGI. The only difference between those systems and human intelligence are resources and incentives.

Not that I believe AGI is the measure of success, there's probably much more efficient ways to achieve company goals than simulating humans.

hodgehog11 · 2025-11-12T08:12:10 1762935130

Unless I've missed a few updates, much of the JEPA stuff didn't really bear a lot of fruit in the end.

HarHarVeryFunny · 2025-11-12T19:35:35 1762976135

I don't think he's given up on it.

How many decades did it take for neural nets to take off?

The reason we're even talking about LeCun today is because he was early in seeing the promise of neural nets and stuck with it through the whole AI winter when most people thought it was a waste of time.

hodgehog11 · 2025-11-13T02:53:07 1763002387

But neural nets were always popular, they just went through phases of hype depending on the capacity of hardware at the time. The only limitation of neural nets at the time was computational power to scale up. AI winters came when other techniques became available that required less compute. Once GPGPU became available, all of that work became immediately viable.

No similar limitations exist today for JEPA, to my knowledge.

HarHarVeryFunny · 2025-11-13T18:18:48 1763057928

Depends on how far back you are going. There was the whole 1969 Minsky Perceptron flap where he said ANNs (i.e Perceptrons) were useless because they can't learn XOR (and no-one at the time knew how to train multi-layer ANNs), which stiffled ANN research and funding for a while. It would then be almost 20 years until the 1986 PDP handbook published LeCun and Hinton's rediscovery of backpropagation as a way to train multi-layer ANNs thereby making them practical.

The JEPA parallel is just that it's not a popular/mainstream approach (at least in terms of well funded research), but may eventually win out over LLMs in the long term. Modern GPUs provide plenty of power for almost any artifical brain type approach, but of course are expensive at scale, so lack of funding can be a barrier in of itself.

netdevphoenix · 2025-11-12T10:47:07 1762944427

>the huge impact they're already having

In the software development world yes, outside of that, virtually none. Yes, you can transcribe a video call in Office, yes, but that's not ground breaking. I dare you to list 10 impacts on different fields, excluding tech and including at least half blue collar fields and at least half white collar fields , at different levels from the lowest to the highest in the company hierarchy, that LLM/Diffusion models are having. Impact here specifically means a significant reduction of costs or a significant increase of revenue. Go on

arcticbull · 2025-11-12T10:53:34 1762944814

I'm also not sure it even drives a ton of value in software engineering. It makes the easy part easier and the hard part harder. Typing out software in your mind was never the difficult part. Figuring out what to write, how to interpret specs in context, how to make your code work within the context of a broader whole, how to be extensible, maintainable, reliable, etc. That's hard, and LLMs really don't help.

Even when writing, it shifts the mental burden from an easy thing (writing code) to a very hard thing (reading that code, validating it's right, hallucination free, and then refactoring it to match your teams code style and patterns).

It's great for building a first-order approximation of a tech demo app that you then throw out and build from scratch, and auto-complete. In my experience, anyways. I'm sure others have had different experiences.

pegasus · 2025-11-12T11:06:29 1762945589

You already mentioned two fields they have a huge impact on, software development and NLP (this latter one the most impacted so far). Another field that comes to mind is academic research is getting an important boost as well, via semantic search or more advanced stuff like Google's biological cell model which already uncovered new treatments. I'm sure I'm missing a lot of other fields I'm less familiar with (legal, for example). But just these impacts I listed are all huge and they will indirectly have a huge impact on all other areas of human industry, it's just a matter of time. "Software will eat the world" and all that.

olalonde · 2025-11-12T11:26:27 1762946787

Personally, I find myself using LLMs more than Google now, even for non-development tasks. I think this shift is going to become the new normal (if it isn't already).

antegamisou · 2025-11-12T16:08:40 1762963720

And what's the end result? All one can see is just bigger representation of those who confidently subscribe to false information and become arrogant when their validity is questioned, as the LLM writing style has convinced them it's some sort of authority. Even people on this website are so misinformed to believe that ChatGPT has developed its own reasoning, despite it being at the core an advanced learning algorithm trained on a enormous amount of human generated data.

And let's not speak about those so deep into sloth that put it into use to deteriorate, and not augment as they claim to do, humane creative recreational activities.

https://archive.ph/fg7HE

olalonde · 2025-11-13T01:14:17 1762996457

This seems a bit self-contradictory: you say LLMs mislead people and can't reason, then fault them for being good at helping people solve puzzles or win trivia games. You can't have it both ways.

antegamisou · 2025-11-13T20:11:56 1763064716

> you say LLMs mislead people and can't reason

Why would you postulate these two to be mutually exclusive?

> then fault them for being good at helping people solve puzzles or win trivia games

They only help them in the same sense that a calculator would 'help' win at a hypothetical mental math competition, that is the gist; robbing people of the creative and mentally stimulating processes that make the game(s) fun. But I've come to realize this is an unpopular opinion on this website where being fiercely competitive is the only remarkable personality trait, so I guess yeah it may be useful for this particular population.

antegamisou · 2025-11-12T11:26:46 1762946806

I don't think you'll find many here believing anything outside tech is worth investing into, it's schizophrenic isn't it.

sebmellen · 2025-11-12T08:12:56 1762935176

While I agree with your point, “Superintelligence” is a far cry from what Meta will end up delivering with Wang in charge. I suppose that, at the end of the day, it’s all marketing. What else should we expect from an ads company :?

metabolian · 2025-11-12T08:33:59 1762936439

The Meta Super-Intelligence can dwell in the Metaverse with the 23 other active users there.

skeeter2020 · 2025-11-12T14:01:56 1762956116

not sure I agree. AI seems to be following the same 3-stage path of many inventions: innovation > adoption > diffusion. LeCun and co focus on the first, and LLMs in their current form appear to be incremental at improvements; we're still using the same basis from more than ten years ago. FB and industry are signalling a focus on harvesting the innovation and that could last - but also take - many years or decades. Your fundamental researchers are not interested (or the right people) in that position.

OJFord · 2025-11-12T11:29:14 1762946954

He's quoted in OP as calling them 'useful but fundamentally limited'; that seems correct, and not at all like he's denying their utility.

raverbashing · 2025-11-12T08:09:53 1762934993

Yeah honestly I'm with the LLM people here

If you think LLMs are not the future then you need to come with something better

If you have a theoretical idea that's great, but take to at least GPT2 level first before writing off LLMs

Theoretical people love coming up with "better ideas" that fall flat or have hidden gotchas when they get to practical implementation

As Linus says, "talk is cheap, show me the code".

DaSHacka · 2025-11-12T08:15:27 1762935327

Do you? Or is it possible to acknowledge a plateau in innovation without necessarily having an immediate solution cooked-up and ready to go?

Are all critiques of the obvious decline in physical durability of American-made products invalid unless they figure out a solution to the problem? Or may critics of a subject exist without necessarily being accredited engineers themselves?

whizzter · 2025-11-12T08:40:50 1762936850

LLM's are probably always going to be the fundamental interface, the problem they solved was related to the flexibility of human languages allowing us to have decent mimikry's.

And while we've been able to approximate the world behind the words, it's just full of hallucinations because the AI's lack axiomatic systems beyond much manually constructed machinery.

You can probably expand the capabilties by attaching to the front-end but I suspect that Yann is seeing limits to this and wants to go back and build up from the back-end of world reasoning and then _among other things_ attach LLM's at the front-end (but maybe on equal terms with vision models that allows for seamless integration of LLM interfacing _combined_ with vision for proper autonomous systems).

rob_c · 2025-11-12T10:21:12 1762942872

> because the AI's lack axiomatic systems beyond much manually constructed machinery.

Oh god, that is massively under-selling their learning ability. These models are able to extract and reply with why jokes are funny without even knowing basic vocab, yet there are pure-code models out there with lingual rules baked in from day one which still struggle with basic grammar.

The _point_ of LLMs arguably is there ability to learn any pattern thrown at it with enough compute. With an exception to learning how logical processes work, and pure LLMs only see "time" in the sense of a paragraph begins and ends.

At the least they have taught computers, "how to language", which in regards to how to interact with a machine is a _huge_ step forward.

Unfortunately the financial incentives are split between agentic model usage (taking the idea of a computerised butler further), maximizing model memory and raw learning capacity (answering all problems at any time), and long-range consistency (longer ranges give better stable results due to a few reasons, but we're some way from seeing an LLM with a 128k experts and 10e18 active tokens).

I think in terms of building the perfect monkey butler we already have most or all of the parts. With regard to a model which can dynamically learn on the fly... LLMs are not the end of the story and we need something to allow the models to more closely tie their LS with the context. Frankly the fact that DeepSeek gave us an LLM with LS was a huge leap since previous model attempts had been overly complex and had failed in training.

ActorNightly · 2025-11-12T20:29:59 1762979399

>If you think LLMs are not the future then you need to come with something better

The problem isn't LLMS, the problem is that everyone is trying to build bigger/better llms or manually code agents around LLMs. Meanwhile, projects like Mu Zero are forgotten, despite being vastly more important for things like self driving.

worldsayshi · 2025-11-12T08:20:48 1762935648

Why not both? LLM:s probably have a lot more potential than what is currently being realized but so does world models.

mitthrowaway2 · 2025-11-12T08:38:01 1762936681

Isn't that exactly why he's starting a new company?

dpe82 · 2025-11-12T08:12:34 1762935154

Of course the challenge with that is it's often not obvious until after quite a bit of work and refinement that something else is, in fact, better.

hhh · 2025-11-12T08:16:56 1762935416

LLMs are the present. We will see what the future holds.

Seattle3503 · 2025-11-12T08:19:30 1762935570

Well, we will see if Yann can.

gdiamos · 2025-11-12T10:13:24 1762942404

The role of basic research is to get off the beaten path.

LLMs aren’t basic research when they have 1 billion users

ACCount37 · 2025-11-12T09:06:46 1762938406

That was obviously him getting sidelined. And it's easy to see why.

LLMs get results. None of the Yann LeCun's pet projects do. He had ample time to prove that his approach is promising, and he didn't.

chaoz_ · 2025-11-12T11:10:48 1762945848

I agree. I never understood LeCun's statement that we need to pivot toward the visual aspects of things because the bitrate of text is low while visual input through the eye is high.

Text and languages contain structured information and encode a lot of real-world complexity (or it's "modelling" that).

Not saying we won't pivot to visual data or world simulations, but he was clearly not the type of person to compete with other LLM research labs, nor did he propose any alternative that could be used to create something interesting for end-users.

tarsinge · 2025-11-12T14:18:01 1762957081

Text and language contain only approximate information filtered through humans eyes and brains. Also animals don't have language and can show quite advanced capabilities compared to what we can currently do in robotics. And if you do enough mindfulness you can dissociate cognition/consciousness from language. I think we are lured because how important language is for us humans, but intuitively it's obvious to me language (and LLMs) are only a subcomponent, or even irrelevant for say self driving or robotics.

ACCount37 · 2025-11-12T21:33:50 1762983230

Seems like that "approximation" is perfectly sufficient for just about any task.

That whole take about the language being basically useless without a human mind to back it lost its legs in 2022.

In the meanwhile, what do those "world model" AIs do? Video generation? Meta didn't release anything like that. Robotics, self-driving? Also basically nothing from Meta there.

In the meanwhile, other companies are perfectly content with bolting multimodal transformers together for robotics tasks. Gemini Robotics being a research example - while modern Tesla FSD stack being a production grade one. Gemini even uses a language transformer as a key part of its stack.

KaiserPro · 2025-11-12T16:16:34 1762964194

Thats where the research is leading.

The issue is context. trying to make an AI assistant with just text only inputs is doeable but limiting. You need to know the _context_ of all the data, and without visual input most of it is useful.

For example "Where is the other half of this" is almost impossible to solve unless you have an idea of what "this" is.

but to do that you need to have cameras, to use cameras you need to have position, object, and people tracking. And that is a hard problem thats not solved.

the hypothesis is that "world models" solve that with an implicit understanding of the worl and the objects in context

ACCount37 · 2025-11-12T11:15:07 1762946107

If LeCun's research has made Meta a powerhouse of video generation or general purpose robotics - the two promising directions that benefit from working with visual I/O and world modeling as LeCun sees it - it could have been a justified detour.

But that sure didn't happen.

camillomiller · 2025-11-12T09:28:56 1762939736

LLMs get results is quite the bold statement. If they get results, they should be getting adopted, and they should be making money. This is all built on hazy promises. If you had marketable results, you wouldn't have to hide 20+ billion dollars of debt financing into an obscure SPV. LLMs are the most baffling piece of tech. They are incredible, and yet marred by their non-deterministic hallucinatory nature, and bound to fail in adoption unless you convince everyone that they don't need precision and accuracy, but they can do their business at 75% quality, just with less human overhead. It's quite the thing to convince people of, and that's why it needs the spend it's needing. A lot of we-need-to-stay-in-the-loop CEOs and bigwigs got infatuated with the idea, and most probably they just had their companies get addicted to the tech equivalent of crack cocaine. A reckoning is coming.

ACCount37 · 2025-11-12T09:43:49 1762940629

LLMs get results, yes. They are getting adopted, and they are making money.

Frontier models are all profitable. Inference is sold with a damn good margin, and the amounts of inference AI companies sell keeps rising. This necessitates putting more and more money into infrastructure. AI R&D is extremely expensive too, and this necessitates even more spending.

A mistake I see people make over and over again is keeping track of the spending but overlooking the revenue altogether. Which sure is weird: you don't get from $0B in revenue to $12B in revenue in a few years by not having a product anyone wants to buy.

And I find all the talk of "non-deterministic hallucinatory nature" to be overrated. Because humans suffer from all of that too, just less severely. On top of a number of other issues current AIs don't suffer from.

Nonetheless, we use human labor for things. All AI has to do is provide a "good enough" alternative, and it often does.

camillomiller · 2025-11-12T11:35:54 1762947354

In this comment you proceeded to basically reinvent the meaning of "profitable company", but sure. I won't even get into the point of comparing LLM to humans, because I choose not to engage with whoever doesn't have the human decency, humanistic compass, or basic phylosophical understanding of how putting LLMs and human labor on the same level to justify hallucinations and non-determinism is deranged and morally bankrupt.

ACCount37 · 2025-11-12T11:55:08 1762948508

You should go and work in a call center for a year, on the first line.

Then come back and tell me how replacing human labor with AI is "deranged and morally bankrupt".

hitarpetar · 2025-11-12T15:31:33 1762961493

red herring. just because some jobs are bad (maybe shouldn't exist like that in the first place) doesn't make this movement humanistic

echelon · 2025-11-12T15:05:24 1762959924

> Frontier models are all profitable.

They generate revenue, but most companies are in the hole for the research capital outlay.

If open source models from China become popular, then the only thing that matters is distribution / moat.

Can these companies build distribution advantage and moats?