How do you “vet” something technical and something that you can’t even do yourse...

harmmonica · 2025-09-03T20:17:20 1756930640

Hey, just replied to a sibling comment of yours that sort of addresses your commentary. Just in case you didn't read it because I didn't reply to you directly. One thing that reply didn't cover and I'll add here: I disagree that the LLM is actually designed to look correct more than it's trying to actually be correct. I might have a blind spot, but I don't think that is a logical conclusion about LLM's, but if you have special insight about why that's the case please do share. That does happen, of course, but I don't think that is intentional, part of the explicit design, or even inherent to the design. As I said, open to being educated otherwise.

qnleigh · 2025-09-05T15:21:13 1757085673

> designed to look correct more than it's trying to actually be correct

This might not quite be true, strictly speaking, but a very similar statement definitely is. LLMs are highly prone to hallucinations, a term you've probably heard a lot in this context. One reason for this is that they are trained to predict the next word in a sequence. In this game, it's almost always better to guess than to output 'I'm not sure,' when you might be wrong. LLMs therefore don't really build up a model of the limits of their own 'knowledge,' they just guess until their guesses get better.

These hallucinations are often hard to catch, in part because the LLM will sound confident regardless of whether it is hallucinating or not. It's this tendency that makes me nervous about your use case. I asked an LLM about world energy consumption recently, and when it couldn't find an answer online in the units I asked for, it just gave a number from a website and changed (not converted) the units. I almost missed it, because the source website had the number!

Stepping back, I actually agree that you can learn new things like this from LLMs, but you either need to be able to verify the output or the stakes need to be low enough that it doesn't matter if you can't. In this case, even if you can verify the math, can you be sure that it's doing the right calculation in the right way? Did it point out the common mistakes that beginners make? Did it notice that you're attaching the support beam incorrectly?

Chances are, you've built everything correctly and it will be fine. But the chances of a mistake are clearly much higher than if you talked to an experienced human (professional or otherwise).