Justine may have addressed unreliable output by using `--temp 0` [0]. I'd agree ...

jart · on Dec 13, 2023

`--temp 0` makes it deterministic. What can make output reliable is `--grammar` which the blog post discusses in detail. It's really cool. For example, the BNF expression `root ::= "yes" | "no"` forces the LLM to only give you a yes/no answer.

verdverm · on Dec 13, 2023

that only works up to a point. If you are trying to transform a text based cli output into a JSON object, even with a grammar, you can get variation in the output. A simple example is field or list ordering. Omission is the real problematic one

verdverm · on Dec 13, 2023

> Justine may have addressed unreliable output by using `--temp 0`

That only works if you have the same input. It also nerfs the model considerably

https://ai.stackexchange.com/questions/32477/what-is-the-tem...