More

Wheaties466 · 2026-01-29T16:03:17 1769702597

from what I understand this can come from the batching of requests.

chrisjj · 2026-01-29T16:31:43 1769704303

So, a known bug?

embedding-shape · 2026-01-29T19:28:07 1769714887

No, basically, the requests are processed in batches, together, and the order they're listed in matters for the results, as the grid (tiles) that the GPU is ultimately processing, are different depending on what order they entered at.

So if you want batching + determinism, you need the same batch with the same order which obviously don't work when there are N+1 clients instead of just one.

chrisjj · 2026-01-29T19:36:25 1769715385

Sure, but how can that lead to increased demand resulting in decreased intelligence? That is the effect we are discussing.

embedding-shape · 2026-01-29T20:00:04 1769716804

Small subtle errors that are only exposed at certain execution parts could be one. You might place things differently onto the GPU depending on how large the batch is, if you've found one way to be faster batch_size<1024, but another when batch_size>1024. As number of concurrent incoming requests goes up, you increase batch_size. Just one possibility, guess there could be a multitude of reasons, as it's really hard to reason about until you sit with the data in front of you. vLLM has had bugs with these sort of thing too, so wouldn't surprise me.

chrisjj · 2026-01-29T20:09:32 1769717372

Wouldn't you think that was as likely to increase as decrease intelligence, so average to nil in the benchmarks?

embedding-shape · 2026-01-29T20:22:13 1769718133

No, I'm not sure how that'd make sense. Either you're making the correct (expected) calculations, or you're getting it wrong. Depending the type of wrong or how wrong, could go from "used #2 in attention instead of #1" so "blue" instead of "Blue" or whatever, to completely incoherent text and garbled output.

chrisjj · 2026-01-29T20:38:13 1769719093

I accept errors are more likely to decrease "intelligence". But I don't see how increased load, through batching, is any more likely to increase than decrease errors.

Wheaties466 · 2025-10-28T18:52:24 1761677544

I get that they are now involved and contribute to 5g. But its pretty shameful how huawei had acquired the ability to do so.

https://en.wikipedia.org/wiki/Concerns_over_Chinese_involvem...

https://www.bloomberg.com/news/features/2020-07-01/did-china...

https://www.politico.com/news/2020/02/13/us-charges-huawei-w...

throw32894833 · 2025-10-30T01:34:15 1761788055

These are articles are basically all speculation with no solid evidence.

Nortel was dying way before Huawei got involved.

Wheaties466 · 2025-08-14T15:04:48 1755183888

apart from the skepticism and anti AI hype. One down side that is pointed out is that AI has the potential to dissuade people from learning. Why go through the pains of learning something when an AI can do it faster and better.

I do get the argument that it is a tool and everyone will have to adapt around it. But at some point it can be extremely demoralizing that a phd project that took months or years can be done by AI in a fraction of the time.

Wheaties466 · 2025-08-12T18:23:57 1755023037

I completely agree. I also love the transparency that it provides as to where it is getting the reasoning for making a specific claim.

I can also ask it to just reference research papers and it will find relevant data relating to my query from peer reviewed sources.

Wheaties466 · 2025-08-12T18:21:05 1755022865

It sounds like you need to be using the research function, which takes ~3 minutes but does a much more in depth search to find more relevant data.

xmprt · 2025-08-12T18:44:50 1755024290

3 minutes is too long for exploratory searches, where I'm not sure what I'm even looking for. And 3 minutes feels too short for deep research which I'm expected to trust some complex result which I either don't know enough about myself (that's why I'm searching for it) or know enough about to the point that AI probably can't do something that I already couldn't within a couple minutes.

I think the sweet spot for AI results is around 10-30 seconds. It's fast enough that I'm willing to wait for the results even if I'm not sure I'm exploring the right topic. And it's also fast enough that even if I knew what to search for, it can give me summarized results faster than I could read on my own.

ochronus · 2025-08-13T06:22:09 1755066129

Hm, I'd think, AI aside, that if 3 minutes is too long for an exploratory research, that's not going to be good quality exploratory research...

Wheaties466 · 2025-05-23T12:10:26 1748002226

Can someone explain to me why you would want to do something like in the example of calculating age based on birthdate? Why wouldn't you do that within an app or within code rather than having a database function?

Wheaties466 · 2025-05-02T11:57:44 1746187064

Do we really know why LLMs seem to score the highest with python related coding tasks? I would think there are equally good examples of javascript/c++/java code to train from but I always see python with the highest scores.

triyambakam · 2025-05-02T17:35:33 1746207333

Could be related to how flexible Python is. Pretty easy to write bad and "working" Python code

Wheaties466 · 2025-04-10T14:22:10 1744294930

I'm genuinely asking this out of curiosity and a bit of naivety — are there as many international students pursuing advanced degrees in China as there are in the U.S.? I don't know the answer and would love to hear from folks who do.

Wheaties466 · 2025-03-19T17:47:48 1742406468

well, not having a unified language slows down things on the other end.

Wheaties466 · 2025-03-18T18:24:17 1742322257

I believe this is actually the second time google has tried to buy this company too. They had to give them a too good to refuse offer.

While it seems like we aren't getting a ton of people who have used the product in the comments. I can tell you it checks a lot of boxes to make people sleep better at night with customer data in the cloud.