More

DrPhish · 2026-02-03T15:44:45 1770133485

Wouldn’t you also need to include the Ancient Greek phryctoriae military fire signalling system by that logic? It probably wasn’t the first, at that.

littlestymaar · 2026-02-03T16:47:38 1770137258

It depends, how versatile was the Greek signaling system?

AFAIK the Télégraphe Chappe was the first general purpose telegraph able to send arbitrary messages, and was used by both the administration (for civilian as well as military purpose) and the private sector for business.

DrPhish · 2026-01-30T21:32:40 1769808760

Very “futurological congress” thought

DrPhish · 2026-01-06T21:06:59 1767733619

Also s4nake, the concept in a 4k binary from the demoscene circa 2013

https://www.pouet.net/prod.php?which=61035

DrPhish · 2025-12-12T14:23:55 1765549435

Just use a commercial signage display

amundskm · 2025-12-12T14:29:18 1765549758

I looked into this. If I am remembering correctly the price was higher. It is just easier to connect a mini PC to an hdmi port and bypass all of the built in TV functionality.

sn · 2025-12-13T11:11:05 1765624265

Yes, the price is higher, maybe partially because it's not ad-subsidized. I was happy to pay it, this is what I bought: https://www.sharp.eu/sharp-nec-multisync-e868

There's historical speculation that a smart TV could connect to an open wireless access point, or more realistically, that it refuses to operate without internet access, perhaps after a certain number of power on hours.

gbear605 · 2025-12-13T17:49:03 1765648143

How'd you wind up buying it? All the options like that I can find start with "Get a Quote".

DrPhish · 2025-11-08T19:12:56 1762629176

It’s not a process monitor, really, but to me the AWS Lightsail monitor tab feels like this. The “sustainable” line hits me right in the OCD to keep me grinding on cpu usage of the workload to keep extra spend at zero.

DrPhish · 2025-09-09T08:25:46 1757406346

Model back doors feel like baseless fearmongering. Something like https://rentry.org/IsolatedLinuxWebService should provide a good guarantee of privacy and security.

amelius · 2025-09-09T09:14:24 1757409264

But what if the model is used to write parts of the kernel?

DrPhish · 2025-09-05T23:31:47 1757115107

I have this as well, but run a heavily locked down and isolated BIND server with NSD and Unbound for external authoritative and internal caching DNS respectively.

Its easy to feed an RBL to unbound to do pi-hole type work, I use pf to transparently redirect all external DNS requests to my local unbound server but I get the bind automation around things like DNSSEC, DHCP ddns and ACME cert renewals.

I'm surprised this isn't a more common stack.

DrPhish · 2025-08-07T06:58:12 1754549892

Its also easy to do 120b on CPU if you have the resources. I had 120b running on my home LLM CPU inference box in just as long as it took to download the GGUFs, git pull and rebuild llama-server. I had it running at 40t/s with zero effort and 50t/s with a brief tweaking. Its just too bad that even the 120b isn't really worth running compared to the other models that are out there.

It really is amazing what ggerganov and the llama.cpp team have done to democratize LLMs for individuals that can't afford a massive GPU farm worth more than the average annual salary.

wkat4242 · 2025-08-07T07:48:32 1754552912

What hardware do you have? 50tk/s is really impressive for cpu.

DrPhish · 2025-08-07T08:45:38 1754556338

2xEPYC Genoa w/768GB of DDR5-4800 and an A5000 24GB card. I built it in January 2024 for about $6k and have thoroughly enjoyed running every new model as it gets released. Some of the best money I’ve ever spent.

testaburger · 2025-08-07T09:29:32 1754558972

Which specific model epcys? And if it's not too much to ask which motherboard and power supply? I'm really interested in building something similar

smartbit · 2025-08-07T10:09:49 1754561389

Looking at https://news.ycombinator.com/submitted?id=DrPhish it's probably this machine https://rentry.co/miqumaxx

  * Gigabyte MZ73-LM1 with two AMD EPYC GENOA 9334 QS 64c/128t
  * 24 sticks of M321R4GA3BB6-CQK 32GB DDR5-4800 RDIMM PC5-38400R
  * 24GB A5000

Note that the RAM price almost doubled since Jan 2024

fouc · 2025-08-07T15:41:21 1754581281

I've seen some mentions of pure-cpu setups being successful for large models using old epyc/xeon workstations off ebay with 40+ cpus. Interesting approach!

wkat4242 · 2025-08-07T09:05:33 1754557533

Wow nice!! That's a really good deal for that much hardware.

How many tokens/s do you get for DeepSeek-R1?

DrPhish · 2025-08-07T14:29:54 1754576994

Thanks, it was a bit of a gamble at the time (lots of dodgy ebay parts), but it paid off.

R1 starts at about 10t/s on an empty context but quickly falls off. I'd say the majority of my tokens are generating around 6t/s.

Some of the other big MoE models can be quite a bit faster.

I'm mostly using QwenCoder 480b at Q8 these days for 9t/s average. I've found I get better real-world results out of it than K2, R1 or GLM4.5.

ekianjo · 2025-08-07T10:01:41 1754560901

thats a r/localllama user right there

SirMaster · 2025-08-07T13:51:29 1754574689

I'm getting 20 tokens/sec on the 120B model with a 5060Ti 16GB and a regular desktop Ryzen 7800x3d with 64GB of DDR5-6000.

wkat4242 · 2025-08-07T20:20:20 1754598020

Wow that's not bad. It's strange, for me it is much much slower on a Radeon Pro VII (also 16GB, with a memory bandwidth of 1TB/s!) and a Ryzen 5 5600 with also 64GB. It's basically unworkably slow. Also, I only get 100% CPU when I check ollama ps, the GPU is not being used at all :( It's also counterproductive because the model is just too large for 64GB.

I wonder what makes it work so well on yours! My CPU isn't much slower and my GPU probably faster.

magicalhippo · 2025-08-07T21:29:05 1754602145

AMD basically decided they wanted to focus on HPC and data center customers rather than consumers, and so GPGPU driver support for consumer cards has been non-existing or terrible[1].

[1]: https://github.com/ROCm/ROCm/discussions/3893

wkat4242 · 2025-08-09T22:43:25 1754779405

The Radeon VII Pro is not a consumer card though and works well with ROCm. It even has datacenter "grade" HBM2 memory that most Nvidias don't have. The continuing support has been dropped but ROCm of course still works fine. It's nearly as fast in Ollama as my 4090 (which I don't use for AI regularly but I just play with it sometimes)

exe34 · 2025-08-07T08:42:06 1754556126

I imagine the gguf is quantised stuff?

DrPhish · 2025-08-07T08:46:35 1754556395

No, I’m running the unquantized 120b

DrPhish · 2025-07-25T13:05:02 1753448702

I generally download the safetensors and make my own GGUFs, usually at Q8_0. Is there any measurable benefit to your dynamic quants at that quant level? I looked at your dynamic quant 2.0 page, but all the charts and graphs appear to cut off at Q4.

danielhanchen · 2025-07-25T20:15:34 1753474534

Oh I also upload Q8_K_XL for eg, which will upcast important layers to BF16 / F16 as well!

Oh the blog at https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs does talk about 1, 2, 3, 4, 5, 6 and 8bit dynamic GGUFs as well!

There definitely is a benefit for dynamically selecting layers to be at diff bit rates - I wrote about the difference between naively quantizing and selectively quantizing: https://unsloth.ai/blog/deepseekr1-dynamic

DrPhish · 2025-07-25T21:57:41 1753480661

Thanks Daniel. I know you upload them, but I was hoping for some solid numbers on your dynamic q8 vs a naive quant. There doesn't seem to be anything on either of those links to show improvement at those quant levels.

My gut feeling is that there's not enough benefit to outweigh the risk of putting a middleman in the chain of custody from the original model to my nvme.

However, I can't know for sure without more testing than I have the time or inclination for, which is why I was hoping there had been some analysis you could point me to.

DrPhish · 2025-06-10T02:44:57 1749523497

Trees are pure carbon. I have heard a number of weak “yeah, but…” arguments that try to diminish the fact, but a central, common sense thesis remains.

If we are truly worried about climate change and are unable to curb our consumption, then we should plant as many trees as we can and aggressively shift as much of our long-lived infrastructure to using wood products as possible.

Grow it, use it, maintain it.

Terr_ · 2025-06-10T02:54:46 1749524086

There are good reasons to green-up our cities, but [edit: capturing] global CO2 levels isn't one of them.

Living things typically don't store carbon long-term, unless you take extra steps like burying them in bogs. Even if we were to collectively invest in sequestration, it'd be more effective with trees that are lower-maintenance, more densely/conveniently situated, and where residents don't complain that a tree needs to be kept-longer/removed-sooner. Perhaps we'd choose something else entirely like algae.

__MatrixMan__ · 2025-06-10T04:28:02 1749529682

Even if it's not typical, when circumstances are right they can store a lot of carbon in a hurry.

My garage is on the same level as my basement, so there's a 5' retaining wall on either side of it. Leaves blow around and get trapped in the corners. Once I didn't bother cleaning it up for several years and when I did I had to move several hundred pounds of new soil into my back yard because of how many leaves had decayed there. Small trees were growing in it.

Similar story with the drainage on the side of my house. Not long after I moved in a heavy rain filled my basement with water. I had to rent a machine to dig a trench on either side so that the back yard would stop becoming a pond when it rained. I'm sure this wasn't a problem in the 60's when it was built, but over time the decaying leaves from my neighbor's tree raised the ground level by something like 1.5 ft and spoiled the original slope (I eventually found the original grade, there was a whole brick patio down there).

We may have to be a bit more intentional than "plant a bunch of trees" to get this effect, but I think it's worth exploiting.

lysp · 2025-06-10T03:04:44 1749524684

> There are good reasons to green-up our cities, but global CO2 levels isn't one of them.

I believe they can to a point. Trees, parks and greenery lower the average temperature for an area. Less heat being absorbed.

This would likely leads to less of a need for cooling and energy use.

That being said, I don't remember reading about how much of an effect it does have, just that it's not zero.

SllX · 2025-06-10T06:42:10 1749537730

I’m a generally pro-Tree person but I do caution against this gung-ho sentiment because it tends to lead people down the path of 1) seeing a forest as just the trees and 2) seeing it as a single species of tree, because that’s how you get monocultures, and the lack of biological diversity in monocultures threatens the entire fake forest you worked so hard to plant.

So, plant trees, yeah, but smartly, in areas protected from animals initially that will eat the saplings and grow more than one kind and introducing other vegetation over time. All of the extra complexity will slow the work down and get people questioning you about why it’s taking so long to get a forest, but at least you’ll get something resembling a forest that will be able to sustain itself without human intervention long after we’re dead.

fendy3002 · 2025-06-10T09:41:19 1749548479

a good video (IMO) on this topic: https://www.youtube.com/watch?v=BiDBAU2d7oE

ahmedbaracat · 2025-06-10T03:00:20 1749524420

Couldn’t agree more. Wrote this few years back:

https://barac.at/essays/we-only-need-to-plant-1-trillion-tre...

selcuka · 2025-06-10T04:10:32 1749528632

Are you sure? There are currently 3 trillion trees on earth and they only absorb about 20% of greenhouse gas emissions (~9.5 GT of CO2) per year [1]. Apparently not all trees absorb the same amount of CO2 as in your assumption. Adding 1 more trillion trees would have a negligible effect.

[1] https://www.pnas.org/doi/10.1073/pnas.1710465114

somenameforme · 2025-06-10T04:53:53 1749531233

It's not just the absorption as any stroll anywhere near a forest should tell you. They somehow cool areas dramatically, not just through shade, and change local systems substantially. Anyhow, if you want papers, there was one just recently discussed here. [1]

It's expected that planting a trillion trees (amounting to global land coverage of ~8%) which is analogous to pre-industrial times, would reduce overall heating by some 25% (!!) by itself. This also opens the door to yet another not poorly understand feedback system - CO2 increases greenery which increases trees which decreases temperatures far more than previously expected.

[1] - https://news.ycombinator.com/item?id=44221489

nitwit005 · 2025-06-10T04:00:38 1749528038

I'm totally on board with planting trees, but as a climate solution, the accounting doesn't make sense. We're burning a hundred million barrels of oil a day or so. If you tried to compensate with forests, you'll quickly start to wonder where you're going to fit them all, and where the water is coming from.

It's almost always going to be vastly easier to reduce emissions than to try to re-absorb it.

SwtCyber · 2025-06-10T09:23:44 1749547424

That said, it’s worth remembering that it’s not a magic bullet.