Hacker Newsnew | past | comments | ask | show | jobs | submit | noman-land's commentslogin

Great domain name. Great site. Bookmarked immediately and shared.

Thank you!

It's not striking because a person who wears an eye patch and has a tendency towards dark clothing is stastically more likely to be involved in an accident where seeing and being seen are important.

The original commentor found it striking.

Hey, thanks for this. Been trying it out and it's very fast but seems to hear more speakers than are in the audio. I didn't see a way to tweak speaker similarity settings or merge speakers in some way. Any advice?

Thanks for checking it out!

Yeah unfortunately, since the diarization is acoustic features based, it really does require high recorded voice fidelity/quality to get the best results. However, I just added another knob to the Diarizer class called mer_cos, which controls the speaker merging threshold. The default is 0.875, so perhaps try lowering to 0.8. That should help.

I'll also get around to adding a oracle/min/max speakers feature at some point, for cases where you know the exact number of speakers ahead of time, or wanna set upper/lower bounds. Gotten busy with another project, so haven't done it yet. PR's welcome though! haha


Thanks, `mer_cos` definitely gets me closer. I appreciate that. Yeah, I was thinking providing a param for the expected number of speakers would be nice. I'll check out the codebase and see if that's something I can contribute :).

Yeah would love contributions! Here's a brief overview of how I think it can be done:

Senko has two clustering types, (1) spectral for audio < 20 mins in length, and (2) UMAP+HDBSCAN for >= 20 mins. In the clustering code, spectral actually already supports orcale/min/max speakers, but UMAP+HDBSCAN doesn't. However, someone forked Senko and added min/max speakers to that here (for oracle, I guess min = max): https://github.com/DedZago/senko/commit/c33812ae185a5cd420f2...

So I think all that's required is basically just testing this thoroughly to make sure it doesn't introduce any regressions in clustering quality. And then just wiring the oracle/min/max parameters to the Diarizer class, or diarize() func.


You doing code completion and agentic stuff successfully with local models? Got any tips? I've been out of the game for [checks watch] a few months and am behind on the latest. Is Cline the move?

I haven't bothered doing code completion locally yet, though its something I want to try with the QWEN model. I'm mostly using it to generate/fix code CLI style.

I had some pretty decent but very non-state-of-the-art success with it even cobbled together with LM Studio and VSCode plugins. I'm excited to keep trying it over the next months and years.

As a practitioner I also inherently believe in well written software but as a lifelong learner, things change, and evolve. There is absolutely no reason why software today has to be written like software of yesterday.

There is no need to be so prescriptive about how software is made. In the end the best will win on the merits. The bad software will die under its own weight with no think pieces necessary.

On the other hand, code might be becoming more like clay than like LEGO bricks. The sculptor is not minding each granule.

We don't know yet if there's long term merit in this new way of crafting software and telling people not to try it both won't work, and honestly looks like old people yelling at clouds.


> There is absolutely no reason why software today has to be written like software of yesterday.

I get what you're saying, but the irony is that AI tools have sort of frozen the state of the art of software development in time. There is now less incentive to innovate on language design, code style, patterns, etc., when it goes outside the range of what an LLM has been trained on and will produce.


> frozen the state of the art

Personally I am experimenting with a lot more data-driven, declarative, correct-by-construction work by default now.

AI handles the polyglot grunt work, which frees you to experiment above the language layer.

I have a dimensional analysis typing metacompiler that enforces physical unit coherence (length + time = compile error) across 25 languages. 23,000 lines of declarative test specs compile down to language-specific validation suites. The LLM shits out templates; it never touches the architecture.

We are still at very very early days.

Specs for my hobby physical types metacompiler tests:

https://gist.github.com/ctoth/c082981b2766e40ad7c8ad68261957...


I have been writing my own DSL (and LSP) for making web apps and LLMs can do a pretty decent job of writing this new language.

https://github.com/williamcotton/webpipe/tree/webpipe-2.0

https://github.com/williamcotton/webpipe-lsp/tree/webpipe-2....


Citation needed. I see no reason at all why that's true any more than the screwdriver freezing the state of home design in time.

LLMs aren't like a screwdriver at all, the analogy doesn't work. I think I was clear. LLMs aren't useful outside the domain of what they were trained on. They are copycats. To really innovate on software design means going outside what has been done before, which an LLM won't help you do.

No, you weren't clear, nor are you correct: you shared FUD about something it seems you have not tried, because testing your claims with a recent agentic system would dispel them.

I've had great success teaching Claude Code use DSLs I've created in my research. Trivially, it has never seen exactly these DSLs before -- yet it has correctly created complex programs using those DSLs, and indeed -- they work!

Have you had frontier agents work on programs in "esoteric" (unpopular) languages (pick: Zig, Haskell, Lisp, Elixir, etc)?

I don't see clarity, and I'm not sure if you've tried any of your claims for real.


This is desperate rebuttal from ignorance.

My point stands. You haven't innovated, you've just leaned on an LLM to work with your unoriginal DSL. I'm sure it's worth 100 megawatt-hours.


> In the end the best will win on the merits.

The last six decades of commercial programming don't exactly bear this out...

The real lesson is that writing software is such a useful, high-leverage activity that even absolutely awful software can be immensely valuable. But that doesn't tell us that better software is useless, it just tells us it is not absolutely necessary.


Software engineers are desperate to have their work be like machining aircraft parts.

It’s a tool. No one cares about code quality because the person using your code isn’t affected by it. There are better and worse tools. No one cares whether a car is made with SnapOn tools or milled on HAAS machines. Only that it functions.

We know there is no long term merit to this idea just looking back at the last 40 years of coding.


There needs to be a RICO case on the entire criminal abuse network which operated as an open secret for over three decades.

Andrew Windsor needs to be extradited to the US and face trial.

We need a referendum about whether we want child rape to be a tool of US policy. I say no.


> Andrew Windsor needs to be extradited to the US and face trial.

Reciprocity is a thing...


Yes! I don't understand how any American can support this kind of hypocrisy. For those who didn't know

https://en.wikipedia.org/wiki/Death_of_Harry_Dunn


I want to see a tax audit.

Epstein was running what he said was legal tax avoidance system for his clients.

I'm not saying they were breaking any laws in it, but Epstein is a dodgy guy (also he used to be involved in IIRC a Ponzi scheme and was previously sacked from a big firm - red flags galore with this guy) and the scheme worked off asset prices and trusts.

If a bunch of billionaires could manipulate asset prices (selling illiquid assets like mansions and artwork between each other and their trusts) I suspect they could really bring down their tax bills. This would be illegal (I think) but you'd need to untangle a large web of transactions to prove it.


> I want to see a tax audit.

Yes. IMHO, the parts that they really don't want to come out are the financial ties. It's the connections of money (and power and influence) that is being covered up, more then the child sex crimes that are now known.


The tax fraud can get a conviction, since there's a paper trail, and juries are more likely to think "yeah this billionaire might not be a monster but they probably cheated in their taxes".

Epstein didn't rape any children, he had a harem of young women. Some of the women were under 18, but all of them were old enough to be held responsible for their part in the crime, that is, being prostitutes. Many even have admitted to comitting "child" sex trafficking by inviting girls from their high schools to be prostitutes.

Now first of all, women below the age of 18 are children. Feels strange to actually having to spell this out, but okay.

More importantly, you seem to know an awful lot more than anybody else right now, or how would you be so certain?


Du bist total falsch. Das Recht deines eigenen Landes sagt Mädels über 14 Jahre dürfen zum Sex freilich annehmen.

Of course the perpetrators of child abuse would be adjacent to child abuse material. That's why the material needs to be indexed, to make sense of it and round up the rest of the perpetrators.

correct

i can say that 90% of the work was in making sense of the data, indexing, processing, etc


Go herd goats. You don't need to wait for AI to destroy your livelihood.

Herding goats doesn't solve the interesting technical problem I'm trying to solve.

Point is: if that problem is solvable without me, that's the win condition for everyone. Then I go herd goats (and have this nifty tool that helps me spec out an optimal goat fence while I'm at it).


> Point is: if that problem is solvable without me, that's the win condition for everyone.

The problem is solvable without you. I don't even need to know what the problem actually is, because the odds of you being one of the handful of the people in the world who are so critical that the world notices their passing is so low, I have a better chance of winning a lottery jackpot than of you being some critical piece of some solution.


I completely disagree - I think it’s the other way around.

Solving the problem - no matter what problem it is - is extremely dependent on you and every single human being (or animal for that matter) is a critical piece of their environment and circumstances.


Yeah, but there's nothing like some sweet, sweet justification.

I need that sweet AI-enabled UBI first to do it comfortably

This basically never happens.

What basically never happen? People testing in Firefox, or websites not working properly?

I don't have this problem.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: