More

bogtog · 2026-01-26T23:36:13 1769470573

> But now that most code is written by LLMs, it's as "hard" for the LLM to write Python as it is to write Rust/Go

The LLM still benefits from the abstraction provided by Python (fewer tokens and less cognitive load). I could see a pipeline working where one model writes in Python or so, then another model is tasked to compile it into a more performant language

anonzzzies · 2026-01-26T23:56:09 1769471769

It's very good (in our experience, YMMV of course) when/llm write prototype with python and then port automatically 1-1 to Rust for perf. We write prototypes in JS and Python and then it gets auto ported to Rust and we have been doing this for about 1 year for all our projects where it makes sense; in the past months it has been incredibly good with claude code; it is absolutely automatic; we run it in a loop until all (many handwritten in the original language) tests succeed.

behnamoh · 2026-01-27T00:12:48 1769472768

IDK what's going on in your shop but that sounds like a terrible idea!

- Libraries don't necessarily map one-to-one from Python to Rust/etc.

- Paradigms don't map neatly; Python is OO, Rust leans more towards FP.

- Even if the code be re-written in Rust, it's probably not the most Rustic (?) approach or the most performant.

anonzzzies · 2026-01-27T00:38:13 1769474293

It doesn't map anything 1 to 1, it uses our guidelines and architecture for porting it which works well. I did say YMMV anyway; it works well for us.

behnamoh · 2026-01-27T00:51:28 1769475088

Sorry, so basically you're saying there are two separate guidelines, one for Python and one for Rust, and you have the LLM write it first in Python and then Rust. But I still don't understand why it would be any better than writing the code in Rust in one go? Why "priming" it in Python would improve the result in any way?

Also, what happens when bug fixes are needed? Again first in Py and then in Rs?

abrookewood · 2026-01-27T01:52:42 1769478762

Why not get it to write it in Rust in the first place?

antonvs · 2026-01-27T16:22:07 1769530927

Presumably the thought experiment hasn’t matured to that point yet.

bko · 2026-01-27T00:01:56 1769472116

I think that's not as beneficial as having proper type errors and feeding that into itself as it writes

LudwigNagasena · 2026-01-27T00:17:23 1769473043

Expressive linting seems more useful for that than lax typing without null safety.

JumpCrisscross · 2026-01-26T23:46:57 1769471217

NP (as in P = NP) is also much lower for Python than Rust on the human side.

behnamoh · 2026-01-26T23:59:22 1769471962

What does that mean? Can you elaborate?

JumpCrisscross · 2026-01-27T00:00:42 1769472042

Sorry, yes. LLMs write code that's then checked by human reviewers. Maybe it will be checked less in the future. But I'm not seeing fully-autonomous AI on the horizon.

At that point, the legibility and prevalence of humans who can read the code becomes almost more important than which language the machine "prefers."

behnamoh · 2026-01-27T00:09:11 1769472551

Well, verification is easier than creation (i.e., P ≠ NP). I think humans who can quickly verify something works will be in more demand than those who know how to write it. Even better: Since LLMs aren't as creative as humans (in-distribution thinking), test-writers will be in more demand (out-of-distribution thinkers). Both of these mean that humans will still be needed, but for other reasons.

The future belongs to generalists!

rvz · 2026-01-27T01:24:49 1769477089

> The future belongs to generalists!

Couldn't be more correct.

The experienced generalists with techniques of verification testing are the winners [0] in this.

But one thing you cannot do, is openly admit or to be found out to say something like: "I don't know a single line of Rust/Go/Typescript/$LANG code but I used an AI to do all of it" and the system breaks down and you can't fix it.

It would be quite difficult to take a SWE seriously that prides themselves in having zero understanding and experience of building production systems and runs the risk of losing the company time and money.

[0] https://news.ycombinator.com/item?id=46772520

bandrami · 2026-01-27T04:34:59 1769488499

I prefer my C compiler to write my asm for me from my C code but I can still (and sometimes have to!) read the asm it creates.

Der_Einzige · 2026-01-27T01:48:54 1769478534

P ≠ NP is NOT confirmed and my god I really do not want that to ever be confirmed

I really do want to live in the world where P = NP and we can trivially get P time algorithms for believed to be NP problems.

I reject your reality and substitute my own.

bogtog · 2026-01-26T20:49:44 1769460584

I figure OP would try and give the models pure text forms of the game?

.....

l....

l.ttt

l..t.

bogtog · 2026-01-23T23:00:55 1769209255

This is fair, but this seems like the only way to test this type of thing while avoiding the risk of harassing tons of farmers with AI emails. In the end, the performance will be judged on how much of a human harness is given

bogtog · 2026-01-17T16:51:50 1768668710

I associate "yello" with Homer Simpson: https://www.facebook.com/TheDoctorZaius/videos/7233283715092...

(fingers crossed I'm not somehow doxxing myself by sharing a fb link)

genter · 2026-01-17T17:58:36 1768672716

Forgot about that. So about a 100% chance my dad got it from that.

bogtog · 2026-01-15T11:01:50 1768474910

People will pay extra for Opus over Sonnet and often describe the $200 Max plan as cheap because of the time it saves. Paying for a somewhat better harness follows the same logic

bogtog · 2026-01-06T20:12:00 1767730320

The game looks really good, although I think it'd be improved if the sphere was a bit smaller. It feels like it takes too long for the game to become difficult

re · 2026-01-06T20:49:15 1767732555

Here's a console command you can run to increase the snake length immediately, and thus the difficulty:

   (() => { let count = 50; const delay = 100; const interval = setInterval(() => { addSnakeNode(); if (--count <= 0) clearInterval(interval); }, delay);})()

londons_explore · 2026-01-06T23:01:43 1767740503

Why wrap in a lambda?

re · 2026-01-07T00:12:34 1767744754

Because I learned JS before ECMAScript 6 was widely supported by browsers and haven't written a ton of it targeting modern browsers. You're right that it's unnecessary.

tinyhitman · 2026-01-06T23:13:56 1767741236

Could be to allow use of local variables that do not leak into the scope this code is executed in. That's what I use this pattern for.

crdrost · 2026-01-06T23:56:28 1767743788

pro tip: no longer necessary

    { let count = 50; const interval = setInterval(() => { addSnakeNode(); if (--count <= 0) clearInterval(interval); }, 100) }

londons_explore · 2026-01-07T10:29:22 1767781762

And polluting the global variable namespace hardly matters when using the console.

elicash · 2026-01-06T22:56:55 1767740215

Speed should slightly increase with each new apple

Wowfunhappy · 2026-01-07T00:17:44 1767745064

I strongly disagree, I like that the challenge comes from the snake getting longer as opposed to speed.

giarc · 2026-01-06T21:31:07 1767735067

Agree - my millennial brain got bored quickly and it was still very easy.

progbits · 2026-01-06T21:48:37 1767736117

Easy up to ~70, interesting between 80-110, very hard around 120-130. I think scores above 200 are pretty sus, there is very little room on the sphere at that point (using the cheat from sibling comment). Anything >400 is definitely made up.

bogtog · 2026-01-06T12:28:03 1767702483

A few months ago, there was a lot of news lambasting tech companies for extending the depreciation lifespan of GPUs from ~3 years to ~5 years. Do these price hikes suggest a longer lifespan is probably the right way to see how long these GPUs will be valuable?

miningape · 2026-01-06T13:46:44 1767707204

Not a finance guy, so fully prepared to be wrong here. But my interpretation is that an increase in price corresponds to a shorter lifespan. i.e. less time to make money so we need to charge more to get the same return over less time

It could also be a supply/demand issue, generally price increases are caused by either 1. demand increasing, or 2. supply decreasing.

In this case we can interpret a shorter lifespan as decreased supply, but it can also be because the demand for GPU compute has gone up. I think in this case we're seeing a bit of both, but it's hard to tell without more data.

We could also consider the supply / demand elasticity changing, f.x since demand has become more price inelastic it could result in a higher price.

lukeschlather · 2026-01-06T16:40:16 1767717616

The thing historically about GPUs has not been the actual lifespan of the hardware (at least half of the hardware will probably work fine for 10 or more years) the problem is that work/watt is dropping for newer hardware, so there's a point where even if you had an equivalent quantity of 10-year-old GPUs, powering them for some period costs $40k and you can buy a single brand-new GPU that costs $40k but only costs $20K to power for the same period which is less than a few years.

I don't think we're seeing any decrease in supply though, ignoring 2020 I'm pretty sure the number of GPUs manufactured has been steadily increasing. It might be the case that projected manufacturing was higher than what actually happened, which is not the same thing as a decrease in supply, but companies like Amazon will talk about it like it is, and from the standpoint of their pricing it essentially is.

zozbot234 · 2026-01-06T16:46:49 1767718009

> the problem is that work/watt is dropping for newer hardware, so there's a point where even if you had an equivalent quantity of 10-year-old GPUs, powering them for some period costs $40k

Sell the old-gen GPU's to on-prem users (including home consumers) who are going to run them a small % of the time (so power use is more or less negligible to them compared to acquisition cost), problem solved.

lukeschlather · 2026-01-06T16:51:09 1767718269

The same math applies for on-prem/home users. If you actually have some workload where it makes sense to get a free GPU that costs $40/hour to power because you only need it for a few hours a month, it's probably cheaper to rent a more efficient GPU from someone who can power it at a lower cost.

bogtog · 2026-01-06T16:07:29 1767715649

Oh my reasoning was coming at this from a different angle: H200s were released in November of 2023, so they're over 2 years old at this point while still being valuable

lukeschlather · 2026-01-06T16:35:48 1767717348

I would not be surprised if the majority of H200s were manufactured in the past 12 or even 6 months.

xiphias2 · 2026-01-06T13:38:59 1767706739

It mainly depends on how much NVIDIA is overselling the improvements.

With adding RL functions, separating prefill and decode chips, nvfp4 and lots of other architectural changes efficiency of the most valuable tasks goes up as long as the algorithms don't change significantly.

Everything else can just stay on older chips.

Der_Einzige · 2026-01-06T14:44:18 1767710658

The people who think that high end GPUs don't last for ~5 years (really it's 6) do not know what they are talking about. 6 years is likely too low. If the cooling is good and they don't fail early, most of these GPUs could still keep going past 6 years.

I'm convinced that anything with more than 80gb of VRAM will be worth it for closer to 10 years at this point.

bogtog · 2025-12-24T12:40:22 1766580022

> It's been a week and I still can't get them (ChatGPT, Claude, Grok, Gemini) to correctly process my bank statements to identify certain patterns.

Can you give any more details on what you mean? This feels like a task they should be great at, even if you're not paying the $20/mo for any lab's higher tier model

Razengan · 2025-12-24T12:52:28 1766580748

I have a couple banks that are peculiar in the way they handle transactions made in a different currency while traveling etc. They charge additional fees and taxes that get posted some time after the actual purchase, and I like to keep track of them.

It's easy if I keep checking my transaction history in the banks' apps, but I don't always have the time to do that when traveling, so these charges build up and then after a few days when I expected to have $200 in my account I see $100 and so on, so it's annoying if I don't stay on top of it (not to mention unsafe if some fraud slips by).

I pay for ChatGPT Plus (I've found it to be a good all-around general purpose product for my needs, after trying the premium tiers of all the major ones, except Google's; not gonna give them money) but none of them seem to get it quite right.

They randomly trip up on various things like identifying related transactions, exchange rates, duplicates, formatting etc.

> This feels like a task they should be great at

That's what I thought too: Something that you could describe with basic guidelines, then the AI's "analog" inference/reasoning would have some room in how it interprets everything to catch similar cases.

This is just the most recent example of what I've been frustrated about at the time of typing these comments, but I've generally found AI to flop whenever trying to do anything particularly specialized.

CPLX · 2025-12-24T13:06:32 1766581592

If you installed Claude Code and put all your statements into a local folder and asked it to process them it could do literally anything you could come up with all the way up to setting up an AWS instance with a website that gives nifty visualizations of your spending. Or anything else you are thinking of.

Razengan · 2025-12-24T13:15:57 1766582157

I may try that, but at this point it's already more work wrestling with the AI than just doing it myself.

The most important factor is confidence: After seeing them get some things mixed up a few times, I would have to manually verify the output myself anyway.

----

Re: the multiple comments that suggest to ask AI for code instead of feeding data to the chatbot:

I get what you mean, but I WANT the AI's non-deterministic AIness in this case!

For example, in some countries there are these "omni apps" that can be used for ride hailing or ordering food etc. The bank statement lists all such transactions with the same merchant name. I want the AI to do its AI thing to guess which transactions were rides and which were food deliveries, based on the prices and times etc. Like if there are multiple small transactions those are taxis, and the most expensive transactions during a day are my lunch and dinner.

And there are other cases, that would be too much "imperative" code that would fail anyway.

Like I said, this is a task that any human could do easily after a short explanation, but takes a hell of a lot of wrangling with AI.

CPLX · 2025-12-24T14:17:40 1766585860

I had the same vague impression as you did when using AI via browser/chat interaction. Like it’s very impressive but how useful is it really?

Using it via the CLI approach as an entirely different experience. It’s literally shocking what you can do.

For context, among many other things I have done this exact thing I am recommending. I just hit export on a Quickbooks instance of a complex multimillion dollar business and had Claude Code generate reports on various things I wanted to optimize and it just handles it in seconds.

The real limit to these tools is knowing what to ask for and stating the requirements clearly and incrementally. Once you get the hang of it, it’s literally shocking how many use cases you can find.

scotty79 · 2025-12-24T19:59:49 1766606389

I think a good mental model of what you can expect from a chat bot is imagining that somebody read tje bank statement to you and them asked you a bunch of questions. Could you follow that, not make smy mistakes, not forget anything? Cam you perform the task "from the top of your head", not writing anything down, not pulling up excell or a calculator? If you can there's a good chance AI will be able to do that too. The fact that it sometimes can do more is pure miracle. And if you want it to do those things consistently you need to provide it with access to the tools you'd need to perform thus task consistently.

Razengan · 2025-12-25T04:04:11 1766635451

> somebody read the bank statement to you…

But it's not that. I'm GIVING it the data.

It's simple, I can do it myself:

Go row by row. See a certain phrase in the transaction description? Look a few rows ahead. Spot associated fees with just a glance. Write that group of transactions down somewhere else.

That's it.

I tried different kinds of prompts, from imperative to declarative, including telling the AI to write a script for its own internal use, but they just don't seem to get it.

scotty79 · 2025-12-25T09:28:04 1766654884

AI has purely linear input channel. It gets tokens one by one. Context is a form of short term memory. I know, that because you give it written text it seems like you provide it with a document it should be able to process in any way it likes, but the system is set up as if you read the document to AI, word by word and asked questions about it, that it needs to answer "of the top of its head".

> It's simple, I can do it myself:

> Go row by row. See a certain phrase in the transaction description? Look a few rows ahead.

Can you do it without looking at the document? Just by ear? Every time correctly? Without missing something?

Razengan · 2025-12-25T13:18:39 1766668719

Whatever the reasons/excuses, the initial assessment stands: AI is still far from "butler" level assistance with anything much beyond simple tasks.

Maybe by next Christmas?

scotty79 · 2025-12-25T19:13:40 1766690020

I think you can find what you are looking for in agentic AIs that can use tools, write programs and execute them even today.

In short, you are holding it wrong. ;-)

dgacmu · 2025-12-24T14:15:28 1766585728

This is exactly why you have it write code instead of analyzing the data. You can have tests, you can inspect then code, you know that the process will be deterministic. The chatbot LLMs are a bad match for bulk data analysis on regular, structured data. But they're often quite decent at writing code.

CPLX · 2025-12-24T15:57:35 1766591855

> Like I said, this is a task that any human could do easily after a short explanation, but takes a hell of a lot of wrangling with AI.

Replying to your edit. It just doesn’t. It’s almost effortless and fast to do exactly what you’re describing, capturing the subjective judgement of AI, to do what you want.

It took me a couple weeks to get very very good at it with good results in the first day or two. If you’re a competent programmer you’ll have the same experience and quickly if you get into the flow that’s being described to you.

I’m the ultimate skeptic I understand where you’re coming from but these workflows are crazy powerful.

darkstarsys · 2025-12-24T13:45:53 1766583953

This is the right answer. Don't just feed the data to a chatbot; have it write code to do what you want, repeatably and testably. You can probably have working python (and a docker container for it) in under 30 min.

bogtog · 2025-12-24T13:18:25 1766582305

Thanks for sharing. I'm surprised you can't just ctrl-a + copy-paste your bank statement and get it to work easily

bogtog · 2025-12-23T22:57:47 1766530667

I don't think the commentor above is saying that an AI should necessarily apply the redaction. Rather, an AI can serve as an objective-ish way of determining what should be redacted. This seems somewhat analogous to how (non-AI) models can we used to evaluate how gerrymandered a map is

bogtog · 2025-12-13T17:13:38 1765646018

Using voice transcription is nice for fully expressing what you want, so the model doesn't need to make guesses. I'm often voicing 500-word prompts. If you talk in a winding way that looks awkward when in text, that's fine. The model will almost certainly be able to tell what you mean. Using voice-to-text is my biggest suggestion for people who want to use AI for programming

(I'm not a particularly slow typer. I can go 70-90 WPM on a typing test. However, this speed drops quickly when I need to also think about what I'm saying. Typing that fast is also kinda tiring, whereas talking/thinking at 100-120 WPM feels comfortable. In general, I think just this lowered friction makes me much more willing to fully describe what I want)

You can also ask it, "do you have any questions?" I find that saying "if you have any questions, ask me, otherwise go ahead and build this" rarely produces questions for me. However, if I say "Make a plan and ask me any questions you may have" then it usually has a few questions

I've also found a lot of success when I tell Claude Code to emulate on some specific piece of code I've previously written, either within the same project or something I've pasted in

Marsymars · 2025-12-13T19:48:44 1765655324

> I'm not a particularly slow typer. I can go 70-90 WPM on a typing test. However, this speed drops quickly when I need to also think about what I'm saying. Typing that fast is also kinda tiring, whereas talking/thinking at 100-120 WPM feels comfortable.

This doesn't feel relatable at all to me. If my writing speed is bottlenecked by thinking about what I'm writing, and my talking speed is significantly faster, that just means I've removed the bottleneck by not thinking about what I'm saying.

eucyclos · 2025-12-14T05:07:57 1765688877

It's often better to segregate creative and inhibitive systems even if you need the inhibitive systems to produce a finished work. There's a (probably apocryphal) conversation between George RR Martin and Stephen King that goes something like:

GRRM: How do you write so many books?... Don't you ever spend hours staring at the page, agonizing over which of two words to use, and asking 'am I actually any good at this?'

SK: Of course! But not when I'm writing.

theshrike79 · 2025-12-16T08:11:46 1765872706

It's not fully apocryphal, there's video of it: https://www.youtube.com/watch?v=xR7XMkjDGw0 - not those exact words, but the gist of it is there.

(Full video here: https://www.youtube.com/watch?v=v_PBqSPNTfg )

bogtog · 2025-12-13T21:40:50 1765662050

That's fair. I sometimes find myself pausing or just talking in circles as I'm deciding what I want. I think when I'm speaking, I feel freer to use less precise/formal descriptions, but the model can still correctly interpret the technical meaning

In either case, different strokes for different folks, and what ultimately matters is whether you get good results. I think the upside is high, so I broadly suggest people try it out

hexaga · 2025-12-13T20:32:05 1765657925

Alternatively: some people are just better at / more comfortable thinking in auditory mode than visual mode & vice versa.

In principle I don't see why they should have different amounts of thought. That'd be bounded by how much time it takes to produce the message, I think. Typing permits backtracking via editing, but speaking permits 'semantic backtracking' which isn't equivalent but definitely can do similar things. Language is powerful.

And importantly, to backtrack in visual media I tend to need to re-saccade through the text with physical eye motions, whereas with audio my brain just has an internal buffer I know at the speed of thought.

Typed messages might have higher _density_ of thought per token, though how valuable is that really, in LLM contexts? There are diminishing returns on how perfect you can get a prompt.

Also, audio permits a higher bandwidth mode: one can scan and speak at the same time.

mattmanser · 2025-12-15T19:45:21 1765827921

It's kind of the point. If you start writing it, you'll start correcting it and moving things around and adding context and fiddling and more and more.

And your 5 minute prompt just turned I to 1/2 hour of typing

With voice you get on with it, and then start iterating, getting Claude to plan with you.

Not been impressed with agentic coding myself so far, but I did notice that using voice works a lot better imo, keeping me focused on getting on with letting the agent do the work.

I've also found it good for stopping me doing the same thing in slack messages. I ramble my general essay to ChatGPT/Claude, get them to summarize rewrite a few lines in my own voice. Stops me spending an hour crafting a slack message and tends to soften it.

buu700 · 2025-12-13T23:31:49 1765668709

I prefer writing myself, but I could see the appeal of producing a first draft of a prompt by dumping a verbal stream of consciousness into ChatGPT. That might actually be kind of fun to try while going on a walk or something.

theshrike79 · 2025-12-16T08:13:10 1765872790

You can feed all that into Claude and have a prototype ready while you get home.

The Claude App version works from your phone and has a virtual environment it can use to write code and push it to a github repo :)

buu700 · 2025-12-16T08:33:27 1765874007

That's definitely cool too. I was just suggesting an intermediary text prompt step as a compromise between 100% writing and 100% voice. So instead of getting home to actual code, you'd get home to a draft of relatively detailed requirements to review and revise before incurring the cost of throwing a coding agent at it.

dyauspitr · 2025-12-13T20:45:23 1765658723

I don’t feel restricted by my typing speed, speaking is just so much easier and convenient. The vast majority of my ChatGPT usage is on my phone and that makes s2t a no brainer.

cjflog · 2025-12-14T04:10:11 1765685411

100% this, I built laboratory.love almost entirely with my voice and (now-outdated) Claude models

My go-to prompt finisher, which I have mapped to a hotkey due to frequent use, is "Before writing any code, first analyze the problem and requirements and identify any ambiguities, contradictions, or issues. Ask me to clarify any questions you have, and then we'll proceed to writing the code"

Applejinx · 2025-12-13T22:18:57 1765664337

It's an AI. You might do better by phrasing it, 'Make a plan, and have questions'. There's nobody there, but if it's specifically directed to 'have questions' you might find they are good questions! Why are you asking, if you figure it'd be better to get questions? Just say to have questions, and it will.

It's like a reasoning model. Don't ask, prompt 'and here is where you come up with apropos questions' and you shall have them, possibly even in a useful way.

dominotw · 2025-12-13T18:17:30 1765649850

surprised ai companies are not making this workflow possible instead of leaving it upto users to figure out how to get voice text into prompt.

alwillis · 2025-12-13T18:25:55 1765650355

> surprised ai companies are not making this workflow possible instead of leaving it upto users to figure out how to get voice text into prompt.

Claude on macOS and iOS have native voice to text transcription. Haven't tried it but since you can access Claude Code from the apps now, I wonder if you use the Claude app's transcription for input into Claude Code.

bogtog · 2025-12-13T18:35:52 1765650952

> Claude on macOS and iOS have native voice to text transcription

Yeah, Claude/ChatGPT/Gemini all offer this, although Gemini's is basically unusable because it will immediately send the message if you stop talking for a few seconds

I imagine you totally could use the app transcript and paste it in, but keeping the friction to an absolute minimum (e.g., just needing to press one hotkey) feels nice

dyauspitr · 2025-12-13T20:48:36 1765658916

All the mobile apps make this very easy.

johnfn · 2025-12-13T17:33:44 1765647224

That's a fun idea. How do you get the transcript into Claude Code (or whatever you use)? What transcription service do you use?

hn_throw2025 · 2025-12-13T17:43:19 1765647799

I'm not the person you're replying to, but I use Whispering connected to the whisper-large-v3-turbo model on Groq.

It's incredibly cheap and works reliably for me.

I have got it to paste my voice transcriptions into Chrome (Gemini, Claude, ChatGPT) as well as Cursor.

https://github.com/EpicenterHQ/epicenter

rgbrgb · 2025-12-13T18:50:01 1765651801

I use Handy with Claude code. Nice to just have a key combo to transcribe into whatever has focus.

https://github.com/cjpais/Handy

bonniesimon · 2025-12-14T07:52:28 1765698748

Love handy. I use it too when dealing with LLMs. The other day I asked chatgpt to generate interview questions based on job description and then I answered using handy. So cool!

quinncom · 2025-12-13T18:35:52 1765650952

I use Spokenly with local Parakeet 0.6B v3 model + Cerebras gpt-oss-120b for post-processing (cleaning up transcription errors and fixing technical mondegreens, e.g., `no JS` → `Node.js`). Almost imperceptible transcription and processing delay. Trigger transcription with right ⌥ key.

ctoth · 2025-12-13T20:01:20 1765656080

According to Google this is the first time the phrase "technical mondegreens" was ever used. I really like it.

hurturue · 2025-12-13T17:56:53 1765648613

your OS might have a built in dictation thing. Google for that and try it before online services.

thehours · 2025-12-15T14:00:33 1765807233

I use the Raycast + Whisper Dictation. I don't think there is anything novel about it, but it integrates nicely into my workflow.

My main gripe is when the recording window loses focus, I haven't found a way to bring it back and continue the recorded session. So occasionally I have to start from scratch, which is particularly annoying if it happens during a long-winded brain dump.

primaprashant · 2025-12-14T15:58:39 1765727919

I built my own open-source tool to do exactly this so that I can run something like `claude $(hns)` in my terminal and then I can start speaking, and after I'm done, claude receives the transcript and start working. See this workflow here: https://hns-cli.dev/docs/drive-coding-agents/

bogtog · 2025-12-13T18:21:40 1765650100

There are a few apps nowadays for voice transcription. I've used Wispr Flow and Superwhisper, and both seem good. You can map some hotkey (e.g., ctrl + windows) to start recording, then when you press it again to stop, it'll get pasted into whatever text box you have open

Superwhisper offers some AI post-processing of the text (e.g., making nice bullets or grammar), but this doesn't seem necessary and just makes things a bit slower

rpwverheij · 2025-12-15T09:51:42 1765792302

+1 for Superwhisper. It has an offline model for transcription. And it transcribes with very high accuracy for me and great speed.

elvin_d · 2025-12-13T21:25:42 1765661142

made this tool to press double control to start and another ctrl to stop which copies to the cliboard

https://github.com/elv1n/para-speak/

erichi · 2025-12-15T10:23:49 1765794229

So cool man! Had to add couple fixes to be able to use it on mac. Love it!

victorbjorklund · 2025-12-14T13:58:35 1765720715

I do the same. On Mac I use macwhisper. The transcription does not have to be correct. Lots of times it writes the wrong word when talking about technical stuff but Claude understands which word I mean from context

singhrac · 2025-12-14T02:57:09 1765681029

I use VoiceInk (needed some patches to get it to compile but Claude figured it out) and the Parakeet V3 model. It’s really good!

d4rkp4ttern · 2025-12-14T13:01:26 1765717286

> if you talk in a winding way …

My regular workflow is to talk (I use VoiceInk for transcription) and then say “tell me what you understood” — this puts your words into a well structured format, and you can also make sure the cli-agent got it, and expressing it explicitly likely also helps it stay on track.

listic · 2025-12-13T17:30:13 1765647013

Thanks for the advice! Could you please share how did you enable voice transcription for your setup and what it actually is?

binocarlos · 2025-12-13T17:57:30 1765648650

I use https://github.com/braden-w/whispering with an OpenAI api key.

I use a keyboard shortcut to start and stop recording and it will put the transcription into the clipboard so I can paste into any app.

It's a huge productivity boost - OP is correct about not overthinking trying to be that coherent - the models are very good at knowing what you mean (Opus 4.5 with Claude Code in my case)

abdullahkhalids · 2025-12-13T22:32:54 1765665174

I just installed this app and it is very nice. The UX is very clean and whatever I say it transcribes it correctly. In fact I'm transcribing this comment with this app just now.

I am using Whisper Medium. The only problem I see is that at the end of the message it sometimes puts a bye or a thank you which is kind of annoying.

listic · 2025-12-13T19:25:59 1765653959

I am all ready to believe that with LLMs it's not worth it trying to be too coherent: I did successfully use LLMs to make sense of what incoherent-sounding people say. (in text)

mattmanser · 2025-12-15T19:48:50 1765828130

Aquavoice, YC company, really good. Got it after doing a bit of research on here, there's something for Mac that's supposed to be good too.

If you want local transcription, locally running models aren't quite good enough yet.

They use right-ctrl as their trigger. I've set mine to double tap and then I can talk with long pauses/thinking and it just keeps listening till I tap to finish.

bogtog · 2025-12-13T18:26:10 1765650370

I'm using Wispr flow, but I've also tried Superwhisper. Both are fine. I have a convenient hotkey to start/end recording with one hand. Having it just need one hand is nice. I'm using this with the Claude Code vscode extension in Cursor. If you go down this route, the Claude Code instance should be moved into a separate window outside your main editor or else it'll flicker a lot

pzo · 2025-12-14T02:12:52 1765678372

another option is MacWhisper if someone is on macOS and doesn't want to pay for subscription (just one time payment) - pretty much all of those apps these days use paraspeech from NVIDIA which is the fastest and the best open source model that can run on edge devices.

Also haven't tried but on latest MacOS 26 apple updated their STT models so their build in voice dictation maybe is good enough.

kapnap · 2025-12-13T18:00:38 1765648838

For me, on Mac, VoiceInk has been top notch. Got tired of superwhispr

lukax · 2025-12-13T21:00:53 1765659653

Spokenly on macOS with Soniox model.

j45 · 2025-12-13T20:45:28 1765658728

Speech also uses a different part of the brain, and maybe less finger coordination.

journal · 2025-12-13T23:23:32 1765668212

voice transcription is silly when someone is listening you talking to something that isn't exactly human, imagine explaining you were talking to AI. When it's more than one sentence I use voice too.