More

ianbutler · 2026-02-03T19:50:53 1770148253

I'd argue we jumped that shark since the shift in focus to post training. Labs focus on getting good at specific formats and tasks. The generalization argument was ceded (not in the long term but in the short term) to the need to produce immediate value.

Now if a format dominates it will be post trained for and then it is in fact better.

Der_Einzige · 2026-02-03T20:24:50 1770150290

Anthropic and Gemini still release new pre-training checkpoints regularly. It's just OpenAI who got stupid on that. RIP GPT-4.5

ianbutler · 2026-02-03T20:44:06 1770151446

All models released from those providers go through stages of post training too, none of the models you interact with go from pre-training to release. An example of the post training pipeline is tool calling, that is to my understanding a part of post training and not pre training in general.

I can't speak to what the exact split is or what is a part of post training versus pre training at various labs but I am exceedingly confident all labs post train for effectiveness in specific domains.

Der_Einzige · 2026-02-03T20:46:55 1770151615

I did not claim that post training doesn't happen on these models, and you are being extremely patronizing (I publish quite a bit of research on LLMs at top conferences).

I claimed that OpenAI overindexed on getting away with aggressive post-training on old pre-training checkpoints. Gemini / Anthropic correctly realized that new pre-training checkpoints need to happen to get the best gains in their latest model releases (which get post-trained too).

ianbutler · 2026-02-03T23:48:01 1770162481

If you read that as patronizing that says more about you than me personally, I have no idea who you are so your own insecurity at what is a rather unloaded explanation perplexes me.

ianbutler · 2026-02-02T06:15:16 1770012916

Except Akka in Java and for the entirety of Erlang and its children Elixir and Gleam. You obviously can scale those to multiple systems, but they provide a lot of benefit in local single process scenarios too imo.

Things like data pipelines, and games etc etc.

katzenversteher · 2026-02-02T09:36:25 1770024985

If I'm not mistaken ROOM (ObjecTime, Rational Rose RealTime) was also heavily based on it. I worked in a company that developed real time software for printing machines with it and liked it a lot.

lmm · 2026-02-02T06:16:44 1770013004

I've worked on a number of systems that used Akka in a non-distributed way and it was always an overengineered approach that made the system more complex for no benefit.

ianbutler · 2026-02-02T06:26:42 1770013602

Fair, I worked a lot on data pipelines and found the actor model worked well in that context. I particularly enjoyed it in the Elixir ecosystem where I was building on top of Broadway[0]

Probably has to do with not fighting the semantics of the language.

[0] https://elixir-broadway.org/

weatherlight · 2026-02-02T08:19:10 1770020350

Really depends of the ergonomics of the language. In erlang/elixir/beam langs etc, its incredibly ergonomic to write code that runs on distributed systems.

you have to try really hard to do the inverse. Java's ergonomics, even with Akka, lends its self to certain design patterns that don't lend itself to writing code for distributed systems.

ianbutler · 2026-01-31T21:14:53 1769894093

It has full web search I often have CC search out docs and compare opinions on an approach

ianbutler · 2026-01-20T17:51:39 1768931499

In practice the system is such that 'everyone' doesn't really seem to include the people making a lot of money they are effectively outside the system the rest of us have to deal with.

ianbutler · 2026-01-15T09:30:24 1768469424

It sounds more like a depression and stress brained reduction to me. Tends to put you in a very binary and extreme thinking mode in my lived experience.

Also I inherently disagree with the idea of meaninglessness the author presents there. Meaning is relative to man, man makes meaning. There is no objective meaning and so you have to choose it for yourself.

ianbutler · 2026-01-15T00:59:32 1768438772

Yeah I've had problems with this recently. "Oh those are just warnings." Yes but leaving them will make this codebase shit in short time.

I do use AI heavily so I resorted to actually turning on warnings as errors in the rust codebases I work in.

micimize · 2026-01-15T17:03:27 1768496607

Easiest to have different agents or turns that set aside the top-level goal via hooks/skills/manual prompt/etc. Heuristically, a human will likely ignore a lot of warnings until they've wired up the core logic, then go back and re-evaluate, but we still have to apply steering to get that kind of higher-order cognitive pattern.

Product is still fairly beta, but in Sculptor[^1] we have an MCP that provides agent & human with suggestions along the lines of "the agent didn't actually integrate the new module" or "the agent didn't actually run the tests after writing them." It leads to some interesting observations & challenges - the agents still really like ignoring tool calls compared to human messages b/c they "know better" (and sometimes they do).

[^]: https://imbue.com/sculptor/

ianbutler · 2026-01-14T20:18:39 1768421919

We shouldn't ban social media we should ban algorithmically curated feeds that push any specific type of content. Outrage sells and so platform curated feeds have curated outrage and extreme content.

In practice I haven't seen much useful political discourse by the average person, but as long as we don't selectively amplify voices through machine signals and they NATURALLY accrue followings then whatever I guess.

gherkinnn · 2026-01-14T21:27:21 1768426041

Ban targeted ads while you're at it and throw around the most savage fines for companies who don't comply.

The world will be better for it.

oplumb · 2026-01-15T11:08:44 1768475324

I'd go even further and say ban ads and the selling and collecting of personal data in general.

Maybe someone can help me there, but I fail to see neither the inherent public good of it nor the one of any service that requires it to function.

ianbutler · 2026-01-11T21:35:31 1768167331

MetroUI in Windows 8 was pretty universally panned. I thought it was pretty good on tablets and such, but it left a lot to be desired on desktops and hid a lot of functionality, it went too mobile for a lot of people's tastes.

Disclaimer: I was one of the dozens who used a windows phone. The Nokia Lumia 920 was great, you can fight me.

stevage · 2026-01-11T22:00:05 1768168805

I think a lot of people liked the Windows mobile experience. Shame it didn't quite get enough market share.

Nextgrid · 2026-01-11T22:08:03 1768169283

Resetting the app ecosystem 3 fucking times by breaking app compatibility didn't help. Windows Phone 7 - Windows Phone 8 -> Windows (Phone?) 10.

hyperrail · 2026-01-11T23:16:50 1768173410

Wrong. There was full app compat of WP7 apps in WP8 and Win10 Mobile, and for WP8 apps in W10M. The only full backward app compat break was from WM6.5/WP6.5 to WP7.

I'll give you the benefit of the doubt and assume you're thinking of the lack of device OS upgrades: from WP6.5 to WP7, from WP7 to WP8, and from older WP8 devices to W10M. So no forward compat, but absolutely yes to backward compat.

sirwhinesalot · 2026-01-12T07:32:42 1768203162

That's not what they mean. As a developer, the API you used to develop your app was now deprecated with no migration path. That meant your app was deprecated, with no migration path.

For an app platform already a distant third place and struggling to attract developers, pissing off the few devs you do have TWICE was not a smart move.

hyperrail · 2026-01-12T08:21:21 1768206081

Even then, that happened at most twice as you say, not three times as the other poster said.

And I disagree with your implicit claim that the WP7 & WP8 Silverlight -> Win10 UWP transition had no migration path. There was >= 90% source code similarity, bolstered if you had already adopted the Win8.1/WP8.1 "universal" project templates. And Microsoft provided tooling to ease the transition. Sometimes it was literally just s/Microsoft.Phone/Windows.UI/g.

Games were a different matter, I'll admit. XNA as an app platform had no direct replacement that was binary compatible with Win10 desktop, but even then, not only was DirectX already available from WP8.0, but Microsoft invested in MonoGame as an XNA replacement precisely because they knew the end of XNA would hit hard. (In fact it was the Windows Phone division that had essentially kept XNA on life support in its final years so that WP7 games would not break.)

vjvjvjvjghv · 2026-01-12T15:46:37 1768232797

"the API you used to develop your app was now deprecated with no migration path."

Seems that's the standard now for .NET desktop dev. Every 2 or 3 years MS crank out a new XAML based framework that's not compatible with the previous and never gets completed before a new framework comes out.

sirwhinesalot · 2026-01-13T13:23:26 1768310606

Nobody in their right mind should be touching any Microsoft provided API that isn't already well established (like Win32 and Direct3D).

I'm happy they're at least maintaining (to a limited extent) Windows Forms and WPF and updating their styles to fit with their fancy Fluent design.

But even that is a pretty sad state of affairs, since Windows Forms should be able to get that info from uxtheme (which Microsoft fumbled) and WPF should be able to get that info from the style distributed with the system-installed .NET framework (which Microsoft fumbled and now only exists for backcompat).

For the company with the best track record for backwards compatibility (with Windows), they sure suck at developing and evolving the same API for long.

ulbu · 2026-01-11T22:18:07 1768169887

i guess they needed to release all that pent up backwards incompatibility

SuperNinKenDo · 2026-01-11T22:51:14 1768171874

You joke, but I honestly wonder if this period and projects didn't involve a bunch of Microsoft employees who got a little overexcited when they were told that they didn't need to maintain the insane, sometimes bug-for-bug, compatibility layers with 20-40 year old software that they had had to deal with their entire career there.

Must have felt incredibly liberating, and maybe they got a little too into the whole idea of "fresh start"(s).