Besides the fact that you do have conflicts of interest (disclosing them doesn't...

simonw · 2025-12-14T17:07:31 1765732051

The purpose of disclosure is to allow people to make their own decisions about how trustworthy I am as a source of information.

If I started rejecting access to early models over a desire to avoid conflicts of interest my coverage would be less useful to people. I think most of my regular readers understand that.

I was responsible for one of the first widely read reports on the ethics of model training back in 2022 when I collaborated with Andy Baio to cover Stable Diffusion's unlicensed training data: https://waxy.org/2022/08/exploring-12-million-of-the-images-...

Calling me "their most enthusiastic shill" is not justified. Have you seen what's out there on LinkedIn/Twitter etc?

The reason I show up on Hacker News so often is that I'm clearly not their most enthusiastic shill.

dymk · 2025-12-14T17:28:04 1765733284

This is a disappointing thread to find - HN is usually a little more thoughtful than throwing around insults like "insufferable AI cheerleader".

If I can provide a different perspective, I find your writing on LLMs to be useful. I've referenced your writing to coworkers in an effort to be a little more rigorous when it comes to how we use these new (often unintuitive) tools.

I think the level of disclosure you do is fine. Certainly a better effort at transparency than what most writers are willing to do.

bsndjdkd · 2025-12-14T19:57:22 1765742242

It's called having standards. If I'm reading propaganda I'd at least like something in return. This whole I'm so positive haha I just wanna help humanity might fly on likedin but the whole point of this place is to have interesting information. BTW why was this thread on the front page with 1 upvote? I'm sure there's no funny business going on here lol. >inb4 flagged

simonw · 2025-12-14T17:35:27 1765733727

You might be talking about this one: https://simonwillison.net/2023/Dec/14/ai-trust-crisis/

I stand by my opinion that if a major AI company says they aren't training on something it means they aren't training on that thing.

I continue to be annoyed that they won't confirm what they ARE training on though. Saying "we don't train on data submitted is our API" isn't exactly transparent, I want to know what they ARE training on.

That lack of transparency is why they have a trust crisis in the first place!