More

badlogic · 2026-01-21T11:04:27 1768993467

> differential rendering

Now where have I seen that before.

badlogic · 2026-01-21T00:55:38 1768956938

It is very 2026, that this exists for the product by a company that goes all in on vibe coding. Kudos for the creative solution.

donw · 2026-01-21T01:02:30 1768957350

I mentioned this to Claude and this was the response:

Ha! The irony is not lost on anyone.

"We've built the world's most advanced AI coding assistant. It can refactor entire codebases, debug complex issues, and ship production features autonomously. Anyway, here's a terminal bug that makes your screen look like a slot machine. We'll get to it eventually."

venturecruelty · 2026-01-21T03:17:57 1768965477

[flagged]

dang · 2026-01-22T03:54:23 1769054063

"Don't be curmudgeonly. Thoughtful criticism is fine, but please don't be rigidly or generically negative."

https://news.ycombinator.com/newsguidelines.html

Edit: unfortunately your account has done basically nothing except break the site guidelines. I've therefore banned it. If you've read https://news.ycombinator.com/newsguidelines.html, it should be obvious that this is not what HN is for, and destroys what it is for.

If you don't want to be banned, you're welcome to email hn@ycombinator.com and give us reason to believe that you'll follow the rules in the future.

p.s. I suppose I'd better add that no, this is not because of your views, it's because we're trying to preserve this place for thoughtful conversation. Accounts that pour snark and cynicism over everything have the effect that pouring salt on a slug has.

Yes, I'm comparing HN to a slug, and we'd rather be a happy one than a salted one.

badlogic · 2025-12-24T19:21:51 1766604111

Neat. Any reason why the MCP server doesn't expose a JavaScript/eval tool? Current models excel at writing JS to drive and inspect the DOM. They aren't great at driving browsers via screenshots.

coty · 2025-12-24T19:45:03 1766605503

FWIW, if you have Claude Code or the like, you can quickly prompt your way to an eval function in MCP. It already exists in clicker and the client API. You can use it to get the accessibility tree, for example, and use that to find what to fill out and click.

hugs · 2025-12-24T19:26:49 1766604409

> why the MCP server doesn't expose a JavaScript/eval tool?

no reason other than my number #1 goal was "ship something". i only started the actual coding on dec 11. it's been a bit of a sprint the last two weeks!

though "image-based" vs "dom-based" testing approaches is a very big topic! (look forward to researching that more in the future.)

v1 announcement: https://github.com/VibiumDev/vibium/blob/main/docs/updates/2...

badlogic · 2025-12-13T01:39:07 1765589947

Create a markdown file, for each SKILL.md of the skills you want to use, put the frontmatter in that single markdown file along with the fulk path to the SKILL.md file. On session start, tell Gemini to read that file. If you put it in your AGENTS.md, you don't have to instruct Gemini. And if you have your skills in a known folder, let Gemini write a small scripts that generates that markdown file for you.

badlogic · 2025-12-08T09:02:33 1765184553

Loved the fun write up. Now that we know that LLM-based vision is lossy, here's a different challenge:

Give the LLM access to the site's DOM and let it recreate the site with modern CSS. LLMs are much better with source code, aka text, right? :)

badlogic · 2025-11-22T02:12:11 1763777531

I can talk for the gov. site in my European home country: they too are buying GPUs for chat ...

badlogic · 2025-11-17T00:26:41 1763339201

Oh, I didn't intend this to come across as MCP being useless. I've written this from the perspective of someone who uses LLMs mostly for coding/computer tasks, where I found MCP to be less than ideal for my use cases.

I actually think MCP can be a multiplier for non-technical users, where it not for some nits like being a bit too technical and the various security footguns many MCP servers hand you.

ripley12 · 2025-11-17T14:18:53 1763389133

That makes sense to me, thanks for the clarification.

badlogic · 2025-11-17T00:09:41 1763338181

Also not disagreeing with your argument. Just want to point out that you can achieve the same by putting minimal info about your CLI tools in your global or project specific CLAUDE.md.

The only downside here is that it's more work than `claude mcp add x -- npx x@latest`. But you get composability in return, as well as the intermediate tool outputs not having to pass through the model's context.

badlogic · 2025-10-21T18:16:44 1761070604

Yes, the only reason they are building a browser is to gobble up more data.

https://x.com/badlogicgames/status/1980698199649317287

ncr100 · 2025-10-22T16:36:53 1761151013

Yikes!

Market capture, again. :sigh: It's such a common motivator for digital (-adjacent) product decisions in business these days.

badlogic · 2025-10-21T16:00:43 1761062443

I run a few production RAG systems, some as old as end of 2023 and arrived at the same conclusions.

Query expansions and non-naive chunking give the biggest bang for the bug, with chunking being the most resource intensive task, if the input data is chunk (pun intended).