Hacker Newsnew | past | comments | ask | show | jobs | submit | rester324's commentslogin

You appear to have written a lot. Without understanding anything.

"would put it to good use - including tackling the deterioration of water pipes"

There is your faith in action. Zero concrete promise, no accountability.


* In the culture that also produced this comment. This is not a universal problem, just a societies unable to produce a high trust environment problem.

You would be surprised to see how much the japanese IT industry is behind the times (a decade at least IMO). There is only a very limited startup culture here (both in size and talentpool and business ideas), there is no real risk taking venture capital market here (maybe Masayoshi Son is the exception here, but again he tends to invest in the US mostly) and most software companies use very very very outdated management practices. On top of that most software development had been/has been outsourced to India, Vietnam, China, etc, so management see no value in software talent... SW engineers' social recognition here are mostly on the level of accountants. Under such circumstances japan will never have a chance to contribute to AI meaningfully (other than niche academic research)

Seems like the Japanese have had this major blind spot in software engineering since the 90's. Even Sony didn't bother to use what they learned from the PlayStations to produce their own TV OS, outsourcing it to Google. It's as if the 5th generation stuff not working out just burned out that circuit in Japan entirely.

IMO we are way past beyond the point where these definitions are necessary. The consensus is that social media sites are objectively mentally harmful

I am curious. What do you want us to see in that github repo?


You can implement this yourself, who is stopping you?


Citation needed


I use iocaine[0] to generate a tarpit. Yesterday it served ~278k "pages" consisting of ~500MB of gibberish (and that's despite banning most AI scrapers in robots.txt.)

[0] https://iocaine.madhouse-project.org


Can't seem to access this.

It flashes some text briefly then gives me an 418 TEAPOT response. I wonder if it's because I'm on Linux?

EDIT: Begrudgingly checked Chrome, and it loads. I guess it doesn't like Firefox?


Doesn't work on my firefox either.

Friendly fire, I suppose.


Works on my Firefox. Mac and Linux


Nor Safari on iOS.


Works fine on my iOS Safari - maybe there's some extension that's tickling it just the wrong way?


It still fails with all of my extensions disabled (wipr, privacy redirect). I just get a download dialog. I don't know what the HTTP status code is, however.

I found a flagged HN submission about it and it has just about the same result for me and for others. My first tap failed in a weird way (showed some text then redirected quickly to its git repo) and all subsequent taps trigger a download.

https://news.ycombinator.com/item?id=44538010


Unfortunately and you kind of have to count this as the cost of the Internet. You've wasted 500Mb of bandwidth.

I've had colocation for eight years+. My monthly b/w cost is now around 20-30Gb a month given to scrapers where I was only be using 1-2Gb a month, years prior.

I pay for premium bandwidth (it's a thing) and only get 2TB of usable data. Do I go offline or let it continue?


> You've wasted 500Mb of bandwidth.

Yep, it sucks, but on the positive side, I'm feeding 500Mb of garbage into them every day and that feels like enough of a small win for me.

> My monthly b/w cost is now around 20-30Gb a month given to scrapers [...] 1-2Gb a month

That definitely sucks.

> Do I go offline or let it continue?

Might be time to start blocking entire IP ranges and ASNs and see if that helps.


i have no idea what this does because the site is rejecting my ordinary firefox browser with "Error code: 418 I'm a teapot". Even from a private browser.

If I hit it with Chrome, now I can see a site.

Seems pretty not ready for prime time as a lot of my viewers use Firefox


One of the most popular ones is Anubis. It uses a proof of work and can even do poisoning: https://anubis.techaro.lol/

They even mention iocaine. I know, inconceivable!: https://iocaine.madhouse-project.org/

There's also tons of HN posts on the topic with varying solutions:

https://news.ycombinator.com/item?id=45935729

https://news.ycombinator.com/item?id=45711094

https://news.ycombinator.com/item?id=44142761

https://news.ycombinator.com/item?id=44378127


Anubis is the only tool that claims to have heuristics to identify a bot, but my understanding is that it does this by presenting obnoxious challenges to all users. Not really feasible. Old school approaches like ip blocking or even ASN blocking are obsolete - these crawlers purposely spam from thousands of IPs, and if you block them on a common ASN, they come back a few days later from thousands of unique ASNs. So this is not really a "roll your own" situation, especially if you are running off the shelf software that doesn't have some straightforward means of building in these various approaches of endless page mazes (which I would still have to serve anyway).


https://forge.hackers.town/hackers.town/nepenthes

> Citation needed

this reply kinda sucks :)


So are you proud of yourself? Or why are you advertising your negligence?


"Move fast and break things"

I could spend an extra 5 minutes doing it "right" or I can get what I need done and have a 0.001% chance of there ever being a problem (since there are other security measure in place, like firewalls, api key rotation, etc.)

Even when security gaps are exploited, the fallout tends to be minimal. Companies that had their entire database of very sensitive information leaked are still growing users and at worst paid a tiny fine.


> Companies that had their entire database of very sensitive information leaked are still growing users and at worst paid a tiny fine.

Or end up bankrupt with criminal charges for CEO: https://yle.fi/a/74-20027665


Bankrupt? I didn't read about any financial penalties in that article. The board fired him back in 2020 when they found out, and then he blamed 2 IT people. Instead, he got 3 months suspended sentence (in a Finnish jail, which is not exactly like a US jail). The company still exists btw.


It got bankrupt in 2021 in an aftermath of the breach. I think they sold some of their operations forward before that.

The actual breach wasn’t that advanced hacking. They had copied their production data with all the patient information to test database which was publicly available and had default credentials.


How many times do you have to roll the dice with .001% of disaster before it strikes? How often are you using the tool in this way?


About 100,000 rolls on average.

About 2.7 rolls per day for 100 years


Or continue being an ass and kissing asses, and watch the workforce unionize and see how the people YOU disdain shows you who has the real power


Thanks for sharing, I found the blogpost hypocritical even without knowing about this


Buying a phone is a non-dispensable part of life today. There are some government services in many countries which are digital only (and phone only in particular), and restaurants, hotels, etc in the service industry which all require you having a phone, otherwise you can't use their services. And this trend is growing. So if you are the type who wants to live in a cave, or hang yourself on a tree rather than accepting that modern societies require a modern phone, thats's your choice. But others rather accept this. We are beyond the point where this trend can be reversed. On the other hand AI is not that integral part of people's lives yet, and it's better to protest now as long as it has an impact


> Buying a phone is a non-dispensable part of life today.

1. it absolutely is not. I have two friends who do not have a phone

2. even if you say you must have a phone to live you can buy ethical one :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: