Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Slightly flip, but it's interesting that no one believes in or brags about cost savings via statistical sampling techniques these days.


well, I can save money by eating only lentils, but I prefer a richer diet. As do BI folks in a highly profitable company.


It's a terribly inelegant and inefficient solution that no one should be "proud" of. The only time you need N=all is for the general ledger.


> The only time you need N=all is for the general ledger.

If you're predicting for each user, you need all of the data.

And generally you probably wouldn't want to sample too much for BI as it could lead to people making wrong decisions.

But yeah, in general sampling rocks and is super effective.


Winning comment.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: