Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Arguably Spark solves a problem that does not exist anymore: single node performance with tools like DuckDB and Polars is so good that there’s no need for more complex orchestration anymore, and these tools are sufficiently user-friendly that there is little point to switching to Pandas for smaller datasets.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: