Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Author here; well, PDFBox is good for simple text stripping. If I wanted to print all the text on the PDF, that would be very straightforward and not much code. However, the PDF chart here is in essence a representation of structured data. I wanted to get the content in that format so that I could both serialize to JSON plus have an SDK to boot.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: