fagnerbrack 41 minutes ago
Conveniently, that's also roughly where the cost math stops working for a free tool. Scanned PDFs are best-effort OCR. Multi-page tables spanning sheets are still a weak spot.
Here's a link you can check:
https://people.math.harvard.edu/~ctm/home/text/others/shanno...
Feel free to try with your own PDF links to see what breaks, it will help improving the crawl logic and the parser (I still need to get some rate limits up)