2 个工具
A toolkit for training language models to work with PDF documents in the wild. Online demo: https://olmocr.allenai.org/
Comes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services.