arxiv EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge