NanoNets/docext
stableAn on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
documentdocument-analysisdocument-data-extractiondocument-information-extractionextractionllm-ocrllmsmachine-learning