Invoice PDF to JSON output
Has anyone come across a library that extract information from invoices (pdf input, json output)? Something like https://mindee.com/, but open source?
3 Replies
Nope... Only something like Tesseract, but that's a general OCR
Personally I didn't use this, but I found pdf2json that seems to be maintained regularly:
https://github.com/modesty/pdf2json
Have you tried it?
GitHub
GitHub - modesty/pdf2json: A PDF file parser that converts PDF bina...
A PDF file parser that converts PDF binaries to text based JSON, powered by a fork of PDF.JS - GitHub - modesty/pdf2json: A PDF file parser that converts PDF binaries to text based JSON, powered by...
Not yet but I will, thanks!