C#C
C#3y ago
Alex Frost

❔ Adding OCR text to PDF file

I have PDF files with scanned pages. I want to keep source image without resizing it, performing OCR and adding OCR text to the file.
What I have for now is, I use PdfSplitter which converts pages to images, upscaling them, then tesseract to OCR the images and construct a PDF file.
Due to upscaling, the resulting file gains significant size and if OCR is performed multiple times due to un-desired results, the file keeps gaining size.
Was this page helpful?