A simple desktop app that converts scanned/image PDFs to searchable text PDFs.
- 📄 Convert multiple PDFs at once
- 🌍 Multi-language OCR support (English, Arabic, German, French, Spanish, Chinese, Japanese)
- ⏭️ Skip pages that already have text
- 🎨 Modern dark-themed interface
📥 Download PDF_OCR_Converter.exe
Note: Requires Tesseract OCR to be installed. The app will prompt you if Tesseract is not detected.
Windows:
- Download installer from: https://github.com/UB-Mannheim/tesseract/wiki
- Run the installer and note the installation path
- Add Tesseract to your PATH environment variable
Or use Chocolatey:
choco install tesseractpip install -r requirements.txtFor Arabic support:
# The Windows installer includes language selection
# Or download language files from: https://github.com/tesseract-ocr/tessdata-
Run the app:
python ocr_app.py
-
Click "Select PDF Files" to choose your PDFs
-
Select the OCR language
-
Click "Convert to Searchable PDF"
-
Find output files with
_ocrsuffix in the same folder
The converted files are saved in the same directory as the input with _ocr appended to the filename:
document.pdf→document_ocr.pdf
