PDF OCR Converter

A simple desktop app that converts scanned/image PDFs to searchable text PDFs.

Features

📄 Convert multiple PDFs at once
🌍 Multi-language OCR support (English, Arabic, German, French, Spanish, Chinese, Japanese)
⏭️ Skip pages that already have text
🎨 Modern dark-themed interface

Download

📥 Download PDF_OCR_Converter.exe

Note: Requires Tesseract OCR to be installed. The app will prompt you if Tesseract is not detected.

Installation (For Developers)

1. Install Tesseract OCR

Windows:

Download installer from: https://github.com/UB-Mannheim/tesseract/wiki
Run the installer and note the installation path
Add Tesseract to your PATH environment variable

Or use Chocolatey:

choco install tesseract

2. Install Python Dependencies

pip install -r requirements.txt

3. Install Language Packs (Optional)

For Arabic support:

# The Windows installer includes language selection
# Or download language files from: https://github.com/tesseract-ocr/tessdata

Usage

Run the app:
```
python ocr_app.py
```
Click "Select PDF Files" to choose your PDFs
Select the OCR language
Click "Convert to Searchable PDF"
Find output files with _ocr suffix in the same folder

Output

The converted files are saved in the same directory as the input with _ocr appended to the filename:

document.pdf → document_ocr.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
image.png		image.png
ocr_app.py		ocr_app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF OCR Converter

Features

Download

Installation (For Developers)

1. Install Tesseract OCR

2. Install Python Dependencies

3. Install Language Packs (Optional)

Usage

Output

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF OCR Converter

Features

Download

Installation (For Developers)

1. Install Tesseract OCR

2. Install Python Dependencies

3. Install Language Packs (Optional)

Usage

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages