OCR Reader & Translator

Welcome to the OCR Reader & Translator, an innovative full-stack application designed to extract text from images and PDFs using Optical Character Recognition (OCR), detect code snippets, and translate content across multiple languages. This project combines a sleek React-based frontend with a robust Flask-powered backend, offering a seamless user experience for processing and translating text.

Overview

This project is a powerful tool for digitizing documents and translating terms effortlessly. The frontend provides an intuitive interface with features like text input, file uploads, image pasting, and theme switching, while the backend handles advanced OCR and translation using state-of-art models like TrOCR and M2M100. This project is a subcomponent of another project, available at https://github/deoanshdeo/Project-starts.

Tech Stack
- Frotend: React, Tailwind CSS, Axios, React Icons, React Transition Group
- Backend: Python, Flask, Transformers, EasyOCR, pytesseract, OpenCV
Key Features:
- Multi-engine OCR (Tesseract, EasyOCR, TrOCR)
- Code detection and extraction from Image
- Multilingual translation (Englisj, Hindi, French, Spanish, etc.)
- Drag-and-drop and paste image support
Status:
- The project is developed. Now I am working on improving it.

Project Structure

The project is organized as follows:

OCR_Reader/
- backend/
  - app/
    - __init__.py (Flask application factory)
    - ocr.py (OCR processing logic)
    - routes.py (API route definitions)
    - translate.py (Translation logic)
    - main.py (Entry point)
  - README.md (Backend documentation)
  - requirements.txt (Backend dependencies)
- frontend/
  - public/
    - index.html (Main HTML file)
    - manifest.json (PWA configuration)
  - src/
    - components/
      - Form.js (Main form component)
      - Popup.js (Result popup component)
      - ThemeSwitch.js (Theme toggle component)
    - App.js (Main app component)
    - App.test.js (Test file)
    - index.css (Custom CSS)
    - index.js (Entry point)
  - package.json (Frontend dependencies)
  - tailwind.config.js (Tailwind CSS configuration)
- README.md (This file!)

Getting Started

Follow these steps to set up and run the project locally.

Prerequisites:
- Node.js and npm (for frontend)
- Python 3.9+ (Setting up Anaconda environment will be a much more viable option.)
- Tesseract OCR installed.
  - Set path in backend/app/ocr.py
  - E.g., pytesseract.pytesseract.tesseract_cmd = r'/usr/bin/tesseract'
Installation:
1. Clone the Repository
```
https://github.com/deoanshdeo/OCR_Reader.git
```
2. Download and install the Tesseract OCR for the image-extraction feature
  - You can make use of the following steps for this:
  - Update package lists:
```
sudo apt update
```
  - Install Tesseract OCR core:
```
sudo apt install tesseract-ocr
```
    This installs the main Tesseract OCR engine, which provides the core functionality for optical character recognition.
  - Install language data packages.
```
sudo apt install tesseract-ocr-eng
sudo apt install tesseract-ocr-hin
```
    These commands install specific language data for Tesseract:
    - tesseract-ocr-eng: English language support
    - tesseract-ocr-hin: Hindi language support You can install these simultaneously with:
      sudo apt install tesseract-ocr-eng tesseract-ocr-hin
  - Verify Installation
```
 tesseract --version
```
    This confirms that Tesseract is installed and shows you the version number.
  - Locate the Tesseract executable:
```
 which tesseract
```
    This shows the full path to the Tesseract executable (usually /usr/bin/tesseract), which can be useful when configuring applications to use Tesseract.
3. Set Up the Backend
  - Navigate to the backend directory
```
cd backend
```
  - Install the required Python dependencies:
```
pip install -r requirements.txt
```
  - Run the backend server:
```
python main.py
```
    The backend should now be running at http://0.0.0.0:5000
4. Setting Up the Frontend
  - Move to the frontend directory
```
cd ../frontend
```
  - Install the frontend dependencies
```
npm install
```
  - Start the development server
```
npm start
```
5. Access the App
  - Open http://localhost:3000 in your browser to start using the app

Features

Text Extraction: Extract text from images and PDFs using multiple OCR engines.
Code Detection: Identify and extract code snippets with specialized preprocessing.
Multilingual Support: Proces and translate text in languages like English, Hindi, French, and Spanish.
Image Handling: Upload files, drag-and-drop images, or paste images directly.
Translation: Translate text using the M2M100 model with auto-detection or manual language selection.
User Experience:
- Light/Dark theme with a particle background effect.
- Responsive design with a glass-morphism and neon glow effects.
- Copy-to-clipboard functionality for results.

Contact

Author: Deoansh Deo
Email: deoanshdeo@gmail.com
GitHub: github.com/deoanshdeo
LinkedIn: linkedin.com/in/deoansh-deo-b0456922a

Acknowledgements

Thanks to the open-source community for tools like React, Flask, Transformers, and Tailwind CSS, Hugginface and the Tesseract-OCR.
Inspired by the need for efficient, multilingual document processing and translation solutions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Reader & Translator

Overview

Tech Stack

Key Features:

Status:

Project Structure

OCR_Reader/

backend/

frontend/

public/

src/

Getting Started

Prerequisites:

Installation:

Features

Contact

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
backend		backend
frontend		frontend
README.md		README.md

deoanshdeo/OCR_Reader

Folders and files

Latest commit

History

Repository files navigation

OCR Reader & Translator

Overview

Tech Stack

Key Features:

Status:

Project Structure

OCR_Reader/

backend/

frontend/

public/

src/

Getting Started

Prerequisites:

Installation:

Features

Contact

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages