IDparser is a modern web application that extracts key information from identification documents (passports and driver's licenses) using OCR technology. It provides a clean, user-friendly interface for document uploads and displays extracted information including names, document numbers, and expiration dates.
- 📷 Image upload and preview
- 🔍 OCR processing using Google Cloud Vision API
- 📝 Extracts key information:
- Name
- Document Number
- Expiration Date
- ⚡ Real-time processing
- 🎯 Confidence scoring for extracted data
- 🖼️ Image preprocessing for better accuracy
- Node.js with Express
- TypeScript
- Google Cloud Vision API
- Sharp for image preprocessing
- Multer for file uploads
- React 18
- TypeScript
- Vite
- Tailwind CSS
- Lucide React for icons
- Node.js (v16 or higher)
- Google Cloud Vision API credentials
- npm or yarn
- Clone the repository:
git clone https://github.com/SxxAq/IDparser.git cd idparser - Install backend dependencies:
cd backend
npm install
- Install frontend dependencies:
cd ../frontend
npm install
-
Set up environment variables:
-
Create a
.envfile in the backend directory -
Add your Google Cloud Vision API credentials
-
Configure other necessary environment variables
- Start the backend server:
cd backend
npm start
- Start the frontend development server:
cd frontend
npm run dev
The application will be available at:
- Frontend: http://localhost:5173
- Backend API: http://localhost:3000
Uploads and processes an identification document.
Request:
- Method: POST
- Content-Type: multipart/form-data
- Body: Form data with 'document' field containing the image file
Response:
{{
"success": true,
"data": {
"name": "John Doe",
"documentNumber": "ABC123456",
"expirationDate": "2025-12-31"
},
"confidence": "high"
}}
- Open the application in your web browser.
- Click on the "Upload a file" button or drag and drop an image of a passport or driver's license.
- Once the image is uploaded, click on "Extract Information".
- The application will process the image and display the extracted information.
Contributions are welcome! Please follow these steps:
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- Google Cloud Vision API for OCR processing
- The React and Vite communities for excellent documentation
- All contributors who help improve this project
If you encounter any issues or have questions, please file an issue on the GitHub repository.