|
Technologies Involved:
OpenCV
SPRING BOOT
mongodb
JAVA
Project Description
  • Insta Doc is a Platform which extracts data from identity document like PAN Card, Voter ID Card, Aadhaar Card (INDIAN identity card), Passport and Driver License.
  • In this platform there is a field where user can upload any document and there are some fields like Name, Date of Birth etc. 
  • So OCR will run in that way when the user has uploaded the document then all above-mentioned fields will be filled by scanning the uploaded document.
  • This platform also uses OpenCV for preprocessing the images, cropping and aligning and then it sends the data to Google Vision API which then extracts the OCR data from the images.
Scope Of Work
  • User Interface (UI) Development:
    • Design and develop a user-friendly interface for document upload.
    • Implement fields for capturing additional information such as Name, Date of Birth, etc., manually if necessary.
  • OCR Integration:
    • Integrate OCR functionality to extract data from uploaded documents automatically.
    • Ensure accurate extraction of information from various types of identity documents.
  • OpenCV Integration:
    • Implement OpenCV for image preprocessing tasks.
    • Develop algorithms for image cropping and alignment to enhance OCR accuracy.
  • Google Vision API Integration:
    • Configure communication with the Google Vision API for data extraction.
    • Develop mechanisms to handle API requests and responses securely and efficiently.
Our Solution

Insta Doc employs OCR and OpenCV for data extraction from PAN Card, Voter ID Card, Aadhaar Card, Passport, and Driver License. Users upload documents, triggering OCR to extract Name, Date of Birth, etc. OpenCV preprocesses images for accuracy by cropping and aligning. Processed images are sent to Google Vision API for OCR data extraction. The backend, written in Python, integrates Tesseract OCR and OpenCV libraries. Secure HTTPS ensures data integrity with the Google Vision API. Frontend interfaces are developed using HTML, CSS, and JavaScript for seamless document upload and display. Deployment on cloud infrastructure ensures scalability, supported by CI/CD pipelines for automation. This solution ensures efficient and accurate data extraction, enhancing user experience in identity verification.

Related Projects