✨ This document for OCR ✨
Problem:
Extract information from image of Personal Account Number(PAN) Card
by OCR in proper format[Standard according Indian Govt.].
Information like -
Name, Father's Name, Date of Birth, PAN
Solution:
Steps:
-> Take image
-> crop to box(which has text in it)
-> convert into gray scale(mono crome)
-> give to tesseract
-> text(output of tesseract)
Now we will process this text means we will get meaningful information from it.
-> find name using name database
-> find father's name(assuming that second will be father's name)
-> find year of birth
-> find for PAN
Dependent packages
-python
-opencv
-numpy
-pytesseract
-JSON
-difflib
-csv
-PIL
-SciPy
-dataparser
Structure and Usage
Directories:
src-
which contains code files
testcases-
which contains testing images
result
it contains JSON object
Usage:
python file_name.py [input image]
Output will be JSON object name
💯