OCR(Optical Character Recognition)

Optical character recognition (OCR) is sometimes referred to as text recognition. An OCR program extracts and repurposes data from scanned documents, camera images and image-only pdfs. OCR software singles out letters on the image, puts them into words and then puts the words into sentences, thus enabling access to and editing of the original content. It also eliminates the need for manual data entry.

Requirements

This project requires OpenCV and also Tesseract
to install OpenCV run

pip install opencv-python

to Download tesseract click

Tesseract(32 bit)

Tesseract(64 bit)

Screenshots

This is the input Image that we send to the Python script to perform OCR.

This is the output in the form of Image that we get while processing the Image.

This is the exact output that we get after completion of the program which is the image as a string.

Saving a Text File in Python
After learning about opening a File in Python, let’s see the ways to save it. Opening
a new file in write mode will create a file and after closing the file, the files get
saved automatically. However, we can also write some text to the file. Python
provides two methods for the same.

This is the Image as audio feature in use

saved_audio.mp4

Image Manipulation

Black-White Image
Rotate angle of the Image
Inverted Image
Remove Noice from Image
Remove Border from Image
Insert custom White Border to Image
Shrink Font Weight (from thick to thin)
Swell or Increase Font Weight (from thin to thick)
Boxing object in Image

Features

Image to Text
Image to Audio

🚀 About Me

I'm a Data Analyst and I am developing my skills towards Data Science and Machine Learning. I started with Basics of Python in my free time, messed with errors and bugs, took help from the internet, and sometimes also my friends helped me. I then cracked some competitive stuff, started developing my very own Optical Character Recognition (OCR) modal with python and Aha! I made one.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
__pycache__		__pycache__
data		data
temp		temp
README.md		README.md
border.py		border.py
bounding_boxes-index.py		bounding_boxes-index.py
bounding_boxes.py		bounding_boxes.py
capture_sidenotes.py		capture_sidenotes.py
data.txt		data.txt
detected.py		detected.py
display.py		display.py
font.py		font.py
main.py		main.py
noice.py		noice.py
saperate_footnotes.py		saperate_footnotes.py
saved_audio.mp4		saved_audio.mp4
saved_audio.wav		saved_audio.wav
skew.py		skew.py
sound.py		sound.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR(Optical Character Recognition)

Requirements

Screenshots

Image Manipulation

Features

🚀 About Me

Authors

🔗 Links

Feedback

About

Releases

Sponsor this project

Packages

Languages

D-ENCODER/OCR-WITH-PYTHON

Folders and files

Latest commit

History

Repository files navigation

OCR(Optical Character Recognition)

Requirements

Screenshots

Image Manipulation

Features

🚀 About Me

Authors

🔗 Links

Feedback

About

Topics

Resources

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages