The notebook of Preprocessor class #60

priyanshi-git · 2022-08-12T05:32:09Z

I am facing a different issue now i.e., although I installed ocred package the Preprocessor is not being imported it shows an ImportError

codecov · 2022-08-12T05:48:53Z

Codecov Report

Merging #60 (23ba713) into main (420aa7a) will decrease coverage by 0.08%.
The diff coverage is n/a.

❗ Current head 23ba713 differs from pull request most recent head c3e778a. Consider uploading reports for the commit c3e778a to get more accurate results

@@            Coverage Diff             @@
##             main      #60      +/-   ##
==========================================
- Coverage   97.09%   97.00%   -0.09%     
==========================================
  Files           3        3              
  Lines         172      167       -5     
==========================================
- Hits          167      162       -5     
  Misses          5        5

Impacted Files	Coverage Δ
ocred/ocr.py	`95.14% <0.00%> (-0.23%)`	⬇️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

Saransh-cpp · 2022-08-12T15:31:00Z

@priyanshi-git can you try !pip install -U ocred --no-cache --force-reinstall?

priyanshi-git · 2022-08-12T16:10:02Z

@priyanshi-git can you try !pip install -U ocred --no-cache --force-reinstall?

Yes this worked. Now the issue of displaying the result persists. The code shows the output as True and do not display the preprocessed.png image on its own.

Saransh-cpp · 2022-08-13T08:18:37Z

Nice!! Page.png is already preprocessed; hence, I think the Preprocessor class won't change it much. Could you try using a different image? Perhaps one of the Cosmos ones.

Additionally, the notebook should be broken up into smaller cells -

preprocessed = Preprocessor("/content/Images/Page.png")

# scan the image and copy the scanned image
scanned = preprocessed.scan(inplace=True)
orig = scanned.copy()

# display scanned image here

# remove noise
noise_free = preprocessed.remove_noise(
    inplace=True, overriden_image=scanned
)

# display noiseless image here

# thicken the ink to draw Hough lines better
thickened = preprocessed.thicken_font(
    inplace=True, overriden_image=noise_free
)

# display thickened image here

and so on...

priyanshi-git · 2022-08-13T16:47:27Z

Hey !! I made the changes and everything seems fine except the last step and I don't understand why that's happening. Let me know what can be done

Saransh-cpp · 2022-08-14T10:47:12Z

The remove_noise call outputs the original image (which it should not) as the return value of the method was wrong. This has been fixed here - #62.

I will make a new release today, which should fix it in the PyPI version,

Saransh-cpp · 2022-08-14T10:48:09Z

@all-contributors please add @priyanshi-git for bug

allcontributors · 2022-08-14T10:48:17Z

@Saransh-cpp

I've put up a pull request to add @priyanshi-git! 🎉

Saransh-cpp · 2022-08-14T10:51:23Z

Merge conflict: @all-contributors please add @priyanshi-git for bug

allcontributors · 2022-08-14T10:51:25Z

@Saransh-cpp

@priyanshi-git already contributed before to bug

Saransh-cpp · 2022-08-14T10:55:29Z

#65

priyanshi-git · 2022-08-15T06:31:36Z

Hey @Saransh-cpp I guess its working perfectly now

Saransh-cpp · 2022-08-16T10:26:58Z

Looks good now! Could you convert this to a markdown file and upload it in this branch? Just copy-pasting the Jupyter notebook content, would be better for the reviews!

Converted the .ipynb to .md

…e-ex

priyanshi-git · 2022-08-17T13:49:26Z

Hey @Saransh-cpp, I created the notebook for examples of OCR, but the output of signboard doesn't look fine to me, some characters are wrong, the same was happening with the invoice example as well.

Saransh-cpp · 2022-08-17T17:56:23Z

Hey @Saransh-cpp, I created the notebook for examples of OCR, but the output of signboard doesn't look fine to me, some characters are wrong, the same was happening with the invoice example as well.

Some characters will always be wrong, don't worry about them. I'll try to find a workaround for the failing check.

Saransh-cpp

Thank you for working on this, @priyanshi-git! Apologies for the late review, and the large review :) The overall structure of the example looks good! I have addressed some changes below -