Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stream does not detect similar tables in the same document #156

Open
daniambrosio opened this issue Mar 4, 2022 · 1 comment
Open

Stream does not detect similar tables in the same document #156

daniambrosio opened this issue Mar 4, 2022 · 1 comment

Comments

@daniambrosio
Copy link

Using Lattice on this bank statement pdf results in no tables found. I thought using backgorund = True for Lattice would work, but no.
So I tried with Stream. And it works for some pages. For others, it gets messy. I mean, the text from the second column is returned inside the first column (the one with the dates).

This one works fine (print of the PDF)
image

Corresponding print of the extracted tables:
image

This one gets messy:
image

Corresponding print of the extracted tables:
image

@daniambrosio
Copy link
Author

Anyone would suggest any approach here on this case?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant