Uploading , Parsing and Filtering Documents #1620
asheeshmathur
started this conversation in
General
Replies: 1 comment
-
I think contents are extracted once it reached ES server just before pushing. Another option is to store the actual binay docs in Firestore etc. and establish a link between two, Regading Google programmable search engine may not work in my case. As those documents should have a reference on web page for it to crawl. Please advise. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
Glad to post my queries in this forum, FSCrawler is a path breaking solution/ alternate to attachment plugin.
Hopefully elastic recognize and embed in its offering.
I plan to build a job search engine based on PDF/Doc/text/RTF CVs.
Seacrching contents and external attributes entered while uploading.
As suggested by David, we should filter relevant contents before uploading the extracted document to elastic search.
I could upload documents it's extracted contents & external attibutes via PHP.
My Queries:
Looking forward to a word from this community of experts.
Best Regards
Asheesh
Beta Was this translation helpful? Give feedback.
All reactions