Asynchronous PDF to CSV Extraction

This project enables the extraction of data from PDF files and converts them into CSV format. It utilizes AWS Textract, AWS Lambda, and AWS S3 services to automate the extraction process.

Project Overview

The goal of this project is to extract data from PDF files and save it in CSV format for easy analysis and further processing. The extraction is performed asynchronously using AWS Textract, which accurately extracts text and data from scanned documents.

Usage

Upload PDF Files: Place the PDF files in the designated location.
Extraction Process: The system will automatically trigger the extraction process.
CSV Output: Extracted data will be saved in CSV format for each PDF file.
Accessing Results: Retrieve the CSV files for further analysis and use.

Requirements

AWS Account with necessary permissions.
AWS CLI installed and configured.
Python installed for deploying AWS Lambda functions.

Resources

AWS Textract: Service for extracting data from scanned documents.
AWS Lambda: Serverless compute service to run code without managing servers.
AWS S3: Scalable object storage service for storing data.

Contributing

Contributions are welcome! If you encounter issues or have suggestions, please open an issue on the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
Sample csv output desired.csv		Sample csv output desired.csv
Sample input.pdf		Sample input.pdf
Statement of Work.docx		Statement of Work.docx
lambda function.py		lambda function.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Asynchronous PDF to CSV Extraction

Project Overview

Usage

Requirements

Resources

Contributing

About

Releases

Packages

Languages

beladiyaraj/csv-from-pdf-using-aws

Folders and files

Latest commit

History

Repository files navigation

Asynchronous PDF to CSV Extraction

Project Overview

Usage

Requirements

Resources

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages