Skip to content

Latest commit

 

History

History
24 lines (18 loc) · 725 Bytes

README.md

File metadata and controls

24 lines (18 loc) · 725 Bytes

linkedin pdf parsing

Parsing resumes in a PDF format from linkedIn. The script takes a folder with PDF files, goes through every one of them looking for Experience and Education sections, extracts all data that is found there and creates a database with following structure:

alt tag

Requirements

Python 2.7

PDFMiner

Usage

 script.py -i inputfolder -o outputfile

Script will search 'inputfolder' for PDF files and will create a database with 'outputfile' path.

Example usage:

python path/to/script.py -i home/mypdfs -o home/mydb.db