Extracting relevant information from resume using Deep Learning.
Code successfully runs with:
OS- Ubuntu 18.04
python 3.6.5
numpy 1.13.3
pandas 0.23.0
keras 2.1.5
pdfminer
Getting training data is most challenging part due to lack of publicaly available dataset of resume. Currently, to create training data you will have to manually label the lines of resume.
1.Put the resume in data/training_data folder. Currently only .pdf and .docx format supported.
2.Run utils/create_training_data.py. A popup will be created. Annote the lines of resume in the same.
3.Run start_training.py.
4.Trained model will be saved in trained_model folder.
5.Put the resume to parse in data/resume_to_parse folder. Only .pdf and .docx format supported.
6.Run predict.py