This repo contains topic modelling done on all the books in the Bible data.
This is the Bible corpus.
Source: https://www.kaggle.com/oswinrh/bible
It contains various versions of the Bible (Like KJV, BBE etc.). The user can choose which version would they like to use for the analysis. I have currently done it on BBE (Bible in Basic English) version. You can find more information on Bible versions here.
The requirements to run this code are minimal. Please install the packages in the requirements.txt file. The code is in a Jupyter Notebook.
After downloading/cloning this repo, open your terminal. Change directory to the root of the repo and run the following to see the Jupyter Notebook.
jupyter notebook BibleTopicModelling.ipynb
The data files are in a folder called "bible". All other files are in the root.
Python 3.7.1
Upasana Parashar