Skip to content

The most read book in the world is the Bible. It's actually a collection of 66 books. I've performed Topic Modelling on Bible data to divide the books into various topics.

License

Notifications You must be signed in to change notification settings

UpasanaAbhishek/BibleTopicModelling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repo contains topic modelling done on all the books in the Bible data.

Data

This is the Bible corpus.

Source: https://www.kaggle.com/oswinrh/bible

It contains various versions of the Bible (Like KJV, BBE etc.). The user can choose which version would they like to use for the analysis. I have currently done it on BBE (Bible in Basic English) version. You can find more information on Bible versions here.

Pre-requisites

The requirements to run this code are minimal. Please install the packages in the requirements.txt file. The code is in a Jupyter Notebook.

Usage

After downloading/cloning this repo, open your terminal. Change directory to the root of the repo and run the following to see the Jupyter Notebook.

jupyter notebook BibleTopicModelling.ipynb

Files

The data files are in a folder called "bible". All other files are in the root.

Version

Python 3.7.1

Author

Upasana Parashar

Note: If you have inputs, feel free to share. :)

About

The most read book in the world is the Bible. It's actually a collection of 66 books. I've performed Topic Modelling on Bible data to divide the books into various topics.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published