Analysis and visualisations of millions of data points from nonprofit organization DonorsChoose that allows individuals to donate directly to public school classroom projects.
The main focus of my analysis are Donors and their donations. Used technologies include Spark, Hadoop, Elasticsearch, Logstash and Kibana.
To reproduce this work you need to first download data from https://www.kaggle.com/donorschoose/io and unzip it into BIG/logtash/datasets/ folder. Then all you have to do is follow the instructions in documentation where you will also find visualiasations and project results. Sadly, the documentation is only in Czech language.
All Kibana objects (visualisations, dashboard) are in the kibana_objects.json file.
Gained maximum points obtainable.
- docker and docker-compose