Skip to content

Latest commit

 

History

History

BIG

Semestral project for Database Technologies for Big Data course (2018/19)

Visualisations

About project

Analysis and visualisations of millions of data points from nonprofit organization DonorsChoose that allows individuals to donate directly to public school classroom projects.
The main focus of my analysis are Donors and their donations. Used technologies include Spark, Hadoop, Elasticsearch, Logstash and Kibana.

Reproducibility

To reproduce this work you need to first download data from https://www.kaggle.com/donorschoose/io and unzip it into BIG/logtash/datasets/ folder. Then all you have to do is follow the instructions in documentation where you will also find visualiasations and project results. Sadly, the documentation is only in Czech language.

All Kibana objects (visualisations, dashboard) are in the kibana_objects.json file.

Overall result

Gained maximum points obtainable.

Dependencies:

  • docker and docker-compose