Overview | |
---|---|
Course | CS-449 Systems for Data Science |
Year | Spring 2021 |
Final Grade | 5.75 / 6 |
Languages | Scala, Apache Spark |
Team size | 1 |
This repository showcases the different milestones realized within the EPFL CS-449 Systems for Data Science class in which I achieved a final grade of 5.75 on a 1 to 6 scale. Throughout the project, we implemented recommender systems of increasing complexity in Apache Spark and Scala. The project is organized as a succession of instructions to implement and argue within the milstone's report. Thus it is not a standalone project but a demonstration of how I can use Scala and Spark to handle Big Data.
The source code of each milestone is located under MilestoneX/src/main/scala/
, for instance the predictions of the first Milestone are implemented in Predictor.scala.
A typical example of what the project consists of:
Instruction | |
---|---|
Implementation |
If you are interested in more details, you can find the instructions of the 3 milestones in instructions.pdf and the report of each milestone is named report.pdf
in its respective folder.