Skip to content

Latest commit

 

History

History
12 lines (10 loc) · 1.02 KB

README.md

File metadata and controls

12 lines (10 loc) · 1.02 KB

Cerner DevCon 2013 Higher Order Data Processing

This project serves as a companion to a presentation given at Cerner DevCon 2013. It is meant to give an example of how to build natural processing workflows and pipelines for MapReduce using with higher order libraries and frameworks such as Oozie and Crunch. This could also be accomplished with alternate technologies such as Cascading and Azkaban 2.

The example takes you through some artificially created scenarios of video game data obtained from VgChartz.com and metacritic.com. It is not meant to be an exemplary example of a finely tuned or highly efficient processing code but instead to help illustrate the natural way in which you can describe a problem with these tools instead of plain MapReduce.