Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 481 Bytes

README.md

File metadata and controls

7 lines (4 loc) · 481 Bytes

kaggle-titanic

This is an attempt to use an entropy-based decision tree to solve the Titanic challenge at Kaggle.com.

The implementation is a pretty standard one, with one exception: it uses a columnar dataset. The CSV is lazily loaded into colums instead of reading it simply as a set of vectors. This makes entropy computation pretty fast.

Refer to src/kaggle_titanic/example.clj for an overview of how to use the thing.