Bandit Problem A study on Contextual Multi-armed bandit problem, simulated by offline evaluation Members K.Zhou X.Sun