Reward Imputation with Sketching for Contextual Batched Bandits

This repository is the official implementation of Reward Imputation with Sketching for Contextual Batched Bandits.

This paper was accepted by NeurIPS 2023.

Requirements

To install requirements:

pip install -r requirements.txt

To train models except DFM-S in the paper, run this command:

python algo_main.py  Algorithm_name

To train DFM-S in the paper, run this command:

python algo_main2.py

We recommend you tuning hyper-parameters by using nni module. In our experiments, we use nni to tune hyper-parameters.

Because we calculate average reward in each episode, you can export reward data using nni after running code. To export reward data, run:

nnictl experiment export [experiment_id] --filename [file_path] --type json --intermediate

If you have any questions, please contact us at the email address [email protected], or submit an issue here.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Algo		Algo
data		data
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Sparse_Sketching.py		Sparse_Sketching.py
algo_main.py		algo_main.py
algo_main2.py		algo_main2.py
config.py		config.py
readme.md		readme.md
requirements.txt		requirements.txt
reward.py		reward.py