We release the code and data for the following paper. If you use these datasets, or refer to its results, please cite:
Siqi Wu, Marian-Andrei Rizoiu, and Lexing Xie. Estimating Attention Flow in Online Video Networks. ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW), 2019. [paper|slides|blog]
We provide three quickstart bash scripts:
Download and place data in the data directory, then uncompress them.
First run run_all_wrangling.sh
to create formatted data, then run run_all_temporal_analysis.sh
to conduct the temporal analysis or run_all_predictors.sh
to reproduce the results of prediction tasks.
Detailed usage and running time are documented in the corresponding python scripts.
Note the datasets are large, so the quickstart scripts will take up to 24 hours to finish. Check the estimated running time in each python script before you run the quickstart scripts.
All codes are developed and tested in Python 3.6.7, along with NumPy 1.14.5, matplotlib 3.0.3 and SciPy 1.2.1.
The data is hosted on Google Drive and Dataverse. See more details in this data description.