This is my Thesis in the direction of Visual Object Tracking.
SiamFusion architecture
-
Prerequisites: The project was built using python 3.6 and tested on Ubuntu 18.04 and 16.04. It was tested on a GTX 1080 Ti. Furthermore it requires PyTorch 4.1.
-
Download the GOT-10k Dataset in http://got-10k.aitestunion.com/downloads and extract it on the folder of your choice, in my case it is
/home/arbi/desktop/GOT-10k
(OBS: data reading is done in execution time, so if available extract the dataset in your SSD partition).
-
Download the ImageNet VID Dataset in http://bvisionweb1.cs.unc.edu/ILSVRC2017/download-videos-1p39.php and extract it on the folder of your choice (OBS: data reading is done in execution time, so if available extract the dataset in your SSD partition). You can get rid of the test part of the dataset, since it has no Annotations.
-
In config.py script
root_dir_for_GOT_10k
,root_dir_for_VID and
androot_dir_for_OTB
change to your directory.
root_dir_for_GOT_10k = '/home/arbi/desktop/GOT-10k' <-- change to your directory
root_dir_for_VID = '/home/arbi/desktop/VID' <-- change to your directory
root_dir_for_OTB = '/home/arbi/desktop/OTB2015' <-- change to your directory
- Run the train.py script:
python3 train.py
- Download pretrained
model_e31.pth
from Yandex Disk, and put the file undermodel/model_e31.pth
.
- Run the test.py script:
python3 test.py
OTB2015
Results on each epoch