Multi-GPU Training #1

LcenArthas · 2019-03-09T01:05:26Z

Hi,
Have you tried to run training on multiple gpus?

JKBox · 2019-03-10T16:14:05Z

Hi,
Have you tried to run training on multiple gpus?

Thanks to your reminder, I wrote the code with single gpu, I will change it to multiple gpus later.

LcenArthas · 2019-03-12T11:08:48Z

I tried, but failed TAT.....,but i found this： ultralytics/yolov3#121 . I tried to fix the code,but failed. I hope it can help u :)

LcenArthas · 2019-03-12T11:36:51Z

by the way. i have fix the code follow by that url, and it can run in the multiple gpus, but it sooooo slow. So i think i have made the wrong code

longxianlei · 2019-03-17T03:01:03Z

os.environ["CUDA_VISIBLE_DEVICES"] = "4,5,6,7"
if torch.cuda.device_count() > 1: model = nn.DataParallel(model, device_ids=[0, 1, 2, 3]) model.to(device).train()
I have 8 GPUs. I set 4 of my device visiable. Then i use the model to parallel to these GPUs.
but when i run the train.py.
inter_area = torch.min(box1, box2).prod(2) RuntimeError: Expected object of type torch.cuda.FloatTensor but found type torch.FloatTensor for argument #2 'other'
Is the code didn't support multi GPU training now.

JKBox · 2019-03-17T07:43:33Z

os.environ["CUDA_VISIBLE_DEVICES"] = "4,5,6,7"
if torch.cuda.device_count() > 1: model = nn.DataParallel(model, device_ids=[0, 1, 2, 3]) model.to(device).train()
I have 8 GPUs. I set 4 of my device visiable. Then i use the model to parallel to these GPUs.
but when i run the train.py.
inter_area = torch.min(box1, box2).prod(2) RuntimeError: Expected object of type torch.cuda.FloatTensor but found type torch.FloatTensor for argument #2 'other'
Is the code didn't support multi GPU training now.

yes, the code only support single GPU training currently, I'll fix it later

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-GPU Training #1

Multi-GPU Training #1

LcenArthas commented Mar 9, 2019

JKBox commented Mar 10, 2019

LcenArthas commented Mar 12, 2019

LcenArthas commented Mar 12, 2019

longxianlei commented Mar 17, 2019

JKBox commented Mar 17, 2019

Multi-GPU Training #1

Multi-GPU Training #1

Comments

LcenArthas commented Mar 9, 2019

JKBox commented Mar 10, 2019

LcenArthas commented Mar 12, 2019

LcenArthas commented Mar 12, 2019

longxianlei commented Mar 17, 2019

JKBox commented Mar 17, 2019