Custom deep learning loop take more memory than using trainNetwork()?

Question

0 Stimmen

Hi,

I followed the instructions from the link below to create a custom training loop by using a U-Net architecture.

https://www.mathworks.com/help//deeplearning/ug/train-network-using-custom-training-loop.html

By the same network architecture and with same "multi-gpu" setting (I have 2 RTX 2060 GPU), I found that I can only take 4 minibatch size at best in the custom training loop, while 16 minibarch size at best by using the built-in trainNetwork() function.

Is this a normal phenomenon that custom loop training will take more gpu memory than trainNetwork()?

Thanks!

1 Kommentar
-1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden

Sara Ahmed am 28 Okt. 2020

Same here :(

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Answer 1

Shashank Gupta am 28 Okt. 2020

0 Stimmen

Yes, it is an expected behaviour, the custom loop does take some extra amount of memory while the existing function trainNetwork is very optimised. More custom loop more inefficiency and thus more GPU memory usage. Neverthless, you can optimise the custom training loop but even then we can't be fully sure that it is as much optimised as trainNetwork.

I hope this clear some of your confusion.

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Shashank Gupta am 30 Okt. 2020

Hey Qiao,

Have a look at this Link, this might enable you to use parallel capabilities in the custom training loop.

Currently, there is no specific reference that talks about the optimisation of custom loop specifically because it is hard to generalise anything and come up with a documented reference. Generally these jobs are really subjective, depends on what sort of things you want to implement, Nevertheless, some suggestions, look for dlarray capable function for quick computing, Try using more MATLAB function rather than implementing your own. try to use as less code as necessary.

Qiao Hu am 31 Okt. 2020

Thanks a bunch!

Melden Sie sich an, um zu kommentieren.

Custom deep learning loop take more memory than using trainNetwork()?

1 Kommentar
-1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden

Akzeptierte Antwort

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Weitere Antworten (0)

Kategorien

Produkte

Version

Tags

Community Treasure Hunt

Custom deep learning loop take more memory than using trainNetwork()?

1 Kommentar -1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden

Akzeptierte Antwort

3 Kommentare 1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Weitere Antworten (0)

Kategorien

Produkte

Version

Tags

Siehe auch

Community Treasure Hunt

1 Kommentar
-1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden