arXiv:1907.05013 Abstract | arXiv Analytics

arXiv:1907.05013 [cs.LG]Abstract References Reviews Resources

Profiling based Out-of-core Hybrid Method for Large Neural Networks

Yuki Ito, Haruki Imai, Tung Le Duc, Yasushi Negishi, Kiyokuni Kawachiya, Ryo Matsumiya, Toshio Endo

Published 2019-07-11Version 1

GPUs are widely used to accelerate deep learning with NNs (NNs). On the other hand, since GPU memory capacity is limited, it is difficult to implement efficient programs that compute large NNs on GPU. To compute NNs exceeding GPU memory capacity, data-swapping method and recomputing method have been proposed in existing work. However, in these methods, performance overhead occurs due to data movement or increase of computation. In order to reduce the overhead, it is important to consider characteristics of each layer such as sizes and cost for recomputation. Based on this direction, we proposed Profiling based out-of-core Hybrid method (PoocH). PoocH determines target layers of swapping or recomputing based on runtime profiling. We implemented PoocH by extending a deep learning framework, Chainer, and we evaluated its performance. With PoocH, we successfully computed an NN requiring 50 GB memory on a single GPU with 16 GB memory. Compared with in-core cases, performance degradation was 38 \% on x86 machine and 28 \% on POWER9 machine.

Comments: 15 pages

Categories: cs.LG, cs.DC, cs.PF

Keywords: out-of-core hybrid method, large neural networks, pooch determines target layers, nns exceeding gpu memory capacity, gb memory

Related articles: Most relevant | Search more

arXiv:2402.15441 [cs.LG] (Published 2024-02-13, updated 2024-03-12)

Active Few-Shot Fine-Tuning

Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

arXiv:1112.3712 [cs.LG] (Published 2011-12-16)

Analysis and Extension of Arc-Cosine Kernels for Large Margin Classification

Youngmin Cho, Lawrence K. Saul

arXiv:2402.17641 [cs.LG] (Published 2024-02-27, updated 2024-06-06)

Variational Learning is Effective for Large Deep Networks

Yuesong Shen et al.

arXiv Analytics

arXiv:1907.05013 [cs.LG]Abstract References Reviews Resources

Profiling based Out-of-core Hybrid Method for Large Neural Networks

Links

Toolbox

arXiv:1907.05013 [cs.LG]AbstractReferencesReviewsResources

Profiling based Out-of-core Hybrid Method for Large Neural Networks

Links

Toolbox

arXiv:1907.05013 [cs.LG]Abstract References Reviews Resources