Skip to main content
SHARE
Publication

Wootz: a compiler-based framework for fast CNN pruning via composability...

by Hui Guan, Xipeng Shen, Seung-hwan Lim
Publication Type
Conference Paper
Journal Name
ACM SIGPLAN Conference on Programming Language Design and Implementation
Publication Date
Page Numbers
717 to 730
Volume
40
Issue
1
Conference Name
ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2019)
Conference Location
Phoenix, Arizona, United States of America
Conference Sponsor
ACM
Conference Date
-

Convolutional Neural Networks (CNN) are widely used for Deep Learning tasks. CNN pruning is an important method to adapt a large CNN model trained on general datasets to fit a more specialized task or a smaller device. The key challenge is on deciding which filters to remove in order to maximize the quality of the pruned networks while satisfying the constraints. It is time-consuming due to the enormous configuration space and the slowness of CNN training.

The problem has drawn many efforts from the machine learning field, which try to reduce the set of network configurations to explore. This work tackles the problem distinctively from a programming systems perspective, trying to speed up the evaluations of the remaining configurations through computation reuse via a compiler-based framework. We empirically uncover the existence of composability in the training of a collection of pruned CNN models, and point out the opportunities for computation reuse. We then propose composability-based CNN pruning, and design a compression-based algorithm to efficiently identify the set of CNN layers to pre-train for maximizing their reuse benefits in CNN pruning. We further develop a compiler-based framework named Wootz, which, for an arbitrary CNN, automatically generates code that builds a Teacher-Student scheme to materialize composability-based pruning. Experiments show that network pruning enabled by Wootz shortens the state-of-art pruning process by up to 186X while producing significantly better pruning results.