Abstract
Convolutional Neural Networks (CNN) have recently demonstrated effective performance in many Natural Language Processing tasks. In this study, we explore a novel approach for pruning a CNN's convolution filters using our new data-driven utility score. We have applied this technique to an information extraction task of classifying a dataset of cancer pathology reports by cancer type, a highly imbalanced dataset. Compared to standard CNN training, our new algorithm resulted in a nearly .07 increase in the micro-averaged F1-score and a strong .22 increase in the macro-averaged F1-score using a model with nearly a third fewer network weights. We show how directly utilizing a network's interpretation of data can result in strong performance gains, particularly with severely imbalanced datasets.