Convolutional neural network (CNN) becomes more and more popular as it has demonstrated great success in many visual content-oriented applications such as computer vision and image/video processing. Inevitably, massive computation resources and storage spaces are required for the hardware implementations of CNN. While “model compression” can significantly reduce parameters (e.g., weights) used in CNN so as corresponding computations by suitable pruning, this paper proposes a pruning approach based on statistical analysis of weight distribution. Experimental results show that substantial redundant weights of a demonstrated CNN were effectively removed through carefully tracing weight distributions for each layer. Meanwhile, saving weight is expected to save up to 30% of storage space and computation complexity.