Fast and Robust Compression of Deep Convolutional Neural Networks

Jia Wen; Liu Yang; Chenyang Shen

doi:10.1007/978-3-030-61616-8_5

Fast and Robust Compression of Deep Convolutional Neural Networks

Jia Wen, Liu Yang, Chenyang Shen

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

Deep convolutional neural networks (CNNs) currently demonstrate the state-of-the-art performance in several domains. However, a large amount of memory and computing resources are required in the commonly used CNN models, posing challenges in training as well as deploying, especially on those devices with limited computational resources. Inspired by the recent advancement of random tensor decomposition, we introduce a Hierarchical Framework for Fast and Robust Compression (HFFRC), which significantly reduces the number of parameters needed to represent a convolution layer via a fast low-rank Tucker decomposition algorithm, while preserving its expressive power. In the merit of randomized algorithm, the proposed compression framework is robust to noises in parameters. In addition, it is a general framework that any tensor decomposition method can be easily adopted. The efficiency and effectiveness of the proposed approach have been demonstrated via comprehensive experiments conducted on the benchmarks CIFAR-10 and CIFAR-100 image classification datasets.

Original language	English (US)
Title of host publication	Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings
Editors	Igor Farkaš, Paolo Masulli, Stefan Wermter
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	52-63
Number of pages	12
ISBN (Print)	9783030616151
DOIs	https://doi.org/10.1007/978-3-030-61616-8_5
State	Published - 2020
Event	29th International Conference on Artificial Neural Networks, ICANN 2020 - Bratislava, Slovakia Duration: Sep 15 2020 → Sep 18 2020

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	12397 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	29th International Conference on Artificial Neural Networks, ICANN 2020
Country/Territory	Slovakia
City	Bratislava
Period	9/15/20 → 9/18/20

Keywords

Deep convolutional neural networks
Model compression
Random Tucker decomposition

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-030-61616-8_5

Cite this

Wen, J., Yang, L., & Shen, C. (2020). Fast and Robust Compression of Deep Convolutional Neural Networks. In I. Farkaš, P. Masulli, & S. Wermter (Eds.), Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings (pp. 52-63). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12397 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-61616-8_5

Fast and Robust Compression of Deep Convolutional Neural Networks. / Wen, Jia; Yang, Liu; Shen, Chenyang.
Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings. ed. / Igor Farkaš; Paolo Masulli; Stefan Wermter. Springer Science and Business Media Deutschland GmbH, 2020. p. 52-63 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12397 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Wen, J, Yang, L & Shen, C 2020, Fast and Robust Compression of Deep Convolutional Neural Networks. in I Farkaš, P Masulli & S Wermter (eds), Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12397 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 52-63, 29th International Conference on Artificial Neural Networks, ICANN 2020, Bratislava, Slovakia, 9/15/20. https://doi.org/10.1007/978-3-030-61616-8_5

Wen J, Yang L, Shen C. Fast and Robust Compression of Deep Convolutional Neural Networks. In Farkaš I, Masulli P, Wermter S, editors, Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings. Springer Science and Business Media Deutschland GmbH. 2020. p. 52-63. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-61616-8_5

Wen, Jia ; Yang, Liu ; Shen, Chenyang. / Fast and Robust Compression of Deep Convolutional Neural Networks. Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings. editor / Igor Farkaš ; Paolo Masulli ; Stefan Wermter. Springer Science and Business Media Deutschland GmbH, 2020. pp. 52-63 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{be7dd22db15e4df897f197f93944e8b1,

title = "Fast and Robust Compression of Deep Convolutional Neural Networks",

abstract = "Deep convolutional neural networks (CNNs) currently demonstrate the state-of-the-art performance in several domains. However, a large amount of memory and computing resources are required in the commonly used CNN models, posing challenges in training as well as deploying, especially on those devices with limited computational resources. Inspired by the recent advancement of random tensor decomposition, we introduce a Hierarchical Framework for Fast and Robust Compression (HFFRC), which significantly reduces the number of parameters needed to represent a convolution layer via a fast low-rank Tucker decomposition algorithm, while preserving its expressive power. In the merit of randomized algorithm, the proposed compression framework is robust to noises in parameters. In addition, it is a general framework that any tensor decomposition method can be easily adopted. The efficiency and effectiveness of the proposed approach have been demonstrated via comprehensive experiments conducted on the benchmarks CIFAR-10 and CIFAR-100 image classification datasets.",

keywords = "Deep convolutional neural networks, Model compression, Random Tucker decomposition",

author = "Jia Wen and Liu Yang and Chenyang Shen",

note = "Publisher Copyright: {\textcopyright} 2020, Springer Nature Switzerland AG.; 29th International Conference on Artificial Neural Networks, ICANN 2020 ; Conference date: 15-09-2020 Through 18-09-2020",

year = "2020",

doi = "10.1007/978-3-030-61616-8_5",

language = "English (US)",

isbn = "9783030616151",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "52--63",

editor = "Igor Farka{\v s} and Paolo Masulli and Stefan Wermter",

booktitle = "Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings",

address = "Germany",

}

TY - GEN

T1 - Fast and Robust Compression of Deep Convolutional Neural Networks

AU - Wen, Jia

AU - Yang, Liu

AU - Shen, Chenyang

PY - 2020

Y1 - 2020

N2 - Deep convolutional neural networks (CNNs) currently demonstrate the state-of-the-art performance in several domains. However, a large amount of memory and computing resources are required in the commonly used CNN models, posing challenges in training as well as deploying, especially on those devices with limited computational resources. Inspired by the recent advancement of random tensor decomposition, we introduce a Hierarchical Framework for Fast and Robust Compression (HFFRC), which significantly reduces the number of parameters needed to represent a convolution layer via a fast low-rank Tucker decomposition algorithm, while preserving its expressive power. In the merit of randomized algorithm, the proposed compression framework is robust to noises in parameters. In addition, it is a general framework that any tensor decomposition method can be easily adopted. The efficiency and effectiveness of the proposed approach have been demonstrated via comprehensive experiments conducted on the benchmarks CIFAR-10 and CIFAR-100 image classification datasets.

AB - Deep convolutional neural networks (CNNs) currently demonstrate the state-of-the-art performance in several domains. However, a large amount of memory and computing resources are required in the commonly used CNN models, posing challenges in training as well as deploying, especially on those devices with limited computational resources. Inspired by the recent advancement of random tensor decomposition, we introduce a Hierarchical Framework for Fast and Robust Compression (HFFRC), which significantly reduces the number of parameters needed to represent a convolution layer via a fast low-rank Tucker decomposition algorithm, while preserving its expressive power. In the merit of randomized algorithm, the proposed compression framework is robust to noises in parameters. In addition, it is a general framework that any tensor decomposition method can be easily adopted. The efficiency and effectiveness of the proposed approach have been demonstrated via comprehensive experiments conducted on the benchmarks CIFAR-10 and CIFAR-100 image classification datasets.

KW - Deep convolutional neural networks

KW - Model compression

KW - Random Tucker decomposition

UR - http://www.scopus.com/inward/record.url?scp=85094100660&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85094100660&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-61616-8_5

DO - 10.1007/978-3-030-61616-8_5

M3 - Conference contribution

AN - SCOPUS:85094100660

SN - 9783030616151

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 52

EP - 63

BT - Artificial Neural Networks and Machine Learning – ICANN 2020 - 29th International Conference on Artificial Neural Networks, Proceedings

A2 - Farkaš, Igor

A2 - Masulli, Paolo

A2 - Wermter, Stefan

PB - Springer Science and Business Media Deutschland GmbH

T2 - 29th International Conference on Artificial Neural Networks, ICANN 2020

Y2 - 15 September 2020 through 18 September 2020

ER -

Fast and Robust Compression of Deep Convolutional Neural Networks

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this