A CNN Compression Methodology for Layer-Wise Rank Selection Considering Inter-Layer Interactions

Milad Kokhazadeh, Georgios Keramidas, Vasilios Kelefouras, Iakovos Stamoulis

Research output: Chapter in Book/Report/Conference proceedingConference proceedings published in a bookpeer-review

10 Downloads (Pure)

Abstract

Convolutional Neural Networks (CNNs) achieve
state-of-the-art performance across various application domains
but are often resource-intensive, limiting their use on resourceconstrained devices. Low-rank factorization (LRF) has emerged as
a promising technique to reduce the computational complexity and
memory footprint of CNNs, enabling efficient deployment without
significant performance loss. However, challenges still remain
in optimizing the rank selection problem, balancing memory
reduction and accuracy, and integrating LRF into the training
process of CNNs. In this paper, a novel and generic methodology
for layer-wise rank selection is presented, considering inter-layer
interactions. Our approach is compatible with any decomposition
method and does not require additional retraining. The proposed
methodology is evaluated in thirteen widely-used, CNN models,
significantly reducing model parameters and Floating-Point Operations (FLOPs). In particular, our approach achieves up to a
94.6% parameter reduction (82.3% on average) and up to 90.7%
FLOPs reduction (59.6% on average), with less than a 1.5% drop
in validation accuracy, demonstrating superior performance and
scalability compared to existing techniques.
Original languageEnglish
Title of host publicationDATE 2025 Conference
Publication statusPublished - 2 Apr 2025
EventDATE 25 (Design, Automation and Test in Europe) - Lyon, France
Duration: 31 Mar 20252 Apr 2025

Conference

ConferenceDATE 25 (Design, Automation and Test in Europe)
Country/TerritoryFrance
CityLyon
Period31/03/252/04/25

Fingerprint

Dive into the research topics of 'A CNN Compression Methodology for Layer-Wise Rank Selection Considering Inter-Layer Interactions'. Together they form a unique fingerprint.

Cite this