Optimizing Temporal Convolutional Network Inference on FPGA-Based Accelerators

Raffo L.;Meloni P.
2020-01-01

Abstract

Convolutional Neural Networks (CNNs) are extensively used in a wide range of applications, commonly including computer vision tasks like image and video classification, recognition and segmentation. Recent research results demonstrate that multi-layer (deep) network involving mono-dimensional convolutions and dilation can be effectively used in time series and sequences classification and segmentation, as well as in tasks involving sequence modeling. These structures, commonly referred to as Temporal Convolutional Networks (TCNs), represent an extremely promising alternative to recurrent architectures, commonly used across a broad range of sequence modeling tasks. While FPGA based inference accelerators for classic CNNs are widespread, literature is lacking in a quantitative evaluation of their usability on inference for TCN models. In this paper we present such an evaluation, considering a CNN accelerator with specific features supporting TCN kernels as a reference and a set of state-of-the-art TCNs as a benchmark. Experimental results show that, during TCN execution, operational intensity can be critical for the overall performance. We propose a convolution scheduling based on batch processing that can boost efficiency up to 96% of theoretical peak performance. Overall we can achieve up to 111,8 GOPS/s and a power efficiency of 33,8 GOPS/s/W on an Ultrascale+ ZU3EG (up to 10× speedup and 3× power efficiency improvement with respect to pure software implementation).
2020
2020
Inglese
10
3
9159637
348
361
14
Esperti anonimi
internazionale
scientifica
embedded systems; FPGA; hardware accelerator; TCN; Temporal convolutional network
Carreras, M.; Deriu, G.; Raffo, L.; Benini, L.; Meloni, P.
1.1 Articolo in rivista
info:eu-repo/semantics/article
1 Contributo su Rivista::1.1 Articolo in rivista
262
5
partially_open
Files in This Item:
File Size Format  
Temporal_postprint_copertina.pdf

open access

Type: Author’s Accepted Manuscript AAM, Post-print, (version accepted by the publisher)
Size 935.35 kB
Format Adobe PDF
935.35 kB Adobe PDF View/Open
Editoriale_Optimizing_Temporal_Convolutional_Network_Inference_on_FPGA-Based_Accelerators.pdf

Solo gestori archivio

Type: versione editoriale
Size 3.95 MB
Format Adobe PDF
3.95 MB Adobe PDF & nbsp; View / Open   Request a copy

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie