Evaluating built-in ECC of FPGA on-chip memories for the mitigation of undervolting faults

dc.contributor
Universitat Politècnica de Catalunya. Doctorat en Arquitectura de Computadors
dc.contributor
Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor
Barcelona Supercomputing Center
dc.contributor
Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.contributor.author
Salami, Behzad
dc.contributor.author
Unsal, Osman Sabri
dc.contributor.author
Cristal Kestelman, Adrián
dc.date.issued
2019
dc.identifier
Salami, B.; Unsal, O.; Cristal, A. Evaluating built-in ECC of FPGA on-chip memories for the mitigation of undervolting faults. A: Euromicro International Conference on Parallel, Distributed, and Network-Based Processing. "27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2019: Pavia, Italy 13-15 February 2019: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2019, p. 242-246.
dc.identifier
978-1-7281-1644-0
dc.identifier
https://hdl.handle.net/2117/188788
dc.identifier
10.1109/EMPDP.2019.8671543
dc.description.abstract
Voltage underscaling below the nominal level is an effective solution for improving energy efficiency in digital circuits, e.g., Field Programmable Gate Arrays (FPGAs). However, further undervolting below a safe voltage level and without accompanying frequency scaling leads to timing related faults, potentially undermining the energy savings. Through experimental voltage underscaling studies on commercial FPGAs, we observed that the rate of these faults exponentially increases for on-chip memories, or Block RAMs (BRAMs). To mitigate these faults, we evaluated the efficiency of the built-in Error-Correction Code (ECC) and observed that more than 90 % of the faults are correctable and further 7 % are detectable (but not correctable). This efficiency is the result of the single-bit type of these faults, which are then effectively covered by the Single-Error Correction and Double-Error Detection (SECDED) design of the built-in ECC. Finally, motivated by the above experimental observations, we evaluated an FPGA-based Neural Network (NN) accelerator under low-voltage operations, while built-in ECC is leveraged to mitigate undervolting faults and thus, prevent NN significant accuracy loss. In consequence, we achieve 40 % of the BRAM power saving through undervolting below the minimum safe voltage level, with a negligible NN accuracy loss, thanks to the substantial fault coverage by the built-in ECC.
dc.description.abstract
The research leading to these results hasreceived funding from the European Union’s Horizon 2020 Programme under the LEGaTO Project (www.legato-project.eu),grantagreement n 780681.
dc.description.abstract
Peer Reviewed
dc.description.abstract
Postprint (author's final draft)
dc.format
5 p.
dc.format
application/pdf
dc.language
eng
dc.publisher
Institute of Electrical and Electronics Engineers (IEEE)
dc.relation
https://ieeexplore.ieee.org/document/8671543/
dc.relation
info:eu-repo/grantAgreement/EC/H2020/780681/EU/Low Energy Toolset for Heterogeneous Computing/LEGaTO
dc.rights
Open Access
dc.subject
Àrees temàtiques de la UPC::Enginyeria electrònica::Microelectrònica::Circuits integrats
dc.subject
Field programmable gate arrays
dc.subject
Error-correcting codes (Information theory)
dc.subject
Logic design
dc.subject
Circuit faults
dc.subject
Random access memory
dc.subject
Power demand
dc.subject
Artificial neural networks
dc.subject
Computer crashes
dc.subject
Matrius de portes programables per l'usuari
dc.subject
Codis correctors d'errors (Teoria de la informació)
dc.subject
Estructura lògica
dc.title
Evaluating built-in ECC of FPGA on-chip memories for the mitigation of undervolting faults
dc.type
Conference report


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

E-prints [73026]