Soft error characterization on scientific applications

Zuhal Ozturk, Haluk Rahmi Topcuoglu, Sanem Arslan, Mahmut Taylan Kandemir

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Decreasing transistor sizes, aggressive power optimization techniques and higher operation frequencies lead to increase error rates. While researchers addressed reliable computing, there is still lack of study for providing the fundamental understanding of error propagation. In this work, we characterize error propagation at software level by utilizing error propagation speed metric. It is validated on a set of commonly used iterative solvers, where the speed of error propagation is modeled for the different input-output pairs. Additionally, we study two different methods for slowing down the error propagation. Firstly, the effect of various algorithmic choices of sorting on error propagation profiles is examined whether such choices have an impact on error propagation profiles. Experimental results show that sorting algorithms differ in error propagation patterns. Secondly, different loop transformation techniques are considered for slowing down the error propagation speed. Specifically, while loop tiling causes a significant change in error propagation, the impact of loop unrolling is negligible for the given applications.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 16th International Conference on Dependable, Autonomic and Secure Computing, IEEE 16th International Conference on Pervasive Intelligence and Computing, IEEE 4th International Conference on Big Data Intelligence and Computing and IEEE 3rd Cyber Science and Technology Congress, DASC-PICom-DataCom-CyberSciTec 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages592-599
Number of pages8
ISBN (Electronic)9781538675182
DOIs
StatePublished - Oct 26 2018
Event16th IEEE International Conference on Dependable, Autonomic and Secure Computing, IEEE 16th International Conference on Pervasive Intelligence and Computing, IEEE 4th International Conference on Big Data Intelligence and Computing and IEEE 3rd Cyber Science and Technology Congress, DASC-PICom-DataCom-CyberSciTec 2018 - Athens, Greece
Duration: Aug 12 2018Aug 15 2018

Publication series

NameProceedings - IEEE 16th International Conference on Dependable, Autonomic and Secure Computing, IEEE 16th International Conference on Pervasive Intelligence and Computing, IEEE 4th International Conference on Big Data Intelligence and Computing and IEEE 3rd Cyber Science and Technology Congress, DASC-PICom-DataCom-CyberSciTec 2018

Other

Other16th IEEE International Conference on Dependable, Autonomic and Secure Computing, IEEE 16th International Conference on Pervasive Intelligence and Computing, IEEE 4th International Conference on Big Data Intelligence and Computing and IEEE 3rd Cyber Science and Technology Congress, DASC-PICom-DataCom-CyberSciTec 2018
CountryGreece
CityAthens
Period8/12/188/15/18

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems
  • Artificial Intelligence
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Control and Optimization

Fingerprint Dive into the research topics of 'Soft error characterization on scientific applications'. Together they form a unique fingerprint.

Cite this