Energy-aware computation duplication for improving reliability in embedded chip multiprocessors

G. Chen, Mahmut Kandemir, F. Li

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

Compilers designed for current embedded systems must be capable of addressing multiple constraints such as low power, high performance, small memory footprint and form factor, and high reliability at the same time. In particular, optimizing for one constraint should be performed carefully, considering its impact on other constraints. Recent trends indicate that transient errors are becoming increasingly important in embedded systems. Focusing on an embedded chip multiprocessor and array-intensive applications, this paper demonstrates how reliability against transient errors can be improved without impacting execution time by utilizing idle processors for duplicating some of the computations of the active processors. It also shows how a balance between power savings and reliability improvement can be struck using a metric called the energy-delay-fallibility product. Our experimental results indicate that the "percentage of duplicated computations" is a useful high-level metric for studying the tradeoffs among performance, power, and reliability.

Original languageEnglish (US)
Title of host publicationProceedings of the ASP-DAC 2006
Subtitle of host publicationAsia and South Pacific Design Automation Conference 2006
Pages134-139
Number of pages6
StatePublished - Sep 19 2006
EventASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006 - Yokohama, Japan
Duration: Jan 24 2006Jan 27 2006

Publication series

NameProceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC
Volume2006

Other

OtherASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006
CountryJapan
CityYokohama
Period1/24/061/27/06

Fingerprint

Embedded systems
Data storage equipment

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Computer Graphics and Computer-Aided Design
  • Electrical and Electronic Engineering

Cite this

Chen, G., Kandemir, M., & Li, F. (2006). Energy-aware computation duplication for improving reliability in embedded chip multiprocessors. In Proceedings of the ASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006 (pp. 134-139). [1594671] (Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC; Vol. 2006).
Chen, G. ; Kandemir, Mahmut ; Li, F. / Energy-aware computation duplication for improving reliability in embedded chip multiprocessors. Proceedings of the ASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006. 2006. pp. 134-139 (Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC).
@inproceedings{a374a77c10684958a0e27268fce38ef1,
title = "Energy-aware computation duplication for improving reliability in embedded chip multiprocessors",
abstract = "Compilers designed for current embedded systems must be capable of addressing multiple constraints such as low power, high performance, small memory footprint and form factor, and high reliability at the same time. In particular, optimizing for one constraint should be performed carefully, considering its impact on other constraints. Recent trends indicate that transient errors are becoming increasingly important in embedded systems. Focusing on an embedded chip multiprocessor and array-intensive applications, this paper demonstrates how reliability against transient errors can be improved without impacting execution time by utilizing idle processors for duplicating some of the computations of the active processors. It also shows how a balance between power savings and reliability improvement can be struck using a metric called the energy-delay-fallibility product. Our experimental results indicate that the {"}percentage of duplicated computations{"} is a useful high-level metric for studying the tradeoffs among performance, power, and reliability.",
author = "G. Chen and Mahmut Kandemir and F. Li",
year = "2006",
month = "9",
day = "19",
language = "English (US)",
isbn = "0780394518",
series = "Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC",
pages = "134--139",
booktitle = "Proceedings of the ASP-DAC 2006",

}

Chen, G, Kandemir, M & Li, F 2006, Energy-aware computation duplication for improving reliability in embedded chip multiprocessors. in Proceedings of the ASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006., 1594671, Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC, vol. 2006, pp. 134-139, ASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006, Yokohama, Japan, 1/24/06.

Energy-aware computation duplication for improving reliability in embedded chip multiprocessors. / Chen, G.; Kandemir, Mahmut; Li, F.

Proceedings of the ASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006. 2006. p. 134-139 1594671 (Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC; Vol. 2006).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Energy-aware computation duplication for improving reliability in embedded chip multiprocessors

AU - Chen, G.

AU - Kandemir, Mahmut

AU - Li, F.

PY - 2006/9/19

Y1 - 2006/9/19

N2 - Compilers designed for current embedded systems must be capable of addressing multiple constraints such as low power, high performance, small memory footprint and form factor, and high reliability at the same time. In particular, optimizing for one constraint should be performed carefully, considering its impact on other constraints. Recent trends indicate that transient errors are becoming increasingly important in embedded systems. Focusing on an embedded chip multiprocessor and array-intensive applications, this paper demonstrates how reliability against transient errors can be improved without impacting execution time by utilizing idle processors for duplicating some of the computations of the active processors. It also shows how a balance between power savings and reliability improvement can be struck using a metric called the energy-delay-fallibility product. Our experimental results indicate that the "percentage of duplicated computations" is a useful high-level metric for studying the tradeoffs among performance, power, and reliability.

AB - Compilers designed for current embedded systems must be capable of addressing multiple constraints such as low power, high performance, small memory footprint and form factor, and high reliability at the same time. In particular, optimizing for one constraint should be performed carefully, considering its impact on other constraints. Recent trends indicate that transient errors are becoming increasingly important in embedded systems. Focusing on an embedded chip multiprocessor and array-intensive applications, this paper demonstrates how reliability against transient errors can be improved without impacting execution time by utilizing idle processors for duplicating some of the computations of the active processors. It also shows how a balance between power savings and reliability improvement can be struck using a metric called the energy-delay-fallibility product. Our experimental results indicate that the "percentage of duplicated computations" is a useful high-level metric for studying the tradeoffs among performance, power, and reliability.

UR - http://www.scopus.com/inward/record.url?scp=33748600383&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33748600383&partnerID=8YFLogxK

M3 - Conference contribution

SN - 0780394518

SN - 9780780394513

T3 - Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC

SP - 134

EP - 139

BT - Proceedings of the ASP-DAC 2006

ER -

Chen G, Kandemir M, Li F. Energy-aware computation duplication for improving reliability in embedded chip multiprocessors. In Proceedings of the ASP-DAC 2006: Asia and South Pacific Design Automation Conference 2006. 2006. p. 134-139. 1594671. (Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC).