Skip to main navigation Skip to search Skip to main content

A Practitioner’s Guide to Software-based Soft-Error Mitigation Using AN-Codes

  • Martin Hoffmann
  • , Peter Ulbrich
  • , Christian Dietrich
  • , Horst Schirmeier
  • , Daniel Lohmann
  • , Wolfgang Schroder-Preikschat

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Abstract

Arithmetic error coding schemes (AN codes) are a well known and effective technique for soft error mitigation. Although coding theory being a rich area of mathematics, their implementation seems to be fairly easy. However, compliance with the theory can be lost easily while moving towards an actual implementation - finally jeopardizing the aspired fault-tolerance characteristics. In this paper, we present our experiences and lessons learned from implementing AN codes in the Cored dependable voter. We focus on the challenges and pitfalls in the transition from maths to machine code for a binary computer from a systems perspective. Our results show, that practical misconceptions (such as the use of prime numbers) and architecture-dependent implementation glitches occur on every stage of this transition. We identify typical pitfalls and describe practical measures to find and resolve them. Our measures eliminate all remaining SDCs in the Cored voter, which is validated by an extensive fault-injection campaign that covers 100 percent of the fault space for 1-bit and 2-bit errors.

Original languageEnglish
Title of host publication2014 IEEE 15th International Symposium on High-Assurance Systems Engineering
Pages33-40
Number of pages8
ISBN (Electronic)978-1-4799-3466-9, 978-1-4799-3465-2
DOIs
Publication statusPublished - 6 Mar 2014
Externally publishedYes
Event2014 IEEE 15th International Symposium on High-Assurance Systems Engineering, HASE 2014 - Miami, FL, United States
Duration: 9 Jan 201411 Jan 2014

Conference

Conference2014 IEEE 15th International Symposium on High-Assurance Systems Engineering, HASE 2014
Country/TerritoryUnited States
CityMiami, FL
Period9 Jan 201411 Jan 2014

Keywords

  • AN code
  • Arithmetic error coding
  • Fault injection
  • Redundancy
  • Soft errors
  • Software-based fault tolerance

ASJC Scopus subject areas

  • Software

Cite this