System Reliability Theory. Marvin Rausand
reliability are closely connected. Reliability may in some respects be considered to be a quality characteristic.
Life cycle costing. The life cycle cost (LCC) may be split into three types: (i) capital expenditure (CAPEX), (ii) operational expenditure (OPEX), and (iii) risk expenditure (RISKEX). The main links to reliability are with types (ii) and (iii). The OPEX is influenced by how regular the function/service is and the cost of maintenance. The RISKEX covers the cost related to accidents, system failures, and insurance. LCC is also called total ownership cost.
Production assurance. Failures in a production system lead to downtime and reduced production. To assure a regular production, the production system must have a high reliability. Production assurance is treated in the international standard ISO 20815 and discussed in Chapter 6.
Warranty planning. A warranty is a formal commitment to deliver reliable items. If failures and malfunctions are detected during a specified warranty period, the supplier has to repair and/or compensate the failure. Unreliable items may incur a high cost for the supplier.
Systems engineering. Reliability is one of the most important quality attributes of many technical systems. Reliability assurance is therefore an important topic during the systems engineering process. This is especially the case within the nuclear power, the aviation, the aerospace, the car, and the process industries.
Environmental protection. Reliability studies are used to improve the design and operational availability of many types of environmental protection systems. Many industries have realized that a main part of the pollution from their plants is caused by production irregularities and that consequently the reliability of the plant is an important factor in order to reduce pollution. Environmental risk analyses are carried out according to the procedure shown in Figure 1.3.
Technology qualification. Many customers require the producer of technical items to verify that the item satisfies the agreed requirements. The verification is carried out by following a technology qualification program (TQP) based on analysis and testing. This is especially the case within the aerospace, defense, and petroleum industries (e.g. see DNV‐RP‐A203 2011).
Applications related to reliability are illustrated in Figure 1.4.
Figure 1.3 Main steps of risk analysis, with main methods. The methods covered in this book are marked with
Figure 1.4 Reliability as basis of other applications.
1.3 Basic Reliability Concepts
The main concept of this book is reliability as defined in Definition 1.1. The aim of this section is to discuss and clarify this definition and to define related terms, such as maintainability and maintenance, availability, quality, and dependability.
It is important that all main words are defined in an unambiguous way. We fully agree with Kaplan (1990) who states: “When the words are used sloppily, concepts become fuzzy, thinking is muddled, communication is ambiguous, and decisions and actions are suboptimal.”
1.3.1 Reliability
Definition 1.1 says that reliability expresses “the ability of an item to perform as required in a stated operating context and for a stated period of time.” We start by clarifying the main words in this definition.
1 Reliability is defined by using the word ability, which is not directly measurable. A quantitative evaluation of the item's ability to perform must therefore be based on one or more metrics, called reliability metrics. Several probabilistic reliability metrics are defined and discussed in Section 1.4.
2 Some authors use the word capability instead of ability in the definition of reliability and claim that the term “capability” is more embracing, covering both ability and capacity. Most dictionaries list ability and capability as synonyms. We prefer the word “ability” because this is the word most commonly used.
3 The statement perform as required means that the item must be able to perform one or more specified functions according to the performance criteria for these function(s). Functions and performance criteria are discussed in Section 2.5.
4 Many items can perform a high number of functions. To assess the reliability (e.g. of a car), we must specify the required function(s) that are considered.
5 To be reliable, the item must do more than meet an initial factory performance or quality specification – it must operate satisfactorily for a specified period of time in the actual operating context.
6 The stated period of time may be a delimited time period, such as a mission time, the time of ownership, and several more.
7 The time may be measured by many different time concepts, such as calendar time, time in operation, number of work cycles, and so on. For vehicles, the time is often measured as the number of kilometers driven. For items that are not operated continuously in the same mode, a more complicated time concept may be needed.
Inherent and Actual Reliability
It may be useful to qualify the reliability of an item by adding a word, such as inherent or actual. The inherent reliability is defined as follows:
Definition 1.3 (Inherent reliability)
The reliability of the item as designed and manufactured, which excludes effects of operation, environment, and support conditions other than those assumed and stated in the item requirements and specification.
The inherent reliability is therefore the reliability of a brand new item that will be used and maintained exactly according to the conditions described in the item specification document or implicitly assumed. The inherent reliability is sometimes called built reliability or built‐in reliability of the item.
The design and development team always attempts to adapt the item to the actual operating context, but it is difficult, if not impossible, to account for all the aspects in practical use. The actual reliability may consequently be different from the inherent reliability that was determined before the item was put into use. The actual reliability of an item is defined as follows:
Definition 1.4 (Actual reliability)
The reliability of the item in an actual operating context.
The actual reliability is sometimes called operational reliability or functional reliability.
Software Reliability
Software reliability is different from hardware reliability. Hardware items generally deteriorate due to wear or other mechanisms and failures occur as