Root Cause Failure Analysis. Trinath Sahoo
be listed. You should use category labels that make sense for the diagram you are creating.Identify as many causes or factors as possible and attach them as subbranches of the major branchesIshikawa Diagram ‐ Categorize Causes
5 Determine Deeper CausesEach cause on the chart is then analyzed further to determine if there is a more fundamental cause for that aspect. This can be done by asking the question, “Why does it happen?”This step can also be done for the deeper causes that are identified. Generally, you can stop going deeper when a cause is controlled one level of management removed from your group. Use your judgment to decide when to stop.Fishbone Chart ‐ Deeper Causes
6 Identify Root CausesThe final step for creating a fishbone diagram is to identify the root causes of the problem. This can be done in several ways…Look for causes that appear repeatedlySelect using group consensus methodsSelect based on frequency of occurrenceFishbone diagrams are an excellent way to explore and visually depict the causes of a problem. They enable the root causes of a problem to be determined. This will help you be more effective by focusing your actions on the true causes of a problem and not on its symptoms. It Encourages group participation, Uses an orderly, easy‐to‐read format to diagram cause and effect relationships.
Fault Tree Analysis
Fault tree analysis helps determine the root cause of failure of a system using Boolean logic to combine a series of lower level events. FTA is a deductive analysis depicting a visual path of failure. It is a top‐down analysis that helps determine the probability of occurrence for an undesirable event. The analysis creates a visual record showing the logical relationships between events and failures that lead to the undesirable event. It easily presents the results of your analysis and pinpoints weaknesses in the system.
The fault tree analysis (FTA) was first introduced by Bell Laboratories and is one of the most widely used methods in system reliability, maintainability and safety analysis. It is a deductive procedure used to determine the various combinations of hardware and software failures and human errors that could cause undesired events (referred to as top events) at the system level.
To do a comprehensive FTA, follow these steps:
1 Define the fault condition, and write down the top‐level failure.
2 Using technical information and professional judgments, determine the possible reasons for the failure to occur. Remember, these are level two elements because they fall just below the top‐level failure in the tree.
3 Continue to break down each element with additional gates to lower levels. Consider the relationships between the elements to help you decide whether to use an “and” or an “or” logic gate.
4 Finalize and review the complete diagram. The chain can only be terminated in a basic fault: human, hardware, or software.
5 If possible, evaluate the probability of occurrence for each of the lowest level elements and calculate the statistical probabilities from the bottom up.
Drawing Fault Trees: Gates and Events
Gate symbols represent results of interactions among contributing failure events and can vary among tools. Basic gates used to construct the Fault Tree can be seen below:
Gate Symbol | Name | Causal Relation |
|
OR | Output event occurs if any one of the input events occurs |
|
AND | Output event occurs if all input events occur |
|
BASIC | Basic event for which failure data is available. |
|
INTERMEDIATE EVENT | System or component event description |
|
TRANSFER | Indicates that this part of the fault tree is developed in a different part of the diagram or on a different page. |
FTA Gate Notes
Fault Tree probabilities can be computed by simple arithmetic only if basic events (component failures without lower level contributors) are independent. Independence is determined by ensuring the failure of one basic event has no effect on any other and groups of basic events cannot fail from common causes such as shock. For independent basic events with very small failure rates, typically found in electronic components, an AND gate output probability can be computed as the product of its input failure probabilities, and an OR gate output probability can be computed as the sum of its input failure probabilities.
Below is a Basic Fault Tree Analysis Example Structure:
The five basic steps to perform a Fault Tree Analysis are as follows:
1 Identify the Hazard
2 Obtain Understanding of the System Being Analyzed
3 Create the Fault Tree
4 Identify the Cut Sets
5 Mitigate the Risk
Top‐level event is called a Cut Set. There are many cut sets within the FTA. Each has an individual probability assigned to it. The paths related to the highest severity / highest probability combinations are identified and will require mitigation.
How to Undertake a Fault Tree Analysis?
Although the nature of the undesired event may be quite different, fault tree analysis has the same procedure for any types of undesired event. To do a comprehensive fault tree analysis, simply follow the process below:
1 Define and identify the fault condition (hazard) as precisely as possible based on the aspects such as the amount, duration, and related impacts.
2 Using technical skills and existing facility details to list and decide all the possible reasons for the failure occurrence.
3 Break down the tree from the top level according to the relationship between different components until you work down to the potential root cause. The structure of your fault tree analysis diagram should be based on the top, middle (subsystems), and the bottom (basic events, component failures) levels.
4 If your analysis involves the quantitative part, evaluate the probability of occurrence for each of the components and calculate the statistical probabilities for the whole tree.
5 Double‐check your overall fault tree analysis diagram and implement modifications to the process if necessary.
6 Collect data, evaluate your results in full details by using risk management, qualitative, and quantitative analysis to improve your system.