|
If you have Adobe Reader you can also download a .pdf version. 
Methods of Determining Safety Integrity Level (SIL) Requirements - Pros and Cons
by W G Gulland (4-sight Consulting)
1 Introduction
The concept of safety integrity levels (SILs) was introduced during the development of BS EN 61508 (BSI 2002) as a measure of
the quality or dependability of a system which has a safety function – a measure of the confidence with which the system can be
expected to perform that function. It is also used in BS IEC 61511(BSI 2003), the process sector specific application of BS EN 61508.
This paper discusses the application of 2 popular methods of determining SIL requirements – risk graph methods and layer of
protection analysis (LOPA) – to process industry installations. It identifies some of the advantages of both methods, but also
outlines some limitations, particularly of the risk graph method. It suggests criteria for identifying the situations where the use of these methods is appropriate.
2 Definitions of SILs
The standards recognise that safety functions can be required to operate in quite different ways. In particular they recognise that
many such functions are only called upon at a low frequency / have a low demand rate. Consider a car; examples of such functions are:
Anti-lock braking (ABS). (It depends on the driver, of course!).
Secondary restraint system (SRS) (air bags).
On the other hand there are functions which are in frequent or continuous use; examples of such functions are:
The fundamental question is how frequently will failures of either type of function lead to accidents. The answer is different for the
2 types:
For functions with a low demand rate, the accident rate is a combination of 2 parameters – i) the frequency of demands,
and ii) the probability the function fails on demand (PFD). In this case, therefore, the appropriate measure of performance
of the function is PFD, or its reciprocal, Risk Reduction Factor (RRF).
For functions which have a high demand rate or operate continuously, the accident rate is the failure rate, λ, which is the
appropriate measure of performance. An alternative measure is mean time to failure (MTTF) of the function. Provided failures are exponentially distributed, MTTF is the reciprocal of λ.
These performance measures are, of course, related. At its simplest, provided the function can be proof-tested at a frequency
which is greater than the demand rate, the relationship can be expressed as:
|
PFD
|
=
|
λT/2
|
or
|
=
|
T/(2 x MTTF), or
|
|
RRF
|
=
|
2/(λT)
|
or
|
=
|
(2 x MTTF)/T
|
where T is the proof-test interval. (Note that to significantly reduce the accident rate below the failure rate of the function, the test
frequency, 1/T, should be at least 2 and preferably ≥ 5 times the demand frequency.) They are, however, different quantities. PFD
is a probability – dimensionless; λ is a rate – dimension t-1. The standards, however, use the same term – SIL – for both these
measures, with the following definitions:
Table 1 - Definitions of SILs for Low Demand Mode from BS EN 61508
|
SIL
|
Range of Average PFD
|
Range of RRF[1]
|
|
4
|
10-5 ≤ PFD < 10-4
|
100,000 ≥ RRF > 10,000
|
|
3
|
10-4 ≤ PFD < 10-3
|
10,000 ≥ RRF > 1,000
|
|
2
|
10-3 ≤ PFD < 10-2
|
1,000 ≥ RRF > 100
|
|
1
|
10-2 ≤ PFD < 10-1
|
100 ≥ RRF > 10
|
|
Table 2 - Definitions of SILs for High Demand / Continuous Mode from BS EN 61508
|
SIL
|
Range of λ (failures per hour)
|
~ Range of MTTF (years)[2]
|
|
4
|
10-9 ≤ λ < 10-8
|
100,000 ≥ MTTF > 10,000
|
|
3
|
10-8 ≤ λ < 10-7
|
10,000 ≥ MTTF > 1,000
|
|
2
|
10-7 ≤ λ < 10-6
|
1,000 ≥ MTTF > 100
|
|
1
|
10-6 ≤ λ < 10-5
|
100 ≥ MTTF > 10
|
|
In low demand mode, SIL is a proxy for PFD; in high demand / continuous mode, SIL is a proxy for failure rate. (The boundary
between low demand mode and high demand mode is in essence set in the standards at one demand per year. This is
consistent with proof-test intervals of 3 to 6 months, which in many cases will be the shortest feasible interval.)
Now consider a function which protects against 2 different hazards, one of which occurs at a rate of 1 every 2 weeks, or 25 times
per year, i.e. a high demand rate, and the other at a rate of 1 in 10 years, i.e. a low demand rate. If the MTTF of the function is
50 years, it would qualify as achieving SIL1 for the high demand rate hazard. The high demands effectively proof-test the function
against the low demand rate hazard. All else being equal, the effective SIL for the second hazard is given by:
So what is the SIL achieved by the function? Clearly it is not unique, but depends on the hazard and in particular whether the
demand rate for the hazard implies low or high demand mode.
In the first case, the achievable SIL is intrinsic to the equipment; in the second case, although the intrinsic quality of the
equipment is important, the achievable SIL is also affected by the testing regime. This is important in the process industry
sector, where achievable SILs are liable to be dominated by the reliability of field equipment – process measurement instruments
and, particularly, final elements such as shutdown valves – which need to be regularly tested to achieve required SILs.
The differences between these definitions may be well understood by those who are dealing with the standards day-by-day, but
are potentially confusing to those who only use them intermittently.
3 Some Methods of Determining SIL Requirements
BS EN 61508 offers 3 methods of determining SIL requirements:
- Quantitative method.
- Risk graph, described in the standard as a qualitative method.
- Hazardous event severity matrix, also described as a qualitative method.
BS IEC 61511 offers:
Semi-quantitative method.
Safety layer matrix method, described as a semi-qualitative method.
Calibrated risk graph, described in the standard as a semi-qualitative method, but by some practitioners as a semi
-quantitative method.
Risk graph, described as a qualitative method.
Layer of protection analysis (LOPA). (Although the standard does not assign this method a position on the qualitative /
quantitative scale, it is weighted toward the quantitative end.)
Risk graphs and LOPA are popular methods for determining SIL requirements, particularly in the process industry sector. Their
advantages and disadvantages and range of applicability are the main topic of this paper.
4 Risk Graph Methods
Risk graph methods are widely used for reasons outlined below. A typical risk graph is shown in Figure 1.

Figure 1 - Typical Risk Graph
The parameters of the risk graph can be given qualitative descriptions, e.g.:
or quantitative descriptions, e.g.:
Table 3 - Typical Definitions of Risk Graph Parameters
|
Consequence
|
|
CA
|
Minor injury
|
|
CB
|
0.01 to 0.1 probable fatalities per event
|
|
CC
|
> 0.1 to 1.0 probable fatalities per event
|
|
CD
|
> 1 probable fatalities per event
|
|
Exposure
|
|
FA
|
< 10% of time
|
|
FB
|
≥ 10% of time
|
|
Avoidability / Unavoidability
|
|
PA
|
> 90% probability of avoiding hazard
|
< 10% probability hazard cannot be avoided
|
|
PB
|
≤ 90% probability of avoiding hazard
|
≥ 10% probability hazard cannot be avoided
|
|
Demand Rate
|
|
W1
|
< 1 in 30 years
|
|
W2
|
1 in > 3 to 30 years
|
|
W3
|
1 in > 0.3 to 3 years
|
|
The first definition begs the question “What does several mean?” In practice it is likely to be very difficult to assess SIL
requirements unless there is a set of agreed definitions of the parameter values, almost inevitably in terms of quantitative ranges.
These may or may not have been calibrated against the assessing organisation’s risk criteria, but the method then becomes
semi-quantitative (or is it semi-qualitative? It is certainly somewhere between the extremities of the qualitative / quantitative scale.)
Table 3 shows a typical set of definitions.
4.1 Benefits
Risk graph methods have the following advantages:
so that capital and maintenance expenditures can be targeted where they are most effective, and lifecycle costs can be
optimised.
4.2 The Problem of Range of Residual Risk
Consider the example: CC, FB, PB, W2 indicates a requirement for SIL3.
CC ≡ > 0.1 to 1 probable fatalities per event
FB ≡ ≥ 10% to 100% exposure
PB ≡ ≥ 10% to 100% probability that the hazard cannot be avoided
W2 ≡ 1 demand in > 3 to 30 years
SIL3 ≡ 10,000 ≥ RRF > 1,000
If all the parameters are at the geometric mean of their ranges:
|
Consequence
|
=
|
√(0.1 x 1.0) probable fatalities per event
|
|
|
=
|
0.32 probable fatalities per event
|
|
Exposure
|
=
|
√(10% x 100%)
|
=
|
32%
|
|
Unavoidability
|
=
|
√(10% x 100%)
|
=
|
32%
|
|
Demand rate
|
=
|
1 in √(3 x 30) years
|
|
|
=
|
1 in ~10 years
|
|
RRF
|
=
|
√(1,000 x 10,000)
|
=
|
3,200
|
(Note that geometric means are used because the scales of the risk graph parameters are essentially logarithmic.)
For the unprotected hazard:
|
Worst case risk
|
=
|
(1 x 100% x 100%) / 3 fatalities per year
|
|
|
=
|
1 fatality in ~3 years
|
|
Geometric mean risk
|
=
|
(0.32 x 32% x 32%) / 10 fatalities per year
|
|
|
=
|
1 fatality in ~300 years
|
|
Best case risk
|
=
|
(0.1 x 10% x 10%) / 30 fatalities per year
|
|
|
=
|
1 fatality in ~30,000 years
|
i.e. the unprotected risk has a range of 4 orders of magnitude.
With SIL3 protection:
|
Worst case residual risk
|
=
|
1 fatality in (~3 x 1,000) years
|
|
|
=
|
1 fatality in ~3,000 years
|
|
Geometric mean residual risk
|
=
|
1 fatality in (~300 x 3,200) years
|
|
|
=
|
1 fatality in ~1 million years
|
|
Best case residual risk
|
=
|
1 fatality in (~30,000 x 10,000) years
|
|
|
=
|
1 fatality in ~300 million years
|
i.e. the residual risk with protection has a range of 5 orders of magnitude.
Figure 2 shows the principle, based on the mean case.

Figure 2 - Risk Reduction Model from BS IEC 61511
A reasonable target for this single hazard might be 1 fatality in 100,000 years. In the worst case we achieve less risk reduction
than required by a factor of 30; in the mean case we achieve more risk reduction than required by a factor of 10; and in the best
case we achieve more risk reduction than required by a factor of 3,000. In practice, of course, it is most unlikely that all the
parameters will be at their extreme values, but on average the method must yield conservative results to avoid any significant
probability that the required risk reduction is under-estimated.
Ways of managing the inherent uncertainty in the range of residual risk, to produce a conservative outcome, include:
Calibrating the graph so that the mean residual risk is significantly below the target, as above.
Selecting the parameter values cautiously, i.e. by tending to select the more onerous range whenever there is any
uncertainty about which value is appropriate.
Restricting the use of the method to situations where the mean residual risk from any single hazard is only a very small
proportion of the overall total target risk. If there are a number of hazards protected by different systems or functions, the
total mean residual risk from these hazards should only be a small proportion of the overall total target risk. It is then very
likely that an under-estimate of the residual risk from one hazard will still be a small fraction of the overall target risk, and
will be compensated by an over-estimate for another hazard when the risks are aggregated.
This conservatism may incur a substantial financial penalty, particularly if higher SIL requirements are assessed.
4.3 Use in the Process Industries
Risk graphs are popular in the process industries for the assessment of the variety of trip functions – high and low pressure,
temperature, level and flow, etc – which are found in the average process plant. In this application domain, the benefits listed
above are relevant, and the criterion that there are a number of functions whose risks can be aggregated is usually satisfied.
Figure 3 shows a typical function. The objective is to assess the SIL requirement of the instrumented over-pressure trip function
(in the terminology of BS IEC 61511, a “safety instrumented function”, or SIF, implemented by a “safety instrumented system”, or
SIS). One issue which arises immediately, when applying a typical risk graph in a case such as this, is how to account for the
relief valve, which also protects the vessel from over-pressure. This is a common situation – a SIF backed up mechanical protection. The options are:
Assume it ALWAYS works
Assume it NEVER works
Something in-between

Figure 3 - High Pressure Trip Function
The first option was recommended in the UKOOA Guidelines (UKOOA 1999), but cannot be justified from failure rate data. The
second option is liable to lead to an over-estimate of the required SIL, and to incur a cost penalty, so cannot be recommended.
See Table 4 for the guidance provided by the standards.
An approach which has been found to work, and which accords with the standards is:
- Derive an overall risk reduction requirement (SIL) on the basis that there is no protection, i.e. before applying the SIF or
any mechanical protection.
- Take credit for the mechanical device, usually as equivalent to SIL2 for a relief valve (this is justified by available failure rate
data, and is also supported by BS IEC 61511, Part 3, Annex F)
- The required SIL for the SIF is the SIL determined in the first step minus 2 (or the equivalent SIL of the mechanical
protection).
The advantages of this approach are:
- It produces results which are generally consistent with conventional practice.
- It does not assume that mechanical devices are either perfect or useless.
- It recognises that SIFs require a SIL whenever the overall requirement exceeds the equivalent SIL of the mechanical device
(e.g. overall requirement = SIL3; relief valve ≡ SIL2; SIF requirement = SIL1).
Table 4 - Guidance from the Standards on Handling “Other Technology Safety Related Systems” with Risk Graphs
|
BS EN 61508
|
BS IEC 61511
|
|
“The purpose of the W factor is to estimate the frequency of the unwanted occurrence
taking place without the addition of any safety-related systems (E/E/PE or other technology) but including any external risk reduction facilities.”
(Part 5, Annex D – A qualitative method: risk graph)
(A relief valve is clearly an “other technology safety-related device”.)
|
“W - The number of times per year that the hazardous event would occur in the absence of
the safety instrumented function under consideration. This can be determined by considering all failures which can lead to the hazardous event and estimating
the overall rate of occurrence. Other protection layers should be included in the consideration.” (Part 3, Annex D – semi-qualitative,
calibrated risk graph) and:
“The purpose of the W factor is to estimate the frequency of the hazard taking place
without the addition of the SIS.” (Part 3, Annex D – semi-qualitative, calibrated risk graph) “The purpose of the W factor is to estimate the
frequency of the unwanted occurrence taking place without the addition of any safety instrumented systems (E/E/PE or other technology) but including any external
risk reduction facilities.” (Part 3, Annex E – qualitative, risk graph)
|
|
4.4 Calibration for Process Plants
Before a risk graph can be calibrated, it must first be decided whether the basis will be:
Individual risk (IR), usually of someone identified as the most exposed individual.
Group risk of an exposed population group, such as the workers on the plant or the members of the public on a nearby
housing estate.
Some combination of these 2 types of risk.
4.4.1 Based on Group Risk
Consider the risk graph and definitions developed above as they might be applied to the group risk of the workers on a given plant
. If we assume that on the plant there are 20 such functions, then, based on the geometric mean residual risk (1 in 1 million years), the total risk is 1 fatality in 50,000 years.
Compare this figure with published criteria for the acceptability of risks. The HSE have suggested that a risk of one 50 fatality
event in 5,000 years is intolerable (HSE Books 2001). They also make reference, in the context of risks from major industrial
installations, to “Major hazards aspects of the transport of dangerous substances” (HMSO 1991), and in particular to the F-N curves it contains (Figure 4).
The “50 fatality event in 5,000 years” criterion is on the “local scrutiny line”, and we may therefore deduce that 1 fatality in 100
years should be regarded as intolerable, while 1 in 10,000 years is on the boundary of “broadly acceptable”. Our target might
therefore be “less than 1 fatality in 1,000 years”. In this case the total risk from hazards protected by SIFs (1 in 50,000 years)
represent 2% of the overall risk target, which probably allows more than adequately for other hazards for which SIFs are not
relevant. We might therefore conclude that this risk graph is over-calibrated for the risk to the population group of workers on the
plant. However, we might choose to retain this additional element of conservatism to further compensate for the inherent uncertainties of the method.
To calculate the average IR from this calibration, let us estimate that there is a total of 50 persons regularly exposed to the
hazards (i.e. this is the total of all regular workers on all shifts). The risk of fatalities of 1 in 50,000 per year from hazards
protected by SIFs is spread across this population, so the average IR is 1 in 2.5 million (4E-7) per year.
Comparing this IR with published criteria from R2P2 (HSE Books 2001):
- Intolerable = 1 in 1,000 per year (for workers)
- Broadly acceptable = 1 in 1 million per year
Our overall target for IR might therefore be “less than 1 in 50,000 (2E-5) per year” for all hazards, so that the total risk from
hazards protected by SIFs again represents 2% of the target, so probably allows more than adequately for other hazards, and we
might conclude that the graph is also over-calibrated for average individual risk to the workers.
The C and W parameter ranges are available to adjust the calibration. (The F and P parameters have only 2 ranges each, and FA
and PA both imply reduction of risk by at least a factor of 10.) Typically, the ranges might be adjusted up or down by half an order of magnitude.

Figure 4 - F-N Curves from Major Hazards of Transport Study
The plant operating organisation may, of course, have its own risk criteria, which may be onerous than these criteria derived from
R2P2 and the Major hazards of transport study.
4.4.2 Based on Individual Risk to Most Exposed Person
To calibrate a risk graph for IR of the most exposed person it is necessary to identify who that person is, at least in terms of his
job and role on the plant. The values of the C parameter must be defined in terms of consequence to the individual, e.g.:
CA ≡ Minor injury
CB ≡ ~0.01 probability of death per event
CC ≡ ~0.1 probability of death per event
CD ≡ death almost certain
The values of the exposure parameter, F, must be defined in terms of the time he spends at work, e.g.:
FA ≡ exposed for < 10% of time spent at work
FB ≡ exposed for ≥ 10% of time spent at work
Recognising that this person only spends ~20% of his life at work, he is potentially at risk from only ~20% of the demands on the
SIF. Thus, again using CC, FB, PB and W2:
CC ≡ ~0.1 probability of death per event
FB ≡ exposed for ≥ 10% of working week or year
PB ≡ > 10% to 100% probability that the hazard cannot be avoided
W2 ≡ 1 demand in > 3 to 30 years
SIL3 ≡ 1,000 ≥ RRF > 10,000
for the unprotected hazard:
|
Worst case risk
|
=
|
20% x (0.1 x 100% x 100%) / 3 probability of death per year
|
|
|
=
|
1 in ~150 probability of death per year
|
|
Geometric mean risk
|
=
|
20% x (0.1 x 32% x 32%) / 10 probability of death per year
|
|
|
=
|
1 in ~4,700 probability of death per year
|
|
Best case risk
|
=
|
20% x (0.1 x 10% x 10%) / 30 probability of death per year
|
|
|
=
|
1 in ~150,000 probability of death per year
|
With SIL3 protection:
|
Worst case residual risk
|
=
|
1 in ~150,000 probability of death / year
|
|
Geometric mean residual risk
|
=
|
1 in ~15 million probability of death / year
|
|
Best case residual risk
|
=
|
1 in ~1.5 billion probability of death / year
|
If we estimate that this person is exposed to 10 hazards protected by SIFs (i.e. to half of the total of 20 assumed above), then,
based on the geometric mean residual risk, his total risk of death from all of them is 1 in 1.5 million per year. This is 3.3% of our
target of 1 in 50,000 per year IR for all hazards, which probably leaves more than adequate allowance for other hazards for which
SIFs are not relevant. We might therefore conclude that this risk graph also is over-calibrated for the risks to our hypothetical
most exposed individual, but we can choose to accept this additional element of conservatism. (Note that this is NOT the same
risk graph as the one considered above for group risk, because, although we have retained the form, we have used a different set of definitions for the parameters.)
The above definitions of the C parameter values do not lend themselves to adjustment, so in this case only the W parameter
ranges can be adjusted to re-calibrate the graph. We might for example change the W ranges to:
W1 ≡ < 1 demand in 10 years
W2 ≡ 1 demand in > 1 to 10 years
W3 ≡ 1 demand in ≤ 1 year
4.5 Typical Results
As one would expect, there is wide variation from installation to installation in the numbers of functions which are assessed as
requiring SIL ratings, but Table 5 shows figures which were assessed for a reasonably typical offshore gas platform.
Table 5 - Typical Results of SIL Assessment
|
SIL
|
Number of Functions
|
% of Total
|
|
4
|
0
|
0%
|
|
3
|
0
|
0%
|
|
2
|
| | |