|
If you have Adobe Reader you can also download a .pdf version. 
Can Technology Eliminate Human Error?
by A G Foord & W G Gulland, 4-sight Consulting
Abstract
This paper argues that it would not be possible to design technological systems to eliminate all human
errors during operation because people are involved in: specifying, designing, implementing, installing,
commissioning and maintaining systems as well as operating them. The paper illustrates this with examples
of incidents caused by human error and concludes that, even if systems can operate without human
intervention, there is still the possibility of human error at other phases of the lifecycle. Thus to improve
process safety it will be necessary to focus on behaviour and methods of working during all phases of the lifecycle so as to remove or reduce opportunities for human error.
Keywords
safety, human behaviour, methods of working, specification errors, design errors, installation errors, maintenance errors.
Introduction
Trevor Kletz (2004) chaired a half-day discussion at the Hazards XVIII (2004) conference in Manchester on “Human
Factors & Behaviour”. The speakers put forward two opposed views. Some described ways of changing behaviour
and others thought that it was more effective, particularly when trying to improve process safety, to change designs or
methods of working so as to remove or reduce opportunities for human error. All agreed that there is a need for both
approaches - the differences were ones of emphasis - but nevertheless there was a real disagreement on which approach was the more practicable and successful.
The disagreement reported implies that some people suggest focusing primarily on a design that eliminates human
errors during operation. This paper gives examples of incidents in a variety of industries to illustrate:
- the problems of focusing on designs when trying to improve process safety; and
- the need to consider the behaviour and methods of working of designers.
When no reference is cited, the information came from a private communication or our own experience.
Human error
The authors agree that, when trying to improve process safety, we need to consider all three approaches to reduce the
opportunities for human error:
ways of changing behaviour, and
designs, and
methods of working.
Any discussion of the value of focusing on designs needs to consider:
the various types of human error; and
the design process itself.
Types of human error
Figure 1. Types of Human Failures. Based on Figure 2 from HSG48 (1999).
Figure 1 based on Figure 2 from HSG48 (1999) provides one classification of types of human failure. Thus, we might
think that a design could aim to eliminate all seven of the categories of errors, mistakes and violations shown in Figure 1
during operation. A design would then not be accepted unless evaluated against all possible skill-based errors and
mistakes and violations during operation of the design. For example, Hazards XVIII (2004) mentions on page 663 an
“Application Guide on Human factors in Engineering Design” (Kariuki & Löwe 2004) - a guideline to assist engineers to
design a process facility that addresses the capability of the operator.
Design process
However, operation is only part of the lifecycle of any design and the guideline (Kariuki Löwe 2004) does not cover the
capability of the designer or the activities involved in design or any parts of the lifecycle other than operation and maintenance. Human error needs to be considered during:
specifying
designing
implementing
installing & commissioning
operating and
maintaining.
The book Out of Control (2003) summarises the analysis of 34 incidents in the process industry. On pages 44 & 45 it states:
“… a total of 56 causes were identified for the 34 incidents. This data has been grouped in Table 2, which gives the
percentage of the primary causes attributable to each lifecycle phase.”
Figure 2. Allocation of Primary Causes of Incidents to Project Lifecycle Phases.
Based on Figure 10 from Out of Control (2003).
Figure 2 (based on Figure 10 from Out of Control 2003) presents these figures in a pie chart. The summary in Out of
Control reveals that technology failures resulted in only a small proportion of the incidents. In the 34 incidents analysed, 44% had inadequate specification as their primary cause.
Other primary causes listed are:
20% changes after commissioning;
15% design and implementation;
15% operation and maintenance; and
6% installation and commissioning
In order to produce a design that eliminated all human errors during operation we would also need eliminate human
error from all the other phases of the lifecycle. Out of Control (2003) illustrates how difficult this is to achieve for control,
monitoring and protection systems. The following examples illustrate that similar problems apply to many designs, not just control, monitoring and protection systems.
1. Specification error example
The specification should include both the function required and the environment within which the function is to be
performed. Kletz (1999) states on page 158 “For example, a Hazan showed that the probability of a leak of toxic
material was acceptably low. At times small fragile packages containing toxic substances had to be moved but they
were conveyed in trolleys and kept in them. However, when the lift was out of order a package was carried downstairs and placed on a table. It slid off and the contents leaked.”
Software tools are available to check that explicit specifications are consistent, but it is much harder to ensure that
specifications are complete. It is difficult to imagine a technology that does not depend upon human behaviour to define
a specification, although technology could be used to warn that the environment is not consistent with the specification.
For example, monitoring systems are now available that can warn of abuse of packages during transportation.
2. Design errors
For example, the instructions for one of the first Apple portable computers stated "with the aid of a large hammer it is
possible to insert the battery the wrong way round!" Unfortunately the same care was not taken with the design of a light
guard on a guillotine using an edge mounted printed circuit board. The guard stopped the blade from operating if
anything interrupted the beams of light in front of the guillotine. After maintenance the printed circuit board was
reinstalled upside down. Interrupting the light beams then caused the blade to move, injuring an operator. This was not
fail-safe and is primarily a design error, not just a maintenance and testing error.
3. Implementation errors
A supplier provided 20 flanges, 19 mild steel and 1 stainless steel (as he did not have enough mild steel flanges). He
thought this was helpful as the stainless steel flanges were better than the required mild steel spec. He did not consider
that mild steel welding would have failed on the stainless steel flange. Fortunately the customer used 100% material
testing and the mistake was identified before welding commenced. Given a good specification, implementation errors
should be easier to detect, but design and implementation errors are still a significant cause of incidents.
4. Installation and commissioning errors
The low pressure trip failed at a power station. Examination revealed that the commissioning override had never been
removed. (This might be considered a design error as the override should have included a timer that limited how long the override could remain in place.)
A distillation column was upset every time it rained. Months were spent trying to analyse what was happening.
Eventually a small plastic protective plug was found in the vent port of the air supply regulator of the valve controlling the
heat input to the column; normally there was sufficient leakage around the plug for the regulator to work correctly, but
when rainwater got into the threads between the regulator and the plug it formed a seal. The plug should have been removed as part of the installation and commissioning process.
5. Operation errors despite design for safety
Some decades ago there was an unusual increase in the incidence of minor shunt road accidents. Initial analysis
showed the increase was occurring with more expensive cars, and further analysis showed these cars were fitted with
anti-lock braking systems (ABS). After questioning many drivers it eventually emerged that a significant number of
drivers were driving closer to the car in front, knowing that ABS would enable them to brake harder and more effectively.
This is now recognised as a possible effect of any car safety device: seat belts; air bags; active suspension; etc. Some
drivers may drive nearer the edge of the safe envelope assuming that the technology will protect against their driving
errors. “Risk compensation” (sometimes known as “risk homeostasis”) is the term now used to describe this behaviour.
A chemical plant had separate storage tanks for acid and caustic soda. Both were delivered by road tanker. One day
the supervisor challenged an operator who was leaving the control room with a large spanner. The operator explained
that he needed to change a coupling on the plant as it did not fit the tanker that had just arrived. The supervisor
explained that it was designed that way to prevent liquid being unloaded into the wrong tank. The design plus the
intervention of the supervisor prevented an accident arising from mixing acid and caustic soda.
6. Maintenance errors
Repair or replacement
1. Aircraft windscreen (Aviation Safety Network 1990)
The windscreen of a British Airways BAC One-Eleven 528FL passenger aircraft was replaced with bolts of which
84, out of a total of 90, were of smaller than specified diameter. The maintenance work looked complete, but on
10 June 1990 at altitude when the aircraft was pressurised, the windscreen blew out. The commander was sucked
halfway out of the windscreen aperture and was restrained by cabin crew whilst the co-pilot flew the aircraft to a safe landing at Southampton Airport.
Cleaning
1. Shutdown valve
A solenoid operated pneumatic valve (SOV) was a vital part of a shutdown system. The exhaust port of the SOV
was not covered while the surrounding area was cleaned by sand blasting. Sand entered the SOV and caused a
fail to danger. Fortunately this was discovered at the next trip (proof) test, so no injuries resulted.
2. Aircraft pressure ports (Job 1998)
The underside of an aircraft was to be cleaned so the static (reference) ports for both the air speed indicator and
the altimeter were covered with strong adhesive tape before cleaning began. The maintenance procedure
required that after completing the cleaning, the cleaner should remove the tape and the maintenance supervisor
should sign that he had checked that the tape had been removed. Despite these precautions, the aircraft took off
with both static ports still covered with tape. During the flight the pilot received the bizarre combination of
simultaneous warnings for both overspeed and stall (the stick shaker operated as part of the stall warning). The
pilot made the plane dive to avoid stalling but the altimeter read 9700 feet when in fact the plane was less than
1000 feet above the sea and the “too low terrain” warning sounded as well as the overspeed and stall warnings.
Sadly the plane crashed into the sea with the loss of all crew and passengers before the false alarms were recognised.
Conclusion
The examples in Out of Control (2003) and those above illustrate that human error may occur when specifying,
designing, implementing, installing, commissioning, operating, and maintaining systems. Even if technological systems
can operate without human intervention, there is still the possibility of human error at other phases of the lifecycle.
Design may reduce the possibilities for human error during operation, but only if human error can also be eliminated
during all the other phases of the lifecycle. Thus it will be necessary to focus on human behaviour and methods of
working during the whole lifecycle, not just operation. Focusing on “the design for operation” alone will not be enough.
References
Hazards XVIII (2004) IChemE Symposium Series No 150,: Process Safety - Sharing Best Practice, IChemE,
ISBN: 0852954603, pages 652-725
HSG48 (1999) Reducing error and influencing behaviour, HSE Books, ISBN 0-7176-2452-8, page 12
Job, M (1998) Air Disasters Volume 3, Aerospace Publications Pty, ISBN 1 875671 34 X
Kariuki, G. & Löwe, K. (2004) INCORPORATION OF HUMAN FACTORS IN THE DESIGN PROCESS, guide
prepared for PRSIM Focus Group 4, Institute for Plant and Process Technology, Process Safety and Plant
Technology, Technische Universität Berlin, Germany – [online]. [accessed January 2005]. Available from World Wide Web: http://www.prism-network.org/
Kletz, T. (1999) HAZOP and HAZAN Identifying and assessing process industry hazards, IChemE, ISBN 0-85295-421-2, page 158
Out of Control (2003), Second edition, HSE Books, ISBN 0-7176-2192-8, pages 44 & 45
|