|
Human Error Ron Graham |
|||||||||
|
Engineers are often treated to the following
explanation of a malfunction or failure:
"human error." When I hear this
explanation, I immediately want to hear more.
Which human? What error? When did it occur?
And why? How is that error to be prevented in
the future? I WANT THE ANSWERS, AND I WANT
'EM NOW!!!
Customers may want to target blame for failures, as would the general public; the responsible parties, on the other hand, want to diffuse the blame. Neither is satisfied by vague responses. It LOOKS to the observer like a shrug and an "oh well, I’m only human." When the public feels this way about a response, it wants a scapegoat; this is why NASA takes a great deal of heat for celebrated failures like that of the Mars Climate Orbiter (MCO).
Engineers focus (hopefully) on identifying such errors and ensuring that they can’t be repeated. We can make plenty of brand new errors without falling back into the same old ones; and sometimes we have to get rid of the old ones just to find out what the new ones ARE. 90% of traffic accidents are based on "human error." (Does this mean that of the drivers?) This assumption leads to the prospect of growth in the telematics industry (automatic information delivery from remote sources). But we don't yet know the point at which information delivery becomes a distraction in itself. We have cell phones, CD players and GPS; we see most states having laws against "distracted driving." But can we readily recognize who's distracted, when, and by what? Audio-visual map prompts have been shown in some studies to be safer than drivers looking at paper maps. (Is this supposed to be a surprise?) Does this mean that holographic GPS-driven maps are next, followed by adaptive cruise control? If we had the "Smart Driver," would we sit in the driver's seat and read the paper? Or take a nap? Astronauts don't trust information delivery well enough to have a totally-automated space rendezvous. Maybe they think what they're doing is more dangerous than driving a car; maybe they're just more sober and serious than the average driver. But would WE trust a "Smart Driver?"
The Roots of Error There are at least two reasons that we put up with the hassle of personal-use high-tech devices that we don’t understand:
But certain types of basic errors of operation are exactly the same whether you’re operating a hand-held appliance or a 777. Stanton refers to Norman’s "mode error" here: a device has multiple modes of operation, and a given action has different consequences in different modes. Mode error is an example of design for the convenience of the designer, mass-produced for all. Even the most noble of engineers can’t be expected on their own to design products that work as easily for everybody else as for themselves. The types of errors we make in doing simple tasks for ourselves, especially in using devices, are the same types of errors we make in complex system design and operation. And, these errors increase in likelihood when the work environment allows for it. Human error results from our own omission or commission, as seen above, but may also result from systemic problems. Eliminating error requires recognizing both individual and systemic causes. How individuals tend to err
How systems tend to allow individuals to err
Dodging Responsibility We will try to avoid taking responsibility for errors because of Karchmer’s Law of Performance Evaluation: ten "attaboys" equals one "aw, shit." An engineer who hasn’t accumulated a substantial number of "attaboys" by the time the error is committed may be out of a job. Again, the threat of losing a job or being thrown to the crowd when we make a mistake may not be the answer -- it may make us miserable without making us any better. Plus, if an engineer’s error causes a problem that propagates up the management chain, it’s very likely the engineer won’t be sacrificed before the managers! How different the engineering world is from the business world! When a business fails, the owners will often be able to find investors in their next venture easily -- investors think the owners won’t make the same mistakes the next time. And they’re probably right. Engineers won’t make the same mistakes the next time either, if the mistakes are large enough -- but will they land on their feet? We learn from watching NASA and its contractors absorb media criticism for failures that
This means that even as a member of the (outraged) public it does me no good to call for a scapegoat. I may not get one, and the product is probably going to improve without one. Anticipating Error Given what we know about ourselves; about the types of errors we commit; and about the environments in which errors occur -- is it possible to predict errors? We can’t know what’s inside others’ minds, but we can observe others’ behavior and the systems they work under. If you observe any of these warning signs, a critical error may be around the corner: Individuals
Systems
We tend to blame workers for having human characteristics, instead of finding ways to work in spite of them. It is true that workers should be able to focus, especially on critical tasks. But the warning signs above tell us it may be impossible to expect that even of the best workers. Systems must therefore be designed to compensate for as much of the above as possible. In this regard we can learn a lot from the stage. Consider: Redundancy. Almost all plays have an understudy for each key role. Does a critical worker require an understudy? If so, then train one. If there’s no money in the budget to do this, then borrow another worker from a non-critical task with its own budget. Is this unethical? Maybe -- but is it more ethical to keep all the staff budgets in their individual pots and risk a failure that haunts all your customers? If there are no other available workers to act as an understudy, draft a manager. There must be one in the organization that hasn’t forgotten the technical side altogether. Simplicity. Plays require the actors to memorize their lines, and enable the memorization by cues. Can this not be done for critical procedures as well? Not if the procedure manual is longer than the screenplay for a major motion picture. Make each critical procedure memorizable, within a single page if possible and written in plain language, and consider ways for the worker to receive a cue for the most critical steps. Honesty. Actors know their play could be a bomb. Workers must know what they face, including all possible consequences of their work. If they’re to have all the necessary confidence in their actions, they must know that others are confident as well. If they’re to be well-rested, they must have the chance to rest. If they’re to be focused, they must have the chance to collect themselves. If they’re not to panic, they must know that what they really need will be there when they need it. It seems clear that the stages in which errors are either most likely to occur or can do the most damage are the stages in which resources should be focused. This means that part of the budget -- whatever part is necessary -- must be targeted to ensure that the stage is clean and paid for, the props tagged and securely stored, the understudies recruited and trained, the scripts edited and revised, the costumes comfortable, and the champagne on ice. Even then there’s no guarantee the play will come off without a hitch, but we can say we did our best. Why Human Error Can't be Avoided There's a certain amount of human error that engineered systems often are intended to tolerate. One classic example involves proximity operations of the Space Shuttle, which are under astronaut control despite their complexity and danger. Astronauts are more comfortable with having their fate in their own hands than with a computer, no matter how powerful the computer.
If we look at any process as involving only controlled inputs plus effort leading to outputs, we miss the contributions of participants, physical constraints, and environment to the process.
...each of these forces us to re-learn how to become error-free. Tracy Kidder's classic, The Soul of a New Machine, reveals four types of human error that affect the development of a new computer, and how the engineers set out to overcome these error types:
References
Neville
Stanton’s web site has some interesting notes on Engineering
Psychology. Start there. (www.soton.ac.uk/~psyweb/sig/engpsy.html) |
|||||||||