PSM Safety Critical Equipment 1713620016
PSM Safety Critical Equipment 1713620016
• Introduction
• Who is BakerRisk ?
• The importance of Safety Critical Elements (SCEs) as protective barriers to prevent Major
  Accident Hazard (MAH) events.
• How to identify SCEs, and understand their different types and roles.
• Performance Standards for SCEs – defining what they must achieve
• Maintaining SCEs, and how to risk assess the impacts of their impairment.
• Questions
                                                                                              2
Presenters
                                                                      3
   BakerRisk – Why ?
 Why ?
 • To save lives and make the
   world a better, safer place to do
   business.
   https://www.bakerrisk.com/about-bakerrisk
   https://www.bakerrisk.com/egypt/
https://www.bakerrisk.com/about-bakerrisk
                                               4
BakerRisk – Background & Foreground
                                                                                  5
BakerRisk Organization
• BakerRisk – What We Do
 ►   Process Safety
 ►   Protective Structures/Structural Eng.
 ►   Blast Effects
 ►   Accident Investigations
 ►   BakerRisk Learning Center
 ►   Testing
 ►   Software
 ►   FORTRESS Protective Buildings
                                             6
Process Safety
                                                            • Process Safety Management
                       Process Safety                         •   Gap analysis
                       Management                             •   Management system development
                                                              •   Leadership coaching
        Quantitative
           Risk                          Leadership           •   Audit and review
        Assessment                      Development
                                                            • Identify hazards
                                                              •   Analyze fire, explosion and toxic threats to personnel,
                                                                  occupied buildings and critical equipment
                                                 Process      •   Assess hazards and operability issues
                                                 Hazard       •
     Facility                                                     Evaluate layers of protection
  Siting Study                                   Analysis
                       Process                              • Risk Assessment
                                                              •   Analyze the risks
                        Safety                                •   Quantify risk exposure, both on- and off-site
                                                                                                                            7
Our input to Process Safety Knowledge
Specialist research programs created   Authoring and contribution to books   Validation testing for industry
and managed by BakerRisk for over      on process safety topics:             standards on facility siting (API 752/
40 member companies since 1993                                               753/ 756)
• Explosion Research
• Pressure Testing
• Ammonia & Fertilizer
Joint Industry Research Programs CCPS Guidelines Book Series Standards Testing & Validation
                                                                                                                                 8
Where we work
 • Industry & Safety organizations : AIChE, API, ASTM, ASME, CCPS, EPSC, IChemE, NFPA, Energy Institute.
 • Conferences and working groups: Safety in Ammonia Plants symposia, IChemE Hazards, HazardEx, EGYPS, FABIG, GCPS, Qatar
   Process Safety Symposium, HF Alkyl Industry Forum, OPERA, Mary Kay O’Connor Safety Centre.
                                                                                                                            9
The importance of Safety Critical Elements (SCEs)
as barriers to prevent Major Accident Hazard (MAH)
events.
                                                                11
 Safety, or Safety Critical ?
  •   However, if the reverse occurs, then lives, businesses and the environment will be
      exposed to an unacceptable level of risk.
                                                                                                   12
 Definition
                                                                                                 13
 Other “Safety Critical” concepts
• Safety Critical Equipment is a term used at the asset tag level to identify components within a Safety Critical
  Element which are key to its operation, e.g., one of many gas detectors that is part of a gas detection SCE.
• Safety Critical System (SCS) refers to the collected protective system formed one or more SCEs, which may
  include administrative protections such as Safety Critical Procedures (SCP) (for example, safety permit system).
• Safety Critical Tasks (SCT) are human activities which play a major part in protecting against MAH events (for
  example, the testing of firefighting equipment).
  •   While the same concepts that apply to SCEs also apply to Safety Critical Procedures and Tasks, it must be remembered that these rely
      on people – and are therefore open to human limitations – often referred to as “human failings” or “human error”.
  •   For Safety Critical Tasks, it may be necessary to address these considerations using techniques such as Human Factors, or Safety
      Critical Task Analysis (SCTA). These techniques will be discussed in more detail during a future webinar.
• ECE – Environmentally Critical Elements – the same concept as SCE, but focussed on environmental
  protection.
                                                                                                                                         14
Layers of Protection
• Major Accident Hazards are so serious, and so significant, that we cannot risk them being
  caused by one single failure.
• A key principle of Process Safety Management is the use of “multiple, diverse layers of
  protection” (also known as “defence in depth”) to protect against MAH events.
• This approach provides many opportunities to try and prevent, or control, or mitigate the
  MAH event – and so manage the risks that they present to an acceptable level : As Low As
  Reasonably Practicable (ALARP).
                                                                                              15
Layers of Protection / Barriers
 The “Swiss Cheese” model for barriers
                                                                                                     18
The Importance of SCEs – Key Points
 3   Swiss Cheese
     Layers of Protection – Barriers – are not
                                                 4   Active or Passive ?
                                                     Passive SCEs do not require energy in
     perfect, and have weaknesses that must          order to operate. Active SCEs require
     be managed. To ensure that SCEs remain          energy and function on demand. A
     robust, assurance and verification checks       combination of these systems gives
     must be carried out                             “defense in depth” against MAH events
                                                                                                  19
How to identify SCEs, and their different types and
roles.
                                                                  Emergency
     Prevention     MAH        Detection   Control   Mitigation
                                                                   Response
                                                                              21
The Stages of a MAH event - Prevention
                                                                                         Emergency
    Prevention        MAH             Detection        Control        Mitigation
                                                                                          Response
                     Structural
                                                  Structural / Foundation Integrity. Support of facilities.
                      Integrity
   Prevention of      Process          Pressure & thermal relief devices, Process containment, Pressure vessels,
    the incident    Containment                        Piping and valve systems, Level control
                                                                                                                    22
The Stages of a MAH event - Detection
                                                                              Emergency
    Prevention      MAH           Detection      Control      Mitigation
                                                                               Response
                                                             Gas detection
                                                             Fire detection
                   Purpose : To
                                                          Toxic gas detection
   Detection of    detect that
                                                          Corrosion detection
   the incident   other systems
                                              Storage tank (high) level / Overfill detection
                   have failed
                                               Process control, alarms and annunciation
                                                        (Liquid) leak detection
                                                                                               23
The Stages of a MAH event - Control
                                                                             Emergency
     Prevention       MAH            Detection    Control    Mitigation
                                                                              Response
                                                                                                24
The Stages of a MAH event - Mitigation
                                                                                        Emergency
    Prevention        MAH                Detection      Control       Mitigation
                                                                                         Response
                                                                                                                 25
The Stages of a MAH event – Emergency Response
                                                                                  Emergency
    Prevention     MAH             Detection       Control      Mitigation
                                                                                   Response
                                                                                                     26
Categories of SCE
                                       Prevention
                                                    Note: Major Accident Hazards are “Low frequency, High
 2. Process Containment.                            consequence” events, often capable of causing multiple
 3. Ignition Control.                               fatalities, and generally have multiple causes.
                                       Mitigation
                                                    Such incidents are addressed by risk reduction measures
 6. Mitigation (Protection) Systems.                which address such hazards as manual handling, slips,
 7. Emergency Response.                             trips, noise etc. Such measures should not be considered
                                                    as SCEs (or SCSs).
 8. Lifesaving Equipment.
                                                                                                               27
 Identifying SCEs
•Safety Critical Elements can be identified by using :
• Checklists (from the eight categories previously
   shown)
•Safety studies, such as:
• Hazard Identification (HAZID) Studies
• Hazard and Operability (HAZOP) Studies
• Simultaneous Operation (SIMOPS) studies
• Layer of Protection Analysis (LOPA)
• Safety Integrity Level (SIL) determination
• Quantitative Risk Analysis (QRA)
•And
• Barrier Analysis (e.g. Bow Tie diagrams)
                                                                                                         29
The Stages of a MAH event
                                                                  Emergency
    Prevention     MAH        Detection   Control    Mitigation
                                                                   Response
                    MAH
    Prevention
                  The “Top”                 Mitigation Barriers
     Barriers
                    Event
                                                                              30
SCE Barrier Analysis
                          MAH        MAH
      Prevention
          Prevention Barriers
                       The “Top”   The “Top”           Mitigation
                                               Mitigation Barriers Barriers
       Barriers
                          Event      Event
    Prevention                                                         Mitigation
     Barriers                                                           Barriers
                                                                                             31
Bow Tie Diagram
                                               32
Criticality Ranking
                                   MAH
              SCE                                  Consequence           Redundancy
           Criticality       =   Functional
                                              *   of Failure score   *      score
                                 Role score
            1 to 7 - Low
          8 to 16 - Medium
                             =      1 to 4
                                              *         1 to 3
                                                                     *      1 to 3
17 to 36 - High
                                                                                         33
Definitions – Scoring system
                                                    Severity of Consequences   Description                      Consequence of
  MAH Management               Functional
                                                    of Failure of SCE                                           Failure Score
  Functional Role of SCE       role score
                                                                               Multiple fatalities and/or
  Prevention                         4              Disastrous                                                          3
                                                                               extensive plant damage.
  Detection                          3                                         Single fatality and/or many
  Control and mitigation             2              Catastrophic               serious injuries and/or                  2
                                                                               significant plant damage.
  Emergency response and                                                       Many injuries and/or local
                                     1              Major                                                               1
  life saving                                                                  plant damage.
                                                Functional
  Redundancy of SCE                                                      SCE Criticality Score   SCE Criticality Rank
                                                role score
  No other SCE that duplicates the full                                         17-36                    High
                                                     3
  functionality of the failed/unavailable SCE
  SCE design has provision for redundancy            2                           8-16                  Medium
  An alternative SCE can provide full
                                                     1                           1-7                     Low
  functionality of the failed/unavailable SCE
                                                                                                                                 34
Summary – Identifying SCEs
                                            35
Identification of SCEs – Key Points
 3   Criticality Ranking
     A criticality score for SCEs can be
                                                  4     Bow Ties
                                                        Bow Ties provide a clear, pictorial
     developed, based on their role in                  representation of what types of barriers in
     addressing a MAH event, the possible               place which is valuable for overall
     consequence of their failure, and an               understanding of SCEs, SCSs and their
     assessment of redundancy in their design           interactions
                                                                                                                 36
Performance Standards for SCEs – defining
what SCEs must achieve
The Performance Standards sets out what the SCE must achieve:
• The key properties that it must have, in order to remain effective at managing the
  MAH over time.
• The levels of performance that it must achieve.
• The assurance activities required to meet that level of performance.
• The verification activities that may be required.
                                                                                       38
The Performance Standard
The Performance Standards sets out what the SCE must achieve:
• The key properties that it must have, in order to remain effective at managing the
  MAH over time.
• The levels of performance that it must achieve.
• The assurance activities required to meet that level of performance.
• The verification activities that may be required.
                                                                                       39
What properties must a SCE have ?
                                                                                                       41
FARSI - Functionality
                                                                                    42
 FARSI - Availability
• Availability – The proportion of time that the SCE must be able to perform its function.
• This may be stated as, e.g., unavailability of [number] hours per year or in terms of mean
  time to repair (MTTR).
• Given their importance in preventing or mitigating a MAH event, most SCEs should always
  be available (high availability).
• Availability is affected by the need to maintain the equipment, either planned or
  unplanned, and so in order to give high availability many SCE systems will employ
  redundancy or use voting systems.
• If an SCE is subject to maintenance and so is out of use, an operational risk assessment (ORA) should be raised if this is
  for a significant period.
                                                                                                                          43
FARSI - Reliability
• Reliability – The likelihood of the SCE performing on demand (or, conversely, the
  likelihood of the SCE failing to perform on demand).
• Reliability may be described as the mean time between failure (MTBF).
                                                                                      44
FARSI - Survivability
• Survivability – The MAH event(s) that the SCE can survive and still perform its
  function.
• Any SCE which is mitigative must still be operating after the MAH event.
  Preventative SCEs may not require survivability.
• This may can be achieved by the design of the SCE itself (e.g. an explosion-proof
  unit) or by its layout within the plant.
                                                                                      45
FARSI - Interactions / Dependency
• Interactions (and Dependencies) – What other systems (usually also SCEs) are required to
  function at the same time, or directly depend upon.
• A common cause failure is where several SCEs are impaired due to one central failure
  mechanism – for example, loss of electrical power. (in the example of emergency lighting,
  this is why luminaries are specified as “self-powered”)
• In this situation, these SCEs are not independent and so should be considered as a
  combined system.
• A diverse selection of SCEs (passive and active) should reduce the risk of common mode
  failure.
                                                                                              46
Performance Standards – Key Points
1   Performance Standards
    To clearly define the performance required
                                                         2        Properties of a SCE - FARSI
                                                                  •    Functionality
    of a SCE :                                                    •    Availability
    • Key properties                                              •    Reliability
    • Levels of performance (including failure                    •    Survivability
        criteria)                                                 •    Interactions and Dependencies
    • Assurance requirements
    • Verification requirements
The Performance Standards sets out what the SCE must achieve:
• The key properties that it must have, in order to remain effective at managing the
  MAH over time.
• The levels of performance that it must achieve.
• The assurance activities required to meet that level of performance.
• The verification activities that may be required.
                                                                                       49
 Management Maintenance System for SCEs
The maintenance management system for SCEs should have the capability to :
• Ensure that SCEs continue to meet their performance specification FARSI criteria throughout their life.
• Record SCE inspection, maintenance, testing and repair history, along with passes or failures so that
  management Key Performance Indicators (KPIs) can be produced to monitor the system.
• Carry out the repair of failed SCEs, and initiate operational risk assessment for those having degraded
  performance.
  •   Manage SCE repairs to maximize availability (i.e. minimize Mean Time To Repair MTTR).
  •   Schedule SCE maintenance to maximize reliability (i.e. maximize Mean Time Between Failure MTBF).
                                                                                                            50
 Assurance of SCEs
• The assurance process should carry out inspection, maintenance and testing according to the
  Performance Standard, including checks on potential failure modes of SCEs.
• The assurance activities must be carried out at the scheduled frequency by competent (i.e. suitably
  qualified, experienced and authorized) people.
• For active SCEs, function on demand must be checked – and that the SCE is not left in an impaired
  condition.
• For passive SCEs, checks on their continued suitability against degradation, such as ageing, weathering
  or corrosion must be made.
  •   Where SCEs are found in an impaired condition and not immediately repaired, a suitable risk assessment must be
      carried out, and appropriate mitigations put in place if required.
                                                                                                                  51
 Verification of SCEs
• For each Performance Standard requirement, there should be at least one verification
  activity, its frequency, and how it is performed (review, examination, or witnessing)
• Verification is most efficient if carried out with a suitable degree of independence from the
  assurance process (e.g., third-party inspection of pressure vessels).
                                                                                                52
 Performance Analysis
                                                                                        53
Impairment
• If an SCE is found not to comply with the Performance Standard – namely, it fails during a
  critical function test – then this must be recorded in the maintenance management system,
  as either:
 • Failed or degraded, or
 • Failed or degraded and fixed – confirming that after repair, the SCE met the Performance
   Standard.
 • In either case, an investigation into the nature of the failure or degradation is required.
• If rapid repair is not possible, then an Operational Risk Assessment (ORA) must be carried
  out (see next slide).
                                                                                                 54
 Operational Risk Assessment (ORA)
                                                                                           55
ORA Team
• The team must have sufficient knowledge of the purpose of the SCE in protecting
  against MAH event(s), the consequences of operating without it in place, and
  interactions with other systems, both SCE, SCS and other.
• The minimum technical authorities to be involved in the ORA could be:
 •   Operational Management
 •   Process Safety
 •   Process Engineering / Technical Authorities
 •   Mechanical Engineering
     • (Other specialist disciplines may be required).
                                                                                    56
Risk Level – Key Questions to answer
 Does risk                      Unit shutdown
now exceed                        required ?
 allowable
   level ?
                                    Acceptable to implement
                                         mitigations ?
                                            Acceptable operational
   SCE                                           risk level ?
Impairment
  occurs
                                                   SCE return to
                                                       duty
                                                                     57
 Key considerations for the ORA
• It is critical to stress to all ORA participants that ORA is not simply a mechanism by which
  continued operation can be justified under any circumstances.
• Whilst guidance provided may not have immediately directed to shut down or limit
  operations, this may still be the eventual outcome of the ORA process.
• It is also particularly important to stress to the team that they are undertaking an ORA and
  not a Task Risk Assessment - so a Major Accident Hazard mindset is needed, rather than a
  personal injury mindset.
                                                                                                 58
 Information Required
                                                      59
 Example questions to ask in the ORA
                                                                                                                       60
 Visualizing the Information
• Bow Tie diagrams can help visualize the large amount of information required to carry out
  an effective ORA.
• So, consideration should be given to having prepared Bow Tie diagrams for each set of MAH
  hazards – see:
  • EGPC-PSM-GL-006 Major Accident Hazard Management Guideline and EGPC-PSM-GL-013
    Safety Case Appraisal Guideline.
• An example is given on the next slide for an impaired SCE level trip on a gasoline storage
  tank
                                                                                               61
Bow Tie example
Note that different colours can be used to indicate impaired or less effective barriers
                                                                                          62
Maintaining SCEs – Key Points
 1   Maintenance System
     Requirements                                     2   Assurance and Verification
                                                          Processes for assurance and verification;
     Ensuring that SCEs comply with the Performance       personnel competency; monitoring for SCE
     Standard; recording performance information          degradation; identification of impairment.
     and the generation of Key Performance                Continuous improvement.
     Indicators (KPIs)
                                                                                                       63
Safety, or Safety Critical ?
The identification and management of
Safety Critical Elements (SCE)
In summary….
• Presenter 1, Presenter 2, Presenter 3
• Safety Critical Elements are critical layers of protection against the consequences of Process
  Safety incidents such as Major Accident Hazards.
• Applying diverse types of SCEs, from “Prevention” to “Emergency Response” provides the
  most robust protection.
• Performance Standards must define SCE requirements, and failure criteria, and be followed
  without compromise.
• Assurance and verification are essential for management of SCEs to maintain performance.
• Operational Risk Assessments are essential if impaired SCEs cannot be repaired promptly.
                                                                                              65
QUESTIONS ?
BakerRisk.com
67