Multi‑site reference laboratories operate as a tightly coupled network, where continuity failure is rarely confined to one bench or one site. The most likely “systemic” failure modes cascade across specimen logistics → accessioning/triage → specialty testing → reporting and client communication, amplified by cross‑border air referral, strict turnaround SLAs, and the need to preserve chain‑of‑custody (toxicology and clinical‑trial flows). 

The most “inspection‑persuasive” continuity architecture for reference labs is built around four enforceable design decisions: 

  1. defining Minimum Viable Laboratory (MVL) tiers with pre‑approved triggers; 
  2. a lab‑specific Business Impact Analysis (BIA) that treats logistics, LIS/middleware, referral pathways, and interpretive reporting as critical processes; 
  3. a Test Prioritization Matrix that incorporates clinical criticality, stability window, SLA/penalty exposure, and referral options; and 
  4. an exercised, documented incident command and CAPA loop consistent with ISO 15189’s requirements for continuity planning, downtime plans, quality indicators, internal audits, and corrective action effectiveness.

Cyber disruption is not theoretical: in a major pathology cyber incident, NHS reported that services were fully restored only after months, underscoring that reference‑lab continuity must assume extended outages and manual operations, not just brief IT incidents.

The recommended implementation approach is a phased program over 12 months: complete BIA and MVL design, establish the eight continuity pillars (with SOP packs and fast‑track change control), execute a structured drill calendar (tabletop + functional), stand up dashboards tied to ISO 15189 quality indicators, and formalize mutual aid / send‑out agreements with defined validation and comparability requirements.

Standards and assurance baseline

Continuity as a management system is anchored in International Organization for Standardization ISO 22301, which defines a BCMS framework to plan, implement, operate, monitor, review, maintain, and continually improve documented continuity capability to protect against and recover from disruptive incidents.

Laboratory‑specific continuity obligations are made explicit in ISO 15189:2022, and College of American Pathologist including:

  1. downtime plans for information systems that affect lab activities, 
  2. continuity and emergency preparedness planning that identifies risks when activities are limited or unavailable, requires coordinated strategies using plans/procedures/technical measures, and requires periodic testing and exercising of response capability
  3. strong controls for specimen transport, acceptance exceptions, referral laboratories, comparability of results, record traceability, corrective action, and monitoring via quality indicators.
  4. specimen transport and tracking, chain‑of‑custody
  5. For operational reality, CAP publications also stress that procedures must remain accessible even during LIS/network downtime.

Operational preparedness in laboratories is addressed by Clinical and Laboratory Standards Institute guidance (eg, CLSI GP36 is the CLSI guideline “Planning for Laboratory Operations During a Disaster.”), which describes all‑hazards planning and sustaining operations through the entire laboratory “path of workflow” (pre‑examination, examination, post‑examination).

Business impact analysis and test prioritization

Definitions that must be used consistently

Reference laboratories frequently mix business continuity terms across IT, operations, and quality. To avoid misalignment, define the following in your BCMS/QMS glossary and enforce them in BIA documentation:

  • MTPD / MAO: the time until adverse impacts from not providing a service become unacceptable (terminology varies by region; treat MAO/MTPD consistently).
  • RTO: the time within which systems/functions must be recovered after an outage (including assessment, execution, and verification time).
  • RPO: the point in time to which data must be restored after an outage to enable operation on resumption.

These terms matter because ISO 15189 requires laboratories to plan downtime operations and also requires traceable amendment of records both are constrained by RTO/RPO decisions.

Tailored BIA approach for multi‑site referral laboratories

A reference lab BIA must treat the laboratory as a network rather than a single facility. ISO 15189 requires that risks across pre‑examination, examination, and post‑examination processes be identified and mitigated, and that the adequacy of sample transportation systems be periodically evaluated both imply a BIA scope that includes collection sites, couriers, hubs, and referral labs.

A practical BIA workflow for your environment is:

  1. Map critical “end‑to‑end service lines” (eg, microbiology culture + susceptibility + critical calls; histopath tissue workflow + reporting; HLA transplantation support; genomics/trials sample flows; toxicology with chain‑of‑custody).
  2. Decompose into processes that fail independently: specimen receipt/triage, accessioning, storage and stability preservation, test set‑up, analytic run, result review and release, interpretation and consultation, critical result notification, report amendment, referral shipment, client notification of delays, billing/claims where tied to trial SLAs.
  3. Identify dependencies per process (people/competence, facilities/utilities, LIS/middleware/interfaces, reagents/consumables, instruments, transport/air lanes, external providers). ISO 15189 explicitly covers externally provided products/services and referral laboratories and requires defined qualification/re‑evaluation criteria.
  4. Set MAO/MTPD, RTO, RPO by process, not by department. Ensure “short stability window” services (eg, many microbiology critical findings, unstable analytes, urgent transfusion‑support requirements) are modeled with logistics + LIS dependencies, not only bench time.
  5. Define the “MVL capability” that satisfies medical safety and contractual duties at reduced capacity, explicitly including manual operations and referral rerouting. ISO 15189 requires a coordinated continuity strategy with periodic testing/exercising.

Test prioritization matrix for multi‑discipline reference labs

Service agreements with users (including advising which activities are performed by referral labs), requires risk‑based patient care protections across workflows, and requires procedures for sample acceptance exceptions and result delay notifications. These requirements support an explicit, defensible prioritization matrix rather than ad‑hoc “who shouts loudest” prioritization.

Use a two‑layer design:

Layer A: Categories (policy)
– Critical to immediate clinical decision making (life/organ/limb).
– Time‑sensitive clinical (same day/next day).
– Routine clinical.
– Elective / research / “nice‑to‑have”.

Layer B: Scoring (operational) A suggested scoring model (adapt weights to your risk appetite):

  • Clinical criticality: 0–5
  • Stability window risk: 0–5 (higher score = more fragile)
  • SLA / penalty severity: 0–5
  • Referral options strength: 0–5 (higher score = fewer/weak options)
  • Public health / outbreak / transfusion dependency: 0–5
  • Chain-of-custody / trial compliance complexity: 0–5

Priority score = (2×Clinical) + (2×Stability) + SLA + (2×Referral weakness) + Public health + Compliance complexity.

This formula is a weighted prioritization matrix designed to rank health conditions, projects, or patients based on a combination of clinical severity, operational stability, and external risk factors. It heavily weights clinical severity and referral weaknesses, suggesting a focus on urgent, high-risk, and operationally vulnerable issues.

The table below is an exportable template with example test “clusters” relevant to your environment.

Test cluster (example grouping)Clinical criticality (0–5)Stability window (Short/Med/Long)SLA / penalty exposureReferral options (in-network / domestic / international)Compliance factors (trial/custody)Default MVL tierNotes / required comms
Transplant crossmatch / urgent HLA support5Short–MedHigh (clinical)Medium (specialty partners)HighTier 1Pre-agreed escalation; critical result process
Critical microbiology (blood cultures/CSF critical comms)5ShortHigh (patient safety)MediumMedTier 1Manual critical call logs if LIS down
Molecular urgent infectious disease pathway4–5ShortMedium–HighMediumMedTier 1Alternate method/site decision documented
Clinical trials central lab panels3–4VariableHigh (contractual)High (often international)HighTier 1–2Document deviations; preserve chain and data integrity
Histopath oncology priority cases4MedMediumMediumMedTier 2Remote diagnostic workflows must preserve traceability
Toxicology with forensic-style custody3–4MedMediumMediumVery highTier 2Chain-of-custody controls and restricted access
Neurochemistry routine panels2–3MedMediumStrongLowTier 2–3Batch and backlog management focus
Non-urgent genomics (non-immediate)2–3Long (often frozen DNA)Medium–HighStrong (international)HighTier 3Manage expectations; document delays by impact 

Minimum Viable Laboratory tiers and triggers

The laboratory requires to identify risks associated with emergency situations or conditions when activities are limited/unavailable and to maintain a coordinated continuity strategy, with periodic testing and exercising of capability.

A practical MVL model for multi‑site reference labs uses four tiers. Each tier defines what continues, what pauses, and what “must never be compromised” (identity, traceability, QC release rules, critical communications, confidentiality).

MVL tiers

Tier 0 — Life safety and specimen preservation (network stabilization)
Purpose: prevent patient harm caused by misidentification, stability loss, uncontrolled backlog, or unsafe operations; preserve irreplaceable specimens (tissues, transplant samples, trial samples).
Minimum functions: controlled receipt, traceability logs, cold chain integrity, safe storage, urgent escalation phone tree, courier exception routing, “hold and protect” approach.

Tier 1 — Critical clinical and transfusion‑linked services
Purpose: maintain the highest criticality services (transplant support, critical microbiology calls, urgent molecular, blood bank support interfaces as applicable).
Minimum functions: MVL test menu “critical only,” manual‑capable reporting and critical notifications, pre‑approved referral reroutes.

Tier 2 — High‑volume clinical operations at reduced capacity
Purpose: restore routine throughput with controlled backlogs; expand referral capacity; stabilize staffing and supply chain.
Minimum functions: extended hours if needed, backlog clearance plan, routine client updates, expanded send‑out lanes, formal exception reporting.

Tier 3 — Full service and recovery optimization
Purpose: full assay menu including elective/non‑urgent genomics and broader trial programs; transition from manual workarounds to normal operations; close CAPAs and update risk registers.

Clear activation triggers

Your triggers should be objective, pre‑approved, and aligned to BIA outputs. A defensible trigger set includes:

  • Information systems trigger: declare LIS/middleware downtime event when systems prevent traceable accessioning or results reporting; ISO 15189 requires planned downtime processes for such failures. 
  • Transport trigger: activate controlled receipt / reroute when transport delays threaten “time between collection and receipt” or required temperature intervals; ISO 15189 requires transport instructions addressing time and temperature and periodic evaluation of transport adequacy.
  • Patient safety trigger: halt examinations and withhold reports when there is risk of harm to patients (eg, uncontrolled QC, instrument out of limits, stability compromise without acceptable exception handling).
  • External provider trigger: activate mutual aid or alternate referral lab when primary provider cannot meet agreed requirements; ISO 15189 requires qualification, performance evaluation, and ensuring externally provided services meet requirements before use. 
  • Escalation trigger: notify users when results are delayed based on patient impact; this is an explicit ISO 15189 requirement and should be part of MVL tier transitions.

Eight continuity pillars with SOP elements, checklists, and fast‑track change control

The continuity planning requires coordinated strategies combining plans, procedures, and technical measures, plus personnel information/training and periodic exercising. The eight pillars below operationalize that requirement for reference labs.

Specimen logistics and stability preservation

This requires instructions for packaging, appropriate time between collection and receipt, maintaining temperature intervals, and periodic evaluation of transport adequacy; it also requires explicit handling of compromised clinically critical/irreplaceable samples with documented risk consideration and report annotation when accepted.

SOP elements (must exist before an incident): controlled receipt mode; stability‑based triage; cold chain excursion rules; “urgent specimen” labeling/routing; documentation for acceptance exceptions.

Activation checklist (first 60–120 minutes): confirm transport disruption scope; activate alternate courier routes; open controlled receipt; implement stability triage (test now / stabilize & store / redirect / reject); issue client notice for delays.

Alternate testing and multi‑site load balancing

This requires comparability procedures when different methods/equipment are used or testing occurs at different sites, with recording of acceptability, periodic review, and user notification of clinically significant differences. 

SOP elements: pre‑validated alternate methods; cross‑site routing maps; bridging/comparability protocol; interpretive consistency rules.

Minimum documentation: route decision record; comparability evidence; updated reference intervals/decision limits if affected; user notification when clinically significant differences exist.

LIS downtime and information continuity

This requires planned downtime processes for information systems failures that affect lab activities. WHO LQMS guidance details essential manual system controls: unique identifiers, standardized requisitions, logs and worksheets, checking processes, confidentiality protection, and data loss protection (backup for electronic, secure storage for paper). Accreditation requirements emphasize that procedures must remain available during LIS/network downtime, so downtime packs must include controlled copies of critical SOPs.

SOP elements: downtime declaration; manual accessioning and labeling controls; downtime worksheets; critical results call logs; result entry controls; reconciliation and amendment practice aligned to ISO 15189 record traceability.

Supply chain resilience

WHO inventory guidance frames uninterrupted availability of reagents and supplies as essential to laboratory operations and recommends establishing minimum stock for an appropriate time period, supported by documented forms/logs, receipt inspections, and expiry controls.

ISO 15189 includes specific expectations for reagent/consumable records (receipt condition, expiry, first use, out‑of‑service dates, acceptance records) and requires procedures to respond to manufacturer recall notices and investigate/report adverse incidents attributed to reagents/consumables.

SOP elements: critical item register; A/B/C criticality classification; dual sourcing; acceptance testing and lot release; recall response workflow; contingency pack‑outs for air referral.

People, competence, and fatigue management

Continuity fails when cross‑trained capability is absent. Accreditation requires competence requirements and authorization of personnel for specific lab activities; continuity plans must also take into account personnel needs and capabilities and require training.

SOP elements: minimum staffing model per MVL tier; cross‑training plan by discipline; on‑call rosters; fatigue controls during surge/outage; delegation-of-authority list.

Quality and patient safety controls under disruption

ISO 15189 requires risk‑based management of nonconforming work, including halting examinations and withholding reports when there is risk of harm to patients, and evaluating clinical significance/impact analysis.

SOP elements: “kill switches” (stop‑testing triggers); QC release rules unchanged; exception acceptance policy for irreplaceable samples; mandatory report comments for compromised samples.

Client communication and command structure

There must be a procedure to notify users when results are delayed, based on patient impact; WHO LQMS emphasizes effective and timely communication, including records for telephone urgent results.

SOP elements: incident command roster; single source of truth updates; standard client notices; escalation for blood bank/hospital critical clients; clinical trial deviation notifications; media/privacy controls when needed.

Compliance and ethics

ISO 15189’s general requirements include impartiality safeguards (not allowing commercial/financial pressures to compromise impartiality) and confidentiality controls for patient information; it also requires disclosure, where appropriate, of incidents that resulted or could have resulted in patient harm and records of mitigations. These requirements become stress‑tested during disruptions and must be wired into incident governance.

Change control form

ISO 15189 requires maintaining management system integrity when changes are planned/implemented and requires recorded decisions/actions on risks. This should include at minimum the following; 

FieldContent to capture (minimum)
Emergency change IDUnique identifier; date/time opened
Trigger / incident referenceWhat happened; MVL tier; affected sites
Change descriptionWhat will change (method, site, referral lab, reporting pathway, manual process)
Patient safety and clinical impactExplicit statement; “halt/withhold” criteria if triggered 
Risk assessment summaryHazards, likelihood, consequence, controls; residual risk; user communication need 
Validation / bridging requiredComparability plan (split samples), acceptance criteria, documentation 
Data integrity controlsHow traceability is preserved; RPO/RTO assumptions; record amendment plan 
ApprovalsIncident Commander, Quality/Risk Lead, Discipline Lead, Client Impact Lead
Communication planWho is informed (hospitals/blood bank/trials); timing; templates 
Effectiveness reviewMetrics to check; time window; CAPA if not effective 

Drills, metrics, and CAPA

Business continuity plans to be periodically tested, and response capability exercised where practicable. It also requires corrective action discipline: immediate control, cause analysis, corrective action, effectiveness review, and updating risks/opportunities.

Prioritized drill scenarios with objectives and success criteria

The scenarios below are prioritized for multi‑site referral networks supporting hospitals, blood banks, and trial programs, including international air referral.

ScenarioTypeCore objectiveSuccess criteria (examples)Primary evidence
LIS/middleware outage (multi‑site)FunctionalRun MVL Tier 0–1 with manual operations≥99.5% specimen traceability; zero critical mis-ID; reconciliation completed within defined windowDowntime logs; reconciliation report; amendment traceability
Ransomware / cyber isolationTabletop + functional (limited)Prove extended-duration continuity assumptionsMVL tier decisions within 60 min; controlled comms; procedure access maintainedCommand log; comms templates; procedure availability evidence 
International air-lane disruptionTabletopActivate alternate referral lanes and pack-outsReroute decisions documented; stability preserved; client notices issuedShipment manifests; temperature logs; client notices
Courier network failure + surge backlogFunctionalMaintain controlled receipt and stability triage≥95% samples triaged within X minutes; documented accept/reject exceptionsTriage log; rejection log; exception reports 
HVAC/cold room failureFunctionalPreserve specimen and reagent integrityTemperature excursion decision tree executed; quarantines documentedTemp records; disposition log; CAPA if needed 
Critical reagent stockoutTabletopDemonstrate A/B/C stock method + substitutionAlternate sourcing triggered; recall/incident response followedStock cards; recall response records 
Single-point instrument failure (specialty)FunctionalShift work across sites/referral labsComparability/bridging initiated; users advised if differences clinically significantComparability records; user notices 
Staff shortage (lab + courier)TabletopExecute staffing MVL modelCoverage meets Tier 1; fatigue controls documentedRosters; training/authorization evidence 
Chain-of-custody breach (tox/trial)TabletopPreserve defensibility and patient safetyImmediate containment; custody integrity decision documentedCustody incident report; secure storage proof 
Result correction surge post-incidentTabletopManage amendments safelyTraceable amendments: users notified; impact assessedAmendment log; revised report audit trail 

When nonconformities occur, the laboratory must control/correct, address consequences with focus on patient safety escalation, determine causes, assess recurrence risk, implement actions, evaluate effectiveness, update risks/opportunities, and change the management system if needed.

External dependencies, mutual aid

Mutual aid / send‑out agreement

Service agreements with users and requires that, where applicable, the user is advised of specific activities performed by referral laboratories and consultants; it also assigns responsibilities for ensuring results are delivered to the requester unless otherwise specified. It further requires qualification, selection, evaluation, and re‑evaluation of external providers and ensuring externally provided services conform to requirements before use.

A mutual aid / send‑out agreement for reference labs should include:

  • Scope: test menu covered, specimen types, volumes, exclusion criteria
  • Quality prerequisites: accreditation status, competence evidence, EQA/PT expectations, change notification rules
  • Pre‑examination requirements: packaging, time/temperature, accept/reject rules, urgent labels 
  • Critical results: escalation and turnaround for notifications 
  • Reporting: units, reference intervals, interpretive comments; requirement to identify examinations performed by the referral laboratory in the report, and manage amendments traceably 
  • Data exchange: secure transmission, retention, audit trails, confidentiality obligations
  • Continuity commitments: what happens if the partner fails (secondary partner, rerouting)
  • Liability and dispute resolution; turnaround penalties and force majeure logic
  • Chain-of-custody requirements where applicable (tox/trials)

Validation and bridging requirements when shifting sites or referral labs

When continuity triggers re-routing, the “method equivalence” question becomes both clinical and compliance critical. ISO 15189 requires comparability procedures when different methods/equipment are used or examinations are performed at different sites, including documenting acceptability and informing users of clinically significant differences. The documentations for a rerouted/referred test should include:

  • Method equivalence assessment: principle, specimen type, analytical range, interferences
  • Split-sample comparability study: clinically significant intervals; acceptance criteria; documentation and periodic review
  • Reference interval / decision limit impact assessment: update and user notification if affected
  • Report transparency: identification of referral lab performing the exam; clear disclaimers where residual risk exists
  • Exception governance: fast‑track change control record, including risk decisions and communications

Business continuity in a multi-site reference laboratory is not a separate “BCP folder” on a shared drive, it is a living system that protects patients, clinical decisions, transplant pathways, blood bank support, and clinical trial integrity when conditions are not normal.

The real measure of readiness is simple: Can we preserve identity, stability, traceability, and trusted reporting even when technology, logistics, or capacity fails? If the answer is yes, the laboratory has resilience. If not, continuity must be treated as a priority risk and quality agenda not an administrative requirement.

Continuity is built in advance, proven through drills, refined through CAPA, and sustained through culture. Every disruption is also a learning event an opportunity to strengthen the network, simplify workflows, and make the system safer than it was yesterday.