Adaptation & Resilience·11 min read··...

Critical infrastructure resilience KPIs by sector (with ranges)

Essential KPIs for Critical infrastructure resilience across sectors, with benchmark ranges from recent deployments and guidance on meaningful measurement versus vanity metrics.

Climate-related disruptions cost US infrastructure operators an estimated $150 billion annually, yet fewer than 20% of utilities and transport agencies use quantified resilience KPIs to guide capital planning. As extreme weather events intensify and interdependencies between energy, water, transportation, and communications systems deepen, the gap between organizations that measure resilience rigorously and those that treat it as a qualitative aspiration is widening. The KPIs an organization selects determine whether resilience investments produce measurable risk reduction or simply consume capital without accountability.

Why It Matters

Critical infrastructure resilience sits at the intersection of physical engineering, financial risk management, and regulatory compliance. FEMA's National Risk Index, the White House National Security Memorandum on Critical Infrastructure (NSM-22), and the EU's Critical Entities Resilience Directive (CER) all require operators to demonstrate measurable resilience capabilities. Utilities that lack quantified baselines cannot prioritize hardening investments across thousands of assets. Transport agencies without recovery-time benchmarks allocate emergency response budgets reactively rather than strategically. For insurers, the absence of standardized resilience metrics makes it difficult to price coverage accurately, creating adverse selection problems across portfolios.

The challenge is not whether to invest in resilience but how to measure the return on that investment. KPIs must reflect hazard exposure, asset vulnerability, system redundancy, recovery speed, and cascading failure potential. Without sector-specific benchmarks, infrastructure operators cannot compare their performance against peers, justify capital expenditures to regulators, or demonstrate progress to stakeholders over time.

Key Concepts

Resilience in the infrastructure context refers to the ability to anticipate, absorb, adapt to, and rapidly recover from disruptive events. Unlike reliability, which focuses on steady-state performance, resilience addresses low-probability, high-consequence scenarios including extreme weather, cyberattacks, and cascading failures across interdependent systems.

System Average Interruption Duration Index (SAIDI) measures the average outage duration per customer over a defined period, typically one year. SAIDI is the most widely used reliability metric for electric utilities and serves as a baseline for resilience benchmarking when filtered to major event days.

Recovery Time Objective (RTO) specifies the maximum acceptable time to restore a system or service after disruption. RTOs vary by asset criticality: hospitals and emergency services typically require RTOs under 4 hours, while secondary commercial loads may tolerate 24-72 hours.

Cascading failure analysis models how the failure of one infrastructure system propagates to dependent systems. Power grid failures, for instance, disable water pumping stations, telecommunications towers, and traffic management systems. Quantifying cascading failure pathways is essential for prioritizing cross-sector resilience investments.

KPI Benchmarks by Sector

KPISectorLow RangeMedianHigh RangeUnit
SAIDI (major event days)Electric utilities2004801,200minutes/customer/year
SAIDI (excluding major events)Electric utilities60110200minutes/customer/year
Recovery time (Category 3+ hurricane)Electric grid48168504hours to 90% restoration
Recovery time (flood event)Water/wastewater1248168hours to operational capacity
Asset hardening coverageElectric utilities5%15%40%% of exposed assets hardened
Backup power coverageTelecommunications4872hours of autonomous operation
Bridge structural deficiency rateTransportation3%7.5%15%% of bridge inventory
Pavement resilience scoreTransportation556885index (0-100)
Redundancy ratio (critical substations)Electric grid1.01.32.0N-1 or N-2 capability
Cybersecurity incident response timeAll sectors1624hours to containment
Flood protection levelWater infrastructure25-year100-year500-yearreturn period design standard
Emergency generator capacityHospitals/critical facilities4896168hours of fuel autonomy
Mutual aid response activationElectric utilities41236hours to first external crew
Climate risk assessment coverageAll sectors10%35%75%% of assets assessed

What's Working

Utility storm hardening programs with measurable outcomes. Florida Power & Light (FPL) invested over $4 billion in grid hardening between 2006 and 2025, including undergrounding 40% of its most outage-prone feeders, installing concrete poles to replace wooden ones, and deploying automated switches across 90% of its distribution network. During Hurricane Ian in 2022, FPL restored power to 95% of affected customers within 4 days compared to 11 days for comparable storm impacts in 2004. The utility tracks restoration speed, customer minutes interrupted, and hardened-asset failure rates as primary KPIs, demonstrating 65% fewer sustained outages on hardened circuits versus non-hardened ones.

Cross-sector resilience scoring frameworks. The American Society of Civil Engineers (ASCE) Infrastructure Report Card and the National Institute of Standards and Technology (NIST) Community Resilience Planning Guide both provide standardized assessment frameworks. New York City's Climate Resiliency Design Guidelines require all city capital projects exceeding $10 million to complete a climate risk screening and apply resilience measures. Since 2017, over 1,400 city projects have been assessed, with 78% incorporating at least one design modification to address flood, heat, or wind risks. The city publishes annual metrics on project-level resilience adoption rates and tracks which hazard types drive the most frequent design changes.

Digital twin and sensor-based monitoring for predictive resilience. Con Edison deployed over 10,000 sensors across its underground cable network in New York City to detect thermal anomalies, partial discharges, and moisture intrusion before failures occur. The system has reduced unplanned outages in monitored zones by 30% since 2020. Pacific Gas and Electric (PG&E) uses LiDAR and satellite imagery to assess wildfire risk across 25,000 miles of distribution lines, prioritizing hardening and vegetation management based on quantified ignition probability scores. These predictive approaches shift resilience measurement from lagging indicators (outage duration) to leading indicators (risk probability reduction).

What's Not Working

Inconsistent definitions of "major event" distort benchmarking. IEEE Standard 1366 provides a methodology for classifying major event days, but utilities apply it differently. Some utilities exclude all major events from their reported SAIDI, making year-over-year comparisons and peer benchmarking unreliable. A utility reporting 100 minutes of SAIDI excluding major events may actually experience 600+ minutes when storms are included. Regulators in several states have begun requiring dual reporting (with and without major events), but adoption is uneven. Without consistent event classification, investors and regulators cannot distinguish genuinely resilient utilities from those that benefit from favorable reporting conventions.

Underinvestment in interdependency modeling. Most infrastructure operators measure resilience within their own systems but fail to account for cascading dependencies. A 2024 Department of Energy study found that 72% of prolonged power outages triggered secondary failures in water, communications, or transportation systems, yet fewer than 10% of utility resilience plans explicitly model these cross-sector impacts. The 2021 Texas winter storm demonstrated the consequences: natural gas supply failures caused power plant shutdowns, which disabled water treatment facilities, creating a compounding crisis across three infrastructure sectors simultaneously. Current KPI frameworks rarely capture these feedback loops, leaving organizations blind to their most consequential vulnerabilities.

Climate risk assessments remain static snapshots. Many infrastructure operators conduct climate vulnerability assessments once and treat the results as fixed inputs for capital planning. However, climate projections evolve as models improve, exposure changes as development patterns shift, and asset conditions degrade over time. FEMA's National Risk Index is updated on a roughly biennial cycle, but most local assessments are conducted on 5-10 year intervals. This creates a mismatch between the dynamic nature of climate risk and the static nature of resilience planning. Organizations that treat resilience KPIs as periodic audit outputs rather than continuously updated metrics systematically underestimate their evolving exposure.

Key Players

Established Leaders

  • Florida Power & Light (FPL): Largest electric utility in Florida with over 5.8 million customer accounts. Operates one of the most extensive storm hardening programs in the US, with documented performance improvements across multiple hurricane seasons.
  • Con Edison: Serves 3.4 million electric customers in New York City and Westchester County. Pioneered underground network resilience monitoring and invested $1 billion in climate adaptation after Superstorm Sandy.
  • US Army Corps of Engineers: Federal agency responsible for flood risk management infrastructure across the US. Manages 715 dams, 14,700 miles of levees, and 25,000 miles of navigable waterways with quantified risk assessments.
  • National Institute of Standards and Technology (NIST): Developed the Community Resilience Planning Guide and the Cybersecurity Framework, both widely adopted by infrastructure operators for resilience measurement.

Emerging Startups

  • Urbint: AI platform predicting infrastructure failures for utilities, achieving 60-80% accuracy in identifying high-risk assets before failures occur. Used by over 50 utilities across North America.
  • ClimateAI: Provides climate-adjusted risk analytics for infrastructure and supply chains. Platform models physical climate risks at asset-level resolution across multiple scenarios and time horizons.
  • Rhombus Power: AI-driven grid resilience platform detecting anomalies and predicting equipment failures using sensor data and machine learning.
  • One Concern: Digital twin platform modeling cascading infrastructure failures across interdependent systems. Used by municipalities and utilities for scenario-based resilience planning.

Key Investors and Funders

  • Department of Energy Grid Resilience and Innovation Partnerships (GRIP): $10.5 billion federal program funding grid hardening and resilience projects across the US.
  • FEMA Building Resilient Infrastructure and Communities (BRIC): Federal grant program providing $1 billion annually for pre-disaster mitigation projects targeting critical infrastructure.
  • Breakthrough Energy Ventures: Investing in grid modernization and resilience technologies including advanced sensors, energy storage, and predictive analytics platforms.

Action Checklist

  1. Establish dual SAIDI reporting (with and without major events) to create an honest baseline of current resilience performance.
  2. Define recovery time objectives for each asset class by criticality tier, using the 4/24/72-hour framework aligned with emergency management standards.
  3. Conduct a climate vulnerability assessment covering at least 75% of critical assets, using downscaled climate projections for the 2040 and 2060 time horizons.
  4. Map interdependencies with adjacent infrastructure systems (power-water-communications-transport) and model at least three cascading failure scenarios annually.
  5. Deploy condition monitoring sensors on the top 20% highest-risk assets to shift from reactive to predictive resilience measurement.
  6. Benchmark asset hardening coverage against peer utilities and set annual targets for incremental hardening of exposed assets.
  7. Update climate risk assessments on a rolling 2-3 year cycle rather than treating them as one-time exercises.

FAQ

What is a good SAIDI target for a resilient electric utility? Leading utilities target SAIDI below 100 minutes excluding major events and below 400 minutes including major events. Top-quartile performers like Singapore Power and Tokyo Electric achieve SAIDI below 30 minutes excluding major events. In the US, the median is approximately 110 minutes excluding major events, but this varies significantly by geography and weather exposure. Utilities in hurricane-prone regions should benchmark against peers with similar hazard profiles rather than national averages.

How do I prioritize which assets to harden first? Use a risk-based framework combining hazard exposure (probability of disruption), asset vulnerability (likelihood of failure given the hazard), and consequence severity (number of customers affected, cascading impacts, public safety implications). Assets scoring in the top decile across all three dimensions should be prioritized. Florida Power & Light's approach of targeting the most outage-prone feeders first delivered the highest return on hardening investment, reducing sustained outages by 65% on treated circuits.

What is the difference between reliability and resilience metrics? Reliability metrics (SAIDI, SAIFI, CAIDI) measure steady-state performance under normal operating conditions and routine disturbances. Resilience metrics measure performance during and recovery from high-impact, low-frequency events such as hurricanes, ice storms, cyberattacks, or cascading failures. A utility can have excellent reliability scores while remaining highly vulnerable to major events. Comprehensive performance measurement requires both reliability and resilience KPIs tracked separately.

How much does a comprehensive resilience assessment cost? For a mid-sized utility or infrastructure operator, a system-wide climate vulnerability assessment typically costs $200,000-$800,000 depending on the number of assets, hazards evaluated, and modeling sophistication. Asset-level condition assessments using sensors and inspections add $500-$5,000 per asset. Cross-sector interdependency modeling runs $100,000-$500,000 per study. These costs are modest relative to the capital expenditures they inform, which typically range from tens of millions to billions of dollars.

Sources

  1. Federal Emergency Management Agency. "National Risk Index: Technical Documentation." FEMA, 2024.
  2. IEEE. "Standard 1366: Guide for Electric Power Distribution Reliability Indices." IEEE, 2022.
  3. National Institute of Standards and Technology. "Community Resilience Planning Guide, Volume 2." NIST Special Publication 1190, 2024.
  4. US Department of Energy. "Grid Resilience and Innovation Partnerships Program: 2024 Selections Report." DOE, 2024.
  5. American Society of Civil Engineers. "2025 Infrastructure Report Card." ASCE, 2025.
  6. New York City Mayor's Office of Climate and Environmental Justice. "Climate Resiliency Design Guidelines, Version 4.1." NYC, 2024.
  7. Florida Power & Light. "Storm Protection Plan: 2024 Annual Report." FPL, 2024.

Stay in the loop

Get monthly sustainability insights — no spam, just signal.

We respect your privacy. Unsubscribe anytime. Privacy Policy

Deep Dive

Deep dive: Critical infrastructure resilience — the fastest-moving subsegments to watch

An in-depth analysis of the most dynamic subsegments within Critical infrastructure resilience, tracking where momentum is building, capital is flowing, and breakthroughs are emerging.

Read →
Deep Dive

Deep dive: Critical infrastructure resilience — what's working, what's not, and what's next

What's working, what isn't, and what's next, with the trade-offs made explicit. Focus on KPIs that matter, benchmark ranges, and what 'good' looks like in practice.

Read →
Explainer

Explainer: Critical infrastructure resilience — a practical primer for teams that need to ship

A practical primer: key concepts, the decision checklist, and the core economics. Focus on KPIs that matter, benchmark ranges, and what 'good' looks like in practice.

Read →
Interview

Interview: The builder's playbook for Critical infrastructure resilience — hard-earned lessons

A practitioner conversation: what surprised them, what failed, and what they'd do differently. Focus on data quality, standards alignment, and how to avoid measurement theater.

Read →
Article

Myths vs. realities: Critical infrastructure resilience — what the evidence actually supports

Side-by-side analysis of common myths versus evidence-backed realities in Critical infrastructure resilience, helping practitioners distinguish credible claims from marketing noise.

Read →
Article

Trend watch: Critical infrastructure resilience in 2026 — signals, winners, and red flags

A forward-looking assessment of Critical infrastructure resilience trends in 2026, identifying the signals that matter, emerging winners, and red flags that practitioners should monitor.

Read →