Critical infrastructure resilience KPIs by sector (with ranges)
Essential KPIs for Critical infrastructure resilience across sectors, with benchmark ranges from recent deployments and guidance on meaningful measurement versus vanity metrics.
Start here
Climate-related disruptions cost US infrastructure operators an estimated $150 billion annually, yet fewer than 20% of utilities and transport agencies use quantified resilience KPIs to guide capital planning. As extreme weather events intensify and interdependencies between energy, water, transportation, and communications systems deepen, the gap between organizations that measure resilience rigorously and those that treat it as a qualitative aspiration is widening. The KPIs an organization selects determine whether resilience investments produce measurable risk reduction or simply consume capital without accountability.
Why It Matters
Critical infrastructure resilience sits at the intersection of physical engineering, financial risk management, and regulatory compliance. FEMA's National Risk Index, the White House National Security Memorandum on Critical Infrastructure (NSM-22), and the EU's Critical Entities Resilience Directive (CER) all require operators to demonstrate measurable resilience capabilities. Utilities that lack quantified baselines cannot prioritize hardening investments across thousands of assets. Transport agencies without recovery-time benchmarks allocate emergency response budgets reactively rather than strategically. For insurers, the absence of standardized resilience metrics makes it difficult to price coverage accurately, creating adverse selection problems across portfolios.
The challenge is not whether to invest in resilience but how to measure the return on that investment. KPIs must reflect hazard exposure, asset vulnerability, system redundancy, recovery speed, and cascading failure potential. Without sector-specific benchmarks, infrastructure operators cannot compare their performance against peers, justify capital expenditures to regulators, or demonstrate progress to stakeholders over time.
Key Concepts
Resilience in the infrastructure context refers to the ability to anticipate, absorb, adapt to, and rapidly recover from disruptive events. Unlike reliability, which focuses on steady-state performance, resilience addresses low-probability, high-consequence scenarios including extreme weather, cyberattacks, and cascading failures across interdependent systems.
System Average Interruption Duration Index (SAIDI) measures the average outage duration per customer over a defined period, typically one year. SAIDI is the most widely used reliability metric for electric utilities and serves as a baseline for resilience benchmarking when filtered to major event days.
Recovery Time Objective (RTO) specifies the maximum acceptable time to restore a system or service after disruption. RTOs vary by asset criticality: hospitals and emergency services typically require RTOs under 4 hours, while secondary commercial loads may tolerate 24-72 hours.
Cascading failure analysis models how the failure of one infrastructure system propagates to dependent systems. Power grid failures, for instance, disable water pumping stations, telecommunications towers, and traffic management systems. Quantifying cascading failure pathways is essential for prioritizing cross-sector resilience investments.
KPI Benchmarks by Sector
| KPI | Sector | Low Range | Median | High Range | Unit |
|---|---|---|---|---|---|
| SAIDI (major event days) | Electric utilities | 200 | 480 | 1,200 | minutes/customer/year |
| SAIDI (excluding major events) | Electric utilities | 60 | 110 | 200 | minutes/customer/year |
| Recovery time (Category 3+ hurricane) | Electric grid | 48 | 168 | 504 | hours to 90% restoration |
| Recovery time (flood event) | Water/wastewater | 12 | 48 | 168 | hours to operational capacity |
| Asset hardening coverage | Electric utilities | 5% | 15% | 40% | % of exposed assets hardened |
| Backup power coverage | Telecommunications | 4 | 8 | 72 | hours of autonomous operation |
| Bridge structural deficiency rate | Transportation | 3% | 7.5% | 15% | % of bridge inventory |
| Pavement resilience score | Transportation | 55 | 68 | 85 | index (0-100) |
| Redundancy ratio (critical substations) | Electric grid | 1.0 | 1.3 | 2.0 | N-1 or N-2 capability |
| Cybersecurity incident response time | All sectors | 1 | 6 | 24 | hours to containment |
| Flood protection level | Water infrastructure | 25-year | 100-year | 500-year | return period design standard |
| Emergency generator capacity | Hospitals/critical facilities | 48 | 96 | 168 | hours of fuel autonomy |
| Mutual aid response activation | Electric utilities | 4 | 12 | 36 | hours to first external crew |
| Climate risk assessment coverage | All sectors | 10% | 35% | 75% | % of assets assessed |
What's Working
Utility storm hardening programs with measurable outcomes. Florida Power & Light (FPL) invested over $4 billion in grid hardening between 2006 and 2025, including undergrounding 40% of its most outage-prone feeders, installing concrete poles to replace wooden ones, and deploying automated switches across 90% of its distribution network. During Hurricane Ian in 2022, FPL restored power to 95% of affected customers within 4 days compared to 11 days for comparable storm impacts in 2004. The utility tracks restoration speed, customer minutes interrupted, and hardened-asset failure rates as primary KPIs, demonstrating 65% fewer sustained outages on hardened circuits versus non-hardened ones.
Cross-sector resilience scoring frameworks. The American Society of Civil Engineers (ASCE) Infrastructure Report Card and the National Institute of Standards and Technology (NIST) Community Resilience Planning Guide both provide standardized assessment frameworks. New York City's Climate Resiliency Design Guidelines require all city capital projects exceeding $10 million to complete a climate risk screening and apply resilience measures. Since 2017, over 1,400 city projects have been assessed, with 78% incorporating at least one design modification to address flood, heat, or wind risks. The city publishes annual metrics on project-level resilience adoption rates and tracks which hazard types drive the most frequent design changes.
Digital twin and sensor-based monitoring for predictive resilience. Con Edison deployed over 10,000 sensors across its underground cable network in New York City to detect thermal anomalies, partial discharges, and moisture intrusion before failures occur. The system has reduced unplanned outages in monitored zones by 30% since 2020. Pacific Gas and Electric (PG&E) uses LiDAR and satellite imagery to assess wildfire risk across 25,000 miles of distribution lines, prioritizing hardening and vegetation management based on quantified ignition probability scores. These predictive approaches shift resilience measurement from lagging indicators (outage duration) to leading indicators (risk probability reduction).
What's Not Working
Inconsistent definitions of "major event" distort benchmarking. IEEE Standard 1366 provides a methodology for classifying major event days, but utilities apply it differently. Some utilities exclude all major events from their reported SAIDI, making year-over-year comparisons and peer benchmarking unreliable. A utility reporting 100 minutes of SAIDI excluding major events may actually experience 600+ minutes when storms are included. Regulators in several states have begun requiring dual reporting (with and without major events), but adoption is uneven. Without consistent event classification, investors and regulators cannot distinguish genuinely resilient utilities from those that benefit from favorable reporting conventions.
Underinvestment in interdependency modeling. Most infrastructure operators measure resilience within their own systems but fail to account for cascading dependencies. A 2024 Department of Energy study found that 72% of prolonged power outages triggered secondary failures in water, communications, or transportation systems, yet fewer than 10% of utility resilience plans explicitly model these cross-sector impacts. The 2021 Texas winter storm demonstrated the consequences: natural gas supply failures caused power plant shutdowns, which disabled water treatment facilities, creating a compounding crisis across three infrastructure sectors simultaneously. Current KPI frameworks rarely capture these feedback loops, leaving organizations blind to their most consequential vulnerabilities.
Climate risk assessments remain static snapshots. Many infrastructure operators conduct climate vulnerability assessments once and treat the results as fixed inputs for capital planning. However, climate projections evolve as models improve, exposure changes as development patterns shift, and asset conditions degrade over time. FEMA's National Risk Index is updated on a roughly biennial cycle, but most local assessments are conducted on 5-10 year intervals. This creates a mismatch between the dynamic nature of climate risk and the static nature of resilience planning. Organizations that treat resilience KPIs as periodic audit outputs rather than continuously updated metrics systematically underestimate their evolving exposure.
Key Players
Established Leaders
- Florida Power & Light (FPL): Largest electric utility in Florida with over 5.8 million customer accounts. Operates one of the most extensive storm hardening programs in the US, with documented performance improvements across multiple hurricane seasons.
- Con Edison: Serves 3.4 million electric customers in New York City and Westchester County. Pioneered underground network resilience monitoring and invested $1 billion in climate adaptation after Superstorm Sandy.
- US Army Corps of Engineers: Federal agency responsible for flood risk management infrastructure across the US. Manages 715 dams, 14,700 miles of levees, and 25,000 miles of navigable waterways with quantified risk assessments.
- National Institute of Standards and Technology (NIST): Developed the Community Resilience Planning Guide and the Cybersecurity Framework, both widely adopted by infrastructure operators for resilience measurement.
Emerging Startups
- Urbint: AI platform predicting infrastructure failures for utilities, achieving 60-80% accuracy in identifying high-risk assets before failures occur. Used by over 50 utilities across North America.
- ClimateAI: Provides climate-adjusted risk analytics for infrastructure and supply chains. Platform models physical climate risks at asset-level resolution across multiple scenarios and time horizons.
- Rhombus Power: AI-driven grid resilience platform detecting anomalies and predicting equipment failures using sensor data and machine learning.
- One Concern: Digital twin platform modeling cascading infrastructure failures across interdependent systems. Used by municipalities and utilities for scenario-based resilience planning.
Key Investors and Funders
- Department of Energy Grid Resilience and Innovation Partnerships (GRIP): $10.5 billion federal program funding grid hardening and resilience projects across the US.
- FEMA Building Resilient Infrastructure and Communities (BRIC): Federal grant program providing $1 billion annually for pre-disaster mitigation projects targeting critical infrastructure.
- Breakthrough Energy Ventures: Investing in grid modernization and resilience technologies including advanced sensors, energy storage, and predictive analytics platforms.
Action Checklist
- Establish dual SAIDI reporting (with and without major events) to create an honest baseline of current resilience performance.
- Define recovery time objectives for each asset class by criticality tier, using the 4/24/72-hour framework aligned with emergency management standards.
- Conduct a climate vulnerability assessment covering at least 75% of critical assets, using downscaled climate projections for the 2040 and 2060 time horizons.
- Map interdependencies with adjacent infrastructure systems (power-water-communications-transport) and model at least three cascading failure scenarios annually.
- Deploy condition monitoring sensors on the top 20% highest-risk assets to shift from reactive to predictive resilience measurement.
- Benchmark asset hardening coverage against peer utilities and set annual targets for incremental hardening of exposed assets.
- Update climate risk assessments on a rolling 2-3 year cycle rather than treating them as one-time exercises.
FAQ
What is a good SAIDI target for a resilient electric utility? Leading utilities target SAIDI below 100 minutes excluding major events and below 400 minutes including major events. Top-quartile performers like Singapore Power and Tokyo Electric achieve SAIDI below 30 minutes excluding major events. In the US, the median is approximately 110 minutes excluding major events, but this varies significantly by geography and weather exposure. Utilities in hurricane-prone regions should benchmark against peers with similar hazard profiles rather than national averages.
How do I prioritize which assets to harden first? Use a risk-based framework combining hazard exposure (probability of disruption), asset vulnerability (likelihood of failure given the hazard), and consequence severity (number of customers affected, cascading impacts, public safety implications). Assets scoring in the top decile across all three dimensions should be prioritized. Florida Power & Light's approach of targeting the most outage-prone feeders first delivered the highest return on hardening investment, reducing sustained outages by 65% on treated circuits.
What is the difference between reliability and resilience metrics? Reliability metrics (SAIDI, SAIFI, CAIDI) measure steady-state performance under normal operating conditions and routine disturbances. Resilience metrics measure performance during and recovery from high-impact, low-frequency events such as hurricanes, ice storms, cyberattacks, or cascading failures. A utility can have excellent reliability scores while remaining highly vulnerable to major events. Comprehensive performance measurement requires both reliability and resilience KPIs tracked separately.
How much does a comprehensive resilience assessment cost? For a mid-sized utility or infrastructure operator, a system-wide climate vulnerability assessment typically costs $200,000-$800,000 depending on the number of assets, hazards evaluated, and modeling sophistication. Asset-level condition assessments using sensors and inspections add $500-$5,000 per asset. Cross-sector interdependency modeling runs $100,000-$500,000 per study. These costs are modest relative to the capital expenditures they inform, which typically range from tens of millions to billions of dollars.
Sources
- Federal Emergency Management Agency. "National Risk Index: Technical Documentation." FEMA, 2024.
- IEEE. "Standard 1366: Guide for Electric Power Distribution Reliability Indices." IEEE, 2022.
- National Institute of Standards and Technology. "Community Resilience Planning Guide, Volume 2." NIST Special Publication 1190, 2024.
- US Department of Energy. "Grid Resilience and Innovation Partnerships Program: 2024 Selections Report." DOE, 2024.
- American Society of Civil Engineers. "2025 Infrastructure Report Card." ASCE, 2025.
- New York City Mayor's Office of Climate and Environmental Justice. "Climate Resiliency Design Guidelines, Version 4.1." NYC, 2024.
- Florida Power & Light. "Storm Protection Plan: 2024 Annual Report." FPL, 2024.
Stay in the loop
Get monthly sustainability insights — no spam, just signal.
We respect your privacy. Unsubscribe anytime. Privacy Policy
Explore more
View all in Critical infrastructure resilience →Deep dive: Critical infrastructure resilience — the fastest-moving subsegments to watch
An in-depth analysis of the most dynamic subsegments within Critical infrastructure resilience, tracking where momentum is building, capital is flowing, and breakthroughs are emerging.
Read →Deep DiveDeep dive: Critical infrastructure resilience — what's working, what's not, and what's next
What's working, what isn't, and what's next, with the trade-offs made explicit. Focus on KPIs that matter, benchmark ranges, and what 'good' looks like in practice.
Read →ExplainerExplainer: Critical infrastructure resilience — a practical primer for teams that need to ship
A practical primer: key concepts, the decision checklist, and the core economics. Focus on KPIs that matter, benchmark ranges, and what 'good' looks like in practice.
Read →InterviewInterview: The builder's playbook for Critical infrastructure resilience — hard-earned lessons
A practitioner conversation: what surprised them, what failed, and what they'd do differently. Focus on data quality, standards alignment, and how to avoid measurement theater.
Read →ArticleMyths vs. realities: Critical infrastructure resilience — what the evidence actually supports
Side-by-side analysis of common myths versus evidence-backed realities in Critical infrastructure resilience, helping practitioners distinguish credible claims from marketing noise.
Read →ArticleTrend watch: Critical infrastructure resilience in 2026 — signals, winners, and red flags
A forward-looking assessment of Critical infrastructure resilience trends in 2026, identifying the signals that matter, emerging winners, and red flags that practitioners should monitor.
Read →