Skip to content

Industry — Energy & Utilities

Scope: Power generation, transmission, distribution, oil & gas, water utilities, renewables. Critical infrastructure, regulated monopolies in many regions, heavy IoT presence, safety-critical OT environments.

Top scenarios

Scenario Pattern Latency Reference
Smart-grid telemetry Meter data + Eventhouse + dbt aggregations seconds Tutorial 05 — Streaming Lambda, Industries — Manufacturing
Asset performance management Sensor + ML + work-order integration minutes Example — IoT Streaming
Renewables forecasting (wind / solar) Weather + asset state + ML hours Example — ML Lifecycle
Outage prediction + restoration Real-time sensor + crew dispatch + customer comms minutes Use Case — Anomaly Detection
Demand response Real-time price signal + customer device control seconds Custom — see Energy patterns
Pipeline / leak detection (oil & gas) Acoustic + pressure + ML + alerting sub-second Industries — Manufacturing (similar OT/IT pattern)
Field worker GenAI (manuals, schematics) RAG + AI Search + mobile/offline seconds Tutorial 08 — RAG
Customer billing analytics Meter data + dbt + Power BI for utility customers daily Tutorial 11 — Data API Builder
Carbon accounting / ESG Multi-source emission data + reporting quarterly Tutorial 02 — Data Governance

Regulatory landscape

Framework Relevance
NERC CIP (North American electric) Mandatory for bulk-electric system; affects OT cyber, asset inventory, change mgmt
ISO 27019 (energy industry cyber) Sector adaptation of ISO 27002
NIS2 (EU critical sectors, 2024) Operational resilience + incident reporting for "essential entities"
TSA Pipeline Security Directive (US oil/gas) Cyber requirements post-Colonial Pipeline
GDPR (EU residential customer data) Compliance — GDPR
C2M2 (DOE cyber maturity) Voluntary but widely used self-assessment
State PUC reporting Per-state customer data + reliability reporting

Reference architecture variations

Smart grid + AMI ingest

flowchart LR
    subgraph Field[Field - OT/Edge]
        Meter[Smart meters<br/>millions of devices]
        Substation[Substation<br/>SCADA + RTUs]
        Sensor[Distribution sensors<br/>fault indicators]
    end

    subgraph Headend[AMI Head-End / SCADA - DMZ]
        AMI[AMI head-end system]
        SCADA[SCADA historian]
    end

    subgraph Cloud[Azure - IT]
        EH[Event Hubs<br/>+ Capture]
        ADX[Azure Data Explorer<br/>or Fabric Eventhouse]
        ADLS[(ADLS Delta)]
        AML[Azure ML]
        Purv[Purview]
    end

    Meter --> AMI
    Substation --> SCADA
    Sensor --> SCADA
    AMI --> EH
    SCADA --> EH
    EH --> ADX
    EH --> ADLS
    ADX --> AML
    ADLS -.scan.-> Purv

    style Field fill:#ffe4cc
    style Headend fill:#fff4cc
    style Cloud fill:#cce4ff

Same OT/IT separation principles as Manufacturing: one-way data flow, no cloud-to-PLC writes without functional safety review, NERC CIP scope explicitly bounded.

Why the standard CSA-in-a-Box pattern works for energy

  • Medallion + dbt = reproducible regulator reports (state PUCs, FERC, DOE)
  • Event Hubs + Fabric Eventhouse / ADX = purpose-built time-series for AMI (billions of meter reads/day)
  • Azure ML + MLflow = forecasting model lifecycle with versioning regulators care about
  • Purview + classifications = customer PII protection for residential billing data
  • Defender for IoT = OT cyber visibility (NERC CIP-007/008/010 evidence)

What's specific to energy

  • Time-series cardinality is extreme. Millions of meters × 15-minute reads + sub-second SCADA tags. Time-series database is not optional.
  • Regulator data residency matters more than in most industries. State PUCs may require customer data stay in-state. Plan region selection accordingly.
  • NERC CIP scope is sticky. Once a system is in CIP scope, getting it OUT requires demonstrating it has no impact on the BES. Design with explicit boundaries.
  • Renewables forecasting is the biggest ML opportunity. Weather + asset state + market price + battery state = high-value optimization. Model latency requirement is hours, not seconds.
  • Field worker mobile is underserved. Field workers spend hours looking up schematics, work orders, manuals. RAG over the asset corpus + offline mobile sync is high-impact.
  • Demand response is becoming real-time at scale (DERMS, VPPs). The platform pattern is similar to anomaly detection but with control feedback — extra rigor on safety + auditability.

Getting started

  1. Read Reference Architecture — Hub-Spoke and Data Flow
  2. Walk Tutorial 05 — Streaming Lambda end-to-end
  3. Adapt Example — IoT Streaming to your meter / SCADA tag inventory
  4. Add Fabric Eventhouse or Azure Data Explorer for time-series — see Patterns — Streaming & CDC
  5. If you're NERC-regulated: read Compliance — NIST 800-53 r5 (CIP maps closely) and engage your CIP compliance team before any cloud migration
  6. Pilot one forecasting model (renewables generation is a great starter) using Example -- ML Lifecycle as the template

Smart grid analytics pipeline

The AMI ingest diagram above shows how data enters the platform. The following diagram focuses on the analytics pipeline downstream -- how meter and sensor data becomes operational intelligence for grid management.

flowchart TB
    subgraph DataSources[Data in Lakehouse]
        AMIData[(AMI Meter Reads<br/>15-min intervals)]
        SCADAData[(SCADA Telemetry<br/>sub-second)]
        WeatherData[(Weather Forecasts<br/>hourly)]
        AssetData[(Asset Registry<br/>GIS + age + condition)]
    end

    subgraph AnalyticsEng[Analytics Engineering - dbt]
        LoadProfiles[Load Profiles<br/>customer segmentation]
        VoltageAnalytics[Voltage Analytics<br/>ANSI C84.1 compliance]
        OutageMetrics[SAIDI / SAIFI / CAIDI<br/>reliability indices]
        RevenueProtect[Revenue Protection<br/>theft detection features]
    end

    subgraph MLModels[ML Models - Azure ML]
        LoadForecast[Load Forecasting<br/>day-ahead + hour-ahead]
        OutagePred[Outage Prediction<br/>weather + asset risk]
        AssetHealth[Asset Health Scoring<br/>transformer + feeder]
        RenewableFcst[Renewables Forecast<br/>solar + wind generation]
    end

    subgraph Operations[Grid Operations]
        DMS[Distribution Mgmt<br/>System - DMS]
        OMS[Outage Mgmt<br/>System - OMS]
        DERMS[DERMS<br/>DER orchestration]
        PBI4[Power BI<br/>reliability + exec dashboards]
    end

    AMIData --> LoadProfiles
    AMIData --> VoltageAnalytics
    SCADAData --> OutageMetrics
    AMIData --> RevenueProtect
    LoadProfiles --> LoadForecast
    WeatherData --> LoadForecast
    WeatherData --> OutagePred
    AssetData --> OutagePred
    AssetData --> AssetHealth
    WeatherData --> RenewableFcst
    LoadForecast --> DMS
    OutagePred --> OMS
    AssetHealth --> PBI4
    RenewableFcst --> DERMS
    OutageMetrics --> PBI4

    style DataSources fill:#cce4ff
    style AnalyticsEng fill:#fff4cc
    style MLModels fill:#e4ccff
    style Operations fill:#ccffe4

Asset management

Transformer health scoring

Power transformers are the most expensive and hardest-to-replace assets on the grid. A health scoring model combines multiple data sources into a single index that drives capital planning and maintenance prioritization.

Data source Key features Refresh
Dissolved gas analysis (DGA) H2, CH4, C2H2, C2H4 concentrations; Duval triangle classification Monthly lab results
Oil quality Dielectric strength, moisture, acidity, interfacial tension Quarterly lab results
Loading history Peak load as % of nameplate, cumulative overload hours Daily from AMI/SCADA
Age and design Manufacture year, insulation class, cooling type, OEM Static from asset registry
Maintenance history Past repairs, bushing replacements, tap changer operations Event-driven from work-order system
Environmental Ambient temperature extremes, coastal salt exposure, flood zone GIS + weather

Compute the health index as a weighted composite or train a gradient-boosted model on historical failure data. Output a 1-100 score per transformer. Transformers scoring below threshold enter the capital replacement queue; those in the caution zone get accelerated inspection schedules.

Vegetation management

Vegetation contact is the leading cause of distribution outages. Analytics improves targeting of tree-trimming crews:

  • LiDAR + satellite imagery — identify encroachment zones where vegetation clearance is below minimum
  • Historical outage correlation — overlay vegetation-caused outage locations with circuit GIS
  • Growth-rate modeling — predict which circuits will reach critical clearance before the next trim cycle
  • Priority scoring — rank circuits by (failure probability x customer impact x critical facility exposure)

Store vegetation risk scores in the gold layer, join with the GIS circuit model, and surface in Power BI maps for vegetation management planners.

Outage prediction

Outage prediction models estimate the probability and severity of outages given weather forecasts and asset condition:

  1. Feature engineering — combine weather forecast (wind speed, ice accumulation, lightning density), asset health scores, vegetation risk, and historical outage rates per circuit
  2. Model — gradient-boosted classifier (LightGBM) predicting outage probability per circuit per 6-hour window
  3. Severity estimation — regression model predicting customers affected and estimated restoration time
  4. Operational use — pre-position crews, stage materials, prepare customer communications before the storm hits

Tip

Outage prediction models trained on your utility's historical data significantly outperform generic weather-severity models. Even two years of outage history correlated with weather gives a useful model. Retrain quarterly as asset condition changes.

NERC CIP compliance mapping

The North American Electric Reliability Corporation Critical Infrastructure Protection (NERC CIP) standards are mandatory for bulk electric system (BES) operators. The table below maps CIP standards to CSA-in-a-Box controls.

NERC CIP Standard Requirement CSA-in-a-Box Control
CIP-003 Security management controls Best Practices -- Data Governance, policy-as-code in IaC
CIP-004 Personnel and training Entra ID + PIM for access provisioning; access reviews via Identity Governance
CIP-005 Electronic security perimeters Hub-spoke VNet topology with NSGs, Azure Firewall, Private Endpoints; see Hub-Spoke
CIP-006 Physical security Out of scope for cloud platform (Azure datacenter physical security is covered by Azure compliance certifications)
CIP-007 System security management Defender for Cloud secure-score policies, patch management, endpoint protection
CIP-008 Incident reporting and response Sentinel SIEM + playbooks + runbooks
CIP-009 Recovery plans DR plan, backup/restore procedures, tested annually
CIP-010 Configuration change management IaC (Bicep/Terraform) + GitHub PR workflows + deployment gates
CIP-011 Information protection Purview sensitivity labels + classification, encryption at rest (platform-managed or CMK)
CIP-013 Supply chain risk management Azure vendor risk documentation, third-party component inventory, SBOM for custom code

Warning

NERC CIP scope determination (BES Cyber Systems and associated assets) should be done by your CIP compliance team, not your cloud architects. The analytics platform typically falls into "medium impact" or "low impact" BES Cyber Systems depending on how it connects to operational systems. Scope classification drives which CIP requirements apply. Get this determination in writing before designing the architecture.

Renewable integration

Solar and wind forecasting

Renewables forecasting is the highest-value ML use case for utilities with generation assets or DER programs. The pipeline:

  1. Weather data — ingest numerical weather prediction (NWP) from NOAA GFS/HRRR models (free) or commercial providers (more granular)
  2. Asset data — panel orientation, tilt, capacity, inverter specs (solar); hub height, rotor diameter, power curve (wind)
  3. Historical generation — actual generation vs forecast for model training
  4. Features — cloud cover, GHI (Global Horizontal Irradiance), wind speed at hub height, temperature, humidity, time-of-day
  5. Model — gradient-boosted trees for day-ahead; LSTM or transformer for intra-day; physical models (PVLib for solar) as baselines

Forecast horizons and uses:

Horizon Resolution Primary use
15-minute ahead 5-min Real-time dispatch, AGC
Hour-ahead 15-min Intra-day market trading
Day-ahead Hourly Day-ahead market bidding, unit commitment
Week-ahead Daily Maintenance scheduling, fuel procurement

Battery optimization

For utilities with battery energy storage systems (BESS), the optimization problem is: when to charge, when to discharge, and at what rate, given:

  • Wholesale energy prices (LMP — locational marginal pricing)
  • Renewables forecast (charge when excess solar, discharge when deficit)
  • Demand forecast (peak shaving reduces demand charges)
  • Battery degradation model (cycle count, depth of discharge, temperature affect battery life)

Implement as a mixed-integer linear program (MILP) or reinforcement learning agent. Score hourly, dispatch to DERMS/SCADA. Store optimization decisions and outcomes in gold for performance tracking.

Demand response analytics

Demand response (DR) programs incentivize customers to reduce consumption during peak periods. Analytics supports DR in several ways:

  • Customer targeting — identify customers with the most flexible load (high AC usage, EV charging, pool pumps) using AMI interval data
  • Baseline estimation — calculate the counterfactual (what would the customer have consumed without DR?) using CAISO/PJM 10-in-10 baseline or regression-based methods
  • Performance measurement — actual reduction vs baseline, program cost-effectiveness ($/kW reduced)
  • Optimization — which customers to call, when, and with what incentive level to achieve the target demand reduction at minimum cost

GIS integration

Geospatial analytics with Azure Maps

Utility data is inherently geospatial. Azure Maps provides the visualization and geocoding layer; the analytics lives in your medallion lakehouse.

Key integration patterns:

  • Asset visualization — plot transformers, feeders, substations, and meters on a map; color-code by health score, age, or loading
  • Outage mapping — real-time outage polygons from OMS data; overlay with weather radar; show estimated restoration times per area
  • Service territory analysis — customer density, revenue per square mile, infrastructure investment needs by area
  • Storm path overlay — project forecasted storm path onto the grid model; highlight at-risk circuits and pre-position crews

Field crew optimization

Field operations consume a significant portion of utility O&M budgets. Optimize crew dispatch using:

  • Route optimization — minimize travel time for scheduled maintenance using vehicle routing problem (VRP) solvers
  • Dynamic dispatch — during storms, reassign crews based on real-time outage priority, crew location, and skills match
  • Work-order clustering — group nearby maintenance tasks into efficient field trips
  • Parts inventory — predict which parts are needed based on asset health scores and failure modes; pre-stage in service vehicles

Surface crew assignments and optimized routes in a mobile-friendly Power BI report or a custom Power App with Azure Maps integration. See Tutorial 11 -- Data API Builder for serving the data layer.

Note

For real-time fleet tracking and dynamic dispatch, consider Azure Maps Route API + Azure Functions for the optimization logic, with results written back to gold for performance analytics. The historical crew efficiency data in gold feeds back into improving the dispatch model.

Carbon accounting and ESG

Utilities face increasing ESG (Environmental, Social, Governance) reporting requirements. Build carbon accounting into the platform from the start rather than bolting it on later.

Emissions data pipeline

Scope What to measure Data source Calculation
Scope 1 (direct) Fossil fuel combustion at owned plants Plant fuel consumption records, EPA CEMS data Fuel quantity x emission factor (EPA AP-42)
Scope 2 (indirect - purchased electricity) Grid electricity consumed at facilities Utility bills, interval meter data kWh consumed x grid emission factor (eGRID)
Scope 3 (value chain) Upstream fuel production, downstream customer usage Supplier data, customer usage estimates Activity data x lifecycle emission factors

Build as a dbt pipeline: bronze captures raw consumption and generation data, silver standardizes units and applies emission factors, gold produces reporting-ready tables for GHG Protocol, CDP, and SEC climate disclosure. Version your emission factor tables in git so calculations are reproducible year-over-year.

Renewable energy certificates (RECs)

Track REC generation, ownership transfers, and retirement in your gold layer. Each MWh of renewable generation produces one REC. Match RECs to customer programs (green tariffs, community solar) and corporate reporting needs. Integrate with registries (M-RETS, GATS, WREGIS) via API or batch extract.

EV integration analytics

Electric vehicle charging is the fastest-growing load category for many utilities. Analytics supports grid planning and customer programs.

  • Charging load profiles — identify when and where EV charging occurs using AMI data (detect EV charging signatures in household load curves)
  • Grid impact assessment — model the impact of EV adoption scenarios on transformer loading, feeder capacity, and substation capacity by neighborhood
  • Managed charging programs — analytics for time-of-use rate design and managed charging incentives that shift EV load to off-peak hours
  • Public charging infrastructure — site selection for utility-owned chargers based on traffic patterns, grid capacity, and underserved areas

Trade-offs

Give Get
ADX/Eventhouse for time-series (dedicated service) Sub-second queries on billions of meter reads, but additional service to manage
NERC CIP scope inclusion (cloud in BES scope) Full cloud analytics capability, but significant compliance overhead
Edge inference for outage detection Faster detection during comms outages, but model update complexity
Multi-region deployment (data residency per state) PUC compliance, but higher infrastructure cost and operational complexity
Granular AMI data (15-min intervals kept hot) Rich analytics capability, but significant storage cost at scale