Forecasting Power Outage Duration with Predictive Models

Author: Gideon Shaked (gshaked@umich.edu)

Table of contents

Forecasting Power Outage Duration with Predictive Models

Introduction

Power outages are costly disruptions that can stem from various causes, ranging from severe weather to technical failures. Understanding the relationships between the cause of an outage and its impact, both in terms of restoration time and number of customers affected, is vital for both policymakers and utility companies.

This project investigates the question of how different aspects of an outage impact the how long it takes for power to be restored.

The dataset spans major U.S. power outages from 2000 to 2016 and contains detailed outage characteristics including cause, duration, geographic metadata, and economic indicators.

Data Size: 1,535 outage events
Relevant Columns
- outage.start.date: The date that the outage started.
- outage.start.date: The time that the outage started.
- u.s._state: Full name of the U.S. state where the outage occurred.
- postal.code: State-level postal code abbreviation (e.g. MI, CA).
- nerc.region: NERC (North American Electric Reliability Corporation) reliability region associated with the outage.
- climate.region: Climate region classification based on geography (e.g. Southeast, Northwest).
- anomaly.level: A numerical indicator of how abnormal the weather was relative to historical averages.
- climate.category: Broader climate category classification used in the dataset.
- hurricane.names: Name of the hurricane associated with the outage, if any.
- cause.category: High-level classification of outage cause (e.g. Severe Weather, Equipment Failure).
- cause.category.detail: More specific cause label (e.g. Lightning, Thunderstorm, Vandalism).
- demand.loss.mw: The megawatts of electricity demand lost during the outage.
- population: Total population in the affected area.
- popden_urban: Urban population density (people per square mile).
- popden_rural: Rural population density (people per square mile).
- poppct_urban: Percentage of the local population living in urban areas.

Data Cleaning and Exploratory Data Analysis

Data Cleaning

Data Cleaning Process

The data cleaning process involved several key steps to prepare the outage dataset for analysis:

First, all column names were standardized to lowercase to ensure consistency and simplify referencing.
Next, the dataset’s separate date and time fields for the start and end of outages were combined and converted into proper datetime objects. This allowed for precise time-based calculations.
Using these timestamps, the outage duration for each record was calculated by taking the difference between the restoration time and the start time. This duration was stored both as a full time interval and as a simplified count of days.

These transformations enabled accurate modeling and analysis of outage lengths and time-related patterns in the data.

Cleaned Data

This is the first five rows of the cleaned data.

OBS	year	month	u.s._state	postal.code	nerc.region	climate.region	anomaly.level	climate.category	outage.start.date	outage.start.time	outage.restoration.date	outage.restoration.time	cause.category	cause.category.detail	hurricane.names	outage.duration	demand.loss.mw	customers.affected	res.price	com.price	ind.price	total.price	res.sales	com.sales	ind.sales	total.sales	res.percen	com.percen	ind.percen	res.customers	com.customers	ind.customers	total.customers	res.cust.pct	com.cust.pct	ind.cust.pct	pc.realgsp.state	pc.realgsp.usa	pc.realgsp.rel	pc.realgsp.change	util.realgsp	total.realgsp	util.contri	pi.util.ofusa	population	poppct_urban	poppct_uc	popden_urban	popden_uc	popden_rural	areapct_urban	areapct_uc	pct_land	pct_water_tot	pct_water_inland	outage.start	outage.end	outage.duration.days
1	2011	7	Minnesota	MN	MRO	East North Central	-0.3	normal	Friday, July 1, 2011	5:00:00 PM	Sunday, July 3, 2011	8:00:00 PM	severe weather	nan	nan	2 days 03:00:00	nan	70000	11.6	9.18	6.81	9.28	2.33292e+06	2.11477e+06	2.11329e+06	6.56252e+06	35.5491	32.225	32.2024	2308736	276286	10673	2595696	88.9448	10.644	0.4112	51268	47586	1.07738	1.6	4802	274182	1.75139	2.2	5348119	73.27	15.28	2279	1700.5	18.2	2.14	0.6	91.5927	8.40733	5.47874	2011-07-01 17:00:00	2011-07-03 20:00:00	2
2	2014	5	Minnesota	MN	MRO	East North Central	-0.1	normal	Sunday, May 11, 2014	6:38:00 PM	Sunday, May 11, 2014	6:39:00 PM	intentional attack	vandalism	nan	0 days 00:01:00	nan	nan	12.12	9.71	6.49	9.28	1.58699e+06	1.80776e+06	1.88793e+06	5.28423e+06	30.0325	34.2104	35.7276	2345860	284978	9898	2640737	88.8335	10.7916	0.3748	53499	49091	1.08979	1.9	5226	291955	1.79	2.2	5457125	73.27	15.28	2279	1700.5	18.2	2.14	0.6	91.5927	8.40733	5.47874	2014-05-11 18:38:00	2014-05-11 18:39:00	0
3	2010	10	Minnesota	MN	MRO	East North Central	-1.5	cold	Tuesday, October 26, 2010	8:00:00 PM	Thursday, October 28, 2010	10:00:00 PM	severe weather	heavy wind	nan	2 days 02:00:00	nan	70000	10.87	8.19	6.07	8.15	1.46729e+06	1.80168e+06	1.9513e+06	5.22212e+06	28.0977	34.501	37.366	2300291	276463	10150	2586905	88.9206	10.687	0.3924	50447	47287	1.06683	2.7	4571	267895	1.70627	2.1	5310903	73.27	15.28	2279	1700.5	18.2	2.14	0.6	91.5927	8.40733	5.47874	2010-10-26 20:00:00	2010-10-28 22:00:00	2
4	2012	6	Minnesota	MN	MRO	East North Central	-0.1	normal	Tuesday, June 19, 2012	4:30:00 AM	Wednesday, June 20, 2012	11:00:00 PM	severe weather	thunderstorm	nan	1 days 18:30:00	nan	68200	11.79	9.25	6.71	9.19	1.85152e+06	1.94117e+06	1.99303e+06	5.78706e+06	31.9941	33.5433	34.4393	2317336	278466	11010	2606813	88.8954	10.6822	0.4224	51598	48156	1.07148	0.6	5364	277627	1.93209	2.2	5380443	73.27	15.28	2279	1700.5	18.2	2.14	0.6	91.5927	8.40733	5.47874	2012-06-19 04:30:00	2012-06-20 23:00:00	1
5	2015	7	Minnesota	MN	MRO	East North Central	1.2	warm	Saturday, July 18, 2015	2:00:00 AM	Sunday, July 19, 2015	7:00:00 AM	severe weather	nan	nan	1 days 05:00:00	250	250000	13.07	10.16	7.74	10.43	2.02888e+06	2.16161e+06	1.77794e+06	5.97034e+06	33.9826	36.2059	29.7795	2374674	289044	9812	2673531	88.8216	10.8113	0.367	54431	49844	1.09203	1.7	4873	292023	1.6687	2.2	5489594	73.27	15.28	2279	1700.5	18.2	2.14	0.6	91.5927	8.40733	5.47874	2015-07-18 02:00:00	2015-07-19 07:00:00	1

Missing Values & Imputation

Imputation Methods

For numeric features, we used mean imputation to fill in missing values. This method replaces any missing entry with the average of the available values in that column. We chose this approach because it is simple, preserves the scale and distribution of the data, and avoids discarding any records due to incomplete information. It ensures that we maintain consistency across the dataset without introducing artificial patterns or relying on external assumptions. In our case, the missing values appeared to be missing completely at random (MCAR). This made mean imputation an appropriate and statistically sound choice.

For categorical features, we filled missing values with the placeholder missing. This allowed us to treat the absence of a value as a meaningful category, rather than attempting to guess the correct label or remove the row. We used this method to ensure that all categorical variables remained complete and could be cleanly encoded without introducing ambiguity or inconsistencies in the dataset. Essentially, we did not impute categorical features.

Numeric Imputation Visualization

Below we have the frequency of the features that needed to be imputed, before and after imputation. As we can see, the distribution of each of the features did not change much, which further validates our choice of mean imputation.

Exploratory Data Analysis

Distribution of Outage Duration

We began by visualizing the distribution of outage durations:

Outages typically last under 3 days, though long tails exist due to extreme events.

Outage Frequency and Duration Over Time

We also looked at how outages are distributed over time:

It seems that there was a spike in outages during 2011-2013. This was probably due to an increase in severe weather events during this time period.

Outage frequency peaks in summer, likely reflecting storm seasons, while outage duration spikes in winter–potentially due to ice storms and slower repairs.

Breakdown of Outage Causes

We examined cause-specific impacts as well:

Weather-related outages generally have higher durations than equipment failures or intentional attacks.

We also explored how outage causes vary over time and geography:

It looks like “public appeal” spikes as a cause during the summer months, which tracks with our earlier conclusion that summer outages are due to the use of air conditioning overtaxing the grid (“public appeal” outages are outages where a utility asks the public to conserve power voluntarily).

Likewise, “public appeal” outages are frequent in the South and Southwest regions, which are the areas likely to be the hottest and thus most aggressively consuming electricity during summer.

It looks like this plot follows the earlier trend that we found, where severe weather events during 2011-2013 caused a spike in power outages across the board.

Outages and Time-of-Day

We visualized how time-of-day and work hours influence outage durations:

It looks like outages during off-hours tend to last longer, especially in certain climate zones.

Number of Days Affected by Cause and Climate Region

This table shows how many days each climate region was affected by different types of power outage causes. It provides insight into which causes are most frequent across regions. For example, severe weather is consistently a leading cause, especially in the Northeast, Southeast, and Central regions.

Climate Region	Equipment Failure	Fuel Supply Emergency	Intentional Attack	Islanding	Public Appeal	Severe Weather	System Operability Disruption
Central	5	4	34	3	2	133	10
East North Central	3	4	20	1	2	104	3
Northeast	5	14	131	1	4	175	14
Northwest	2	1	85	3	2	25	4
South	9	4	28	2	42	106	27
Southeast	4	0	9	0	5	116	16
Southwest	5	1	61	1	1	10	9
West	21	10	31	28	9	67	39
West North Central	1	0	4	5	2	4	0

Number of Customers Affected by Cause and Climate Region

This table tracks the scale of impact by showing the number of customers affected in each climate region for each cause of power outage. It complements the first table by highlighting not just the frequency of outages but the magnitude. For instance, although equipment failures occurred less frequently in the West, they affected a large number of customers, showing the importance of evaluating not just how often events happen, but how severe they are in terms of customer impact.

Climate Region	Equipment Failure	Fuel Supply Emergency	Intentional Attack	Islanding	Public Appeal	Severe Weather	System Operability Disruption
Central	87750	0	110.714	9666.67	0	148707	210450
East North Central	0	0	660.111	0	7600	134972	759738
Northeast	28575.8	0.5	1055.58	0	18600	169467	530759
Northwest	46651.5	0	92.5926	0	4000	169284	35000
South	62721.7	0	1042.83	14500	4917.64	223231	227102
Southeast	145420	0	0	0	0	206523	75555.5
Southwest	55666.7	0	327.423	35230	0	85138.4	135656
West	198608	0	14060	5039.19	0	361041	152040
West North Central	0	0	0	0	34500	74178	0

Framing a Prediction Problem

Prediction Problem

We decided to frame a regression problem:

Given all of the data known at the time of the outage, predict the duration of the outage.

This problem has practical importance: better duration forecasts allow for smarter resource allocation and improved communication with customers.

Evaluation Metrics

Specifically, we predicted the target variable outage.duration. To assess model performance, we primarily used root mean squared error (RMSE) and the coefficient of determination, along with absolute error.

Root Mean Squared Error (RMSE)

RMSE is a widely used metric that measures the average magnitude of the prediction error, with larger errors penalized more heavily due to squaring. It is defined as:

\[\text{RMSE} = \sqrt{\frac{1}{n} \sum_{i=1}^n (\hat{y}_i - y_i)^2}\]

Advantages: RMSE is intuitive and retains the original units of the target variable, making it easy to interpret. It is particularly useful when large errors are undesirable, as it emphasizes them more than metrics like Mean Absolute Error (MAE).

Coefficient of Determination

The coefficient of determination measures how well the model explains the variability of the target variable:

\[R^2 = 1 - \frac{\sum_{i=1}^n (\hat{y}_i - y_i)^2}{\sum_{i=1}^n (y_i - \bar{y})^2}\]

Advantages: The coefficient of determination provides a normalized measure of fit, ranging from 0 (no explanatory power) to 1 (perfect prediction). It complements RMSE by showing how well the model captures overall variance rather than just minimizing error.

Together, RMSE and the coefficient of determination offer a comprehensive view of model performance, with one emphasizing prediction error magnitude, and the other explaining variance captured.

Absolute Error

Absolute Error measures the magnitude of the difference between the predicted and actual values for each individual prediction:

\[\text{Absolute Error}_i = |\hat{y}_i - y_i|\]

Advantages: It offers a direct, interpretable measure of model accuracy for each sample, without amplifying large errors. This makes it well-suited for visualizing performance over time or across conditions, especially when highlighting when and where predictions deviate most.

We are going to visualize absolute error as a function of time, in order to directly see model performance as a function of year and time-of-year.

Baseline Model

Model Selection

We selected a Gradient Boosting Regressor for this task due to its strong performance on structured, tabular datasets and its ability to model complex nonlinear relationships. Gradient boosting builds an ensemble of decision trees in a sequential manner, where each new tree corrects the errors of the previous ones. This results in a highly flexible and powerful model capable of capturing subtle patterns in the data.

Given the mix of numeric, categorical, and engineered features (some with high cardinality and others with skewed distributions) gradient boosting offers robustness without requiring extensive feature scaling or transformation. It also handles interactions between variables well, which is important in a dataset where multiple factors (e.g., weather, population, time of day) may jointly influence outage duration.

Additionally, gradient boosting supports feature importance analysis, aiding interpretability and helping identify key drivers of model predictions.

Feature Selection and Encoding

We selected 21 features used to predict the target variable: outage.duration.

Quantitative Features (9)

These features are continuous or count-based. Values were standardized using StandardScaler.

year
month
hour
demand.loss.mw
population
popden_urban
popden_rural
poppct_urban
anomaly.level

Nominal Categorical Features (12)

These features have no intrinsic order. They were encoded using OneHotEncoder.

season
day_period
is_work_hours
is_weekend
u.s._state
postal.code
nerc.region
climate.region
climate.category
hurricane.names
cause.category
cause.category.detail

Target Variable

outage.duration: The duration of the power outage in hours (regression target – quantitative).

Baseline Model Results

Parameter Selection

After performing a grid search with 5-fold cross-validation, the baseline model’s best parameters were:

n_estimators: 200
max_depth: 3
learning_rate: 0.05
min_samples_leaf: 2
min_samples_split: 2

Performance Metrics

RMSE: 103.01
Coefficient of determination: 0.37

Performance Visualization

The following plot shows the model performance for outage duration over time, along with the absolute error.

Baseline Model Overall Evaluation

We can see from the plot above that the model tends to fail to predict outlier outage events, which suggests that there is an extraneous variable influencing the outage duration that is not given in our dataset. This is also supported by the low coefficient of determination, which suggests that the variance of the actual outage duration is not captured by the prediction model.

That being said, for the majority of outages the model’s performance is decent. Overall, we think that this model’s performance is good.

Final Model

After evaluating several modeling options, we retained the Gradient Boosting Regressor in the final pipeline due to its superior performance on key evaluation metrics, particularly RMSE and coefficient of determination. Despite some limitations in capturing all variance (as reflected by a lower coefficient of determination), the model consistently delivered low absolute prediction error, indicating strong predictive utility.

Gradient boosting effectively handled the dataset’s mix of numeric and categorical features, including engineered variables with skewed distributions and nonlinear effects. Its ability to model complex interactions between features made it especially well-suited to this multifactorial problem.

Engineered Features

Feature Name	Description	Justification
`year_offset`	Relative year from a baseline (e.g., 2000)	Captures long-term trends while avoiding large raw year values that can distort learning.
`had_hurricane`	Binary flag for presence of a hurricane	Simplifies hurricane data and highlights its potential impact on outages.
`cause_combined`	Combined category of general and detailed cause	Encodes richer information while reducing redundancy between related categorical features.
`popden_diff`	Urban minus rural population density	Reflects demographic imbalance that may influence infrastructure vulnerability.
`demand_loss_log`	Log-transformed demand loss (in megawatts)	Mitigates skew and reduces sensitivity to large outliers in demand loss data.
`is_peak_hour`	Binary indicator for peak electricity usage hours	Helps model learn time-sensitive behavior, especially around demand and response prioritization.

Final Model Results

The same hyperparameter grid and preprocessing steps were used as in the baseline model.

Parameter Selection

After performing a grid search with 5-fold cross-validation, the final model’s best parameters were:

n_estimators: 200
max_depth: 3
learning_rate: 0.05
min_samples_leaf: 2
min_samples_split: 5 (changed from baseline model)

Performance Metrics

RMSE: 102.00
Coefficient of determination: 0.38

Performance Visualization

The following plot shows the final model’s performance for outage duration over time compared to the baseline model’s performance, along with the absolute error as before.

Final Model Overall Evaluation

The final model is only a slight improvement on the baseline model, with the coefficient of determination improving from 0.37 to 0.38 and the RMSE improving from 103 to 102.

That being said, we believe that the added features add additional robustness to the model, and that they will make it more generalizable to unseen data.

Conclusion

In this project, we set out to understand how the cause of a power outage affects its overall impact–specifically the number of customers affected and the time it takes to restore service. Through a combination of exploratory analysis and predictive modeling, we found that outages caused by severe weather tend to be more disruptive and longer-lasting than those from technical failures or other sources.

We trained a regression model to predict outage duration using only information that would be available at the time the outage begins. Our baseline model already showed reasonable performance, but after engineering a few additional features, like whether a hurricane was involved or how urban the affected area was. With these added features, we were able to reduce the prediction error even further.

While there’s still plenty of room for improvement, this project shows that it’s possible to make meaningful predictions about outage duration using static metadata alone. That kind of insight could be valuable for improving communication with customers and planning emergency responses more effectively.