A New Paradigm for Medium-Range Severe Weather Forecasts: Probabilistic Random Forest–Based Predictions

Hill, Aaron J.; Schumacher, Russ S.; Jirak, Israel L.

doi:10.1175/waf-d-22-0143.1

i

A New Paradigm for Medium-Range Severe Weather Forecasts: Probabilistic Random Forest–Based Predictions

2023
By Hill, Aaron J. ; Schumacher, Russ S. ; Jirak, Israel L.

Details

Journal Title:

Weather and Forecasting
Personal Author:

Hill, Aaron J. ; Schumacher, Russ S. ; Jirak, Israel L.
NOAA Program & Office:

NWS (National Weather Service) ; NCEP (National Centers for Environmental Prediction) ; SPC (Storm Prediction Center)
Description:

Historical observations of severe weather and simulated severe weather environments (i.e., features) from the Global Ensemble Forecast System v12 (GEFSv12) Reforecast Dataset (GEFS/R) are used in conjunction to train and test random forest (RF) machine learning (ML) models to probabilistically forecast severe weather out to days 4–8. RFs are trained with ∼9 years of the GEFS/R and severe weather reports to establish statistical relationships. Feature engineering is briefly explored to examine alternative methods for gathering features around observed events, including simplifying features using spatial averaging and increasing the GEFS/R ensemble size with time lagging. Validated RF models are tested with ∼1.5 years of real-time forecast output from the operational GEFSv12 ensemble and are evaluated alongside expert human-generated outlooks from the Storm Prediction Center (SPC). Both RF-based forecasts and SPC outlooks are skillful with respect to climatology at days 4 and 5 with diminishing skill thereafter. The RF-based forecasts exhibit tendencies to slightly underforecast severe weather events, but they tend to be well-calibrated at lower probability thresholds. Spatially averaging predictors during RF training allows for prior-day thermodynamic and kinematic environments to generate skillful forecasts, while time lagging acts to expand the forecast areas, increasing resolution but decreasing overall skill. The results highlight the utility of ML-generated products to aid SPC forecast operations into the medium range. Significance Statement Medium-range severe weather forecasts generated from statistical models are explored here alongside operational forecasts from the Storm Prediction Center (SPC). Human forecasters at the SPC rely on traditional numerical weather prediction model output to make medium-range outlooks and statistical products that mimic operational forecasts can be used as guidance tools for forecasters. The statistical models relate simulated severe weather environments from a global weather model to historical records of severe weather and perform noticeably better than human-generated outlooks at shorter lead times (e.g., day 4 and 5) and are capable of capturing the general location of severe weather events 8 days in advance. The results highlight the value in these data-driven methods in supporting operational forecasting.
Keywords:

Atmospheric Science
Source:

Weather and Forecasting, 38(2), 251-272
DOI:

https://doi.org/10.1175/waf-d-22-0143.1
ISSN:

0882-8156 ; 1520-0434
Format:

PDF
Publisher:

American Meteorological Society
Document Type:

Journal Article
Funding:

Grant no. NA20OAR4590350
Rights Information:

Other
Compliance:

Library
Main Document Checksum:

urn:sha256:c70a4250ad977482692ca14c7dc82bb35e49b3b78a4fd05bebc014c9dfb110da
Download URL:

https://repository.library.noaa.gov/view/noaa/53429/noaa_53429_DS1.pdf
File Type:

[PDF - 5.82 MB ]

ON THIS PAGE

Details

The NOAA IR serves as an archival repository of NOAA-published products including scientific findings, journal articles, guidelines, recommendations, or other information authored or co-authored by NOAA or funded partners. As a repository, the NOAA IR retains documents in their original published format to ensure public access to scientific information.