U.S. flag An official website of the United States government.
Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

i

Observation Definitions and Their Implications in Machine Learning–Based Predictions of Excessive Rainfall



Select the Download button to view the document
Please click the download button to view the document.

Details

  • Journal Title:
    Weather and Forecasting
  • Personal Author:
  • NOAA Program & Office:
  • Description:
    The implications of definitions of excessive rainfall observations on machine learning model forecast skill are assessed using the Colorado State University Machine Learning Probabilities (CSU-MLP) forecast system. The CSU-MLP uses historical observations along with reforecasts from a global ensemble to train random forests to probabilistically predict excessive rainfall events. Here, random forest models are trained using two distinct rainfall datasets, one that is composed of fixed-frequency (FF) average recurrence intervals exceedances and flash flood reports and the other a compilation of flooding and rainfall proxies [Unified Flood Verification System (UFVS)]. Both models generate 1–3-day forecasts and are evaluated against a climatological baseline to characterize their overall skill as a function of lead time, season, and region. Model comparisons suggest that regional frequencies in excessive rainfall observations contribute to when and where the ML models issue forecasts and subsequently their skill and reliability. Additionally, the spatiotemporal distribution of observations has implications for ML model training requirements, notably, how long of an observational record is needed to obtain skillful forecasts. Experiments reveal that shorter-trained UFVS-based models can be as skillful as longer-trained FF-based models. In essence, the UFVS dataset exhibits a more robust characterization of excessive rainfall and impacts, and machine learning models trained on more representative datasets of meteorological hazards may not require as extensive training to generate skillful forecasts.
  • Source:
    Weather and Forecasting, 39(11), 1733-1750
  • DOI:
  • ISSN:
    0882-8156 ; 1520-0434
  • Format:
  • Publisher:
  • Document Type:
  • Funding:
  • Rights Information:
    Other
  • Compliance:
    Submitted
  • Main Document Checksum:
    urn:sha-512:5a8ecb149f9767d701add46ecbdf04f0a60cbd95b84723287b806c564eaf8ddcbb38d21994d18cab8b35a2b7144b8bd6005ba28130e3cd5959848947dfa16140
  • Download URL:
  • File Type:
    Filetype[PDF - 10.32 MB ]
ON THIS PAGE

The NOAA IR serves as an archival repository of NOAA-published products including scientific findings, journal articles, guidelines, recommendations, or other information authored or co-authored by NOAA or funded partners. As a repository, the NOAA IR retains documents in their original published format to ensure public access to scientific information.