The effects of imputing missing data on ensemble temperature forecasts

A major issue for developing post-processing methods for NWP forecasting systems is the need to obtain complete training datasets. Without a complete dataset, it can become difficult, if not impossible, to train and verify statistical post-processing techniques, including ensemble consensus forecasting schemes. In addition, when ensemble forecast data are missing, the real-time use of the consensus forecast weighting scheme becomes difficult and the quality of uncertainty information derived from the ensemble is reduced. To ameliorate these problems, an analysis of the treatment of missing data in ensemble model temperature forecasts is performed to determine which method of replacing the missing data produces the lowest Mean Absolute Error (MAE) of consensus forecasts while preserving the ensemble calibration. This study explores several methods of replacing missing data, including ones based on persistence, a Fourier fit to capture seasonal variability, ensemble member mean substitution, three day mean deviation, and an Artificial Neural Network (ANN). The analysis is performed on 48-hour temperature forecasts for ten locations in the Pacific Northwest. The methods are evaluated according to their effect on the forecast performance of two ensemble post-processing forecasting methods, specifically an equal-weight consensus forecast and a ten day performance-weighted window. The methods are also assessed using rank histograms to determine if they preserve the calibration of the ensembles. For both postprocessing techniques all imputation methods, with the exception of the ensemble mean substitution, produce mean absolute errors not significantly different from the cases when all ensemble members are available. However, the three day mean deviation and ANN have rank histograms similar to that for the baseline of the non-imputed cases (i.e. the ensembles are appropriately calibrated) for all locations, while persistence, ensemble mean, and Fourier substitution do not consistently produce appropriately calibrated ensembles. The three day mean deviation has the advantage of being computationally efficient in a real-time forecasting environment.

To Access Resource:

Go to Resource HomepageHTML

Questions? Email Resource Support Contact:

opensky@ucar.edu
UCAR/NCAR - Library

Resource Type	publication
Temporal Range Begin	N/A
Temporal Range End	N/A
Temporal Resolution	N/A
Bounding Box North Lat	N/A
Bounding Box South Lat	N/A
Bounding Box West Long	N/A
Bounding Box East Long	N/A
Spatial Representation	N/A
Spatial Resolution	N/A
Related Links	N/A
Additional Information	N/A
Resource Format	PDF
Standardized Resource Format	PDF
Asset Size	N/A
Legal Constraints	Copyright 2011 Academy Publisher.
Access Constraints	None
Software Implementation Language	N/A

Resource Support Name	N/A
Resource Support Email	opensky@ucar.edu
Resource Support Organization	UCAR/NCAR - Library
Distributor	N/A
Metadata Contact Name	N/A
Metadata Contact Email	opensky@ucar.edu
Metadata Contact Organization	UCAR/NCAR - Library

Author	McCandless, T. Haupt, Sue Ellen Young, G.
Publisher	UCAR/NCAR - Library
Publication Date	2011-02-01T00:00:00
Digital Object Identifier (DOI)	Not Assigned
Alternate Identifier	N/A
Resource Version	N/A
Topic Category	geoscientificInformation
Progress	N/A
Metadata Date	2023-08-18T19:09:21.181841
Metadata Record Identifier	edu.ucar.opensky::articles:17538
Metadata Language	eng; USA
Suggested Citation	McCandless, T., Haupt, Sue Ellen, Young, G.. (2011). The effects of imputing missing data on ensemble temperature forecasts. UCAR/NCAR - Library. http://n2t.net/ark:/85065/d7b56m2k. Accessed 21 June 2025.

Harvest Source

ISO-19139 ISO-19139 Metadata
Download Metadata (XML) · View Full Metadata (HTML)

The effects of imputing missing data on ensemble temperature forecasts

To Access Resource:

Questions? Email Resource Support Contact:

Scientific Information

Contact Information

Citation Information

Harvest Source