Refactoring Data-Driven Model Selection Code for Improvements in Interpretability, Generality, and Computational Expense

Buchholz et al. used observations of total column carbon monoxide (CO) from the Measurements Of Pollution In The Troposphere (MOPITT) satellite instrument to build a record of monthly anomalies between 2001 and 2016, focusing on 7 biomass burning regions in the Southern Hemisphere and tropics. CO anomalies in each of the regions were modeled using climate indices for influential climate modes. A linear modeling approach was used, where de-trended, de-seasonalized, regionally aggregated CO measurements were taken as the response variable, and the climate index anomaly values (at various time lags) were taken as explanatory variables. Initial analyses were completed in MATLAB using serial algorithms carried out over non-functionalized scripts. We sought to refactor this codebase, with 3 specific improvement goals; first, to improve code interpretability in preparation for public release; second, to improve code generality, so that the techniques and code used in this application can be easily adapted for similar problems; and third, to utilize parallel computing to substantially speed up program executions. During the early phase of this refactoring, data structures and algorithms were selected to work with the parallel computing tools in the MATLAB Parallel Computing Toolbox. When the codebase was sufficiently developed, a series of parallel timing studies were performed to assess the extent of realizable time savings; in general, these savings were substantial.

To Access Resource:

Questions? Email Resource Support Contact:

    UCAR/NCAR - Library


Resource Type publication
Temporal Range Begin N/A
Temporal Range End N/A
Temporal Resolution N/A
Bounding Box North Lat N/A
Bounding Box South Lat N/A
Bounding Box West Long N/A
Bounding Box East Long N/A
Spatial Representation N/A
Spatial Resolution N/A
Related Links N/A
Additional Information N/A
Resource Format PDF
Standardized Resource Format PDF
Asset Size N/A
Legal Constraints

Copyright Author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Access Constraints None
Software Implementation Language N/A

Resource Support Name N/A
Resource Support Email
Resource Support Organization UCAR/NCAR - Library
Distributor N/A
Metadata Contact Name N/A
Metadata Contact Email
Metadata Contact Organization UCAR/NCAR - Library

Author Simonson, Peter
Hammerling, Dorit
Publisher UCAR/NCAR - Library
Publication Date 2018-08-22T00:00:00
Digital Object Identifier (DOI) Not Assigned
Alternate Identifier N/A
Resource Version N/A
Topic Category geoscientificInformation
Progress N/A
Metadata Date 2023-08-18T18:06:43.351726
Metadata Record Identifier edu.ucar.opensky::technotes:564
Metadata Language eng; USA
Suggested Citation Simonson, Peter, Hammerling, Dorit. (2018). Refactoring Data-Driven Model Selection Code for Improvements in Interpretability, Generality, and Computational Expense. UCAR/NCAR - Library. Accessed 29 September 2023.

Harvest Source