Increasing the reproducibility and replicability of supervised AI/ML in the Earth systems science by leveraging social science methods

Artificial intelligence (AI) and machine learning (ML) pose a challenge for achieving science that is both reproducible and replicable. The challenge is compounded in supervised models that depend on manually labeled training data, as they introduce additional decision-making and processes that require thorough documentation and reporting. We address these limitations by providing an approach to hand labeling training data for supervised ML that integrates quantitative content analysis (QCA)-a method from social science research. The QCA approach provides a rigorous and well-documented hand labeling procedure to improve the replicability and reproducibility of supervised ML applications in Earth systems science (ESS), as well as the ability to evaluate them. Specifically, the approach requires (a) the articulation and documentation of the exact decision-making process used for assigning hand labels in a "codebook" and (b) an empirical evaluation of the reliability" of the hand labelers. In this paper, we outline the contributions of QCA to the field, along with an overview of the general approach. We then provide a case study to further demonstrate how this framework has and can be applied when developing supervised ML models for applications in ESS. With this approach, we provide an actionable path forward for addressing ethical considerations and goals outlined by recent AGU work on ML ethics in ESS.

To Access Resource:

Questions? Email Resource Support Contact:

  • opensky@ucar.edu
    UCAR/NCAR - Library

Resource Type publication
Temporal Range Begin N/A
Temporal Range End N/A
Temporal Resolution N/A
Bounding Box North Lat N/A
Bounding Box South Lat N/A
Bounding Box West Long N/A
Bounding Box East Long N/A
Spatial Representation N/A
Spatial Resolution N/A
Related Links

Related ConferencePaper #1 : Validation Methodology for Expert-Annotated Datasets: Event Annotation Case Study

Related Dataset #1 : Quantitative Content Analysis Data for Hand Labeling Road Surface Conditions in New York State Department of Transportation Camera Images

Related Other #1 : Trust and trustworthiness codebook for content analysis: An example from NWS Forecaster interviews about AI

Additional Information N/A
Resource Format PDF
Standardized Resource Format PDF
Asset Size N/A
Legal Constraints

Copyright author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Access Constraints None
Software Implementation Language N/A

Resource Support Name N/A
Resource Support Email opensky@ucar.edu
Resource Support Organization UCAR/NCAR - Library
Distributor N/A
Metadata Contact Name N/A
Metadata Contact Email opensky@ucar.edu
Metadata Contact Organization UCAR/NCAR - Library

Author Wirz, Christopher D.
Sutter, C.
Demuth, Julie L.
Mayer, Kirsten
Chapman, William
Cains, Mariana
Radford, Jacob
Przybylo, V.
Evans, A.
Martin, T.
Gaudet, L. C.
Sulia, K.
Bostrom, A.
Gagne, David John
Bassill, N.
Schumacher, Andrea
Thorncroft, C.
Publisher UCAR/NCAR - Library
Publication Date 2024-07-01T00:00:00
Digital Object Identifier (DOI) Not Assigned
Alternate Identifier N/A
Resource Version N/A
Topic Category geoscientificInformation
Progress N/A
Metadata Date 2025-07-10T20:00:48.677701
Metadata Record Identifier edu.ucar.opensky::articles:27339
Metadata Language eng; USA
Suggested Citation Wirz, Christopher D., Sutter, C., Demuth, Julie L., Mayer, Kirsten, Chapman, William, Cains, Mariana, Radford, Jacob, Przybylo, V., Evans, A., Martin, T., Gaudet, L. C., Sulia, K., Bostrom, A., Gagne, David John, Bassill, N., Schumacher, Andrea, Thorncroft, C.. (2024). Increasing the reproducibility and replicability of supervised AI/ML in the Earth systems science by leveraging social science methods. UCAR/NCAR - Library. https://n2t.org/ark:/85065/d7dv1q3w. Accessed 11 August 2025.

Harvest Source