Neural network emulation of the formation of organic aerosols based on the explicit GECKO‐A chemistry model

Secondary organic aerosols (SOA) are formed from oxidation of hundreds of volatile organic compounds (VOCs) emitted from anthropogenic and natural sources. Accurate predictions of this chemistry are key for air quality and climate studies due to the large contribution of organic aerosols to submicron aerosol mass. Currently, only explicit models, such as the Generator for Explicit Chemistry and Kinetics of Organics in the Atmosphere (GECKO-A), can fully represent the chemical processing of thousands of organic species. However, their extreme computational cost prohibits their use in current chemistry-climate models, which rely on simplified empirical parameterizations to predict SOA concentrations. This study demonstrates that machine learning can accurately emulate SOA formation from an explicit chemistry model with an approximate error of 2%-8%, up to five days for several precursors and for potentially up to one month for recurrent neural network models, and with 100 to 100,000 times speedup over GECKO-A, making it computationally useable in a chemistry-climate model. We generated the training data using thousands of GECKO-A box simulations sampled from a broad range of initial environmental conditions, and focused on three representative SOA precursors: the oxidation by OH of two anthropogenic (toluene, dodecane), and the oxidation by O-3 of one biogenic VOC (alpha-pinene). We compare several neural models and quantify their underlying uncertainty and robustness. These are promising results, suggesting that neural network models could be applied to predict SOA in chemistry-climate models, limited however to the range of environmental conditions that were considered in the training datasets.

To Access Resource:

Questions? Email Resource Support Contact:

  • opensky@ucar.edu
    UCAR/NCAR - Library

Resource Type publication
Temporal Range Begin N/A
Temporal Range End N/A
Temporal Resolution N/A
Bounding Box North Lat N/A
Bounding Box South Lat N/A
Bounding Box West Long N/A
Bounding Box East Long N/A
Spatial Representation N/A
Spatial Resolution N/A
Related Links

Related Dataset #1 : Data sets used in "Neural network emulation of the formation of organic aerosols based on the explicit GECKO-A chemistry model"

Additional Information N/A
Resource Format PDF
Standardized Resource Format PDF
Asset Size N/A
Legal Constraints

Copyright author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Access Constraints None
Software Implementation Language N/A

Resource Support Name N/A
Resource Support Email opensky@ucar.edu
Resource Support Organization UCAR/NCAR - Library
Distributor N/A
Metadata Contact Name N/A
Metadata Contact Email opensky@ucar.edu
Metadata Contact Organization UCAR/NCAR - Library

Author Schreck, John S.
Becker, Charles
Gagne, David John
Lawrence, Keely
Wang, Siyuan
Mouchel‐Vallon, Camille
Choi, Jinkyul
Hodzic, Alma
Publisher UCAR/NCAR - Library
Publication Date 2022-10-18T00:00:00
Digital Object Identifier (DOI) Not Assigned
Alternate Identifier N/A
Resource Version N/A
Topic Category geoscientificInformation
Progress N/A
Metadata Date 2023-08-18T18:20:04.088464
Metadata Record Identifier edu.ucar.opensky::articles:25818
Metadata Language eng; USA
Suggested Citation Schreck, John S., Becker, Charles, Gagne, David John, Lawrence, Keely, Wang, Siyuan, Mouchel‐Vallon, Camille, Choi, Jinkyul, Hodzic, Alma. (2022). Neural network emulation of the formation of organic aerosols based on the explicit GECKO‐A chemistry model. UCAR/NCAR - Library. http://n2t.net/ark:/85065/d7rf5ztp. Accessed 27 June 2025.

Harvest Source