Causal drivers of land‐atmosphere carbon fluxes from machine learning models and data

Interactions among atmospheric, root‐soil, and vegetation processes drive carbon dioxide fluxes ( Fc ) from land to atmosphere. Eddy covariance measurements are commonly used to measure Fc at sub‐daily timescales and validate process‐based and data‐driven models. However, these validations do not reveal process interactions, thresholds, and key differences in how models replicate them. We use information theory‐based measures to explore multivariate information flow pathways from forcing data to observed and modeled hourly Fc , using flux tower data sets in the Midwestern U.S. in intensively managed corn‐soybean landscapes. We compare multiple linear regressions, long‐short term memory, and random forests (RF), and evaluate how different model structures use information from combinations of sources to predict Fc . We extend a framework for model predictive and functional performance, which examines a suite of dependencies from all forcing variables to the observed or modeled target. Of the three model types, RF exhibited the highest functional and predictive performance, correctly capturing strong dependencies between radiation and temperature variables with Fc . Regionally trained models demonstrate lower predictive but higher functional performance compared to site‐specific models, suggesting superior reproduction of observed relationships at the expense of predictive accuracy. This study shows that some metrics of predictive performance encapsulate functional behaviors better than others, highlighting the need for multiple metrics of both types. This study improves our understanding of carbon fluxes in an intensively managed landscape, and more generally provides insight into how model structures and forcing variables translate to interactions that are well versus poorly captured in models.

To Access Resource:

Questions? Email Resource Support Contact:

  • opensky@ucar.edu
    UCAR/NCAR - Library

Resource Type publication
Temporal Range Begin N/A
Temporal Range End N/A
Temporal Resolution N/A
Bounding Box North Lat N/A
Bounding Box South Lat N/A
Bounding Box West Long N/A
Bounding Box East Long N/A
Spatial Representation N/A
Spatial Resolution N/A
Related Links

Related Preprint #1 : Sequence to Sequence Learning with Neural Networks

Additional Information N/A
Resource Format PDF
Standardized Resource Format PDF
Asset Size N/A
Legal Constraints

Copyright author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Access Constraints None
Software Implementation Language N/A

Resource Support Name N/A
Resource Support Email opensky@ucar.edu
Resource Support Organization UCAR/NCAR - Library
Distributor N/A
Metadata Contact Name N/A
Metadata Contact Email opensky@ucar.edu
Metadata Contact Organization UCAR/NCAR - Library

Author Mozhgan A. Farahani
Goodwell, A. E.
Publisher UCAR/NCAR - Library
Publication Date 2024-06-01T00:00:00
Digital Object Identifier (DOI) Not Assigned
Alternate Identifier N/A
Resource Version N/A
Topic Category geoscientificInformation
Progress N/A
Metadata Date 2025-07-10T20:01:33.522762
Metadata Record Identifier edu.ucar.opensky::articles:43279
Metadata Language eng; USA
Suggested Citation Mozhgan A. Farahani, Goodwell, A. E.. (2024). Causal drivers of land‐atmosphere carbon fluxes from machine learning models and data. UCAR/NCAR - Library. https://n2t.net/ark:/85065/d70z77qr. Accessed 02 August 2025.

Harvest Source