Copy this text
Estimates of Water-Column Nutrient Concentrations and Carbonate System Parameters in the Global Ocean: A Novel Approach Based on Neural Networks
A neural network-based method (CANYON: CArbonate system and Nutrients concentration from hYdrological properties and Oxygen using a Neural-network) was developed to estimate water-column biogeochemically relevant variables in the Global Ocean. These are the concentrations of 3 nutrients [nitrate (NO3−), phosphate (PO43−) and silicate (Si(OH)4)] and 4 carbonate system parameters [total alkalinity (AT), dissolved inorganic carbon (CT), pH (pHT) and partial pressure of CO2 (pCO2)], which are estimated from concurrent in situ measurements of temperature, salinity, hydrostatic pressure and oxygen (O2) together with sampling latitude, longitude and date. Seven neural-networks were developed using the GLODAPv2 database, which is largely representative of the diversity of open-ocean conditions, hence making CANYON potentially applicable to most oceanic environments. For each variable, CANYON was trained using 80 % randomly chosen data from the whole database (after eight 10° x 10° zones removed providing an “independent data-set” for additional validation), the remaining 20 % data were used for the neural-network test of validation. Overall, CANYON retrieved the variables with high accuracies (RMSE): 0.93 mol kg-1 (NO3−), 0.07 mol kg-1 (PO43-), 3.0 mol kg-1 (Si(OH)4), 0.019 (pHT), 7 mol kg-1 (AT), 10 mol kg-1 (CT) and 28 atm (pCO2). This was confirmed for the 8 independent zones not included in the training process. CANYON was also applied to the Hawaiian Time Series site to produce a 22-years long simulated time series for the above 7 variables. Comparison of modeled and measured data was also very satisfactory (RMSE in the order of magnitude of RMSE from validation test). CANYON is thus a promising method to derive distributions of key biogeochemical variables. It could be used for a variety of global and regional applications ranging from data quality control to the production of datasets of variables required for initialization and validation of biogeochemical models but difficult to obtain. In particular, combining the increased coverage of the global Biogeochemical-Argo program, where O2 is one of the core variables now very accurately measured, with the CANYON approach offers the fascinating perspective of obtaining large-scale estimates of key biogeochemical variables with unprecedented spatial and temporal resolutions.
Keyword(s)
neural network, nutrients, carbonate system, global ocean, GLODAPv2 database, profiling floats