FN Archimer Export Format PT J TI OceanSODA-MDB: a standardised surface ocean carbonate system dataset for model-data intercomparisons BT AF Land, Peter Edward Findlay, Helen S. Shutler, Jamie D. Piolle, Jean-Francois Sims, Richard Green, Hannah Kitidis, Vassilis Polukhin, Alexander Pipko, Irina I. AS 1:1;2:1;3:2;4:3;5:2;6:1,2;7:1;8:4;9:5; FF 1:;2:;3:;4:PDG-ODE-LOPS-SIAM;5:;6:;7:;8:;9:; C1 Plymouth Marine Laboratory, Prospect Place, West Hoe, Plymouth, PL1 3DH, UK University of Exeter, Centre for Geography and Environmental Science, Penryn, Cornwall. TR10 9FE IFREMER, Brest, France Shirshov Institute of Oceanology, 36, Nakhimovskiy prospect, Moscow, 117997, Russia V.I. Il’ichev Pacific Oceanological Institute FEB RAS, Vladivostok, 690041, Russia C2 PML, UK UNIV EXETER, UK IFREMER, FRANCE PP SHIRSHOV OCEANOL INST, RUSSIA RUSSIAN ACAD SCI, RUSSIA SI BREST SE PDG-ODE-LOPS-SIAM UM LOPS IN WOS Ifremer UMR DOAJ copubli-europe copubli-int-hors-europe IF 11.4 TC 4 UR https://archimer.ifremer.fr/doc/00787/89867/95341.pdf https://archimer.ifremer.fr/doc/00787/89867/100104.pdf LA English DT Article CR OISO - OCÉAN INDIEN SERVICE D'OBSERVATION AB In recent years, large datasets of in situ marine carbonate system parameters (partial pressure of CO2 (pCO2), total alkalinity, dissolved inorganic carbon and pH) have been collated, quality controlled and made publicly available. These carbonate system datasets have highly variable data density in both space and time, especially in the case of pCO2, which is routinely measured at high frequency using underway measuring systems. This variation in data density can create biases when the data are used, for example for algorithm assessment, favouring datasets or regions with high data density. A common way to overcome data density issues is to bin the data into cells of equal latitude and longitude extent. This leads to bins with spatial areas that are latitude and projection dependent (e. g. become smaller and more elongated as the poles are approached). Additionally, as bin boundaries are defined without reference to the spatial distribution of the data or to geographical features, data clusters may be divided sub-optimally (e. g. a bin covering a region with a strong gradient). To overcome these problems and to provide a tool for matching surface in situ data with satellite, model and climatological data, which often have very different spatiotemporal scales both from the in situ data and from each other, a methodology has been created to group in situ data into ‘regions of interest’: spatiotemporal cylinders consisting of circles on the Earth’s surface extending over a period of time. These regions of interest are optimally adjusted to contain as many in situ measurements as possible. All surface in situ measurements of the same parameter contained in a region of interest are collated, including estimated uncertainties and regional summary statistics. The same grouping is applied to each of the non-in situ datasets in turn, producing a dataset of coincident matchups that are consistent in space and time. About 35 million in situ data points were matched with data from five satellite sources and five model and re-analysis datasets to produce a global matchup dataset of carbonate system data, consisting of ~286,000 regions of interest spanning 54 years from 1957 to 2020. Each region of interest is 100 km in diameter and 10 days in duration. An example application, the reparameterisation of a global total alkalinity algorithm, is shown. This matchup dataset can be updated as and when in situ and other datasets are updated, and similar datasets at finer spatiotemporal scale can be constructed, for example to enable regional studies. The matchup dataset provides users with a large multiparameter carbonate system dataset containing data from different sources, in one consistent, collated and standardised format suitable for model-data intercomparisons and model evaluations. The OceanSODA-MDB data can be downloaded from https://doi.org/10.12770/0dc16d62-05f6-4bbe-9dc4-6d47825a5931 (Land and Piollé, 2022). PY 2023 PD FEB SO Earth System Science Data SN 1866-3508 PU Copernicus Publications VL 15 IS 2 UT 000940816100001 BP 921 EP 947 DI 10.5194/essd-15-921-2023 ID 89867 ER EF