FN Archimer Export Format PT J TI Data‐Driven Modeling of the Distribution of Diazotrophs in the Global Ocean BT AF Tang, Weiyi Cassar, Nicolas AS 1:1;2:1,2; FF 1:;2:; C1 Division of Earth and Ocean Sciences, Nicholas School of the EnvironmentDuke University Durham NC USA Laboratoire des Sciences de l'Environnement Marin, UMR 6539 UBO/CNRS/IRD/IFREMER Institut Universitaire Européen de la Mer Brest, France C2 UNIV DURHAM, USA UBO, FRANCE UM LEMAR IN WOS Cotutelle UMR copubli-int-hors-europe IF 4.497 TC 37 UR https://archimer.ifremer.fr/doc/00591/70322/68359.pdf https://archimer.ifremer.fr/doc/00591/70322/68360.pdf https://archimer.ifremer.fr/doc/00591/70322/68362.xlsx https://archimer.ifremer.fr/doc/00591/70322/68363.gif https://archimer.ifremer.fr/doc/00591/70322/68364.gif https://archimer.ifremer.fr/doc/00591/70322/68365.gif https://archimer.ifremer.fr/doc/00591/70322/68367.gif LA English DT Article DE ;diazotrophs;marine nitrogen fixation;meta-analysis;machine learning AB Diazotrophs play a critical role in the biogeochemical cycling of nitrogen, carbon, and other elements in the global ocean. Despite their well‐recognized role, the diversity, abundance, and distribution of diazotrophs in the world's ocean remain poorly characterized largely due to limited observations. Here we update the database of diazotroph nifH gene abundances and assess how environmental factors may regulate diazotrophs at the global scale. Our meta‐analysis more than doubles the number of observations in the previous database. Using linear and nonlinear regressions, we find that the abundances of Trichodesmium, UCYN‐A, UCYN‐B, and Richelia relate differently to temperature, light, and nutrients. We further apply a random forest algorithm to estimate the global distributions of these diazotrophic groups, identifying undersampled potential hot spots of diazotrophy in the South Atlantic and southern Indian Ocean, and in coastal waters. The distinct ecophysiologies of diazotrophs highlighted here argue for separate parameterizations of different diazotrophs in model simulations. Plain Language Summary Microbial communities drive the cycling of critical elements like carbon and nitrogen in the ocean. By converting N2 into more bioavailable nitrogen, diazotrophs alleviate nitrogen limitation and support primary production. Despite their importance, their distributions are poorly characterized in great part due to limited observations. Here we compile from the literature observations to update the global database of marine diazotrophs. We also assess how the abundance and distribution of different types of diazotrophs at the global scale relate to environmental factors, including temperature, depth, and nutrients. Finally, we use a random forest machine learning method to predict the distribution of different types of diazotrophs in the world's ocean. Our results highlight the need for observations over broader oceanic regimes and a more granular representation of diazotrophy in models. PY 2019 PD NOV SO Geophysical Research Letters SN 0094-8276 PU American Geophysical Union (AGU) VL 46 IS 21 UT 000496183000001 BP 12258 EP 12269 DI 10.1029/2019GL084376 ID 70322 ER EF