Transcriptomic dataset for Sardina pilchardus: Assembly, annotation, and expression of nine tissues

Type Article
Date 2021-12
Language English
Author(s) Langa JorgeORCID1, Huret MartinORCID2, Montes Iratxe1, Conklin DarrellORCID3, 4, Estonba Andone1
Affiliation(s) 1 : Department of Genetics, Physical Anthropology and Animal Physiology, Faculty of Science and Technology, University of the Basque Country, UPV/EHU, Leioa, Bizkaia 48940, Spain
2 : IFREMER, STH/LBH, B.P. 70, Plouzané 29280, France
3 : Department of Computer Science and Artificial Intelligence, Faculty of Computer Science, University of the Basque Country UPV/EHU, San Sebastián, Spain
4 : IKERBASQUE, Basque Foundation for Science, Bilbao, Spain
Source Data In Brief (2352-3409) (Elsevier BV), 2021-12 , Vol. 39 , P. 107583 (7p.)
DOI 10.1016/j.dib.2021.107583
WOS© Times Cited 1
Keyword(s) Sardina pilchardus, European sardine, Transcriptome assembly, Annotation, Expression, Tissue quantification, Pathway, Gene ontology
Abstract

European sardine or pilchard is a planktonic small pelagic fish present from the North Sea in Europe to the coast of Senegal in the North of Africa, and across the Mediterranean sea to the Black Sea. Ecologically, sardines are an intermediary link in the trophic network, preying on plankton and being predated by larger fishes, marine mammals, and seabirds. This species is of great nutritional and economic value as a cheap but rich source of protein and fat. It is either consumed directly by humans or fed as fishmeal for aquaculture and farm animals. Despite its importance in the food basket, little is known about the molecular mechanisms involved in protein and lipid synthesis in this species. We collected nine tissues of Sardina pilchardus and reconstructed the transcriptome. In all, 198,597 transcripts were obtained, from which 68,031 are protein-coding. Quality assessment of the transcriptome was performed by back-mapping reads to the transcriptome and by searching for Single Copy Orthologs. Additionally, Gene Ontology and KEGG annotations were retrieved for most of the protein-coding genes. Finally, each library was quantified in terms of Transcripts per Million to disclose their expression patterns.

Full Text
File Pages Size Access
Publisher's official version 7 1 MB Open access
Top of the page