Abstract
Several studies show the difficulty experienced for the reuse of the ever increasing amount of genomic data. Initiatives are being created to mitigate this concern; one of the most well-known is the FAIR Data Principles. Nonetheless, the related works are too generic and do not describe simultaneously and properly the human and machine perspectives of the FAIRness of databases. Hence, in order to bridge this gap, our paper introduces an approach named the Bio FAIR Evaluator Framework, a semiautomated tool aimed to analyze the FAIRness of genomic databases. Furthermore, we performed experiments that analyzed selected genomic databases according to two orthogonal and complementary perspectives (human and machine). The approach uses standardized FAIR metrics and generates recommendation reports to researchers indicating how to enhance the FAIRness of databases. Our findings, when compared with related works, show the feasibility of the approach, indicating that the current genomic databases are poorly compliant with FAIR Principles.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Paquetto, I., Ranfles, B., Borgman, C.: On the reuse of scientific data. Data Sci. J. 16, 8 (2017)
Hey, T., Trefethen, A.: The Data Deluge: an e-science Perspective.Ā Grid Computing: Making the Global Infrastructure a Reality, pp. 809ā824. Wiley-Blackwell (2003)
Bayat, A.: Bioinformatics: science, medicine, and the future: bioinformatics. Br. Med. J. 324(7344), 1018ā1022 (2002)
Cook, C., Bergman, M., Finn, R., Cochrane, G., et al.: The European bioinformatics institute in 2016: data growth and integration. Nucl. Acids Res. 44(D1), D20āD26 (2016)
GonƧalves, R., Musen, M.: The variable quality of metadata about biological samples used in biomedical experiments. Sci. Data 6, 190021 (2019)
Wilkinson, M., Dumontier, M., Aalbersberg, I., Apleton, G., Axt, M., et al.: The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016)
Navarro, F., Mohsen, H., Yan, C., Li, S., Gu, M., Meyerson, W.: Genomics and data science: an application within an umbrella. Genome Biol. 20, 109 (2019)
Wallis, J.: Data producers courting data reusers: two cases from modeling communities. Int. J. Digit. Curation 9(1) (2014)
Stephens, Z., Lee, S., Faghri, F., Campbell, R., Zhai, C., Efron, M., et al.: Big data: astronomical or genomical. PLoS Biol. 13(7), e1002195 (2015)
Re3Data. https://www.re3data.org. Last accessed 06 June 2020
Bouadjenek, M., Verspoor, K., Zobel, Z.: Automated detection of records in biological sequence databases that are inconsistent with the literature. J. Biomed. Inform. 71, 229ā240 (2017)
Miron, L., GonƧalves, R., Musen, M.: Obstacles to the Reuse of Study Metadata in Clini-calTrials.gov. BioRxiv (2020)
GonƧalves, R., Connor, M., Romero, M., Graybeal, J., Musen, M.: Metadata in the biosample online repository are impaired by numerous anomalies. In: 1st Workshop on Enabling Open Semantic Science, pp. 39ā46 (2017)
Mons, B., Cameron, N., et al.: Cloudy, increasingly FAIR: revisiting the FAIR data guiding principles for the European open science cloud. Info. Serv. Use 37(1), 49ā56 (2017)
Wilkinson, M., Dumontier, M., Sansone, S., Santos, B., et al.: Evaluating FAIR maturity through a scalable, automated, community-governed framework. Sci. Data 6(174), 1ā12 (2019)
Dunning, A., Smaele, M., Bohmer, J.: Are the fair data principles fair? Int. J. Digit. Curation 12(2), 177ā195 (2017)
Bahim, C., Dekkers, M., Wyns, B.: Results of an Analysis of existing FAIR assessment tools. RDA (2019)
Acknowledgements
This study was financed in part by the National Council for Scientific and Technological Development (CNPq), Programa de EducaĆ§Ć£o Tutorial (PET) and CoordenaĆ§Ć£o de AperfeiƧoamento de Pessoal de NĆvel Superior ā Brasil (CAPES) ā Finance Code 001.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
FeijoĆ³, M.P.P., Jardim, R., da Cruz, S.M.S., Campos, M.L.M. (2020). Evaluating FAIRness of Genomic Databases. In: Grossmann, G., Ram, S. (eds) Advances in Conceptual Modeling. ER 2020. Lecture Notes in Computer Science(), vol 12584. Springer, Cham. https://doi.org/10.1007/978-3-030-65847-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-65847-2_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-65846-5
Online ISBN: 978-3-030-65847-2
eBook Packages: Computer ScienceComputer Science (R0)