Sima, AC;
Dessimoz, C;
Stockinger, K;
Zahn-Zabal, M;
Mendes de Farias, T;
(2019)
A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL [version 1; peer review: 1 approved, 2 approved with reservations].
F1000Research
, 8
, Article 1822. 10.12688/f1000research.21027.1.
Preview |
Text
Dessimoz_4b5718bd-8456-48f6-b82d-3dad922c2f3d_21027_-_tarcisio_mendes.pdf - Published Version Download (1MB) | Preview |
Abstract
The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple sources. However, analyzing evolutionary data in RDF is not trivial, due to the steep learning curve required to understand both the data models adopted by different RDF data sources, as well as the SPARQL query language. In this article, we provide a hands-on introduction to querying evolutionary data across multiple sources that publish orthology information in RDF, namely: The Orthologous MAtrix (OMA), the European Bioinformatics Institute (EBI) RDF platform, the Database of Orthologous Groups (OrthoDB) and the Microbial Genome Database (MBGD). We present four protocols in increasing order of complexity. In these protocols, we demonstrate through SPARQL queries how to retrieve pairwise orthologs, homologous groups, and hierarchical orthologous groups. Finally, we show how orthology information in different sources can be compared, through the use of federated SPARQL queries.
Type: | Article |
---|---|
Title: | A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL [version 1; peer review: 1 approved, 2 approved with reservations] |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.12688/f1000research.21027.1 |
Publisher version: | https://doi.org/10.12688/f1000research.21027.1 |
Language: | English |
Additional information: | © 2019 Sima AC et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/). |
Keywords: | Orthology, Comparative Genomics, Sequence Homology, Resource Description Framework (RDF), SPARQL |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment |
URI: | https://discovery.ucl.ac.uk/id/eprint/10085118 |
Archive Staff Only
View Item |