Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations: an example for tomato and grapevine

Abstract

Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at http://biosrv.cab.unina.it/comparalogs/gene/search. Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.

Document Type

Article

Document version

Published version

Language

English

Pages

17

Publisher

BMC

Published in

BMC Bioinformatics

Grant Agreement Number

EC/FP7/289220/EU/Pollen thermotolerance and crop fertility/SPOT-ITN

EC/COST/FA1106/EU/Developmental mechanisms controlling fleshy fruit quality in tomato and grapevine/QualityFruit

Recommended citation

Ambrosino, Luca, Valentino Ruggieri, Hamed Bostan, Marco Miralto, Nicola Vitulo, Mohamed Zouine, and Amalia Barone et al. 2018. "Multilevel Comparative Bioinformatics To Investigate Evolutionary Relationships And Specificities In Gene Annotations: An Example For Tomato And Grapevine". BMC Bioinformatics 19 (S15). Springer Nature. doi:10.1186/s12859-018-2420-y.

Rights

Attribution 4.0 International

Attribution 4.0 International

This item appears in the following Collection(s)