Gene Tree Construction and Correction using SuperTree and Reconciliation

October 17, 2016 Β· Entered Twilight Β· πŸ› IEEE/ACM Transactions on Computational Biology & Bioinformatics

πŸŒ… TWILIGHT: Old Age
Predates the code-sharing era β€” a pioneer of its time

"Last commit was 9.0 years ago (β‰₯5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: LICENSE, SuperGeneTrees

Authors Manuel Lafond, CΓ©dric Chauve, Nadia El-Mabrouk, AΓ―da Ouangraoua arXiv ID 1610.05068 Category cs.DS: Data Structures & Algorithms Citations 11 Venue IEEE/ACM Transactions on Computational Biology & Bioinformatics Repository https://github.com/UdeM-LBIT/SuGeT Last Checked 1 month ago
Abstract
The supertree problem asking for a tree displaying a set of consistent input trees has been largely considered for the reconstruction of species trees. Here, we rather explore this framework for the sake of reconstructing a gene tree from a set of input gene trees on partial data. In this perspective, the phylogenetic tree for the species containing the genes of interest can be used to choose among the many possible compatible "supergenetrees", the most natural criteria being to minimize a reconciliation cost. We develop a variety of algorithmic solutions for the construction and correction of gene trees using the supertree framework. A dynamic programming supertree algorithm for constructing or correcting gene trees, exponential in the number of input trees, is first developed for the less constrained version of the problem. It is then adapted to gene trees with nodes labeled as duplication or speciation, the additional constraint being to preserve the orthology and paralogy relations between genes. Then, a quadratic time algorithm is developed for efficiently correcting an initial gene tree while preserving a set of "trusted" subtrees, as well as the relative phylogenetic distance between them, in both cases of labeled or unlabeled input trees. By applying these algorithms to the set of Ensembl gene trees, we show that this new correction framework is particularly useful to correct weaklysupported duplication nodes. The C++ source code for the algorithms and simulations described in the paper are available at https://github.com/UdeM-LBIT/SuGeT.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Data Structures & Algorithms