TY - GEN
T1 - Dynamic and parallel approaches to optimal evolutionary tree construction
AU - Bhattacharjee, Anupam
AU - Sultana, Kazi Zakia
AU - Shams, Zalia
PY - 2006/1/1
Y1 - 2006/1/1
N2 - Phylogenetic trees are commonly reconstructed based on hard optimization problems such as Maximum parsimony (MP) and Maximum likelihood (ML). Conventional MP heuristics for producing phylogenetic trees produce good solutions within reasonable time on small databases (up to a few thousand sequences) while ML heuristics are limited to smaller datasets (up to a few hundred sequences). However, since MP and ML are NP-hard, application of such approaches do not scale large datasets. In this paper, we present a promising divide-and-conquer technique, the TAZ method, to construct an evolutionary tree. The algorithm has been implemented and tested against five large biological datasets ranging from 5000-7000 sequences and dramatic speedup with significant improvement in accuracy (better than 94%), in comparison to existing approaches, has been obtained. Thus, high quality reconstruction can be obtained for large datasets by using this approach. Moreover, we present here another approach to construct the tree dynamically (when sequences come dynamically with partial information). Finally Combining the two approaches, we show parallel approaches to construct the tree when sequences are generated or obtained dynamically.
AB - Phylogenetic trees are commonly reconstructed based on hard optimization problems such as Maximum parsimony (MP) and Maximum likelihood (ML). Conventional MP heuristics for producing phylogenetic trees produce good solutions within reasonable time on small databases (up to a few thousand sequences) while ML heuristics are limited to smaller datasets (up to a few hundred sequences). However, since MP and ML are NP-hard, application of such approaches do not scale large datasets. In this paper, we present a promising divide-and-conquer technique, the TAZ method, to construct an evolutionary tree. The algorithm has been implemented and tested against five large biological datasets ranging from 5000-7000 sequences and dramatic speedup with significant improvement in accuracy (better than 94%), in comparison to existing approaches, has been obtained. Thus, high quality reconstruction can be obtained for large datasets by using this approach. Moreover, we present here another approach to construct the tree dynamically (when sequences come dynamically with partial information). Finally Combining the two approaches, we show parallel approaches to construct the tree when sequences are generated or obtained dynamically.
KW - Bioinformatics
KW - Evolutionary tree construction
KW - Parallel algorithm
KW - Shared memory model
UR - http://www.scopus.com/inward/record.url?scp=39049117347&partnerID=8YFLogxK
U2 - 10.1109/CCECE.2006.277582
DO - 10.1109/CCECE.2006.277582
M3 - Conference contribution
SN - 1424400384
SN - 9781424400386
T3 - Canadian Conference on Electrical and Computer Engineering
SP - 119
EP - 122
BT - 2006 Canadian Conference on Electrical and Computer Engineering, CCECE'06
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2006 Canadian Conference on Electrical and Computer Engineering, CCECE'06
Y2 - 7 May 2006 through 10 May 2006
ER -