TY - GEN

T1 - Algorithms and software for calculating and visualizing the cardinality of the grand bounding ball by means of intersection trees

AU - Barton, Robert

AU - Zhang, Sen

AU - Herbert, Katherine G.

PY - 2008

Y1 - 2008

N2 - While it is straightforward to calculate the bounding ball of a partially resolved phylogenetic tree, the calculation of the cardinality of the grand bounding ball for multiple partially resolved trees is very time-consuming due to the power set principle that governs the highly iterative calculation process. What can further complicate the calculation is that the practitioner has to possess domain knowledge in phylogenetic trees, subtree mining and discrete math at the same time. This paper presents the first efficient software to automate the process of calculating the cardinality of the grand bounding ball by combining a number of advanced data mining techniques and concepts including systematically growing intersection trees, which is specially defined for this work, level by level, cluster sets, Apriori, equivalence class, and set theory, and rooting unrooted trees etc. The algorithms are discussed in the the rooted tree scope first; then they are extended to the unrooted tree scope by using a novel directing method. In addition, a displayer component has been implemented to help visualize the complicated calculation process demanded by the problem. The software is expected to be useful in phylogenetic tree clustering research.

AB - While it is straightforward to calculate the bounding ball of a partially resolved phylogenetic tree, the calculation of the cardinality of the grand bounding ball for multiple partially resolved trees is very time-consuming due to the power set principle that governs the highly iterative calculation process. What can further complicate the calculation is that the practitioner has to possess domain knowledge in phylogenetic trees, subtree mining and discrete math at the same time. This paper presents the first efficient software to automate the process of calculating the cardinality of the grand bounding ball by combining a number of advanced data mining techniques and concepts including systematically growing intersection trees, which is specially defined for this work, level by level, cluster sets, Apriori, equivalence class, and set theory, and rooting unrooted trees etc. The algorithms are discussed in the the rooted tree scope first; then they are extended to the unrooted tree scope by using a novel directing method. In addition, a displayer component has been implemented to help visualize the complicated calculation process demanded by the problem. The software is expected to be useful in phylogenetic tree clustering research.

KW - Aprori

KW - Data mining

KW - Equivalence class

KW - Phylogenetic trees

KW - Software

UR - http://www.scopus.com/inward/record.url?scp=62649156842&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:62649156842

SN - 1601320620

SN - 9781601320629

T3 - Proceedings of the 2008 International Conference on Data Mining, DMIN 2008

SP - 529

EP - 535

BT - Proceedings of the 2008 International Conference on Data Mining, DMIN 2008

A2 - Stahlbock, R.

A2 - Crone, S.F.

A2 - Lessmann, S.

T2 - 2008 International Conference on Data Mining, DMIN 2008

Y2 - 14 July 2008 through 17 July 2008

ER -