Using Permutation Patterns for Content-Based Phylogeny

Md Enamul Karim²²,
Laxmi Parida²³ &
Arun Lakhotia²²

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 4146))

Included in the following conference series:

International Workshop on Pattern Recognition in Bioinformatics

695 Accesses
2 Citations

Abstract

When the same set of genes appear in different orders on the chromosomes, they form a permutation pattern. Permutation patterns have been used to identify potential haplogroups in mammalian data [8]. They also have been successfully used to detect phylogenetic relationships between computer viruses [9]. In this paper we explore the use of these patterns as a content similarity measure and use this in inferring phylogenies from genome rearrangement data in polynomial time. The method uses a function of the cardinality of the set of common maximal permutation patterns as a proxy for evolutionary “proximity” between genomes. We introduce Pi-logen, a phylogeny tool based on this method. We summarize results of feasibility study for this scheme on synthetic data by (1) content verification and (2) ancestor prediction. We also successfully infer phylogenies on series of synthetic data and on chloroplast gene order of Campanulaceae data.

Download to read the full chapter text

Chapter PDF

When and How the Perfect Phylogeny Model Explains Evolution

Exploring phylogenetic hypotheses via Gibbs sampling on evolutionary networks

Article Open access 11 November 2016

Genome-Wide Comparative Analysis of Phylogenetic Trees: The Prokaryotic Forest of Life

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Kececioglu, J., Sankoff, D.: Efficient bounds for oriented chromosome inversion distance. In: Crochemore, M., Gusfield, D. (eds.) CPM 1994. LNCS, vol. 807, pp. 307–325. Springer, Heidelberg (1994)
Google Scholar
Blanchette, M., Bourque, G., Sankoff, D.: Breakpoint phylogenies. In: Miyano, S., Takagi, T. (eds.) In Genome Informatics Workshop (GIW 1997), pp. 25–34. University Academy Press, Tokyo (1997)
Google Scholar
Bourque, G., Pevzner, P.A.: Genome-Scale Evolution: Reconstructing Gene Orders in the Ancestral Species. Genome Research 12(1), 26–36 (2002)
Google Scholar
Caprara, A.: Formulations and complexity of multiple sorting by reversals. In: Istrail, S., et al. (eds.) Proceedings of the Third Annual International Conference on Computational Molecular Biology RECOMB, pp. 84–93. ACM Press, Lyon, France (1999)
Chapter Google Scholar
Cosner, M.E., et al.: An Empirical Comparison of Phylogenetic Methods on Chloroplast Gene Order Data in Campanulaceae. In: Sankoff, D., Nadeau, J. (eds.) Comparative Genomics: Empirical and Analytical Approaches to Gene Order Dynamics, Map Alignment, and the Evolution of Gene Families, pp. 99–121. Kluwer Academic Publishers, Dordrecht (2000)
Google Scholar
Eres, R., Landau, G.M., Parida, L.: Combinatorial approach to automatic discovery of cluster patterns. In: Benson, G., Page, R.D.M. (eds.) WABI 2003. LNCS (LNBI), vol. 2812, pp. 139–150. Springer, Heidelberg (2003)
Chapter Google Scholar
Kauffman, L., Rousseeuw, P.: Finding Groups in Data: An Introduction to Cluster Analysis (1990)
Google Scholar
Landau, G.M., Parida, L., Weimann, O.: Using PQ Trees for Comparative Genomics. In: Apostolico, A., Crochemore, M., Park, K. (eds.) CPM 2005. LNCS, vol. 3537, pp. 128–143. Springer, Heidelberg (2005)
Chapter Google Scholar
Karim, M.E., Walenstein, A., Lakhotia, A., Parida, L.: Malware phylogeny generation using permutations of code. European Journal of Computer Virology 1, 1–11 (2005)
Article Google Scholar
Nadeau, J., Taylor, B.: Lengths of chromosomal segments conserved since divergence of man and mouse. Proc. Natl.Acad. Sci. PNAS 81, 814–818 (1984)
Article Google Scholar
Watterson, G., Ewens, W., Hall, T., Morgan, A.: The chromosome inversion problem. J. Theor. Biol. 99, 1–7 (1982)
Article Google Scholar
Peer, I., Shamir, R.: The median problems for breakpoints are NP-complete, Electronic Colloquium on Computational Complexity Technical Report 98-071 (1998), http://www.eccc.uni-trier.de/eccc
Hannenhalli, S., Pevzner, P.: Transforming cabbage into turnip (polynomial algorithm for sorting signed permutations by reversals). In: Proc. of the 27th Annual Symposium on Theory of Computing STOC, pp. 178–189 (1995)
Google Scholar
Sankoff, D.: Edit distance for genome comparison based on non-local operations. In: Apostolico, A., Galil, Z., Manber, U., Crochemore, M. (eds.) CPM 1992. LNCS, vol. 644, pp. 121–135. Springer, Heidelberg (1992)
Google Scholar
Berman, P., Hannenhalli, S.: Fast sorting by reversal. In: Hirschberg, D.S., Meyers, G. (eds.) CPM 1996. LNCS, vol. 1075, pp. 168–185. Springer, Heidelberg (1996)
Google Scholar
Kaplan, H., Shamir, R., Tarjan, R.: Faster and simpler algorithm for sorting signed permutations by reversals. In: Proc. of the 8th Annual ACM-SIAM Symposium on Discrete Algorithms SODA, pp. 344–351 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Advanced Computer Studies, University of Louisiana at Lafayette, USA
Md Enamul Karim & Arun Lakhotia
Computational Biology Center, IBM T J Watson Research Center, Yorktown Heights, USA
Laxmi Parida

Authors

Md Enamul Karim
View author publications
You can also search for this author in PubMed Google Scholar
Laxmi Parida
View author publications
You can also search for this author in PubMed Google Scholar
Arun Lakhotia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Singapore-MIT Alliance, 50 Nanyang Avenue, N2-B2C-15, Singapore
Jagath C. Rajapakse
School of Computing, National University of Singapore, Singapore
Limsoon Wong
Computer Science and Engineering, The Penn State University, USA
Raj Acharya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karim, M.E., Parida, L., Lakhotia, A. (2006). Using Permutation Patterns for Content-Based Phylogeny. In: Rajapakse, J.C., Wong, L., Acharya, R. (eds) Pattern Recognition in Bioinformatics. PRIB 2006. Lecture Notes in Computer Science(), vol 4146. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11818564_13

Download citation

DOI: https://doi.org/10.1007/11818564_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37446-6
Online ISBN: 978-3-540-37447-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Using Permutation Patterns for Content-Based Phylogeny

Abstract

Chapter PDF

Similar content being viewed by others

When and How the Perfect Phylogeny Model Explains Evolution

Exploring phylogenetic hypotheses via Gibbs sampling on evolutionary networks

Genome-Wide Comparative Analysis of Phylogenetic Trees: The Prokaryotic Forest of Life

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Using Permutation Patterns for Content-Based Phylogeny

Abstract

Chapter PDF

Similar content being viewed by others

When and How the Perfect Phylogeny Model Explains Evolution

Exploring phylogenetic hypotheses via Gibbs sampling on evolutionary networks

Genome-Wide Comparative Analysis of Phylogenetic Trees: The Prokaryotic Forest of Life

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation