The Center for Education and Research in Information Assurance and Security (CERIAS)

The Center for Education and Research in
Information Assurance and Security (CERIAS)

Discovering Consensus Patterns in Biological Databases

Author

MY ElTabakh, WG Aref, M Ouzzani, MH Ali

Entry type

inbook

Abstract

Consensus patterns, like motifs and tandem repeats, are highly conserved patterns with very few substitutions where no gaps are allowed. In this paper, we present a progressive hierarchical clustering technique for discovering consensus patterns in biological databases over a certain length range. This technique can discover consensus patterns with various requirements by applying a post-processing phase. The progressive nature of the hierarchical clustering algorithm makes it scalable and efficient. Experiments to discover motifs and tandem repeats on real biological databases show significant performance gain over non-progressive clustering techniques.

Date

2006

Booktitle

Data Mining and Bioinformatics

Key alpha

Aref

Pages

170-184

Publisher

Springer Berlin / Heidelberg

Series

Lecture Notes in Computer Science

Volume

4316

Publication Date

2006-00-00

BibTex-formatted data

To refer to this entry, you may select and copy the text below and paste it into your BibTex document. Note that the text may not contain all macros that BibTex supports.