Issue February 2006

category image Volume 23
No. 4 (p 357-484)
February 2006
ISSN 0739-110

?Genomemark?: Detecting Word Periodicity in Biological Sequences (p. 457-464)

Identifying and predicting the structural characteristics of novel repeats throughout the genome can lend insight into biological function. Specific repeats are believed to have biological significance as a function of their distribution patterns. We have developed ?GenomeMark,? a computer program that detects and statistically analyzes candidate repeats. Specifically, ?GenomeMark? identifies the periodic distribution of unique words, calculating their χ2 and Z-score values. Using ?GenomeMark,? we identified novel sequence words present in tandem throughout genomes. We found that these sequences have remarkable spacer sequence distributions and many were genome specific, validating the genome signature theory. Further analysis confirmed that many of these sequences have a specific biological function. The program is available from the authors upon request and is freely available for non-commercial and academic entities.

Key words: Computational Biology, Genome Rearrangements, Genome Signature, Genomics, Markov Chain, and Sequence Repeats.

A. Fadiel1,*
K. D. Eichenbaum1
A. Hamza2,a

1Yale University School of Medicine
Yale Center for Research On Reproductive Biology
New Haven, CT 06511, USA
2Unite de Modelisation Moleculaire
Institut Pasteur de Tunis
13, Place Pasteur
1002 Tunis-Belvedere, Tunisia

aPresent address:
College of Pharmacy
University of Kentucky
Lexington, KY 40536, USA
*afadiel@yale.edu

Purchase Downloadable Full Text PDF of Articles

Corporate User

$100.00

University/Academic User

$50.00

Subscription is more cost effective than purchasing PDFs on-the-fly.  Click here for details.