Issue February 2006No. 4 (p 357-484) February 2006 ISSN 0739-110 ?Genomemark?: Detecting Word Periodicity in Biological Sequences (p. 457-464)Identifying and predicting the structural characteristics of novel repeats throughout the genome can lend insight into biological function. Specific repeats are believed to have biological significance as a function of their distribution patterns. We have developed ?GenomeMark,? a computer program that detects and statistically analyzes candidate repeats. Specifically, ?GenomeMark? identifies the periodic distribution of unique words, calculating their χ2 and Z-score values. Using ?GenomeMark,? we identified novel sequence words present in tandem throughout genomes. We found that these sequences have remarkable spacer sequence distributions and many were genome specific, validating the genome signature theory. Further analysis confirmed that many of these sequences have a specific biological function. The program is available from the authors upon request and is freely available for non-commercial and academic entities.
Key words: Computational Biology, Genome Rearrangements, Genome Signature, Genomics, Markov Chain, and Sequence Repeats. A. Fadiel1,* 1Yale University School of Medicine Subscription is more cost effective than purchasing PDFs on-the-fly. Click here for details. |