smyRNA: A Novel Ab Initio ncRNA Gene Finder
No Thumbnail Available
Date
2009
Advisor
Journal Title
Journal ISSN
Volume Title
Publisher
Public Library of Science (PLOS)
Abstract
Background
Non-coding RNAs (ncRNAs) have important functional roles in the cell: for example, they regulate gene expression by means of establishing stable joint structures with target mRNAs via complementary sequence motifs. Sequence motifs are also important determinants of the structure of ncRNAs. Although ncRNAs are abundant, discovering novel ncRNAs on genome sequences has proven to be a hard task; in particular past attempts for ab initio ncRNA search mostly failed with the exception of tools that can identify micro RNAs.
Methodology/Principal Findings
We present a very general ab initio ncRNA gene finder that exploits differential distributions of sequence motifs between ncRNAs and background genome sequences.
Conclusions/Significance
Our method, once trained on a set of ncRNAs from a given species, can be applied to a genome sequences of other organisms to find not only ncRNAs homologous to those in the training set but also others that potentially belong to novel (and perhaps unknown) ncRNA families. Availability: http://compbio.cs.sfu.ca/taverna/smyrna
Description
© 2009 Salari et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Keywords
non-coding RNA, genomics, RNA structure, sequence motif analysis, sequence alignment, multiple alignment calculation, gene prediction, genomic databases