TY - JOUR
T1 - MiPred: Classification of real and pseudo microRNA precursors using random forest prediction model with combined features
AU - Jiang, P.
AU - Wu, Haonan
AU - Wang, Wenkai
AU - Ma, Wei
AU - Sun, Xiao
AU - Lu, Zuhong
PY - 2007/7/1
Y1 - 2007/7/1
N2 - To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides, a novel machine-learning algorithm, random forest (RF), is introduced. The results suggest that our method predicts at 98.21% specificity and 95.09% sensitivity. When compared with the previous study, Triplet-SVM-classifier, our RF method was nearly 10% greater in total accuracy. Further analysis indicated that the improvement was due to both the combined features and the RF algorithm. The MiPred web server is available at http://www.bioinf.seu.edu.cn/miRNA/. Given a sequence, MiPred decides whether it is a pre-miRNA-like hairpin sequence or not. If the sequence is a pre-miRNA-like hairpin, the RF classifier will predict whether it is a real premiRNA or a pseudo one. © 2007 The Author(s).
AB - To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides, a novel machine-learning algorithm, random forest (RF), is introduced. The results suggest that our method predicts at 98.21% specificity and 95.09% sensitivity. When compared with the previous study, Triplet-SVM-classifier, our RF method was nearly 10% greater in total accuracy. Further analysis indicated that the improvement was due to both the combined features and the RF algorithm. The MiPred web server is available at http://www.bioinf.seu.edu.cn/miRNA/. Given a sequence, MiPred decides whether it is a pre-miRNA-like hairpin sequence or not. If the sequence is a pre-miRNA-like hairpin, the RF classifier will predict whether it is a real premiRNA or a pseudo one. © 2007 The Author(s).
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=34547596338&origin=inward
UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=34547596338&origin=inward
U2 - 10.1093/nar/gkm368
DO - 10.1093/nar/gkm368
M3 - Article
C2 - 17553836
SN - 0305-1048
VL - 35
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - SUPPL.2
ER -