YamiPred: A novel evolutionary method for predicting pre-miRNAs and selecting relevant features

Dimitrios A. Kleftogiannis, Konstantinos Theofilatos, Spiros Likothanassis, Seferina Mavroudi

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

MicroRNAs (miRNAs) are small non-coding RNAs, which play a significant role in gene regulation. Predicting miRNA genes is a challenging bioinformatics problem and existing experimental and computational methods fail to deal with it effectively. We developed YamiPred, an embedded classification method that combines the efficiency and robustness of Support Vector Machines (SVM) with Genetic Algorithms (GA) for feature selection and parameters optimization. YamiPred was tested in a new and realistic human dataset and was compared with state-of-the-art computational intelligence approaches and the prevalent SVM-based tools for miRNA prediction. Experimental results indicate that YamiPred outperforms existing approaches in terms of accuracy and of geometric mean of sensitivity and specificity. The embedded feature selection component selects a compact feature subset that contributes to the performance optimization. Further experimentation with this minimal feature subset has achieved very high classification performance and revealed the minimum number of samples required for developing a robust predictor. YamiPred also confirmed the important role of commonly used features such as entropy and enthalpy, and uncovered the significance of newly introduced features, such as %A-U aggregate nucleotide frequency and positional entropy. The best model trained on human data has successfully predicted pre-miRNAs to other organisms including the category of viruses.
Original languageEnglish (US)
Pages (from-to)1183-1192
Number of pages10
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume12
Issue number5
DOIs
StatePublished - Jan 23 2015

Fingerprint Dive into the research topics of 'YamiPred: A novel evolutionary method for predicting pre-miRNAs and selecting relevant features'. Together they form a unique fingerprint.

Cite this