Discovery of miRNA-Gene regulatory networks by using an integrative data-mining approach

Abstract

IntroductionMicroRNAs (miRNAs) represent the largest class of small non-coding RNAs (20-24 nucleotide long) acting as post-transcriptional regulators of many genes and playing a pivotal role in important biological processes, in almost all organisms and in a large number of human diseases. Computational approaches have been proven to be fundamental in the miRNA research for both gene-specific and large-scale predictions of miRNA targets, for the formulation of new functional hypothesis on their biological role, for gene network discovery and to guide experimental validations. However, their effectiveness is negatively affected by high uncertainty of miRNA gene target predictions and by the complexity of rules governing miRNA functional targeting, whose mechanisms still remain elusive. In order to improve predictions of miRNA targets and to support the elucidation of miRNA functional role in the context of gene regulatory networks, we have recently developed a new two-stepped computational approach based on: i) a semi-supervised ensemble-based classifier for the prediction of miRNA target interactions (MTIs) [1] and, ii) a biclustering algorithm (HOCCLUS2) for the prediction of miRNA-gene regulatory networks (MGRNs) [2]. Data produced are available at ComiRNet, a user-friendly web-based system providing efficient query, retrieval, export, visualization and analysis of predicted MTIs and MGRNs.MethodIn the first step, a semi-supervised ensemble-based classifier is learned from both experimentally validated interactions (positively labelled examples), extracted from miRTarBase [3], and miRNA gene target predictions (MTIs), returned from several prediction algorithms (unlabelled examples) and extracted from mirDIP [4]. This classifier acts as a meta-classifier of unlabelled examples. As a result of the first step, a unique (meta-)prediction score is available for all possible interactions. In the second step, these prediction scores are used to identify miRNA-gene regulatory networks (MGRNs) through the biclustering algorithm HOCCLUS2. HOCCLUS2 exploits the large set of produced predictions, with the associated probability, to extract a set of overlapping and hierarchically organized biclusters each one representing putative MGRNs. The construction of the hierarchy is performed by an iterative merging, considering both distance and density-based criteria. Extracted biclusters are also ranked on the basis of the p-values obtained by the Student's T-Test which compares intra- and inter- functional similarity of miRNA targets, computed on the basis of the gene classification provided in Gene Ontology (GO) [5]. The ComiRNet database relies on PostgreSQL DBMS, while the web-based platform is built through the Play 2.2 Java framework and the Cytoscape library. ResultsThe effectiveness of the computational approach has been validated on a number of alternative combinations of competitive algorithms for the first [1] and the second step [2]. B


Autore Pugliese

Tutti gli autori

  • G. Pio ; M. Ceci ; D. D'Elia ; D. Malerba

Titolo volume/Rivista

Non Disponibile


Anno di pubblicazione

2014

ISSN

Non Disponibile

ISBN

Non Disponibile


Numero di citazioni Wos

Nessuna citazione

Ultimo Aggiornamento Citazioni

Non Disponibile


Numero di citazioni Scopus

Non Disponibile

Ultimo Aggiornamento Citazioni

Non Disponibile


Settori ERC

Non Disponibile

Codici ASJC

Non Disponibile