ncPRO-seq: a tool for annotation and profiling of ncRNAs in sRNA-seq data

Bioinformatics. 2012 Dec 1;28(23):3147-9. doi: 10.1093/bioinformatics/bts587. Epub 2012 Oct 7.

Abstract

Summary: Non-coding RNA (ncRNA) PROfiling in small RNA (sRNA)-seq (ncPRO-seq) is a stand-alone, comprehensive and flexible ncRNA analysis pipeline. It can interrogate and perform detailed profiling analysis on sRNAs derived from annotated non-coding regions in miRBase, Rfam and RepeatMasker, as well as specific regions defined by users. The ncPRO-seq pipeline performs both gene-based and family-based analyses of sRNAs. It also has a module to identify regions significantly enriched with short reads, which cannot be classified under known ncRNA families, thus enabling the discovery of previously unknown ncRNA- or small interfering RNA (siRNA)-producing regions. The ncPRO-seq pipeline supports input read sequences in fastq, fasta and color space format, as well as alignment results in BAM format, meaning that sRNA raw data from the three current major platforms (Roche-454, Illumina-Solexa and Life technologies-SOLiD) can be analyzed with this pipeline. The ncPRO-seq pipeline can be used to analyze read and alignment data, based on any sequenced genome, including mammals and plants.

Availability: Source code, annotation files, manual and online version are available at http://ncpro.curie.fr/.

Contact: bioinfo.ncproseq@curie.fr or cciaudo@ethz.ch

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Computational Biology / methods*
  • Internet
  • RNA, Small Interfering / genetics*
  • RNA, Untranslated / genetics*
  • Sequence Alignment
  • Sequence Analysis, RNA / methods*
  • Software*

Substances

  • RNA, Small Interfering
  • RNA, Untranslated