List of RNA-Seq bioinformatics tools

RNA-Seq[1][2][3] is a technique [4] that allows transcriptome studies based on next-generation sequencing technologies. This technique is largely dependent on bioinformatics tools developed to support the different steps of the process. Here are listed some of the principal tools commonly employed and links to some important web resources.

To follow an integrated guide about RNA-seq analysis, please see - github rnaseq_tutorial, Next Generation Sequencing (NGS)/RNA, RNA-seqlopedia, Hands-On Tutorial [5] or RNA-Seq Workflow. Also, important links are SEQanswers,Omictools, RNA-SeqBlog, Biostar, homolog.us and bioscholar.


Quality control, trimming, error correction and pre-processing of data

Quality assessment [6] is the first step of the bioinformatics pipeline of RNA-Seq. Often, is necessary to filter data, removing low quality sequences or bases (trimming), adapters, contaminations, overrepresented sequences or correcting errors to assure a coherent final result.

Quality control

Improving the Quality

Improvement of the RNA-Seq quality, correcting the bias is a complex subject.[15][16] Each RNA-Seq protocol introduces specific type of bias, each step of the process (such as the sequencing technology used) is susceptible to generate some sort of noise or type of error. Furthermore, even the specie under investigation and the biological context of the samples are able to influence the results and introduce some kind of bias. Many sources of bias were already reported – GC content and PCR enrichment,[17][18] rRNA depletion,[19] errors produced during sequencing,[20] priming of reverse transcription caused by random hexamers.[21]

Different tools were developed to attempt to solve each of the detected errors.

Trimming and adapters removal

Detection of chimeric reads

Recent sequencing technologies normally require DNA samples to be amplified via polymerase chain reaction (PCR). Amplification often generates chimeric elements (specially from ribosomal origin) - sequences formed from two or more original sequences joined together.

Error Correction

High-throughput sequencing errors characterization and their eventual correction.[28]

Bias Correction

Other tasks/Pre-processing data

Further tasks performed before alignment, namely paired-read mergers.

Alignment Tools

After control assessment, the first step of RNA-Seq analysis involves alignment (RNA-Seq alignment) of the sequenced reads to a reference genome (if available) or to a transcriptome database. See also [41] and List of sequence alignment software.

Short (Unspliced) aligners

Short aligners are able to align continuous reads (not containing gaps result of splicing) to a genome of reference. Basically, there are two types: 1) based on the Burrows-Wheeler transform method such as Bowtie and BWA, and 2) based on Seed-extend methods, Needleman-Wunsch or Smith-Waterman algorithms. The first group (Bowtie and BWA) is many times faster, however some tools of the second group, despite the time spent tend to be more sensitive, generating more reads correctly aligned. See a comparative study of short aligners - comparative study.

Spliced aligners

Many reads span exon-exon junctions and can not be aligned directly by Short aligners, thus specific aligners were necessary - Spliced aligners. Some Spliced aligners employ Short aligners to align firstly unspliced/continuous reads (exon-first approach), and after follow a different strategy to align the rest containing spliced regions - normally the reads are split into smaller segments and mapped independently. See also.[42]

Aligners based on known splice junctions (annotation-guided aligners)

In this case the detection of splice junctions is based on data available in databases about known junctions. This type of tools cannot identify new splice junctions. Some of this data comes from other expression methods like expressed sequence tags (EST).

De novo Splice Aligners

De novo Splice aligners allow the detection of new Splice junctions without need to previous annotated information (some of these tools present annotation as a suplementar option). See also De novo Splice Aligners.

De novo Splice Aligners that also use annotation optionally
Other Spliced Aligners

Evaluation of Alignment tools

Normalization, Quantitative analysis and Differential Expression

General Tools

These tools perform normalization and calculate the abundance of each gene expressed in a sample.[48] RPKM, FPKM and TPMs are some of the units employed to quantification of expression (RPKM-FPKM-TPMs video). Some software are also designed to study the variability of genetic expression between samples (differential expression). Quantitative and differential studies are largely determined by the quality of reads alignment and accuracy of isoforms reconstruction. Several studies are available comparing differential expression methods.[49][50]

Evaluation of quantification and differential expression

Multi-tool solutions

Workbench (analysis pipeline / integrated solutions)

Commercial Solutions

Open (free) Source Solutions

Alternative Splicing Analysis

General Tools

Intron Retention Analysis

Fusion genes/chimeras/translocation finders/structural variations

Genome arrangements result of diseases like cancer can produce aberrant genetic modifications like fusions or translocations. Identification of these modifications play important role in carcinogenesis studies.[59]

Copy Number Variation identification

Single Cell RNA-Seq

Single cell sequencing. Comparative analysis of single-cell RNA-sequencing methods.[61]

RNA-Seq simulators

These Simulators generate in silico reads and are useful tools to compare and test the efficiency of algorithms developed to handle RNA-Seq data. Moreover, some of them make possible to analyse and model RNA-Seq protocols.See also Genetic Simulation Resources and some discussion about simulation at Biostars.

Transcriptome assemblers

The transcriptome is the total population of RNAs expressed in one cell or group of cells, including non-coding and protein-coding RNAs. There are two types of approaches to assemble transcriptomes. Genome-guided methods use a reference genome (if possible a finished and high quality genome) as a template to align and assembling reads into transcripts. Genome-independent methods does not require a reference genome and are normally used when a genome is not available. In this case reads are assembled directly in transcripts. Some important comparative studies [71][72] were already published.

Genome-Guided assemblers

Genome-Independent (de novo) assemblers

Assembly evaluation tools

Co-expression networks

miRNA prediction and analysis

Visualization tools

Functional, Network & Pathway Analysis Tools

Further annotation tools for RNA-Seq data

RNA-Seq Databases

Single specie RNA-Seq databases

Webinars and Presentations

References

  1. Wang, Z., Gerstein, M., & Snyder, M. (2009). "RNA-Seq: a revolutionary tool for transcriptomics". Nature Reviews Genetics. 10 (1): 57–63. doi:10.1038/nrg2484. PMC 2949280Freely accessible. PMID 19015660.
  2. Kimberly R. Kukurba & Stephen B. Montgomery (2015). "RNA Sequencing and Analysis". Cold Spring Harb Protoc. 2015: pdb.top084970. doi:10.1101/pdb.top084970.
  3. Ana Conesa, Pedro Madrigal, Sonia Tarazona, David Gomez-Cabrero, Alejandra Cervera,Andrew McPherson, Michał Wojciech Szcześniak, Daniel J. Gaffney, Laura L. Elo, Xuegong Zhang and Ali Mortazavi (2016). "A survey of best practices for RNA-seq data analysis". Genome Biology. 17 (13). doi:10.1186/s13059-016-0881-8.
  4. "RNA Sequencing and analysis" (PDF). Canadian Bioinformatics Workshops. 2012.
  5. Verk, M., Hickman, R., Pieterse, C., Van Wees, S. (2013). "RNA-Seq: revelation of the messengers". Trends in Plant Science. 18 (4): 175–179. doi:10.1016/j.tplants.2013.02.001.
  6. Sheng Q, Vickers K, Zhao S, Wang J, Samuels DC, Koues O, Shyr Y, Guo Y (2016). "Multi-perspective quality control of Illumina RNA sequencing data analysis". Brief Funct Genomics: 1–11. doi:10.1093/bfgp/elw035. PMID 27687708.
  7. Sayols S & Klein H (2015). "dupRadar: Assessment of duplication rates in RNA-Seq datasets. R package version 1.1.0.".
  8. Matthew P.A. Davis, Stijn van Dongen, Cei Abreu-Goodger, Nenad Bartonicek and Anton J. Enright (2013). "Kraken: A set of tools for quality control and analysis of high-throughput sequence data". Methods. 63 (1): 41–49. doi:10.1016/j.ymeth.2013.06.027. PMID 23816787.
  9. S Anders; T P Pyl; W Huber (2015). "HTSeq — A Python framework to work with high-throughput sequencing data.". Bioinformatics. 31 (2): 166–169. doi:10.1093/bioinformatics/btu638.
  10. Feng,H., Zhang,X., Zhang,C. (2015). "mRIN for direct assessment of genome-wide and gene-specific mRNA integrity from large-scale RNA sequencing data". Nature Communications. 6 (7816). doi:10.1038/ncomms8816.
  11. Ewels, Philip; Magnusson, Måns; Lundin, Sverker; Käller, Max (2016-06-16). "MultiQC: summarize analysis results for multiple tools and samples in a single report". Bioinformatics: btw354. doi:10.1093/bioinformatics/btw354. ISSN 1367-4803. PMID 27312411.
  12. Deluca DS, Levin JZ, Sivachenko A, Fennell T, Nazaire MD, Williams C, Reich M, Winckler W, Getz G (2012). "RNA-SeQC: RNA-seq metrics for quality control and process optimization". Bioinformatics. 28 (11): 1530–1532. doi:10.1093/bioinformatics/bts196.
  13. Wang, L., Wang, S., & Li, W. (2012). "RSeQC: Quality Control of RNA-seq experiments.". Bioinformatics. 28 (16): 2184–2185. doi:10.1093/bioinformatics/bts356.
  14. Lassmann T, Hayashizaki Y, Daub CO (2010). "SAMStat: monitoring biases in next generation sequencing data.". Bioinformatics. 27 (1): 130–131. doi:10.1093/bioinformatics/btq614. PMID 21088025.
  15. Nicholas F Lahens, Ibrahim Halil Kavakli, Ray Zhang, Katharina Hayer, Michael B Black, Hannah Dueck, Angel Pizarro, Junhyong Kim, Rafael Irizarry, Russell S Thomas, Gregory R Grant and John B Hogenesch (2014). "IVT-seq reveals extreme bias in RNA sequencing". Genome Biology. 15 (6): R86. doi:10.1186/gb-2014-15-6-r86.
  16. Li S, Łabaj PP, Zumbo P, Sykacek P, Shi W, Shi L, Phan J, Wu PY, Wang M, Wang C, Thierry-Mieg D, Thierry-Mieg J, Kreil DP, Mason CE (2014). "Detecting and correcting systematic variation in large-scale RNA sequencing data.". Nature Biotechnology. 32: 888–895. doi:10.1038/nbt.3000. PMID 25150837.
  17. Benjamini Y, Speed TP (2014). "Summarizing and correcting the GC content bias in high-throughput sequencing .". Nucleic Acids Res. 40 (10): e72. doi:10.1093/nar/gks001. PMID 22323520.
  18. Aird D, Ross MG, Chen WS, Danielsson M, Fennell T, Russ C, Jaffe DB, Nusbaum C, Gnirke A (2011). "Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries.". Genome Biol. 12 (2): 888–895. doi:10.1186/gb-2011-12-2-r18.
  19. Adiconis X, Borges-Rivera D, Satija R, DeLuca DS, Busby MA, Berlin AM, Sivachenko A, Thompson DA, Wysoker A, Fennell T, Gnirke A, Pochet N, Regev A, Levin JZ (2013). "Comparative analysis of RNA sequencing methods for degraded or low-input samples.". Nat Methods. 10 (7): 623–629. doi:10.1038/nmeth.2483. PMID 23685885.
  20. Nakamura K, Oshima T, Morimoto T, Ikeda S, Yoshikawa H, Shiwa Y, Ishikawa S, Linak MC, Hirai A, Takahashi H, Altaf-Ul-Amin M, Ogasawara N, Kanaya S (2011). "Sequence-specific error profile of Illumina sequencers.". Nucleic Acids Res. 39 (13): e90. doi:10.1093/nar/gkr344. PMID 21576222.
  21. Hansen KD, Brenner SE, Dudoit S (2010). "Biases in Illumina transcriptome sequencing caused by random hexamer priming.". Nucleic Acids Res. 38: e131. doi:10.1093/nar/gkq224. PMC 2896536Freely accessible. PMID 20395217.
  22. Smeds, Linnéa; Künstner, Axel; Donlin, Maureen J. (19 October 2011). "ConDeTri - A Content Dependent Read Trimmer for Illumina Data". PLoS ONE. 6 (10): e26314. doi:10.1371/journal.pone.0026314.
  23. Martin, Marcel (2 May 2011). "Cutadapt removes adapter sequences from high-throughput sequencing reads". EMBnet.journal. 17 (1): 10. doi:10.14806/ej.17.1.200.
  24. Spandow, O; Hellström, S; Schmidt, SH; De Paoli, Emanuale; Policriti, Alberto (2012). "ERNE-BS5: Aligning BS-treated Sequences by Multiple Hits on a 5-letters Alphabet". Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine. 12: 12–19. doi:10.1145/2382936.2382938.
  25. Schmieder, R.; Edwards, R. (28 January 2011). "Quality control and preprocessing of metagenomic datasets". Bioinformatics. 27 (6): 863–864. doi:10.1093/bioinformatics/btr026.
  26. Dlugosch KM, Lai Z, Bonin A, Hierro J, Rieseberg LH (2013). "Allele identification for transcriptome-based population genomics in the invasive plant Centaurea solstitialis.". G3. 3 (2): 359–367. doi:10.1534/g3.112.003871. PMC 3564996Freely accessible. PMID 23390612.
  27. Bolger, A. M.; Lohse, M.; Usadel, B. (1 April 2014). "Trimmomatic: a flexible trimmer for Illumina sequence data". Bioinformatics. 30 (15): 2114–2120. doi:10.1093/bioinformatics/btu170.
  28. David Laehnemann, Arndt Borkhardt & Alice Carolyn McHardy (2015). "Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction". Briefings in Bioinformatics: 1–26. doi:10.1093/bib/bbv029. PMID 26026159.
  29. Quince C, Lanzen A, Davenport RJ, Turnbaugh PJ (2011). "Removing noise from pyrosequenced amplicons.". BMC Bioinformatics. 12 (38). doi:10.1186/1471-2105-12-38. PMC 3045300Freely accessible. PMID 21276213.
  30. Heo Y, Wu XL, Chen D, Ma J, Hwu WM (2014). "BLESS: bloom filter-based error correction solution for high-throughput sequencing reads.". Bioinformatics. 15 (30): 1354–1362. doi:10.1093/bioinformatics/btu030. PMID 24451628.
  31. Paul Greenfield; Konsta Duesing; Alexie Papanicolaou; Denis C. Bauer (2014). "Blue: correcting sequencing errors using consensus and context.". Bioinformatics. 30 (19): 2723–32. doi:10.1093/bioinformatics/btu368. PMID 24919879.
  32. Michael I Love; John B Hogenesch; Rafael A Irizarry (2015). "Modeling of RNA-seq fragment sequence bias reduces systematic errors in transcript abundance estimation". BioRXiv.
  33. Hansen KD, Irizarry RA, Wu Z (2012). "Removing technical variability in RNA-seq data using conditional quantile normalization.". Biostatistics. 13 (2): 204–16. doi:10.1093/biostatistics/kxr054. PMID 22285995.
  34. Risso D, Schwartz K, Sherlock G, Dudoit S (2011). "GC-Content Normalization for RNA-Seq Data.". BMC Bioinformatics. 12 (1): 480. doi:10.1186/1471-2105-12-480.
  35. Oliver Stegle; Leopold Parts; Matias Piipari; John Winn & Richard Durbin (2012). "Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses.". Nat Protoc. 7 (6): 500–507. doi:10.1038/nprot.2011.457. PMC 3398141Freely accessible. PMID 22343431.
  36. Risso D, Ngai J, Speed TP, Dudoit S (2014). "Normalization of RNA-seq data using factor analysis of control genes or samples". Nat. Biotechnol. 32 (9): 896–902. doi:10.1038/nbt.2931. PMID 25150836.
  37. Meacham, F., Boffelli, D., Dhahbi, J., Martin, D. I., Singer, M., & Pachter, L. (2011). "Identification and correction of systematic error in high-throughput sequence data". BMC Bioinformatics. 12 (1): 451. doi:10.1186/1471-2105-12-451.
  38. Liu B, Yuan J, Yiu SM, Li Z, Xie Y, Chen Y, Shi Y, Zhang H, Li Y, Lam TW, Luo R (2012). "COPE: an accurate k-mer-based pair-end reads connection tool to facilitate genome assembly.". Bioinformatics. 28 (22): 2870–4. doi:10.1093/bioinformatics/bts563. PMID 23044551.
  39. J. Zhang; K. Kobert; T. Flouri; A. Stamatakis (2013). "PEAR: A fast and accurate Illumina Paired-End read mergeR.". Bioinformatics. 30: 614–620. doi:10.1093/bioinformatics/btt593.
  40. Sébastien Rodrigue; Arne C. Materna; Sonia C. Timberlake; Matthew C. Blackburn; Rex R. Malmstrom; Eric J. Alm; Sallie W. Chisholm (2010). "Unlocking Short Read Sequencing for Metagenomics". PLoS ONE. 5 (7): e11840. doi:10.1371/journal.pone.0011840. PMC 2911387Freely accessible. PMID 20676378.
  41. Nuno A. Fonseca, Johan Rung, Alvis Brazma and John C. Marioni (2012). "Tools for mapping high-throughput sequencing data". Bioinformatics. 28 (24): 3169–3177. doi:10.1093/bioinformatics/bts605.
  42. Alamancos GP, Agirre E, Eyras E (2014). "Methods to study splicing from high-throughput RNA sequencing data.". Methods Mol Biol. 1126: 357–97. doi:10.1007/978-1-62703-980-2_26. PMID 24549677.
  43. Campagna D, Telatin A, Forcato C, Vitulo N, Valle G (2013). "PASS-bis: a bisulfite aligner suitable for whole methylome analysis of Illumina and SOLiD reads.". Bioinformatics. 29 (2): 268–70. doi:10.1093/bioinformatics/bts675. PMID 23162053.
  44. Jaegyoon Ahn & Xinshu Xiao (2015). "RASER: reads aligner for SNPs and editing sites of RNA". Bioinformatics: btv505. doi:10.1093/bioinformatics/btv505.
  45. 1 2 Yang Liao, Gordon K Smyth and Wei Shi (2013). "The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote". Nucleic Acids Research. 41 (10): e108. doi:10.1093/nar/gkt214. PMID 23558742.
  46. http://bioinformatics.oxfordjournals.org/content/29/1/15.full
  47. Cole Trapnell, Lior Pachter and Steven Salzberg (2009). "TopHat: discovering splice junctions with RNA-Seq". Bioinformatics. 25 (9): 11051111. doi:10.1093/bioinformatics/btp120. PMC 2672628Freely accessible. PMID 19289445.
  48. Lior Pachter (2011). "Models for transcript quantification from RNA-Seq". arXiv:1104.3889Freely accessible.
  49. VANESSA M. KVAM; PENG LIU & YAQING SI (2012). "A COMPARISON OF STATISTICAL METHODS FOR DETECTING DIFFERENTIALLY EXPRESSED GENES FROM RNA-SEQ DATA". American Journal of Botany. 99 (2): 248–256. doi:10.3732/ajb.1100340.
  50. Marie-Agne's Dillies, Andrea Rau, Julie Aubert, Christelle Hennequet-Antier, and on behalf of The French StatOmique Consortium (2012). "A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis". Brief Bioinform. 14: 1–13. doi:10.1093/bib/bbs046. PMID 22988256.
  51. Zhijin Wu,corresponding author1 Bethany D Jenkins, Tatiana A Rynearson, Sonya T Dyhrman, Mak A Saito, Melissa Mercier , and LeAnn P Whitney (2010). "Empirical bayes analysis of sequencing-based transcriptional profiling without replicates.". BMC Bioinformatics. 11: 564. doi:10.1186/1471-2105-11-564. PMC 3098101Freely accessible. PMID 21080965.
  52. Cole Trapnell, Brian A Williams, Geo Pertea, Ali Mortazavi, Gordon Kwan, Marijke J van Baren, Steven L Salzberg, Barbara J Wold and Lior Pachter (2010). "Transcript assembly and abundance estimation from RNA-Seq reveals thousands of new transcripts and switching among isoforms". Nature Biotechnology. 28 (5): 511515. doi:10.1038/nbt.1621. PMC 3146043Freely accessible. PMID 20436464.
  53. Klambauer, G.; Unterthiner, T.; Hochreiter, S. (2013). "DEXUS: Identifying differential expression in RNA-Seq studies with unknown conditions". Nucleic Acids Research. 41 (21): e198. doi:10.1093/nar/gkt834. PMID 24049071.
  54. Feng J, Meyer CA, Wang Q, Liu JS, Liu XS, Zhang Y (2012). "GFOLD: a generalized fold change for ranking differentially expressed genes from RNA-seq data.". Bioinformatics. 28: 2782–2788. doi:10.1093/bioinformatics/bts515.
  55. Rauschenberger A, Jonker MA, van de Wiel MA, Menezes RX (2016). "Testing for association between RNA-Seq and high-dimensional data". BMC Bioinformatics. 17 (118). doi:10.1186/s12859-016-0961-5. PMC 4782413Freely accessible. PMID 26951498.
  56. Moulos, P; Hatzis, P (2015). "Systematic integration of RNA-Seq statistical algorithms for accurate detection of differential gene expression patterns". Nucleic Acids Research. 43 (4): e25. doi:10.1093/nar/gku1273. PMID 25452340.
  57. Kartashov, Andrey V., and Artem Barski. "BioWardrobe: an integrated platform for analysis of epigenomics and transcriptomics data." Genome Biology 16.1 (2015): 158. http://www.genomebiology.com/2015/16/1/158
  58. evin L, Bar-Yaacov D, Bouskila A, Chorev M, Carmel L, Mishmar D (2015). "LEMONS – A Tool for the Identification of Splice Junctions in Transcriptomes of Organisms Lacking Reference Genomes.". PLoS ONE. 10 (11): e0143329. doi:10.1371/journal.pone.0143329.
  59. Shailesh Kumar, Angie Duy Vo, Fujun Qin & Hui Li (2016). "Comparative assessment of methods for the fusion transcripts detection from RNA-Seq data". Scientific Reports. 6 (21587): 21597. doi:10.1038/srep21597.
  60. Jia, W; Qiu, K; He, M; Song, P; Zhou, Q; Zhou, F; Yu, Y; Zhu, D; Nickerson, ML; Wan, S; Liao, X; Zhu, X; Peng, S; Li, Y; Wang, J; Guo, G (2013). "SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data". Genome Biology. 14: R12. doi:10.1186/gb-2013-14-2-r12. PMID 23409703.
  61. Christoph Ziegenhain; Swati Parekh; Beate Vieth; Martha Smets; Heinrich Leonhardt; Ines Hellmann; Wolfgang Enard (2016). "Comparative analysis of single-cell RNA-sequencing methods.". bioRXiv. doi:10.1101/035758.
  62. Tamar Hashimshony; Florian Wagner; Noa Sher; Itai Yanai (2012). "CEL-Seq: Single-Cell RNA-Seq by Multiplexed Linear Amplification". Cell Reports. 2 (3): 666–673. doi:10.1016/j.celrep.2012.08.003. PMID 22939981.
  63. Evan Z. Macosko; Anindita Basu; Rahul Satija; James Nemesh; Karthik Shekhar; Melissa Goldman; Itay Tirosh; Allison R. Bialas; Nolan Kamitaki; Emily M. Martersteck; John J. Trombetta; David A. Weitz; Joshua R. Sanes; Alex K. Shalek; Aviv Regev; Steven A. McCarroll (2015). "Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets.". Cell. 161 (5): 1202–1214. doi:10.1016/j.cell.2015.05.002.
  64. Marco E, Karp RL, Guo G, Robson P, Hart AH, Trippa L, Yuan GC (2014). "Bifurcation analysis of single-cell gene expression data reveals epigenetic landscape". PNAS. 111: E5643–50. doi:10.1073/pnas.1408993111.
  65. Buettner F, Natarajan KN, Casale FP, Proserpio V, Scialdone A, Theis FJ, Teichmann SA, Marioni JC & Stegle O (2015). "Computational analysis of cell-to-cell heterogeneity in single-cell RNA-Sequencing data reveals hidden subpopulation of cells". Nature Biotechnology. 33: 155–160. doi:10.1038/nbt.3102.
  66. Minzhe Guo; Hui Wang; S. Steven Potter; Jeffrey A. Whitsett; Yan Xu (2015). "SINCERA: A Pipeline for Single-Cell RNA-Seq Profiling Analysis". PLoS Comput Biol. 11 (11): e1004575. doi:10.1371/journal.pcbi.1004575.
  67. Monzoorul Haque M, Tarini Shankar Ghosh, Nitin Kumar Singh and Sharmila S Mande (2011). "SPHINX - An algorithm for taxonomic binning of metagenomic sequences". Bioinformatics. 27: 22–30. doi:10.1093/bioinformatics/btq608.
  68. Stubbington, Michael JT; Lönnberg, Tapio; Proserpio, Valentina; Clare, Simon; Speak, Anneliese O; Dougan, Gordon; Teichmann, Sarah A. "T cell fate and clonality inference from single-cell transcriptomes". Nature Immunology. doi:10.1038/nmeth.3800.
  69. Eltahla, Auda A; Rizzetto, Simone; Pirozyan, Mehdi R; Betz-Stablein, Brigid D; Venturi, Vanessa; Kedzierska, Katherine; Lloyd, Andrew R; Bull, Rowena A; Luciani, Fabio. "Linking the T cell receptor to the single cell transcriptome in antigen-specific human T cells". Immunology and Cell Biology. doi:10.1038/icb.2016.16.
  70. Emma Pierson & Christopher Yau (2015). "ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis". Genome Biology. 16 (241). doi:10.1186/s13059-015-0805-z.
  71. Hayer, Katharina E.; Pizarro, Angel; Lahens, Nicholas F.; Hogenesch, John B.; Grant, Gregory R. (3 September 2015). "Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data". Bioinformatics: btv488. doi:10.1093/bioinformatics/btv488.
  72. Steijger T, Abril JF, Engström PG, Kokocinski F, Hubbard TJ, Guigó R, Harrow J, Bertone P; RGASP Consortium. (2013). "Assessment of transcript reconstruction methods for RNA-seq". Nat Methods. 10 (12): 1177–84. doi:10.1038/nmeth.2714. PMID 24185837.
  73. Chang, Zheng; Li, Guojun; Liu, Juntao; Zhang, Yu; Ashby, Cody; Liu, Deli; Cramer, Carole L.; Huang, Xiuzhen (2015-02-11). "Bridger: a new framework for de novo transcriptome assembly using RNA-seq data". Genome Biology. 16 (1): 30. doi:10.1186/s13059-015-0596-2. ISSN 1465-6906. PMC 4342890Freely accessible. PMID 25723335.
  74. Zerbino DR, Birney E (2008). "Velvet: Algorithms for de novo short read assembly using de Bruijn graphs". Genome Research. 18 (5): 821829. doi:10.1101/gr.074492.107. PMC 2336801Freely accessible. PMID 18349386.
  75. Camelia Quek, Chol-hee Jung, Shayne A. Bellingham, Andrew Lonie and Andrew F. Hill (2015). "iSRAP - a one-touch research tool for rapid profiling of small RNA-seq data". Journal of Extracellular Vesicles. 4. doi:10.3402/jev.v4.29454.
  76. Schmid-Burgk JL, Hornung V (2015). "BrowserGenome.org: web-based RNA-seq data analysis and visualization". Nat Methods. 12 (11): 1001. doi:10.1038/nmeth.3615.
  77. Milne I, Stephen G, Bayer M, Cock PJ, Pritchard L, Cardle L, Shaw PD, Marshall D (2013). "Using Tablet for visual exploration of second-generation sequencing data". Briefings in Bioinformatics. 14 (2): 193–202. doi:10.1093/bib/bbs012. PMID 22445902.
  78. Weijun Luo, Michael S Friedman, Kerby Shedden, Kurt D Hankenson and Peter J Woolf (2009). "GAGE: generally applicable gene set enrichment for pathway analysis". BMC Bioinformatics. 10 (161): 17. doi:10.1186/1471-2105-10-161. PMC 2696452Freely accessible. PMID 19473525.
  79. Santhilal Subhash and Chandrasekhar Kanduri (2016). "GeneSCF: a real-time based functional enrichment tool with support for multiple organisms". BMC Bioinformatics. 17 (1): 365. doi:10.1186/s12859-016-1250-z. PMC 5020511Freely accessible. PMID 27618934.
  80. Rue-Albrecht K (2014). "Visualise microarray and RNAseq data using gene ontology annotations. R package version 1.4.1".
  81. Young MD, Wakefield MJ, Smyth GK, Oshlack A (2010). "Gene ontology analysis for RNA-seq: accounting for selection bias.". Genome Biology. 11 (2): 12. doi:10.1186/gb-2010-11-2-r14. PMC 2872874Freely accessible. PMID 20132535.
  82. Qing Xiong, Sayan Mukherjee & Terrence S. Furey (2014). "GSAASeqSP: A Toolset for Gene Set Association Analysis of RNA-Seq Data". SCIENTIFIC REPORTS. 4 (6347). doi:10.1038/srep06347.
  83. Sonja Hänzelmann, Robert Castelo & Justin Guinney (2013). "GSVA: gene set variation analysis for microarray and RNA-Seq data.". BMC Bioinformatics. 14 (17): 7. doi:10.1186/1471-2105-14-7.
  84. Yi-Hui Zhou (2015). "Pathway analysis for RNA-Seq data using a score-based approach". Biometrics. doi:10.1111/biom.12372.
  85. Ivana Ihnatova & Eva Budinska (2015). "ToPASeq: an R package for topology-based pathway analysis of microarray and RNA-Seq data". BMC Bioinformatics. 16 (350). doi:10.1186/s12859-015-0763-1.
  86. Van Bel M, Proost S, Van Neste C, Deforce D, Van de Peer Y, Vandepoele K (2013). "TRAPID, an efficient online tool for the functional and comparative analysis of de novo RNA-Seq transcriptomes.". Genome Biol. 14: R134. doi:10.1186/gb-2013-14-12-r134. PMID 24330842.
  87. Anne de Jong, Sjoerd van der Meulen, Oscar P. Kuipers and Jan Kok (2015). "T-REx: Transcriptome analysis webserver for RNA-seq Expression data". BMC Genomics. 16 (663). doi:10.1186/s12864-015-1834-4.
  88. Ye Zhang; Kenian Chen; Steven A Sloan; Mariko L Bennett; Anja R Scholze; Sean O'Keefe; Hemali P Phatnani; Paolo Guarnieri; Christine Caneda; Nadine Ruderisch; Shuyun Deng; Shane A Liddelow; Chaolin Zhang; Richard Daneman; Tom Maniatis; Ben A Barres; Jia Qian Wu (2014). "An RNA-Seq transcriptome and splicing database of glia, neurons, and vascular cells of the cerebral cortex.". Journal of Neuroscience. 34 (36): 11929–11947. doi:10.1523/JNEUROSCI.1860-14.2014.
  89. Wang, Y., Wu, N., Liu, J., Wu, Z., & Dong, D. (2015). "FusionCancer: a database of cancer fusion genes derived from RNA-seq data". Diagnostic Pathology. 10 (131): 39. doi:10.1186/s13000-015-0310-4. PMC 4517624Freely accessible. PMID 26215638.
  90. Krupp M; Marquardt JU; Sahin U; Galle PR; Castle J; Teufel A. (2012). "RNA-Seq Atlas--a reference database for gene expression profiling in normal tissue by next-generation sequencing". Bioinformatics. 28 (8): 1184–1185. doi:10.1093/bioinformatics/bts084. PMID 22345621.
This article is issued from Wikipedia - version of the 11/26/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.