Difference between revisions of "Trinity"

From HPC users
Jump to navigationJump to search
m (Reverted edits by Schwietzer (talk) to last revision by JBrunken)
Line 1: Line 1:
== Introduction ==
== Introduction ==
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Theano features:


== Installed version ==
Trinity, developed at the Broad Institute and the Hebrew University of Jerusalem, represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Trinity combines three independent software modules: '''Inchworm''', '''Chrysalis''', and '''Butterfly''', applied sequentially to process large volumes of RNA-seq reads. Trinity partitions the sequence data into many individual de Bruijn graphs, each representing the transcriptional complexity at a given gene or locus, and then processes each graph independently to extract full-length splicing isoforms and to tease apart transcripts derived from paralogous genes. Briefly, the process works like so:
 
*'''Inchworm''' assembles the RNA-seq data into the unique sequences of transcripts, often generating full-length transcripts for a dominant isoform, but then reports just the unique portions of alternatively spliced transcripts.


Currently there is one version installed:
*'''Chrysalis''' clusters the Inchworm contigs into clusters and constructs complete de Bruijn graphs for each cluster. Each cluster represents the full transcriptonal complexity for a given gene (or sets of genes that share sequences in common). Chrysalis then partitions the full read set among these disjoint graphs.
On '''hpc-env/6.4'''
*'''Theano/1.0.1-foss-2017b-Python-3.6.3'''


== Using Theano ==
*'''Butterfly''' then processes the individual graphs in parallel, tracing the paths that reads and pairs of reads take within the graph, ultimately reporting full-length transcripts for alternatively spliced isoforms, and teasing apart transcripts that corresponds to paralogous genes.


To use Theano, at first you have to load the corresponding module.
== Installed version ==
module load hpc-env/6.4
module load Theano


Afterwards, you have to start ''Python'' and import Theano:
There are currently two version of Trinity installed:
python
*'''2.2.0'''
from theano import *
*'''2.4.0'''


== Documentation ==
== Documentation ==


The full documentation can be found [http://deeplearning.net/software/theano/index.html here].
The full documentation can be found [https://github.com/trinityrnaseq/trinityrnaseq/wiki here].
To learn how to use Theanos, you could also take a look at the [http://deeplearning.net/software/theano/tutorial/index.html tutorial].

Revision as of 10:21, 7 November 2018

Introduction

Trinity, developed at the Broad Institute and the Hebrew University of Jerusalem, represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Trinity combines three independent software modules: Inchworm, Chrysalis, and Butterfly, applied sequentially to process large volumes of RNA-seq reads. Trinity partitions the sequence data into many individual de Bruijn graphs, each representing the transcriptional complexity at a given gene or locus, and then processes each graph independently to extract full-length splicing isoforms and to tease apart transcripts derived from paralogous genes. Briefly, the process works like so:

  • Inchworm assembles the RNA-seq data into the unique sequences of transcripts, often generating full-length transcripts for a dominant isoform, but then reports just the unique portions of alternatively spliced transcripts.
  • Chrysalis clusters the Inchworm contigs into clusters and constructs complete de Bruijn graphs for each cluster. Each cluster represents the full transcriptonal complexity for a given gene (or sets of genes that share sequences in common). Chrysalis then partitions the full read set among these disjoint graphs.
  • Butterfly then processes the individual graphs in parallel, tracing the paths that reads and pairs of reads take within the graph, ultimately reporting full-length transcripts for alternatively spliced isoforms, and teasing apart transcripts that corresponds to paralogous genes.

Installed version

There are currently two version of Trinity installed:

  • 2.2.0
  • 2.4.0

Documentation

The full documentation can be found here.