@@ -4,8 +4,8 @@ In order to build StringTie from this GitHub repository
44the following steps can be taken:
55
66```
7- git clone https://github.com/gpertea/stringtie
8- cd stringtie
7+ git clone https://github.com/mpertea/stringtie2
8+ cd stringtie2
99make release
1010```
1111
@@ -14,12 +14,34 @@ which is more suitable for debugging and runtime checking but which can be
1414significantly slower than the optimized version which is obtained by using
1515` make release ` .
1616
17+
18+ ### Installation of the super-reads module (optional)
19+
20+ This optional module can be used to de-novo assemble, align and pre-process
21+ RNA-Seq reads, preparing them to be used as "super-reads" by Stringtie.
22+
23+ Mode detailed information is provided in the SuperReads_RNA/README.md
24+ Quick installation instructions for this module (assuming the above Stringtie installation
25+ was completed):
26+
27+ ```
28+ cd SuperReads_RNA
29+ ./install.sh
30+ ```
31+
32+ #### Using super-reads with Stringtie
33+
34+ After running the super-reads module (see the SuperRead_RNA/README.md file for usage details), there
35+ is a BAM file which contains sorted alignment for both short reads and super-reads, called * ` sr_merge.bam ` * ,
36+ created in the selected output directory. This file can be directly given as the main input file
37+ to StringTie as described in the _ Running Stringtie_ section below.
38+
1739## Running StringTie
1840
1941Run stringtie from the command line like this:
20- '''
42+ ```
2143stringtie [options] <aligned_reads.bam>
22- '''
44+ ```
2345The main input of the program is a SAMTools BAM file with RNA-Seq mappings
2446sorted by genomic location (for example the accepted_hits.bam file produced
2547by TopHat).
@@ -84,7 +106,6 @@ the following options are available:
84106
85107## Input files
86108
87-
88109StringTie takes as input a binary SAM (BAM) file sorted by reference position.
89110This file contains spliced read alignments such as the ones produced by TopHat or HISAT2.
90111A text file in SAM format should be converted to BAM and sorted using the
@@ -103,10 +124,13 @@ records. For example HISAT2 should be run with the `--dta` option in order to ta
103124alignments this way. As explained above, the alignments in SAM format should be sorted and
104125preferrably converted to BAM.
105126
106- Optionally, a reference annotation file in GTF/GFF3 format can be provided to StringTie.
107- In this case, StringTie will check to see if the reference transcripts are expressed in the
108- RNA-Seq data, and for the ones that are expressed it will compute coverage and FPKM values.
109- Note that the reference transcripts need to be fully covered by reads in order to be included
110- in StringTie's output. Other transcripts assembled from the data by StringTie and not present
111- in the reference file will be printed as well ("novel" transcripts).
127+ Optionally, a reference annotation file in GTF/GFF3 format can be provided to StringTie
128+ using the ` -G ` option. In this case, StringTie will check to see if the reference transcripts
129+ are expressed in the RNA-Seq data, and for the ones that are expressed it will compute coverage
130+ and FPKM values.
131+ Note that the reference transcripts should be fully covered by reads in order to be included
132+ in StringTie's output with the original ID of the reference transcript shown in the
133+ _ ` reference_id ` _ GTF attribute in the output file . Other transcripts assembled from
134+ the input alignment data by StringTie and not present in the reference file will be
135+ printed as well ("novel" transcripts).
112136
0 commit comments