Skip to content

Commit d84cf1d

Browse files
committed
sync mpertea/stringtie2 changes
1 parent dedf082 commit d84cf1d

5 files changed

Lines changed: 42 additions & 18 deletions

File tree

README.md

Lines changed: 35 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,8 @@ In order to build StringTie from this GitHub repository
44
the following steps can be taken:
55

66
```
7-
git clone https://github.com/gpertea/stringtie
8-
cd stringtie
7+
git clone https://github.com/mpertea/stringtie2
8+
cd stringtie2
99
make release
1010
```
1111

@@ -14,12 +14,34 @@ which is more suitable for debugging and runtime checking but which can be
1414
significantly slower than the optimized version which is obtained by using
1515
`make release`.
1616

17+
18+
### Installation of the super-reads module (optional)
19+
20+
This optional module can be used to de-novo assemble, align and pre-process
21+
RNA-Seq reads, preparing them to be used as "super-reads" by Stringtie.
22+
23+
Mode detailed information is provided in the SuperReads_RNA/README.md
24+
Quick installation instructions for this module (assuming the above Stringtie installation
25+
was completed):
26+
27+
```
28+
cd SuperReads_RNA
29+
./install.sh
30+
```
31+
32+
#### Using super-reads with Stringtie
33+
34+
After running the super-reads module (see the SuperRead_RNA/README.md file for usage details), there
35+
is a BAM file which contains sorted alignment for both short reads and super-reads, called *`sr_merge.bam`*,
36+
created in the selected output directory. This file can be directly given as the main input file
37+
to StringTie as described in the _Running Stringtie_ section below.
38+
1739
## Running StringTie
1840

1941
Run stringtie from the command line like this:
20-
'''
42+
```
2143
stringtie [options] <aligned_reads.bam>
22-
'''
44+
```
2345
The main input of the program is a SAMTools BAM file with RNA-Seq mappings
2446
sorted by genomic location (for example the accepted_hits.bam file produced
2547
by TopHat).
@@ -84,7 +106,6 @@ the following options are available:
84106

85107
## Input files
86108

87-
88109
StringTie takes as input a binary SAM (BAM) file sorted by reference position.
89110
This file contains spliced read alignments such as the ones produced by TopHat or HISAT2.
90111
A text file in SAM format should be converted to BAM and sorted using the
@@ -103,10 +124,13 @@ records. For example HISAT2 should be run with the `--dta` option in order to ta
103124
alignments this way. As explained above, the alignments in SAM format should be sorted and
104125
preferrably converted to BAM.
105126

106-
Optionally, a reference annotation file in GTF/GFF3 format can be provided to StringTie.
107-
In this case, StringTie will check to see if the reference transcripts are expressed in the
108-
RNA-Seq data, and for the ones that are expressed it will compute coverage and FPKM values.
109-
Note that the reference transcripts need to be fully covered by reads in order to be included
110-
in StringTie's output. Other transcripts assembled from the data by StringTie and not present
111-
in the reference file will be printed as well ("novel" transcripts).
127+
Optionally, a reference annotation file in GTF/GFF3 format can be provided to StringTie
128+
using the `-G` option. In this case, StringTie will check to see if the reference transcripts
129+
are expressed in the RNA-Seq data, and for the ones that are expressed it will compute coverage
130+
and FPKM values.
131+
Note that the reference transcripts should be fully covered by reads in order to be included
132+
in StringTie's output with the original ID of the reference transcript shown in the
133+
_`reference_id`_ GTF attribute in the output file . Other transcripts assembled from
134+
the input alignment data by StringTie and not present in the reference file will be
135+
printed as well ("novel" transcripts).
112136

prep_linux.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,10 +13,10 @@ mkdir $linpack
1313
cd $srcpack
1414
make cleanall
1515
make static
16-
cp LICENSE README stringtie ../$linpack/
16+
cp LICENSE README.md run_tests.sh stringtie ../$linpack/
1717
cd ..
1818
tar cvfz $linpack.tar.gz $linpack
1919
ls -l $linpack.tar.gz
2020
echo "scp $linpack.tar.gz $srcpack.tar.gz salz:~/html/software/stringtie/dl/"
21-
echo "If you're on salzN machines you can also update the release# links:"
21+
echo "If you're on CCB servers you can also update the release# links:"
2222
echo "perl -i -pe 's/stringtie\-\d\.\d+\.\d+\w?\./stringtie-$ver./g' ~/html/software/stringtie/home.shtml"

prep_mac.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ echo "-------------------"
1111
mkdir $macpack
1212
make clean
1313
make release
14-
cp -p LICENSE README stringtie $macpack/
14+
cp -p LICENSE README.md run_tests.sh stringtie $macpack/
1515
tar cvfz $macpack.tar.gz $macpack
1616
ls -l $macpack.tar.gz
1717
#echo "If you're on igmN machines you can also update the web files:"

prep_source.sh

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,13 +13,11 @@ cd samtools-0.1.18
1313
make clean
1414
cd ..
1515
libdir=stringtie-$ver/gclib/
16-
cp Makefile LICENSE README stringtie.cpp {rlink,tablemaker,tmerge}.{h,cpp} $pack/
17-
#sed 's|\.\./gclib|./gclib|' Makefile > $pack/Makefile
16+
cp Makefile LICENSE README.md run_tests.sh stringtie.cpp {rlink,tablemaker,tmerge}.{h,cpp} $pack/
1817
cp -r samtools-0.1.18 $pack/
1918
/bin/rm -rf $pack/samtools-0.1.18/.svn
2019
cp ./gclib/{GVec,GList,GHash,GIntHash}.hh $libdir
2120
cp ./gclib/GBitVec.h $libdir
2221
cp ./gclib/{GArgs,GBam,GBase,gdna,GStr,gff,codons,GFaSeqGet,GFastaIndex,proc_mem,GThreads}.{h,cpp} $libdir
2322
tar cvfz $pack.tar.gz $pack
2423
ls -l $pack.tar.gz
25-
#scp $pack.tar.gz igm3:~/src/

run_tests.sh

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,9 @@ for i in ${!arrmsg[@]}; do
4242
echo "Error: file $fcmp does not exist! Re-download test data."
4343
exit 1
4444
fi
45-
echo "Test ${i}: ${arrmsg[$i]}"
45+
n=$i
46+
((n++))
47+
echo "Test ${n}: ${arrmsg[$i]}"
4648
fin=${arrins[$i]}.bam
4749
../stringtie ${arrparms[$i]} -o $fout $fin
4850
if [ ! -f $fout ]; then

0 commit comments

Comments
 (0)