You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Allo was written with the following imported packages. All other packages used are built into Python. Especially concerning Tensorflow, it is recommended to create a conda environment specific to this version as Allo may not function without it:
Using Allo requires a few pre-processing steps. In most ChIP pipelines, the default behavior of aligners is to assign multi-mapped reads to random locations within their mappings without retaining information on the other locations. Both Bowtie1/2 and BWA can be used for single-end. Unfortunately, BWA cannot be used for paired-end reads prior to Allo due to constraints in how it outputs multi-mapped reads. The following arguments should be used:
20
26
21
27
*Bowtie1*
@@ -46,7 +52,7 @@ Finally, the output of the aligners must be sorted by read name in order to use
allo ALIGNEROUTPUT_SORT.SAM -seq PAIRED_OR_SINGLE -o OUTPUTNAME -m MIXED_OR_NARROW_PEAKS
@@ -55,13 +61,19 @@ Allo also accepts BAM files as input. See other options below..
55
61
56
62
During each run, Allo will create temporary files as it allocates the data. UM files are reads designated as uniquely mapped (has to be parsed in Bowtie2 or BWA). MM files are unallocated multi-mapped reads. AL files are allocated reads. Checking the size of the AL files during the run will give you an estimate of how many reads have already been allocated at that time.
57
63
58
-
## Post-processing and tips
64
+
Very short test files are supplied to make sure Allo runs to completion on your machine. Imports can take a minute so be patient. Using the paired-end example:
65
+
66
+
```
67
+
allo testRunPE.sam -seq pe
68
+
```
69
+
70
+
### Post-processing and tips
59
71
Allo adds a ZA tag to every MMR that is allocated. For reads that are allocated to regions that all contain 0 UMRs (random assignment), a ZZ tag is used instead. This allows users to remove reads that only map to zero UMR regions if they wish. The value within either tag corresponds to the number of places a read/pair mapped to. In order to get only uniquely mapped reads, grep could be used with the -v option to exclude lines with ZA or ZZ tags. On the same note, awk can used to filter reads with a specific number of mapping locations (can also be done with the -max option within Allo). Outside of adding these tags, Allo does not change anything within the read alignment columns for allocated reads.
60
72
61
73
Tip: It is recommended to run Allo on both the control and target sequencing files in order to balance out background in the samples. We recommend running Allo using the --random argument on the control file. This generally results in higher confidence peaks.
62
74
63
75
64
-
## Options
76
+
###Options
65
77
| Argument | Options | Explanation |
66
78
| ------------- | ------------- | ------------- |
67
79
| -o | any string | Output file name |
@@ -79,8 +91,5 @@ Tip: It is recommended to run Allo on both the control and target sequencing fil
79
91
| --parser || Allo utility that produces separate files for unique and multi-mapped reads from a SAM or BAM file. Bowtie2 and BWA will output all alignments that meet the given threshold even if one alignment has the highest score. |
80
92
81
93
82
-
## Test files
83
-
Very short test files are supplied to make sure Allo runs to completion on your machine. Imports can take a minute so be patient.
84
-
85
94
## Contact information
86
95
Please contact Alexis Morrissey ([email protected]) with any questions or issues concerning Allo.
0 commit comments