You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[](https://www.nextflow.io/)
8
8
9
-
The TronFlow variant normalization pipeline is part of a collection of computational workflows for tumor-normal pair
9
+
The TronFlow VCF postprocessing pipeline is part of a collection of computational workflows for tumor-normal pair
10
10
somatic variant calling.
11
+
These workflows are implemented in the Nextflow (Di Tommaso, 2017) framework.
11
12
12
13
Find the documentation here [](https://tronflow-docs.readthedocs.io/en/latest/?badge=latest)
13
-
14
14
15
-
This pipeline aims at normalizing variants represented in a VCF into the convened normal form as described in Tan 2015.
16
-
The variant normalization is based on the implementation in vt (Tan 2015) and bcftools (Danecek 2021).
17
-
The pipeline is implemented on the Nextflow (Di Tommaso 2017) framework.
15
+
This pipeline has several objectives:
16
+
* Variant filtering
17
+
* Variant normalization
18
+
* Technical annotations from different BAM files
19
+
* Functional annotations
20
+
21
+
## Variant filtering
22
+
23
+
Optionally, only variants with the value in the column `FILTER` matching the value of parameter `--filter` are kept.
24
+
If this parameter is not used not variants are filtered out. Multiple values can be passed separated by commas without spaces.
25
+
26
+
For instance, `--filter PASS,.` will keep variant having `FILTER=PASS` or `FILTER=.`, but remove all others.
27
+
28
+
29
+
## Variant normalization
30
+
31
+
The normalization step aims to represent variants into the convened normal form as described in Tan 2015.
32
+
The variant normalization is based on the implementation in vt (Tan, 2015) and bcftools (Danecek, 2021).
18
33
19
-
The pipeline consists of the following steps:
20
-
* Variant filtering (optional)
34
+
The normalization pipeline consists of the following steps:
21
35
* Decomposition of MNPs into atomic variants (ie: AC > TG is decomposed into two variants A>T and C>G) (optional).
22
36
* Decomposition of multiallelic variants into biallelic variants (ie: A > C,G is decomposed into two variants A > C and A > G)
23
37
* Trim redundant sequence and left align indels, indels in repetitive sequences can have multiple representations
Each patient can have any number of samples. Any sample can have any number of BAM files, annotations from the
232
+
different BAM files of the same sample will be provided with suffixes _1, _2, etc.
233
+
The aggregated vafator annotations on each sample will also be provided without a suffix.
181
234
182
-
183
235
184
236
## References
185
237
186
238
* Adrian Tan, Gonçalo R. Abecasis and Hyun Min Kang. Unified Representation of Genetic Variants. Bioinformatics (2015) 31(13): 2202-2204](http://bioinformatics.oxfordjournals.org/content/31/13/2202) and uses bcftools [Li, H. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics (Oxford, England), 27(21), 2987–2993. 10.1093/bioinformatics/btr509
187
239
* Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. Twelve years of SAMtools and BCFtools. Gigascience. 2021 Feb 16;10(2):giab008. doi: 10.1093/gigascience/giab008. PMID: 33590861; PMCID: PMC7931819.
188
240
* Di Tommaso, P., Chatzou, M., Floden, E. W., Barja, P. P., Palumbo, E., & Notredame, C. (2017). Nextflow enables reproducible computational workflows. Nature Biotechnology, 35(4), 316–319. 10.1038/nbt.3820
241
+
* Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.". Fly (Austin). 2012 Apr-Jun;6(2):80-92. PMID: 22728672
0 commit comments