You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[](https://www.nextflow.io/)
@@ -18,6 +18,78 @@ This pipeline has several objectives:
$ nextflow run tron-bioinformatics/tronflow-vcf-postprocessing --help
36
+
37
+
TronFlow VCF normalization v${VERSION}
38
+
39
+
Usage:
40
+
nextflow run main.nf --input_vcfs input_vcfs --reference reference.fasta
41
+
42
+
43
+
Input:
44
+
* --input_vcf: the path to a single VCF to normalize (not compatible with --input_files)
45
+
* --input_vcfs: the path to a tab-separated values file containing in each row the sample name and path to the VCF file (not compatible with --input_vcf)
46
+
The input file does not have header!
47
+
Example input file:
48
+
sample1 /path/to/your/file.vcf
49
+
sample2 /path/to/your/file2.vcf
50
+
* --reference: path to the FASTA genome reference (indexes expected *.fai, *.dict)
51
+
* --vcf-without-ad: indicate when the VCFs to normalize do not have the FORMAT/AD annotation
52
+
53
+
Optional input:
54
+
* --output: the folder where to publish output
55
+
* --skip_decompose_complex: flag indicating not to split complex variants (ie: MNVs and combinations of SNVs and indels)
56
+
* --filter: specify the filter to apply if any (e.g.: PASS), only variants with this value will be kept
57
+
* --input_bams: a tab-separated values file containing in each row the sample name, tumor and normal BAM files for annotation with Vafator
58
+
* --snpeff_organism: the SnpEff organism name (eg: hg19, hg38, GRCh37.75, GRCh38.99)
59
+
* --snpeff_datadir: the SnpEff data folder where the reference genomes were previously downloaded. Required if --snpeff_organism is provided
60
+
* --snpeff_args: additional SnpEff arguments
61
+
62
+
Output:
63
+
* Normalized VCF file
64
+
* Tab-separated values file with the absolute paths to the normalized VCF files, normalized_vcfs.txt
65
+
* Summary statistics before and after normalization
66
+
```
67
+
68
+
### Input tables
69
+
70
+
The table with VCF files expects two tab-separated columns without a header
$ nextflow run tron-bioinformatics/tronflow-variant-normalization --help
181
-
182
-
TronFlow VCF normalization v${VERSION}
183
-
184
-
Usage:
185
-
nextflow run main.nf --input_vcfs input_vcfs --reference reference.fasta
186
-
187
-
188
-
Input:
189
-
* --input_vcf: the path to a single VCF to normalize (not compatible with --input_files)
190
-
* --input_vcfs: the path to a tab-separated values file containing in each row the sample name and path to the VCF file (not compatible with --input_vcf)
191
-
The input file does not have header!
192
-
Example input file:
193
-
sample1 /path/to/your/file.vcf
194
-
sample2 /path/to/your/file2.vcf
195
-
* --reference: path to the FASTA genome reference (indexes expected *.fai, *.dict)
196
-
* --vcf-without-ad: indicate when the VCFs to normalize do not have the FORMAT/AD annotation
197
-
198
-
Optional input:
199
-
* --output: the folder where to publish output
200
-
* --skip_decompose_complex: flag indicating not to split complex variants (ie: MNVs and combinations of SNVs and indels)
201
-
* --filter: specify the filter to apply if any (e.g.: PASS), only variants with this value will be kept
202
-
* --input_bams: a tab-separated values file containing in each row the sample name, tumor and normal BAM files for annotation with Vafator
203
-
204
-
Output:
205
-
* Normalized VCF file
206
-
* Tab-separated values file with the absolute paths to the normalized VCF files, normalized_vcfs.txt
207
-
* Summary statistics before and after normalization
208
-
```
209
-
210
-
### Input tables
211
-
212
-
The table with VCF files expects two tab-separated columns without a header
Each patient can have any number of samples. Any sample can have any number of BAM files, annotations from the
232
-
different BAM files of the same sample will be provided with suffixes _1, _2, etc.
233
-
The aggregated vafator annotations on each sample will also be provided without a suffix.
234
-
235
-
236
238
## References
237
239
238
240
* Adrian Tan, Gonçalo R. Abecasis and Hyun Min Kang. Unified Representation of Genetic Variants. Bioinformatics (2015) 31(13): 2202-2204](http://bioinformatics.oxfordjournals.org/content/31/13/2202) and uses bcftools [Li, H. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics (Oxford, England), 27(21), 2987–2993. 10.1093/bioinformatics/btr509
Copy file name to clipboardExpand all lines: nextflow.config
+3Lines changed: 3 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -63,6 +63,9 @@ Optional input:
63
63
* --filter: specify a comma-separated list of filters to apply (e.g.: PASS,.), only variants with these values will be kept. If not provided all varianst are kept
64
64
* --vcf-without-ad: indicate when the VCFs to normalize do not have the FORMAT/AD annotation
65
65
* --input_bams: a tab-separated values file containing in each row the sample name, tumor and normal BAM files for annotation with Vafator
66
+
* --snpeff_organism: the SnpEff organism name (eg: hg19, hg38, GRCh37.75, GRCh38.99)
67
+
* --snpeff_datadir: the SnpEff data folder where the reference genomes were previously downloaded. Required if --snpeff_organism is provided
0 commit comments