Skip to content

Commit eb6868a

Browse files
committed
document phasing
1 parent aff0446 commit eb6868a

2 files changed

Lines changed: 20 additions & 0 deletions

File tree

README.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -268,6 +268,25 @@ These annotations are provided by VAFator (https://github.com/TRON-Bioinformatic
268268

269269
No technical annotations are performed if the parameter `--input_bams` is not passed.
270270

271+
## Phasing with WhatsHap
272+
273+
The phase of the mutations can be inferred from the reads pileup.
274+
This information is relevant to determine the impact of nearby mutations.
275+
WhatsHap adds the phasing information when possible to the VCF following the VCF specification for that purpose.
276+
Genotypes with phase information use the `|` instead of the `/` and `0|1` and `1|0` represent the two different phases.
277+
A homozygous mutations in phase is represented as `1|1`.
278+
Furthermore, because phasing may be incomplete, to define the different phased blocks the INFO/PS annotation used.
279+
All mutations belonging to the same phase block share the same PS number.
280+
281+
BAM files need to be provided to perform phasing with `--input_bams` and the phasing sample has to be determined with
282+
`--phasing_sample`.
283+
284+
This approach has some limitations:
285+
- We only support diploid genotyped VCF files. WhatsHap does support polyploid samples. But the particular case of
286+
somatic mutations is not supported.
287+
- The sample name chosen to perform the phasing has to be the same across all VCFs.
288+
Not to be mistaken with the patient name. Eg: only `normal` samples across all patients can be used for phasing
289+
271290
## Functional annotations
272291

273292
The functional annotations provide a biological context for every variant. Such as the overlapping genes or the effect

nextflow.config

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -87,6 +87,7 @@ Optional input:
8787
* --snpeff_memory: for some samples SnpEff may require to use more memory (default: 3g)
8888
* --mapping_quality: VAFator minimum mapping quality (default: 0)
8989
* --base_call_quality: VAFator minimum base call quality (default: 0)
90+
* --phasing-sample: enables phasing with whatshap and defines the sample BAM to phase the VCF (BAMs need to be provided)
9091
9192
Output:
9293
* Normalized VCF file

0 commit comments

Comments
 (0)