Variant analysis pipeline for accurate detection of genomic variants from transcriptome sequencing data

doi:10.1371/journal.pone.0216838

Fig 1.

Flow chart of the VAP workflow.

FastQ files are QC using FastQC, mapped using three aligners. BAM files are pre-processed by Picard and GATK, then merged, annotated and filtered to achieve high-confident SNPs.

More »

Expand

Table 1.

Criteria used in the VAP filtering workflow.

More »

Expand

Table 2.

Summary from the multiple aligners; read mapping statistics and variant calls.

More »

Expand

Fig 2.

Comparison of RNA-seq SNPs identified in the different mapping tools.

More »

Expand

Fig 3.

Comparison of RNA-seq SNPs found in either dbSNP or WGS.

More »

Expand

Fig 4.

The mutational profile of RNA-seq variants.

More »

Expand

Fig 5.

Comparison of SNPs identified as homozygous and heterozygous in RNA-seq.

More »

Expand

Table 3.

SNPs belonging to different annotation categories.

More »

Expand

Fig 6.

Overlap of SNPs found in coding regions from RNA-seq and WGS.

66% of the coding variants identified in WGS data were found in RNA-seq. However, the remaining WGS coding variants were not detected as a result of either: lack of expression/transcription (“no transcription”), the position was homozygous in RNA (“no variation”), “found but filtered” signifying that the position was detected but removed by one of our filtering steps, or “filtered” which indicates the position was heterozygous but filtered because it didn’t meet the default parameters for variant detection.

More »