- Generate the shell scripts for all samples:
shell> perl fastq2vcf.v9.pl -d dataTable.txt -c config.ini -e cluster.ini -o $PWD
Note: After running this step, you will generate a series of shell scripts for this run.
Run QC and Mapping:
shell> qsub QC_Mapping_sample1_RG1_SRR504483.sh
shell> qsub QC_Mapping_sample2_RG1_SRR504515.sh
shell> qsub QC_Mapping_sample3_RG1_SRR504516.sh
shell> qsub QC_Mapping_sample4_RG1_SRR504517.sh
shell> qsub QC_Mapping_sample5_RG1_SRR524806.sh
- When those scripts have finished executing, mark duplicates, realignments, quality recalibration and data compression:
shell> qsub PreCalling_sample1.sh
shell> qsub PreCalling_sample2.sh
shell> qsub PreCalling_sample3.sh
shell> qsub PreCalling_sample4.sh
shell> qsub PreCalling_sample5.sh
- When those scripts have finished executing, perform multi-sample variant calling: we use four variant callers to call variants.
shell> qsub variant.HaplotypeCaller.sh
shell> qsub variant.samtools.sh
shell> qsub variant.SNVer.sh
shell> qsub variant.UnifiedGenotyper.sh
Note: After running this step, you will get all the annotated variants for all samples from each caller.
- Finally, run variant summary:
shell> qsub variant.summary.sh
Note: After running this step, you will get the overlapped variant call set among 4 callers.