Genomic Selection Signature Pipeline using Hp Statistics

Published 2022-07-16
Platform Udemy
Number of Students 1
Price $29.99
Instructors
Dr. Rashid Saif
Subjects

Go to Udemy

Whole-Genome Pooled-Seq Data Analysis using Hp Method

In this course, audience will learn complete bioinformatics pipeline for detecting genomic selection signatures using pooled-seq whole-genome sequencing data with the help of Linux & R. Basics bash command-line and R scripting is being used for running the different steps involved in this pipeline. As there are many statistical methods for detecting positive selective sweeps in the the genome which entirely depends on the type of omics data generated. we used pooled-heterozygosity statistics (Hp) method which is robust in case of pooled-seq dataset with sliding window approach of 150kb.

Trimming of the sequences was performed with Trimmomatic software followed by indexing of the reference genome and mapping with BWA-MEM, sorting and mark duplication steps were performed with Picard tool, then SAMtools was used for making the pileup files, further, PoPoolation2 tools was used to generate the .rc files and to synchronize the files. Finally, in-housed Ruby Hp script was used to find the hitchhiking positive selection pressure in the subject genome. A brief commentary was also given on Hp adopted statistics, moreover, data was prepared and normalized to visualize it on R. Manhattan, qq density plots and histograms were generated and finally Tajima’s D statistics was also applied for analyzing the same data using this classical method of site frequency spectrum.   

Go to Udemy