Biological replicates are merged directly after filtering. Peak visualization is performed by a custom Gaussian smoother, which creates a Gaussian distribution around the midpoint of each read. Locations with more overlapping reads are thus represented by larger peaks.
The visualization output tracks are bedGraph files, which are converted into bigWig format using bedGraphToBigWig.
The final step of the pipeline is peak calling using SEACR version 1.3. The bedGraph files processed from both experimental and control experiments are needed to call peaks. In particular:
SEACR_1.3.sh <exp_bedgraph> <ctl_bedgraph> <norm> <stringency>
norm indicates the normalization experimental and control data and can be either "norm" (normalization occurs) or "non" (normalization does not occur). The default for this pipeline is "norm".
stringency specifies the threshold for peak calling. The default for this pipeline is relaxed, which compares the knee and peak of the curve, rather than stringent, which uses the peak of the curve only.
SEACR output is a bed file containing peaks. Each peak is detailed by its chromosome, start and end coordinates, total signal, and the location and value of the maximum signal within its coordinates.
Peak calls can be viewed (alongside bigwig tracks) using HiGlass.
SEACR is designed specifically for CUT&RUN experiments. Learn more about SEACR from the SEACR GitHub page.