There are a few key considerations when analyzing sequencing data generated from the xGen HS EGFR Pathway Amplicon Panel with unique molecular identifiers (UMIs).
The first 10 bases in front of Read 2 constitute a UMI. For these first ten bases we recommend trimming them with Trimmomatic (using the CROP option) to make an MID/UMI fastq file for use with the MID pipeline from the fgbio package (Fulcrum Genomics). Before aligning the reads, make sure that the 10 bp UMI (which contains random bases) has been trimmed from 5’ of Read 2.
Also, check that adapter trimming is enabled while setting up the sequencing run. Alternatively, adapter trimming can be performed bioinformatically before analysis.
xGen Custom Amplicon Panels are designed with overlapping amplicons to provide contiguous regions of coverage in a single-tube format. Synthetic primer sequences will be encountered both at the beginning and end of some reads which must be trimmed during analysis. This can be done using a publicly available tool called Primerclip. See our app note Primerclip—A Tool for Trimming Primer Sequences for detailed information.
For more advice, reach out to our Scientific Application Supports team.
Note: A target BED file is provided with purchase of the xGen HS EGFR Pathway Amplicon Panel or the xGen Custom Amplicon Panel.