Abstract
Next-generation sequencing (NGS) technology advancements continue to reduce the cost of high-throughput genome-wide genotyping for breeding and genetics research. Skim sequencing, which surveys the entire genome at low coverage, has become feasible for quantitative trait locus (QTL) mapping and genomic selection in various crops. However, the genome complexity of allopolyploid crops such as wheat (Triticum aestivum L.) still poses a significant challenge for genome-wide genotyping. Targeted sequencing of the protein-coding regions (i.e., exome) reduces sequencing costs compared to whole genome re-sequencing and can be used for marker discovery and genotyping. We developed a method called skim exome capture (SEC) that combines the strengths of these existing technologies and produces targeted genotyping data while decreasing the cost on a per-sample basis compared to traditional exome capture. Specifically, we fragmented genomic DNA using a tagmentation approach, then enriched those fragments for the low-copy genic portion of the genome using commercial wheat exome baits and multiplexed the sequencing at different levels to achieve desired coverage. We demonstrated that for a library of 48 samples, ∼7–8× target coverage was sufficient for high-quality variant detection. For higher multiplexing levels of 528 and 1056 samples per library, we achieved an average coverage of 0.76× and 0.32×, respectively. Combining these lower coverage SEC sequencing data with genotype imputation using a customized wheat practical haplotype graph database that we developed, we identified hundreds of thousands of high-quality genic variants across the genome. The SEC method can be used for high-resolution QTL mapping, genome-wide association studies, genomic selection, and other downstream applications.