Genome‐wide development of intra‐ and inter‐specific transferable SSR markers and construction of a dynamic web resource for yam molecular breeding: Y2MD

Abstract

Microsatellite markers are widely used in population genetics and breeding. Despite the economic significance of yams in developing countries, there is a paucity of microsatellite markers, and as of now, no comprehensive microsatellite marker database exists. In this study, we conducted genome-wide microsatellite marker development across four yam species, identified cross-species transferable markers, and designed an easy-to-use web portal for the yam researchers. The screening of Dioscorea alata, Dioscorea rotundata, Dioscorea dumetorum, and Dioscorea zingiberensis genomes resulted in 318,713, 322,501, 307,040, and 253,856 microsatellites, respectively. Mono-, di-, and tri-nucleotides were the most important types of repeats in the different species, and a total of 864,128 primer pairs were designed. Furthermore, we identified 1170 cross-species transferable microsatellite markers. Among them, 17 out of 18 randomly selected were experimentally validated with good discriminatory power, regardless of the species and ploidy levels. Ultimately, we created and deployed a dynamic Yam Microsatellite Markers Database (Y2MD) available at https://y2md.ucad.sn/. Y2MD is embedded with various useful tools such as JBrowse, Blast, insilicoPCR, and SSR Finder to facilitate the exploitation of microsatellite markers in yams. This study represents the first comprehensive microsatellite marker mining across several yam species and will contribute to advancing yam genetic research and marker-assisted breeding. The released user-friendly database constitutes a valuable platform for yam researchers.

Genome‐wide association mapping for field spot blotch resistance in South Asian spring wheat genotypes

Abstract

Spot blotch caused by Bipolaris sorokiniana ((Sacc.) Shoemaker) (teleomorph: Cochliobolus sativus [Ito and Kuribayashi] Drechsler ex Dastur) is an economically important disease of warm and humid regions. The present study focused on identifying resistant genotypes and single-nucleotide polymorphism (SNP) markers associated with spot blotch resistance in a panel of 174 bread spring wheat lines using field screening and genome-wide association mapping strategies. Field experiments were conducted in Agua Fria, Mexico, during the 2019–2020 and 2020–2021 cropping seasons. A wide range of phenotypic variation was observed among genotypes tested during both years. Twenty SNP markers showed significant association with spot blotch resistance on 15 chromosomes, namely, 1A, 1B, 2A, 2B, 2D, 3A, 3B, 4B, 4D, 5A, 5B, 6A, 6B, 7A, and 7B. Of these, two consistently significant SNPs on 5A, TA003225-0566 and TA003225-1427, may represent a new resistance quantitative trait loci. Further, in the proximity of Tsn1 on 5B, AX-94435238 was the most stable and consistent in both years. The identified genomic regions could be deployed to develop spot blotch-resistant genotypes, particularly in the spot blotch-vulnerable wheat growing areas.

Genomic‐wide identification and expression analysis of AP2/ERF transcription factors in Zanthoxylum armatum reveals the candidate genes for the biosynthesis of terpenoids

Abstract

Terpenoids are the main active components in the Zanthoxylum armatum leaves, which have extensive medicinal value. The Z. armatum leaf is the main by-product in the Z. armatum industry. However, the transcription factors involved in the biosynthesis of terpenoids are rarely reported. This study was performed to identify and classify the APETALA2/ethylene-responsive factor (AP2/ERF) gene family of Z. armatum. The chromosome distribution, gene structure, conserved motifs, and cis-acting elements of the promoter of the species were also comprehensively analyzed. A total of 214 ZaAP2/ERFs were identified. From the obtained transcriptome and terpenoid content data, four candidate ZaAP2/ERFs involved in the biosynthesis of terpenoids were selected via correlation and weighted gene co-expression network analysis. A phylogenetic tree was constructed using 13 AP2/ERFs related to the biosynthesis of terpenoids in other plants. ZaERF063 and ZaERF166 showed close evolutionary relationships with the ERFs in other plant species and shared a high AP2-domain sequence similarity with the two closest AP2/ERF proteins, namelySmERF8 from Salvia miltiorrhiza and AaERF4 from Artemisia annua. Further investigation into the effects of methyl jasmonate (MeJA) treatment on the content of terpenoids in Z. armatum leaves revealed that MeJA significantly induced the upregulation of ZaERF166 and led to a significant increase in the terpenoids content in Z. armatum leaves, indicating that ZaERF166 might be involved in the accumulation of terpenoids of Z. armatum. Results will be beneficial for the functional characterization of AP2/ERFs in Z. armatum and establishment of the theoretical foundation to increase the production of terpenoids via the manipulation of the regulatory elements and strengthen the development and utilization of Z. armatum leaves.

Impact of genotype‐calling methodologies on genome‐wide association and genomic prediction in polyploids

Abstract

Discovery and analysis of genetic variants underlying agriculturally important traits are key to molecular breeding of crops. Reduced representation approaches have provided cost-efficient genotyping using next-generation sequencing. However, accurate genotype calling from next-generation sequencing data is challenging, particularly in polyploid species due to their genome complexity. Recently developed Bayesian statistical methods implemented in available software packages, polyRAD, EBG, and updog, incorporate error rates and population parameters to accurately estimate allelic dosage across any ploidy. We used empirical and simulated data to evaluate the three Bayesian algorithms and demonstrated their impact on the power of genome-wide association study (GWAS) analysis and the accuracy of genomic prediction. We further incorporated uncertainty in allelic dosage estimation by testing continuous genotype calls and comparing their performance to discrete genotypes in GWAS and genomic prediction. We tested the genotype-calling methods using data from two autotetraploid species, Miscanthus sacchariflorus and Vaccinium corymbosum, and performed GWAS and genomic prediction. In the empirical study, the tested Bayesian genotype-calling algorithms differed in their downstream effects on GWAS and genomic prediction, with some showing advantages over others. Through subsequent simulation studies, we observed that at low read depth, polyRAD was advantageous in its effect on GWAS power and limit of false positives. Additionally, we found that continuous genotypes increased the accuracy of genomic prediction, by reducing genotyping error, particularly at low sequencing depth. Our results indicate that by using the Bayesian algorithm implemented in polyRAD and continuous genotypes, we can accurately and cost-efficiently implement GWAS and genomic prediction in polyploid crops.

Editorial Board

Editorial Board

On the cover: Rice grains depicting health benefits such as low glycemic index, pigmented rice, and low chalk, as well as long slender grains which capture the major market share. Read the introduction to the special issue “Grain Quality and Nutritional Genomics for Breeding Next Generation Crops” lead by guest editors Drs. Manish Pandey, Reyazul Rouf Mir, and Nese Sreenivasulu: https://doi.org/10.1002/tpg2.20396. Photo credit: Nese Sreenivasulu, IRRI.


Soybean genetics, genomics, and breeding for improving nutritional value and reducing antinutritional traits in food and feed

Abstract

Soybean [Glycine max (L.) Merr.] is a globally important crop due to its valuable seed composition, versatile feed, food, and industrial end-uses, and consistent genetic gain. Successful genetic gain in soybean has led to widespread adaptation and increased value for producers, processors, and consumers. Specific focus on the nutritional quality of soybean seed composition for food and feed has further elucidated genetic knowledge and bolstered breeding progress. Seed components are historical and current targets for soybean breeders seeking to improve nutritional quality of soybean. This article reviews genetic and genomic foundations for improvement of nutritionally important traits, such as protein and amino acids, oil and fatty acids, carbohydrates, and specific food-grade considerations; discusses the application of advanced breeding technology such as CRISPR/Cas9 in creating seed composition variations; and provides future directions and breeding recommendations regarding soybean seed composition traits.

A medium‐density genotyping platform for cultivated strawberry using DArTag technology

Abstract

Genomic prediction in breeding populations containing hundreds to thousands of parents and seedlings is prohibitively expensive with current high-density genetic marker platforms designed for strawberry. We developed mid-density panels of molecular inversion probes (MIPs) to be deployed with the “DArTag” marker platform to provide a low-cost, high-throughput genotyping solution for strawberry genomic prediction. In total, 7742 target single nucleotide polymorphism (SNP) regions were used to generate MIP assays that were tested with a screening panel of 376 octoploid Fragaria accessions. We evaluated the performance of DArTag assays based on genotype segregation, amplicon coverage, and their ability to produce subgenome-specific amplicon alignments to the FaRR1 assembly and subsequent alignment-based variant calls with strong concordance to DArT's alignment-free, count-based genotype reports. We used a combination of marker performance metrics and physical distribution in the FaRR1 assembly to select 3K and 5K production panels for genotyping of large strawberry populations. We show that the 3K and 5K DArTag panels are able to target and amplify homologous alleles within subgenomic sequences with low-amplification bias between reference and alternate alleles, supporting accurate genotype calling while producing marker genotypes that can be treated as functionally diploid for quantitative genetic analysis. The 3K and 5K target SNPs show high levels of polymorphism in diverse F. × ananassa germplasm and UC Davis cultivars, with mean pairwise diversity (π) estimates of 0.40 and 0.32 and mean heterozygous genotype frequencies of 0.35 and 0.33, respectively.

Mapping QTL for vernalization requirement identified adaptive divergence of the candidate gene Flowering Locus C in polyploid Camelina sativa

Abstract

Vernalization requirement is an integral component of flowering in winter-type plants. The availability of winter ecotypes among Camelina species facilitated the mapping of quantitative trait loci (QTL) for vernalization requirement in Camelina sativa. An inter and intraspecific crossing scheme between related Camelina species, where one spring and two different sources of winter-type habit were used, resulted in the development of two segregating populations. Linkage maps generated with sequence-based markers identified three QTLs associated with vernalization requirement in C. sativa; two from the interspecific (chromosomes 13 and 20) and one from the intraspecific cross (chromosome 8). Notably, the three loci were mapped to different homologous regions of the hexaploid C. sativa genome. All three QTLs were found in proximity to Flowering Locus C (FLC), variants of which have been reported to affect the vernalization requirement in plants. Temporal transcriptome analysis for winter-type Camelina alyssum demonstrated reduction in expression of FLC on chromosomes 13 and 20 during cold treatment, which would trigger flowering, since FLC would be expected to suppress floral initiation. FLC on chromosome 8 also showed reduced expression in the C. sativa ssp. pilosa winter parent upon cold treatment, but was expressed at very high levels across all time points in the spring-type C. sativa. The chromosome 8 copy carried a deletion in the spring-type line, which could impact its functionality. Contrary to previous reports, all three FLC loci can contribute to controlling the vernalization response in C. sativa and provide opportunities for manipulating this requirement in the crop.

Genomic analysis and characterization of new loci associated with seed protein and oil content in soybeans

Abstract

Breeding for increased protein without a reduction in oil content in soybeans [Glycine max (L.) Merr.] is a challenge for soybean breeders but an expected goal. Many efforts have been made to develop new soybean varieties with high yield in combination with desirable protein and/or oil traits. An elite line, R05-1415, was reported to be high yielding, high protein, and low oil. Several significant quantitative trait loci (QTL) for protein and oil were reported in this line, but many of them were unstable across environments or genetic backgrounds. Thus, a new study under multiple field environments using the Infinium BARCSoySNP6K BeadChips was conducted to detect and confirm stable genomic loci for these traits. Genetic analyses consistently detected a single major genomic locus conveying these two traits with remarkably high phenotypic variation explained (R 2), varying between 24.2% and 43.5%. This new genomic locus is located between 25.0 and 26.7 Mb, distant from the previously reported QTL and did not overlap with other commonly reported QTL and the recently cloned gene Glyma.20G085100. Homolog analysis indicated that this QTL did not result from the paracentric chromosome inversion with an adjacent genomic fragment that harbors the reported QTL. The pleiotropic effect of this QTL could be a challenge for improving protein and oil simultaneously; however, a further study of four candidate genes with significant expressions in the seed developmental stages coupled with haplotype analysis may be able to pinpoint causative genes. The functionality and roles of these genes can be determined and characterized, which lay a solid foundation for the improvement of protein and oil content in soybeans.

Genome‐wide identification and expression analysis of heat shock protein gene family in cassava

Abstract

Heat shock proteins are important molecular chaperones that are involved in plant growth and stress responses. However, members of the Hsp family have been poorly studied in cassava. In this study, 225 MeHsp genes were identified in the cassava genome, and their genetic structures exhibited relatively conserved features within each subfamily. The 225 MeHsp genes showed random chromosomal distribution, and at least 74 pairs of segmentally duplicated MeHsp genes. Eleven tandemly duplicated MeHsp genes were identified. Cis-element analysis revealed the importance of MeHsps in plant adaptations to the environment. The prediction of protein interactions suggested that MeHsp70-20 may play a critical regulatory role in the interactive network. Furthermore, the expression profiles of MeHsps in different tissues and cell subsets were analyzed using bulk transcriptomics and single-cell transcriptomic data. Several subfamily genes exhibited unique expression patterns in the transcriptome and were selected for detailed analysis of the single-cell transcriptome. Quantitative real-time polymerase chain reaction (qRT-PCR) revealed the expression patterns of these genes under temperature stress, further supporting the prediction of cis-acting elements. This study provides valuable information for understanding the functional characteristics of MeHsp genes and the evolutionary relationships between MeHsps.