Genomic and phenotypic characterization of finger millet indicates a complex diversification history

Abstract

Advances in sequencing technologies mean that insights into crop diversification can now be explored in crops beyond major staples. We use a genome assembly of finger millet, an allotetraploid orphan crop, to analyze DArTseq single nucleotide polymorphisms (SNPs) at the whole and sub-genome level. A set of 8778 SNPs and 13 agronomic traits was used to characterize a diverse panel of 423 landraces from Africa and Asia. Through principal component analysis (PCA) and discriminant analysis of principal components, four distinct groups of accessions were identified that coincided with the primary geographic regions of finger millet cultivation. Notably, East Africa, presumed to be the crop's origin, exhibited the lowest genetic diversity. The PCA of phenotypic data also revealed geographic differentiation, albeit with differing relationships among geographic areas than indicated with genomic data. Further exploration of the sub-genomes A and B using neighbor-joining trees revealed distinct features that provide supporting evidence for the complex evolutionary history of finger millet. Although genome-wide association study found only a limited number of significant marker-trait associations, a clustering approach based on the distribution of marker effects obtained from a ridge regression genomic model was employed to investigate trait complexity. This analysis uncovered two distinct clusters. Overall, the findings suggest that finger millet has undergone complex and context-specific diversification, indicative of a lengthy domestication history. These analyses provide insights for the future development of finger millet.

Genome scans capture key adaptation and historical hybridization signatures in tetraploid wheat

Abstract

Tetraploid wheats (Triticum turgidum L.), including durum wheat (T. turgidum ssp. durum (Desf.) Husn.), are important crops with high nutritional and cultural values. However, their production is constrained by sensitivity to environmental conditions. In search of adaptive genetic signatures tracing historical selection and hybridization events, we performed genome scans on two datasets: (1) Durum Global Diversity Panel comprising a total of 442 tetraploid wheat and wild progenitor accessions including durum landraces (n = 286), domesticated emmer (T. turgidum ssp. dicoccum (Schrank) Thell.; n = 103) and wild emmer (T. turgidum ssp. dicoccoides (Korn. ex Asch. & Graebn.) Thell.; n = 53) wheats genotyped using the 90K single nucleotide polymorphism (SNP) array, and (2) a second dataset comprising a total 121 accessions of nine T. turgidum subspecies including wild emmer genotyped with >100 M SNPs from whole-genome resequencing. The genome scan on the first dataset detected six outlier loci on chromosomes 1A, 1B, 3A (n = 2), 6A, and 7A. These loci harbored important genes for adaptation to abiotic stresses, phenological responses, such as seed dormancy, circadian clock, flowering time, and key yield-related traits, including pleiotropic genes, such as HAT1, KUODA1, CBL1, and ZFN1. The scan on the second dataset captured a highly differentiated region on chromosome 2B that shows significant differentiation between two groups: one group consists of Georgian (T. turgidum ssp. paleocolchicum A. Love & D. Love) and Persian (T. turgidum ssp. carthlicum (Nevski) A. Love & D. Love) wheat accessions, while the other group comprises all the remaining tetraploids including wild emmer. This is consistent with a previously reported introgression in this genomic region from T. timopheevii Zhuk. which naturally cohabit in the Georgian and neighboring areas. This region harbored several adaptive genes, including the thermomorphogenesis gene PIF4, which confers temperature-resilient disease resistance and regulates other biological processes. Genome scans can be used to fast-track germplasm housed in gene banks and in situ; which helps to identify environmentally resilient accessions for breeding and/or to prioritize them for conservation.

HaploCatcher: An R package for prediction of haplotypes

Abstract

Wheat (Triticum aestivum L.) is crucial to global food security but is often threatened by diseases, pests, and environmental stresses. Wheat-stem sawfly (Cephus cinctus Norton) poses a major threat to food security in the United States, and solid-stem varieties, which carry the stem-solidness locus (Sst1), are the main source of genetic resistance against sawfly. Marker-assisted selection uses molecular markers to identify lines possessing beneficial haplotypes, like that of the Sst1 locus. In this study, an R package titled “HaploCatcher” was developed to predict specific haplotypes of interest in genome-wide genotyped lines. A training population of 1056 lines genotyped for the Sst1 locus, known to confer stem solidness, and genome-wide markers was curated to make predictions of the Sst1 haplotypes for 292 lines from the Colorado State University wheat breeding program. Predicted Sst1 haplotypes were compared to marker-derived haplotypes. Our results indicated that the training set was substantially predictive, with kappa scores of 0.83 for k-nearest neighbors and 0.88 for random forest models. Forward validation on newly developed breeding lines demonstrated that a random forest model, trained on the total available training data, had comparable accuracy between forward and cross-validation. Estimated group means of lines classified by haplotypes from PCR-derived markers and predictive modeling did not significantly differ. The HaploCatcher package is freely available and may be utilized by breeding programs, using their own training populations, to predict haplotypes for whole-genome sequenced early generation material.

Genetic dissection of domestication traits in interspecific chickpea populations

Abstract

Chickpea (Cicer arietinum) is a pulse crop that provides an integral source of nutrition for human consumption. The close wild relatives Cicer reticulatum and Cicer echinospermum harbor untapped genetic diversity that can be exploited by chickpea breeders to improve domestic varieties. Knowledge of genomic loci that control important chickpea domestication traits will expedite the development of improved chickpea varieties derived from interspecific crosses. Therefore, we set out to identify genomic loci underlying key chickpea domestication traits by both association and quantitative trait locus (QTL) mapping using interspecific F2 populations. Diverse phenotypes were recorded for various agronomic traits. A total of 11 high-confidence markers were detected on chromosomes 1, 3, and 7 by both association and QTL mapping; these were associated with growth habit, flowering time, and seed traits. Furthermore, we identified candidate genes linked to these markers, which advanced our understanding of the genetic basis of domestication traits and validated known genes such as the FLOWERING LOCUS gene cluster that regulates flowering time. Collectively, this study has elucidated the genetic basis of chickpea domestication traits, which can facilitate the development of superior chickpea varieties.

Characterization of the CMS genetic regulation through comparative complete mitochondrial genome sequencing in Nicotiana tabacum

Abstract

Mitochondrial genomes (mitogenomes) of flowering plants vary greatly in structure and size, which can lead to frequent gene mutation, rearrangement, or recombination, then result in the cytoplasmic male sterile (CMS) mutants. In tobacco (Nicotiana tabacum), suaCMS lines are widely used in heterosis breeding; however, the related genetic regulations are not very clear. In this study, the cytological observation indicated that the pollen abortion of tobacco suaCMS(HD) occurred at the very early stage of the stamen primordia differentiation. In this study, the complete mitochondrial genomes of suaCMS(HD) and its maintainer HD were sequenced using the PacBio and Illumina Hiseq technology. The total length of the assembled mitogenomes of suaCMS(HD) and HD was 494,317 bp and 430,694 bp, respectively. Comparative analysis indicated that the expanded 64 K bases in suaCMS(HD) were mainly located in noncoding regions, and 23 and 21 big syntenic blocks (>5000 bp) were found in suaCMS(HD) and HD with a series of repeats. Electron transport chain-related genes were highly conserved in two mitogenomes, except five genes (ATP4, ATP6, COX2, CcmFC, and SDH3) with substantial substitutions. Three suaCMS(HD)-specific genes, orf261, orf291, and orf433, were screened. Sequence analysis and RT-PCR verification showed that they were unique to suaCMS(HD). Further gene location analysis and protein property prediction indicated that all the three genes were likely candidates for suaCMS(HD). This study provides new insight into understanding the suaCMS mechanism and is useful for improving tobacco breeding.

Accelerating genetic gains for quantitative resistance to verticillium wilt through predictive breeding in strawberry

Abstract

Verticillium wilt (VW), a devastating vascular wilt disease of strawberry (Fragaria × ananassa), has caused economic losses for nearly a century. This disease is caused by the soil-borne pathogen Verticillium dahliae, which occurs nearly worldwide and causes disease in numerous agriculturally important plants. The development of VW-resistant cultivars is critically important for the sustainability of strawberry production. We previously showed that a preponderance of the genetic resources (asexually propagated hybrid individuals) preserved in public germplasm collections were moderately to highly susceptible and that genetic gains for increased resistance to VW have been negligible over the last 60 years. To more fully understand the challenges associated with breeding for increased quantitative resistance to this pathogen, we developed and phenotyped a training population of hybrids (n=564$n = 564$) among elite parents with a wide range of resistance phenotypes. When these data were combined with training data from a population of elite and exotic hybrids (n=386$n = 386$), genomic prediction accuracies of 0.47–0.48 were achieved and were predicted to explain 70%–75% of the additive genetic variance for resistance. We concluded that breeding values for resistance to VW can be predicted with sufficient accuracy for effective genomic selection with routine updating of training populations.

Advances and opportunities in unraveling cold‐tolerance mechanisms in the world’s primary staple food crops

Abstract

Temperatures below or above optimal growth conditions are among the major stressors affecting productivity, end-use quality, and distribution of key staple crops including rice (Oryza sativa), wheat (Triticum aestivum), and maize (Zea mays L.). Among temperature stresses, cold stress induces cellular changes that cause oxidative stress and slowdown metabolism, limit growth, and ultimately reduce crop productivity. Perception of cold stress by plant cells leads to the activation of cold-responsive transcription factors and downstream genes, which ultimately impart cold tolerance. The response triggered in crops to cold stress includes gene expression/suppression, the accumulation of sugars upon chilling, and signaling molecules, among others. Much of the information on the effects of cold stress on perception, signal transduction, gene expression, and plant metabolism are available in the model plant Arabidopsis but somewhat lacking in major crops. Hence, a complete understanding of the molecular mechanisms by which staple crops respond to cold stress remain largely unknown. Here, we make an effort to elaborate on the molecular mechanisms employed in response to low-temperature stress. We summarize the effects of cold stress on the growth and development of these crops, the mechanism of cold perception, and the role of various sensors and transducers in cold signaling. We discuss the progress in cold tolerance research at the genome, transcriptome, proteome, and metabolome levels and highlight how these findings provide opportunities for designing cold-tolerant crops for the future.

Identification of sources of resistance to Striga gesnerioides Willd. Vatke among cowpea (Vigna unguiculata [L.] Walp) germplasm from Togo

Abstract

Striga gesnerioides is one of the major biotic constraints to cowpea (Vigna unguiculata [L.] Walp) production throughout West Africa. The best way to control and limit the spread of this weed remains genetic control, but recent works have shown that all cowpea varieties commonly grown in Togo are susceptible to S. gesnerioides. This study aims to investigate the resistance response of cowpea germplasm from Togo to S. gesnerioides. A field test was performed under natural infestation followed by a pots test under artificial infestation. Genotyping was carried out with three molecular markers SSR1, 61RM2 and C42-2B, known for their linkage to S. gesnerioides resistance genes. The results have revealed three cowpea accessions, TG20_66, TG20_82 and TG20_108, phenotypically resistant to S. gesnerioides. All three resistant accessions have shown the RSG3-301 resistance gene presence. Potential resistance gene sources to S. gesnerioides exist among cowpea accessions from Togo.

Rapid analysis of hydrogen cyanide in fresh cassava roots using NIRSand machine learning algorithms: Meeting end user demand for low cyanogenic cassava

Abstract

This study focuses on meeting end-users’ demand for cassava (Manihot esculenta Crantz) varieties with low cyanogenic potential (hydrogen cyanide potential [HCN]) by using near-infrared spectrometry (NIRS). This technology provides a fast, accurate, and reliable way to determine sample constituents with minimal sample preparation. The study aims to evaluate the effectiveness of machine learning (ML) algorithms such as logistic regression (LR), support vector machine (SVM), and partial least squares discriminant analysis (PLS-DA) in distinguishing between low and high HCN accessions. Low HCN accessions averagely scored 1–5.9, while high HCN accessions scored 6–9 on a 1–9 categorical scale. The researchers used 1164 root samples to test different NIRS prediction models and six spectral pretreatments. The wavelengths 961, 1165, 1403–1505, 1913–1981, and 2491 nm were influential in discrimination of low and high HCN accessions. Using selected wavelengths, LR achieved 100% classification accuracy and PLS-DA achieved 99% classification accuracy. Using the full spectrum, the best model for discriminating low and high HCN accessions was the PLS-DA combined with standard normal variate with second derivative, which produced an accuracy of 99.6%. The SVM and LR had moderate classification accuracies of 75% and 74%, respectively. This study demonstrates that NIRS coupled with ML algorithms can be used to identify low and high HCN accessions, which can help cassava breeding programs to select for low HCN accessions.