Identification of SSR markers linked to new fertility restoration trait in sorghum (Sorghum bicolor (L.) Moench) for A4 (maldandi) male sterile cytoplasm

Abstract

Identification of markers associated with fertility restoration (Rf) genes is essential because they can streamline the breeding of new CMS lines and production of commercial hybrid seeds. Therefore, in the present study, F2 populations (M31-2A × DSMR 8) was utilized to identify markers linked to Rf loci on maldandi (A4) cytoplasm through bulk segregant analysis (BSA). The F2 population was analysed for seed set percentage. Chi-square (χ2) analysis showed that the fertility restoration trait followed expected digenic ratio. By BSA, simple sequence repeats (SSRs) markers, namely, Xtxp 34 and Xtxp 69 located on chromosome 3 and SB 3956 and Xtxp 312 located on chromosome 7, showed clear polymorphism between two groups of fertile and sterile bulks. The genomic region harbouring Rf locus on chromosome 3 (2.61 Mbp) predicted to encode five pentatricopeptide repeat (PPR) genes whereas, on chromosome 7, the gene SORBI_3007G047400 predicted to encode MYB (myeloblastosis) domain containing proteins. These predicted genes could be the candidate for restoring fertility on A4 cytoplasm. This finding will be fundamental in the production and rapid selection of novel restorer lines.

Multivariate analysis of the effects of weather variables on white rust epidemics and yield reduction of mustard over multiple growing seasons

Multivariate analysis of the effects of weather variables on white rust epidemics and yield reduction of mustard over multiple growing seasons

Present work illustrates the relevance of weather variables in predicting multiple epidemiological variables, and of multiple disease variables as predictors of actual crop yield.


Abstract

Three shrinkage regression and two machine-learning approaches were evaluated to derive models for the prediction of epidemiological characteristics of white rust of mustard, using data from 112 epidemics in the field. Four epidemiological characteristics were considered: (a) crop age at first appearance of disease, (b) crop age at highest disease severity, (c) highest disease severity in a growing season and (d) area under disease progress curve (AUDPC), along with (e) crop yield to measure the effects of disease on crop performance. We developed models using weather indices to predict these variables using five different approaches: ANN, Elastic Net, LASSO, random forest and ridge regression. One model was developed for each sowing date corresponding to each dependent variable. Two hundred different models were developed. All models performed well at the calibration stage for most of the five variables at all sowing dates. However, at the validation stage, ANN-derived models outperformed (R 2val ~ 1.00, nRMSEV ~0.00 and MBEV ~0.00 in most cases) the three shrinkage regression-derived models in predicting all five variables. Predictions by random forest- and LASSO-derived models were acceptable for AUDPC and crop yield. Evaluation metrics (including R 2val, nRMSEV and MBEV) suggested that ENET- and ridge-derived models do not perform satisfactorily, whereas ANN-derived models yielded reliable results and thus generate robust predictions. The present work constitutes a systematic effort to compare modelling methods for disease and yield prediction and illustrates the relevance of weather variables in predicting multiple epidemiological variables, and of multiple disease variables as predictors of actual crop yield.

Genomic‐wide identification and expression analysis of AP2/ERF transcription factors in Zanthoxylum armatum reveals the candidate genes for the biosynthesis of terpenoids

Abstract

Terpenoids are the main active components in the Zanthoxylum armatum leaves, which have extensive medicinal value. The Z. armatum leaf is the main by-product in the Z. armatum industry. However, the transcription factors involved in the biosynthesis of terpenoids are rarely reported. This study was performed to identify and classify the APETALA2/ethylene-responsive factor (AP2/ERF) gene family of Z. armatum. The chromosome distribution, gene structure, conserved motifs, and cis-acting elements of the promoter of the species were also comprehensively analyzed. A total of 214 ZaAP2/ERFs were identified. From the obtained transcriptome and terpenoid content data, four candidate ZaAP2/ERFs involved in the biosynthesis of terpenoids were selected via correlation and weighted gene co-expression network analysis. A phylogenetic tree was constructed using 13 AP2/ERFs related to the biosynthesis of terpenoids in other plants. ZaERF063 and ZaERF166 showed close evolutionary relationships with the ERFs in other plant species and shared a high AP2-domain sequence similarity with the two closest AP2/ERF proteins, namelySmERF8 from Salvia miltiorrhiza and AaERF4 from Artemisia annua. Further investigation into the effects of methyl jasmonate (MeJA) treatment on the content of terpenoids in Z. armatum leaves revealed that MeJA significantly induced the upregulation of ZaERF166 and led to a significant increase in the terpenoids content in Z. armatum leaves, indicating that ZaERF166 might be involved in the accumulation of terpenoids of Z. armatum. Results will be beneficial for the functional characterization of AP2/ERFs in Z. armatum and establishment of the theoretical foundation to increase the production of terpenoids via the manipulation of the regulatory elements and strengthen the development and utilization of Z. armatum leaves.

Genome‐wide identification and expression analysis of heat shock protein gene family in cassava

Abstract

Heat shock proteins are important molecular chaperones that are involved in plant growth and stress responses. However, members of the Hsp family have been poorly studied in cassava. In this study, 225 MeHsp genes were identified in the cassava genome, and their genetic structures exhibited relatively conserved features within each subfamily. The 225 MeHsp genes showed random chromosomal distribution, and at least 74 pairs of segmentally duplicated MeHsp genes. Eleven tandemly duplicated MeHsp genes were identified. Cis-element analysis revealed the importance of MeHsps in plant adaptations to the environment. The prediction of protein interactions suggested that MeHsp70-20 may play a critical regulatory role in the interactive network. Furthermore, the expression profiles of MeHsps in different tissues and cell subsets were analyzed using bulk transcriptomics and single-cell transcriptomic data. Several subfamily genes exhibited unique expression patterns in the transcriptome and were selected for detailed analysis of the single-cell transcriptome. Quantitative real-time polymerase chain reaction (qRT-PCR) revealed the expression patterns of these genes under temperature stress, further supporting the prediction of cis-acting elements. This study provides valuable information for understanding the functional characteristics of MeHsp genes and the evolutionary relationships between MeHsps.

Genomic analysis and characterization of new loci associated with seed protein and oil content in soybeans

Abstract

Breeding for increased protein without a reduction in oil content in soybeans [Glycine max (L.) Merr.] is a challenge for soybean breeders but an expected goal. Many efforts have been made to develop new soybean varieties with high yield in combination with desirable protein and/or oil traits. An elite line, R05-1415, was reported to be high yielding, high protein, and low oil. Several significant quantitative trait loci (QTL) for protein and oil were reported in this line, but many of them were unstable across environments or genetic backgrounds. Thus, a new study under multiple field environments using the Infinium BARCSoySNP6K BeadChips was conducted to detect and confirm stable genomic loci for these traits. Genetic analyses consistently detected a single major genomic locus conveying these two traits with remarkably high phenotypic variation explained (R 2), varying between 24.2% and 43.5%. This new genomic locus is located between 25.0 and 26.7 Mb, distant from the previously reported QTL and did not overlap with other commonly reported QTL and the recently cloned gene Glyma.20G085100. Homolog analysis indicated that this QTL did not result from the paracentric chromosome inversion with an adjacent genomic fragment that harbors the reported QTL. The pleiotropic effect of this QTL could be a challenge for improving protein and oil simultaneously; however, a further study of four candidate genes with significant expressions in the seed developmental stages coupled with haplotype analysis may be able to pinpoint causative genes. The functionality and roles of these genes can be determined and characterized, which lay a solid foundation for the improvement of protein and oil content in soybeans.

Mapping QTL for vernalization requirement identified adaptive divergence of the candidate gene Flowering Locus C in polyploid Camelina sativa

Abstract

Vernalization requirement is an integral component of flowering in winter-type plants. The availability of winter ecotypes among Camelina species facilitated the mapping of quantitative trait loci (QTL) for vernalization requirement in Camelina sativa. An inter and intraspecific crossing scheme between related Camelina species, where one spring and two different sources of winter-type habit were used, resulted in the development of two segregating populations. Linkage maps generated with sequence-based markers identified three QTLs associated with vernalization requirement in C. sativa; two from the interspecific (chromosomes 13 and 20) and one from the intraspecific cross (chromosome 8). Notably, the three loci were mapped to different homologous regions of the hexaploid C. sativa genome. All three QTLs were found in proximity to Flowering Locus C (FLC), variants of which have been reported to affect the vernalization requirement in plants. Temporal transcriptome analysis for winter-type Camelina alyssum demonstrated reduction in expression of FLC on chromosomes 13 and 20 during cold treatment, which would trigger flowering, since FLC would be expected to suppress floral initiation. FLC on chromosome 8 also showed reduced expression in the C. sativa ssp. pilosa winter parent upon cold treatment, but was expressed at very high levels across all time points in the spring-type C. sativa. The chromosome 8 copy carried a deletion in the spring-type line, which could impact its functionality. Contrary to previous reports, all three FLC loci can contribute to controlling the vernalization response in C. sativa and provide opportunities for manipulating this requirement in the crop.

A medium‐density genotyping platform for cultivated strawberry using DArTag technology

Abstract

Genomic prediction in breeding populations containing hundreds to thousands of parents and seedlings is prohibitively expensive with current high-density genetic marker platforms designed for strawberry. We developed mid-density panels of molecular inversion probes (MIPs) to be deployed with the “DArTag” marker platform to provide a low-cost, high-throughput genotyping solution for strawberry genomic prediction. In total, 7742 target single nucleotide polymorphism (SNP) regions were used to generate MIP assays that were tested with a screening panel of 376 octoploid Fragaria accessions. We evaluated the performance of DArTag assays based on genotype segregation, amplicon coverage, and their ability to produce subgenome-specific amplicon alignments to the FaRR1 assembly and subsequent alignment-based variant calls with strong concordance to DArT's alignment-free, count-based genotype reports. We used a combination of marker performance metrics and physical distribution in the FaRR1 assembly to select 3K and 5K production panels for genotyping of large strawberry populations. We show that the 3K and 5K DArTag panels are able to target and amplify homologous alleles within subgenomic sequences with low-amplification bias between reference and alternate alleles, supporting accurate genotype calling while producing marker genotypes that can be treated as functionally diploid for quantitative genetic analysis. The 3K and 5K target SNPs show high levels of polymorphism in diverse F. × ananassa germplasm and UC Davis cultivars, with mean pairwise diversity (π) estimates of 0.40 and 0.32 and mean heterozygous genotype frequencies of 0.35 and 0.33, respectively.