Whole‐genome sequencing based discovery of candidate genes and diagnostic markers for seed weight in groundnut

Abstract

Seed weight in groundnut (Arachis hypogaea L.) has direct impact on yield as well as market price because of preference for bold seeds by consumers and industry, thereby making seed-size improvement as one of the most important objectives of groundnut breeding programs globally. Marker-based early generation selection can accelerate the process of breeding for developing large-seeded varieties. In this context, we deployed the quantitative trait locus-sequencing (QTL-seq) approach on a biparental mapping population (Chico × ICGV 02251) to identify candidate genes and develop markers for seed weight in groundnut. A total of 289.4–389.4 million reads sequencing data were generated from three libraries (ICGV 02251 and two extreme bulks) achieving 93.9–95.1% genome coverage and 8.34–9.29× average read depth. The analysis of sequencing data using QTL-seq pipeline identified five genomic regions (three on chromosome B06 and one each on chromosomes B08 and B09) for seed weight. Detailed analysis of above associated genomic regions detected 182 single-nucleotide polymorphisms (SNPs) in genic and intergenic regions, and 11 of these SNPs were nonsynonymous in the genomic regions of 10 candidate genes including Ulp proteases and BIG SEED locus genes. Kompetitive allele specific polymerase chain reaction (KASP) markers for 14 SNPs were developed, and four of these markers (snpAH0031, snpAH0033, snpAH0037, and snpAH0038) were successfully validated for deployment in breeding for large-seeded groundnut varieties.

The trans‐kingdom communication of noncoding RNAs in plant–environment interactions

Abstract

As conserved regulatory agents, noncoding RNAs (ncRNAs) have an important impact on many aspects of plant life, including growth, development, and environmental response. Noncoding RNAs can travel through not only plasmodesma and phloem but also intercellular barriers to regulate distinct processes. Increasing evidence shows that the intercellular trans-kingdom transmission of ncRNAs is able to modulate many important interactions between plants and other organisms, such as plant response to pathogen attack, the symbiosis between legume plants and rhizobia and the interactions with parasitic plants. In these interactions, plant ncRNAs are believed to be sorted into extracellular vesicles (EVs) or other nonvesicular vehicles to pass through cell barriers and trigger trans-kingdom RNA interference (RNAi) in recipient cells from different species. There is evidence that the features of extracellular RNAs and associated RNA-binding proteins (RBPs) play a role in defining the RNAs to retain in cell or secrete outside cells. Despite the few reports about RNA secretion pathway in plants, the export of extracellular ncRNAs is orchestrated by a series of pathways in plants. The identification and functional analysis of mobile small RNAs (sRNAs) are attracting increasing attention in recent years. In this review, we discuss recent advances in our understanding of the function, sorting, transport, and regulation of plant extracellular ncRNAs.

Insights into the roles of long noncoding RNAs in the communication between plants and the environment

Abstract

In addition to coding proteins, RNA molecules, especially long noncoding RNAs (lncRNAs), have well-established functions in regulating gene expression. The number of studies focused on the roles played by different types of lncRNAs in a variety of plant biological processes has markedly increased. These lncRNA roles involve plant vegetative and reproductive growth and responses to biotic and abiotic stresses. In this review, we examine the classification, mechanisms, and functions of lncRNAs and then emphasize the roles played by these lncRNAs in the communication between plants and the environment mainly with respect to the following environmental factors: temperature, light, water, salt stress, and nutrient deficiencies. We also discuss the consensus among researchers and the remaining challenges and underscore the exciting ways lncRNAs may affect the biology of plants.

Leveraging prior biological knowledge improves prediction of tocochromanols in maize grain

Abstract

With an essential role in human health, tocochromanols are mostly obtained by consuming seed oils; however, the vitamin E content of the most abundant tocochromanols in maize (Zea mays L.) grain is low. Several large-effect genes with cis-acting variants affecting messenger RNA (mRNA) expression are mostly responsible for tocochromanol variation in maize grain, with other relevant associated quantitative trait loci (QTL) yet to be fully resolved. Leveraging existing genomic and transcriptomic information for maize inbreds could improve prediction when selecting for higher vitamin E content. Here, we first evaluated a multikernel genomic best linear unbiased prediction (MK-GBLUP) approach for modeling known QTL in the prediction of nine tocochromanol grain phenotypes (12–21 QTL per trait) within and between two panels of 1,462 and 242 maize inbred lines. On average, MK-GBLUP models improved predictive abilities by 7.0–13.6% when compared with GBLUP. In a second approach with a subset of 545 lines from the larger panel, the highest average improvement in predictive ability relative to GBLUP was achieved with a multi-trait GBLUP model (15.4%) that had a tocochromanol phenotype and transcript abundances in developing grain for a few large-effect candidate causal genes (1–3 genes per trait) as multiple response variables. Taken together, our study illustrates the enhancement of prediction models when informed by existing biological knowledge pertaining to QTL and candidate causal genes.

Long noncoding RNA transcriptome analysis reveals novel lncRNAs in Morus alba ‘Yu‐711’ response to drought stress

Abstract

Drought stress has been a key environmental factor affecting plant growth and development. The plant genome is capable of producing long noncoding RNAs (lncRNAs). To better understand white mulberry (Morus alba L.) drought response mechanism, we conducted a comparative transcriptome study comparing two treatments: drought-stressed (EG) and well-watered (CK) plants. A total of 674 differentially expressed lncRNAs (DElncRNAs) were identified. In addition, 782 differentially expressed messenger RNAs (DEmRNAs) were identified. We conducted Gene Ontology (GO) and KEGG enrichment analyses focusing on the differential lncRNAs cis-target genes. The target genes of the DElncRNAs were most significantly involved in the biosynthesis of secondary metabolites. Gene regulatory networks of the target genes involving DElncRNAs–mRNAs–DEmRNAs and DElncRNA–miRNA–DEmRNA were constructed. In the DElncRNAs–DEmRNAs network, 30 DEmRNAs involved in the biosynthesis of secondary metabolites are collocated with 46 DElncRNAs. The interaction between DElncRNAs and candidate genes was identified using LncTar. In summary, quantitative real-time polymerase chain reaction (qRT-PCR) validated nine candidate genes and seven target lncRNAs including those identified by LncTar. We predicted that the DElncRNAs–DEmRNAs might recruit microRNAs (miRNAs) to interact with gene regulatory networks under the drought stress response in mulberry. The findings will contribute to our understanding of the regulatory functions of lncRNAs under drought stress and will shed new light on the mulberry–drought stress interactions.

Genome‐wide identification of R2R3‐MYB family genes and gene response to stress in ginger

Abstract

Ginger (Zingiber officinale Roscoe) is an important plant used worldwide for medicine and food. The R2R3-MYB transcription factor (TF) family has essential roles in plant growth, development, and stresses resistance, and the number of genes in the family varies greatly among different types of plants. However, genome-wide discovery of ZoMYBs and gene responses to stresses have not been reported in ginger. Therefore, genome-wide analysis of R2R3-MYB genes in ginger was conducted in this study. Protein phylogenetic relations and conserved motifs and chromosome localization and duplication, structure, and cis-regulatory elements were analyzed. In addition, the expression patterns of selected genes were analyzed under two different stresses. A total of 299 candidate ZoMYB genes were discovered in ginger. Based on groupings of R2R3-MYB genes in the model plant Arabidopsis thaliana (L.) Heynh., ZoMYBs were divided into eight groups. Genes were distributed across 22 chromosomes at uneven densities. In gene duplication analysis, 120 segmental duplications were identified in the ginger genome. Gene expression patterns of 10 ZoMYBs in leaves of ginger under abscisic acid (ABA) and low-temperature stress treatments were different. The results will help to determine the exact roles of ZoMYBs in anti-stress responses in ginger.

Genetic factors underlying anaerobic germination in rice: Genome‐wide association study and transcriptomic analysis

Abstract

The success of rice (Oryza sativa L.) germination and survival under submerged conditions is mainly determined by the rapid growth of the coleoptile to reach the water surface. Previous reports have shown the presence of genetic variability within rice accessions in the levels of flooding tolerance during germination or anaerobic germination (AG). Although many studies have focused on the physiological mechanisms of oxygen stress, few studies have explored the breadth of natural variation in AG tolerance-related traits in rice. In this study, we evaluated the coleoptile lengths of a geographically diverse rice panel of 241 accessions, including global accessions along with elite breeding lines and released cultivars from the United States, under the normal and flooded conditions in laboratory and greenhouse environments. A genome-wide association study (GWAS) was performed using a 7K single-nucleotide polymorphism (SNP) array and the phenotypic data of normal coleoptile length, flooded coleoptile length, flooding tolerance index, and survival at 14 d after seeding (DAS). Out of the 30 significant GWAS quantitative trait loci (QTL) regions identified, 14 colocalized with previously identified candidate genes of AG tolerance, whereas 16 were potentially novel. Two rice accessions showing contrasting phenotypic responses to AG stress were selected for the transcriptomics study. The combined approach of GWAS and transcriptomics analysis identified 77 potential candidate genes related to AG tolerance. The findings of our study may assist rice improvement programs in developing rice cultivars with robust tolerance under flooding stress during germination and the early seedling stage.

ASRpro: A machine‐learning computational model for identifying proteins associated with multiple abiotic stress in plants

Abstract

One of the thrust areas of research in plant breeding is to develop crop cultivars with enhanced tolerance to abiotic stresses. Thus, identifying abiotic stress-responsive genes (SRGs) and proteins is important for plant breeding research. However, identifying such genes via established genetic approaches is laborious and resource intensive. Although transcriptome profiling has remained a reliable method of SRG identification, it is species specific. Additionally, identifying multistress responsive genes using gene expression studies is cumbersome. Thus, endorsing the need to develop a computational method for identifying the genes associated with different abiotic stresses. In this work, we aimed to develop a computational model for identifying genes responsive to six abiotic stresses: cold, drought, heat, light, oxidative, and salt. The predictions were performed using support vector machine (SVM), random forest, adaptive boosting (ADB), and extreme gradient boosting (XGB), where the autocross covariance (ACC) and K-mer compositional features were used as input. With ACC, K-mer, and ACC + K-mer compositional features, the overall accuracy of ∼60–77, ∼75–86, and ∼61–78% were respectively obtained using the SVM algorithm with fivefold cross-validation. The SVM also achieved higher accuracy than the other three algorithms. The proposed model was also assessed with an independent dataset and obtained an accuracy consistent with cross-validation. The proposed model is the first of its kind and is expected to serve the requirement of experimental biologists; however, the prediction accuracy was modest. Given its importance for the research community, the online prediction application, ASRpro, is made freely available (https://iasri-sg.icar.gov.in/asrpro/) for predicting abiotic SRGs and proteins.

Changes in epigenetic features in legumes under abiotic stresses

Abstract

Legume crops are rich in nutritional value for human and livestock consumption. With global climate change, developing stress-resilient crops is crucial for ensuring global food security. Because of their nitrogen-fixing ability, legumes are also important for sustainable agriculture. Various abiotic stresses, such as salt, drought, and elevated temperatures, are known to adversely affect legume production. The responses of plants to abiotic stresses involve complicated cellular processes including stress hormone signaling, metabolic adjustments, and transcriptional regulations. Epigenetic mechanisms play a key role in regulating gene expressions at both transcriptional and posttranscriptional levels. Increasing evidence suggests the importance of epigenetic regulations of abiotic stress responses in legumes, and recent investigations have extended the scope to the epigenomic level using next-generation sequencing technologies. In this review, the current knowledge on the involvement of epigenetic features, including DNA methylation, histone modification, and noncoding RNAs, in abiotic stress responses in legumes is summarized and discussed. Since most of the available information focuses on a single aspect of these epigenetic features, integrative analyses involving omics data in multiple layers are needed for a better understanding of the dynamic chromatin statuses and their roles in transcriptional regulation. The inheritability of epigenetic modifications should also be assessed in future studies for their applications in improving stress tolerance in legumes through the stable epigenetic optimization of gene expressions.