Skip to main content
Advertisement

Main menu

  • Home
  • Articles
    • Newest Articles
    • Current Issue
    • Methods & Resources
    • Author Interviews
    • Archive
    • Subjects
  • Collections
  • Submit
    • Submit a Manuscript
    • Author Guidelines
    • License, Copyright, Fee
    • FAQ
    • Why submit
  • About
    • About Us
    • Editors & Staff
    • Board Members
    • Licensing and Reuse
    • Reviewer Guidelines
    • Privacy Policy
    • Advertise
    • Contact Us
    • LSA LLC
  • Alerts
  • Other Publications
    • EMBO Press
    • The EMBO Journal
    • EMBO reports
    • EMBO Molecular Medicine
    • Molecular Systems Biology
    • Rockefeller University Press
    • Journal of Cell Biology
    • Journal of Experimental Medicine
    • Journal of General Physiology
    • Journal of Human Immunity
    • Cold Spring Harbor Laboratory Press
    • Genes & Development
    • Genome Research

User menu

  • My alerts

Search

  • Advanced search
Life Science Alliance
  • Other Publications
    • EMBO Press
    • The EMBO Journal
    • EMBO reports
    • EMBO Molecular Medicine
    • Molecular Systems Biology
    • Rockefeller University Press
    • Journal of Cell Biology
    • Journal of Experimental Medicine
    • Journal of General Physiology
    • Journal of Human Immunity
    • Cold Spring Harbor Laboratory Press
    • Genes & Development
    • Genome Research
  • My alerts
Life Science Alliance

Advanced Search

  • Home
  • Articles
    • Newest Articles
    • Current Issue
    • Methods & Resources
    • Author Interviews
    • Archive
    • Subjects
  • Collections
  • Submit
    • Submit a Manuscript
    • Author Guidelines
    • License, Copyright, Fee
    • FAQ
    • Why submit
  • About
    • About Us
    • Editors & Staff
    • Board Members
    • Licensing and Reuse
    • Reviewer Guidelines
    • Privacy Policy
    • Advertise
    • Contact Us
    • LSA LLC
  • Alerts
  • Follow LSA on Bluesky
  • Follow lsa Template on Twitter
Research Article
Transparent Process
Open Access

Metabolome-based prediction of yield heterosis contributes to the breeding of elite rice

View ORCID ProfileZhiwu Dan, Yunping Chen, Weibo Zhao, Qiong Wang, View ORCID ProfileWenchao Huang  Correspondence email
Zhiwu Dan
State Key Laboratory of Hybrid Rice, Key Laboratory for Research and Utilization of Heterosis in Indica Rice, The Yangtze River Valley Hybrid Rice Collaboration & Innovation Center, College of Life Sciences, Wuhan University, Wuhan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Zhiwu Dan
Yunping Chen
State Key Laboratory of Hybrid Rice, Key Laboratory for Research and Utilization of Heterosis in Indica Rice, The Yangtze River Valley Hybrid Rice Collaboration & Innovation Center, College of Life Sciences, Wuhan University, Wuhan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Weibo Zhao
State Key Laboratory of Hybrid Rice, Key Laboratory for Research and Utilization of Heterosis in Indica Rice, The Yangtze River Valley Hybrid Rice Collaboration & Innovation Center, College of Life Sciences, Wuhan University, Wuhan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qiong Wang
State Key Laboratory of Hybrid Rice, Key Laboratory for Research and Utilization of Heterosis in Indica Rice, The Yangtze River Valley Hybrid Rice Collaboration & Innovation Center, College of Life Sciences, Wuhan University, Wuhan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wenchao Huang
State Key Laboratory of Hybrid Rice, Key Laboratory for Research and Utilization of Heterosis in Indica Rice, The Yangtze River Valley Hybrid Rice Collaboration & Innovation Center, College of Life Sciences, Wuhan University, Wuhan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Wenchao Huang
  • For correspondence: wenchaoh@whu.edu.cn
Published 13 December 2019. DOI: 10.26508/lsa.201900551
  • Article
  • Figures & Data
  • Info
  • Metrics
  • Reviewer Comments
  • PDF
Loading

Abstract

Improvement of the breeding efficiencies of heterotic crops adaptive to different conditions can mitigate the food shortage crisis due to overpopulation and climate change. To date, diverse molecular markers have been used to guide field phenotypic selection, whereas accurate predictions of complex heterotic traits are rarely reported. Here, we present a practical metabolome-based strategy for predicting yield heterosis in rice. The dissection of population structure based on untargeted metabolite profiles as the initial critical step in multivariate modeling performed better than the screening of predictive variables. Then the assessment of each predictive variable’s contribution to predictive models according to all latent factors was more precise than the conventional first one. Metabolites belonging to specific pathways were closely associated with yield heterosis, and the up-regulation of galactose metabolism promoted robust yield heterosis in hybrids under different growth conditions. Our study demonstrates that metabolome-based predictive models with correctly dissected population structure and screened predictive variables can facilitate accurate predictions of yield heterosis and have great potential for establishing molecular marker–based precision breeding programs.

Introduction

Hybrid rice has a higher yield than conventional rice and has contributed greatly to boosting global food production over the past four decades (Cheng et al, 2007). However, to feed 9.1 billion people by 2050 (FAO, 2009), the rate at which the annual yield of staple crops increases must be more than double (Hickey et al, 2017). Worse yet, severe global yield losses of rice, maize, and soybean are plausible under dramatic climate change (Zhao et al, 2016, 2017). Therefore, the breeding of new heterotic varieties adaptive to different growth conditions is a promising option for ensuring global food security.

Elite hybrid rice, such as the wild abortive and Honglian series (Luo et al, 2013; Huang et al, 2015), are perfect combinations of sterile and restorer lines for generating yield heterosis through successful breeding (Chen et al, 2014). The yield potential of a new combination is unknown before field hybridization and phenotyping, which are labor-intensive and time-consuming tasks. To improve breeding efficiency, biomarkers, including DNA, RNA, proteins, and metabolites, that can reflect parental genetic differences have been adopted for crop improvement. The results of marker-based performance prediction in maize, rice, wheat, sorghum, and potato demonstrate that various factors, including genetic relatedness (Zhao et al, 2015; Yu et al, 2016), environmental differences (Li et al, 2018; Sprenger et al, 2018), predictive variable number (Technow et al, 2014; de Abreu et al, 2017; Sprenger et al, 2018), feature selection method (Xu et al, 2014; de Abreu et al, 2017), and population structure and size (Riedelsheimer et al, 2012a; Xu et al, 2014; Zhao et al, 2015), can affect the power of predictive models. In addition, the absence of high-quality reference genomes and the genome complexity of polyploids also hinder the establishment of highly efficient molecular breeding programs (Rasheed et al, 2017).

The metabolome is a bridge between the genome and phenome, and metabolite levels have close and comprehensive connections with polygenic complex traits (Schauer et al, 2006; Riedelsheimer et al, 2012a, 2012b). Metabolic prediction of agronomic traits has been preliminarily explored in maize and rice (Dan et al, 2016; Xu et al, 2016; de Abreu et al, 2017), and metabolic analytes detected in young rice seedlings can predict grain weight in hybrids from different populations (Dan et al, 2019). However, the accurate prediction of yield heterosis remains a challenge because the mechanisms of yield heterosis are highly complex and the corresponding component traits (grain weight, grain number per panicle, tiller number, and seed setting rate) are polygenic. Moreover, the performance of complex quantitative traits such as yield and disease resistance is determined by the genotype, environment, and their interaction (Thomas, 2010; Scheres et al, 2017; Li et al, 2018). Here, we provide a practical metabolome-based strategy for predicting yield heterosis in rice that has wide applicability and can contribute to the breeding of climate-resilient hybrid crops, especially those of orphan varieties without reference genomes or polyploidy.

Results

Determination of predictive variables for yield heterosis

To build predictive models for rice hybrids under different growth conditions, we measured the grain yield of two F1 hybrid populations (Pop2012 and Pop2015) and calculated better-parent heterosis for yield per plant (BPH-YPP) for subsequent analysis (Table S1). The YPPs of parents from the two populations were significantly different (Fig 1A and Table S2), and hybrid YPP and BPH-YPP also differed significantly between populations (Fig 1B and Table S2). Then, we performed untargeted liquid chromatography–mass spectrometry (LC-MS) analysis of 15-d-old seedlings of the parents of both populations, and a total of 3,746 metabolic analytes were detected in the parents (Table S3). A dendrogram of parents from Pop2012 based on parental metabolite profiles showed a clustering trend similar to that based on indica–japonica–specific insertion and deletion DNA markers (Figs 1C and S1) (Dan et al, 2016). Moreover, principal component analysis (PCA) (Jolliffe, 1986) of the metabolome of the parents from both populations revealed little overlaps between populations (Fig 1D).

Table S1 YPP of parents and hybrids.

Table S2 Independent-samples t test to YPP of parents, YPP, and BPH-YPP of hybrids across populations.

Table S3 Metabolite files of parents and hybrids.

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1. Determination of predictive variables for BPH-YPP.

(A) YPP of parents from Pop2012 and Pop2015. ***P = 2.7 × 10−4, two-tailed, independent-samples t test. (B) YPP and BPH-YPP of hybrids from Pop2012 and Pop2015. For YPP, ***P = 2.1 × 10−19, two-tailed, independent-samples t test. For BPH-YPP, ***P = 5.2 × 10−7, two-tailed, independent-samples t test. (C) Dendrogram of the 18 cultivars from Pop2012 based on the 3,746 detected metabolic analytes. (D) Three-dimensional distribution of parents from Pop2012 and Pop2015 based on PCA of the 3,746 detected analytes. (E, F) Distributions of three pairs of reciprocal hybrids and corresponding parents based on PCA of 3,746 metabolic analytes. The ratios of (E), means of, and differences in (F) parental metabolite levels were calculated and compared to hybrid metabolite profiles. Slashes indicate crosses between two parents, and female and male parents are on the left and right of the slash, respectively. Shadows with color are 95% confidence regions. (G) Changes in the number of latent factors and adjusted R-square values. PLS regression analysis was performed between the BPH-YPP of hybrids from Pop2012 and the means of/differences in parental metabolite profiles. No more than 17 latent factors were extracted in both analyses. (H) Three-dimensional distribution of hybrids from Pop2012 and Pop2015 based on PCA of the 3,746 predictive variables.

Figure S1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S1. Dendrogram of the 18 cultivars from Pop2012 based on 34 pairs of indica–japonica–specific InDel DNA markers.

Insertion and deletion DNA markers were used to assess genetic differences among the 18 parents from Pop2012, and the polymorphism of 34 pairs of markers was used to construct the dendrogram.

To determine predictive variables for yield heterosis, we then performed metabolite profiling of seedlings of three pairs of reciprocal F1 hybrids and investigated the relationship between hybrid metabolite profiles and corresponding transformed parental metabolite levels (Table S3). The means of, differences in, and ratios of parental metabolite levels were calculated for each hybrid, and the ratios of parental metabolite levels were highly distinct from the hybrid metabolite profiles according to PCA score plots (Fig 1E and Table S4). In contrast, the means of and differences in parental metabolite levels showed closer connections, and the parental means displayed the distributions most similar to the hybrid metabolite profiles (Fig 1F and Table S4), suggesting that mean parental metabolite levels are appropriate for representing hybrid metabolite profiles.

Table S4 Correlations between hybrid metabolite levels and corresponding transformed parental metabolite levels.

Subsequently, the means of and differences in all detected analytes were calculated between every pair of parents for the two populations and assigned as predictive variables to corresponding hybrids. Partial least squares (PLS) regression (Wold, 1982), which can model quantitative and multivariate complex relationships, was applied to the predictive variables and BPH-YPP for hybrids from Pop2012. No more than 17 latent factors were extracted, and the adjusted R-square values of the means used as predictive variables were much larger than those of the differences (Fig 1G). Furthermore, PCA of the mean metabolite levels of the parents of hybrids demonstrated distant genetic relationships between populations (Fig 1H), suggesting that the materials were suitable for reliable cross-validation (Wray et al, 2013). In summary, these results show that the means of parental metabolite levels are suitable predictive variables for yield heterosis and the two populations used in this study are suitable for exploring predictive models for genetically distant hybrids under different growth conditions.

Prediction of BPH for grain yield across environments

To predict BPH-YPP for Pop2015 based on Pop2012, parameters of predictive analytes and the constant in Fig 1G (three latent factors with an adjusted R-square value of 0.3647) were used to construct an equation for predicting yield heterosis. The predictability of BPH-YPP was 0.61 for Pop2012 (Fig 2A; P = 5.7 × 10−31) but only 0.24 for Pop2015 (Fig 2B; P = 0.01). Then, quantiles of BPH-YPP for Pop2012 were calculated to divide the hybrids into high- and low-BPH-YPP subgroups to increase predictability by grouping appropriate predictive variables (Fig 2C). The metabolome-based PLS-discriminant analysis (PLS-DA) (Barker et al, 2003) grouped hybrids with high- and low BPH-YPP separately (Fig S2). Next, the top 1,000 predictive variables were chosen for modeling based on the variable importance in projection (VIP) of component 1 (VIP > 1.0198; Table S5). Unexpectedly, although the predictability of BPH-YPP for Pop2012 was as high as 0.61 (Fig 2D), the predictability of BPH-YPP for Pop2015 was only 0.17 (Fig 2E; P = 0.09). We obtained even lower predictabilities of BPH-YPP for Pop2015 by changing the number of predictive variables (Fig 2F and G; VIP > 1.4631, 500 predictive analytes) and the numbers of high- and low-BPH-YPP hybrids from Pop2012 for screening predictive variables (Figs S3–S5).

Table S5 VIP values of components in PLS-DA of 72 high- and 72 low-BPH-YPP hybrids.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2. Prediction of BPH-YPP with predictive variables selected from PLS-DA.

(A) Prediction of BPH-YPP for Pop2012 with 3,746 predictive variables (PVs). (B) Prediction of BPH-YPP for Pop2015 based on the predictive model constructed in Fig 2A. (C) Box plot of BPH-YPP for Pop2012. Hybrids were divided into low- and high-BPH-YPP subgroups according to the 25th and 75th percentiles. (D) Prediction of BPH-YPP for Pop2012 with the top 1,000 predictive variables based on PLS-DA of 72 high- and 72 low-BPH-YPP hybrids. (E) Prediction of BPH-YPP for Pop2015 based on the predictive model constructed in Fig 2D. (F) Prediction of BPH-YPP for Pop2012 with the top 500 predictive variables based on PLS-DA of 72 high- and 72 low-BPH-YPP hybrids. (G) Prediction of BPH-YPP for Pop2015 based on the predictive model constructed in Fig 2F. The dotted lines are the fit lines.

Figure S2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S2. PCA and PLS-DA of 72 high- and 72 low-BPH-YPP hybrids from Pop2012.

(A) PCA and PLS-DA score plots of 72 high- and 72 low-BPH-YPP hybrids based on the 3,746 predictive analytes. (B) Cross-validation of PLS-DA. (C) Permutation testing of PLS-DA. BPH-YPP, better-parent heterosis for yield per plant; PC, principal component; PCA, principal component analysis; PLS-DA, partial least squares-discriminant analysis.

Figure S3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S3. PCA and PLS-DA of 100 high- and 100 low-BPH-YPP hybrids from Pop2012.

(A) PCA and PLS-DA score plots of 100 high- and 100 low-BPH-YPP hybrids based on the 3,746 predictive analytes. (B) Cross-validation of PLS-DA. (C) Permutation testing of PLS-DA. BPH-YPP, better-parent heterosis for yield per plant; PC, principal component; PCA, principal component analysis; PLS-DA, partial least squares-discriminant analysis.

Figure S4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S4. PCA and PLS-DA of 50 high- and 50 low-BPH-YPP hybrids from Pop2012.

(A) PCA and PLS-DA score plots of 50 high- and 50 low-BPH-YPP hybrids based on the 3,746 predictive analytes. (B) Cross-validation of PLS-DA. (C) Permutation testing of PLS-DA. BPH-YPP, better-parent heterosis for yield per plant; PC, principal component; PCA, principal component analysis; PLS-DA, partial least squares-discriminant analysis.

Figure S5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S5. Correlations between the observed and predicted BPH-YPP based on varied numbers of high- and low-BPH-YPP hybrids.

(A) Prediction of BPH-YPP for Pop2012 with the top 1,000 predictive variables based on PLS-DA of 100 high- and 100 low-BPH-YPP hybrids. (B) Prediction of BPH-YPP for Pop2015 based on the predictive model constructed in Fig S5A. (C) Prediction of BPH-YPP for Pop2012 with the top 1,000 predictive variables based on PLS-DA of 50 high- and 50 low-BPH-YPP hybrids. (D) Prediction of BPH-YPP for Pop2015 based on the predictive model constructed in Fig S5C. The dotted lines are the fit lines. BPH-YPP, better-parent heterosis for yield per plant; PV, predictive variable.

Next, PCA was performed on both populations based on the top 1,000 predictive variables selected as shown in Fig 2D (Fig S6), and the principal component 1 scores were ordered to identify core hybrids (Tables S6 and S7). Finally, one of every three hybrids from Pop2012 (herein named the 1/3N set) and one of every two hybrids from Pop2015 (1/2N set) were combined as the training set (1/3N + 1/2N set) to predict the remaining hybrids. However, the predictability for Pop2015 was only 0.16 (Fig 3A; P = 0.26). Clues from the slight predictability changes in Pop2012, as shown in Fig 2D (r = 0.61) and Fig 3A (r = 0.51), indicated that the low predictability of BPH-YPP for Pop2015 might have been caused by improper selection of core hybrids.

Table S6 Scores of PCA of Pop2012 with the top 1,000 predictive variables.

Table S7 Scores of PCA of Pop2015 with the top 1,000 predictive variables.

Figure S6.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S6. Distributions of core and noncore hybrids from both populations based on PCA of different numbers of predictive variables.

(A) Distribution of core and noncore hybrids from Pop2012 based on PCA of the top 1,000 predictive variables used in Fig 2D. (B) Distribution of core and noncore hybrids from Pop2012 based on PCA of 3,746 predictive variables. (C) Distribution of core and noncore hybrids from Pop2015 based on PCA of the top 1,000 predictive variables used in Fig 2D. (D) Distribution of core and noncore hybrids from Pop2015 based on PCA of 3,746 predictive variables. PC is short for principal component.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3. Dissection of population structure increases predictabilities.

(A) Prediction of BPH-YPP for Pop2012 and Pop2015 with the top 1,000 predictive variables used in Fig 2D and population structure determined by the top 1,000 predictive variables. (B) Prediction of BPH-YPP for Pop2012 and Pop2015 with the top 1,000 predictive variables used in Fig 2D and population structure determined by 3,746 predictive variables. (C) Prediction of BPH-YPP for Pop2012 and Pop2015 with 3,746 predictive variables and population structure determined by 3,746 predictive variables. The dotted lines are the fit lines.

The proportions of variance explained by the first two principal components were smaller than those shown in Fig S6A and C (Fig S6B and D and Tables S8 and S9), indicating that the PCA of all 3,746 predictive variables rather than the top 1,000 predictive variables helped dissect population structure. The predictability of BPH-YPP for Pop2015 increased to 0.34 when the top 1,000 predictive variables were used (Fig 3B; P = 0.01). Next, the performance of “noncore” hybrids from both populations was predicted with all 3,746 predictive variables. A correlation analysis between the observed and predicted BPH-YPPs indicated that the predictability of BPH-YPP for Pop2015 was further improved (Fig 3C; r = 0.44, P = 0.001). Therefore, for predicting component traits (e.g., grain weight) (Dan et al, 2019), the dissection of population structures as the initial step in building predictive models is better than the screening of predictive variables.

Table S8 Scores of PCA of Pop2012 with the 3,746 predictive variables.

Table S9 Scores of PCA of Pop2015 with the top 3,746 predictive variables.

Optimization and validation of the predictive model

To further increase the predictability of BPH-YPP, we filtered low-contribution or unrelated predictive variables among the 3,746 analytes based on VIP values of the first latent factor (Table S10). As shown in Fig S7A, the predictabilities of BPH-YPP for Pop2015 varied with changes in the number of predictive variables. Then, the mean VIP values of the first three latent factors were calculated (Table S10), and the predictabilities did not increase (Fig S7B). Unexpectedly, the predictabilities of BPH-YPP for Pop2015 increased when all seven latent factors were used (Fig 4A), peaking at 0.58 with 1,400 predictive variables (P = 5.0 × 10−6; Fig 4B and Table S11). Thus, hybrids from both populations were predicted with predictabilities close to 0.6, although the ranges of yield heterosis differed greatly between the two populations. Moreover, no improvement in predictability was observed after another 400 predictive variables were removed from the 1,400 correlated predictive analytes (Fig S8A and B; r = 0.55, P = 1.9 × 10−5).

Table S10 VIP values of latent factors in PLS of core hybrids with 3,746 predictive variables.

Table S11 PLS of core hybrids with 1,400 predictive variables.

Figure S7.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S7. Changes of predictabilities with varied numbers of predictive variables when different latent factors were used for feature selection.

(A) Predictabilities of BPH-YPP with different numbers of predictive variables selected according to VIP values of the first latent factor. (B) Predictabilities of BPH-YPP with different numbers of predictive variables selected according to mean VIP values of the first three latent factors.

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4. Feature selection for BPH-YPP.

(A) Predictabilities of BPH-YPP with different numbers of predictive variables selected according to mean VIP values of all seven latent factors. (B) Prediction of BPH-YPP for Pop2012 and Pop2015 with the top 1,400 predictive variables selected according to the mean VIP values of the seven latent factors. (C) Differential analytes between 72 high- and 72 low-BPH-YPP hybrids from Pop2012. Independent-samples t test, equal group variance assumed, adjusted P-value cutoff: 0.05. (D) Comparison of the 1,400 predictive variables used in Fig 4B (PLS) and differential analytes identified in Fig 4C (t tests). (E) Differential metabolic pathways between high- and low-BPH-YPP hybrids from Pop2012. (F) Differential metabolic pathways between high- and low-BPH-YPP hybrids from Pop2015. (G) Enriched metabolic pathways between high- and low-BPH-YPP hybrids from Pop2012 based on dysregulated metabolic pathway analysis with MetDNA. (H) Comparison of the average levels of metabolites involved in galactose metabolism between high- and low-BPH-YPP hybrids from Pop2012. ***P = 2.7 × 10−5, two-tailed, independent-samples t test.

Figure S8.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S8. Optimization of the predictive model by removing redundant predictive variables.

(A) Pearson correlations between the 1,400 predictive variables shown in Fig 4B. (B) Prediction of BPH-YPP for Pop2012 and Pop2015 with the top 1,000 predictive variables selected according to the mean VIP values of the nine latent factors. BPH-YPP, better-parent heterosis for yield per plant; PV, predictive variable.

We next performed t tests for high- and low-BPH-YPP hybrids from Pop2012 according to the results shown in Fig 2C. A total of 1,311 analytes were significantly different between the two subgroups (Fig 4C and Table S12), and most of them overlapped with the 1,400 predictive variables (Fig 4D). The results from MetaboAnalyst (Xia et al, 2011) indicated that metabolic pathways, including that for galactose metabolism, were enriched for both populations (Fig 4E and F, Tables S13 and S14, Figs S9, and S10), which was consistent with findings in previous reports (Schauer et al, 2006; Obata et al, 2015; Wen et al, 2015). In addition, galactose metabolism was detected when dysregulated metabolic pathway analysis was performed on high- and low-BPH-YPP hybrids from Pop2012 with MetDNA (Shen et al, 2019) (Fig 4G and Table S15). The average levels of metabolites involved in galactose metabolism in high-BPH-YPP hybrids were higher than those in low-BPH-YPP hybrids (Fig 4H, Tables S16, and S17), and significant differences in metabolites involved in galactose metabolism were detected between the two subgroups (Table S18). These results show that the 1,400 analytes identified in this study are predictive of BPH-YPP and that the up-regulation of galactose metabolism is closely associated with high degrees of yield heterosis.

Table S12 t tests on 72 high- and 72 low-BPH-YPP hybrids from Pop2012.

Table S13 Metabolite set enrichment analysis of differential analytes from Pop2012.

Table S14 Metabolite set enrichment analysis of differential analytes from Pop2015.

Table S15 Dysregulated metabolic pathway analysis of high- and low-BPH-YPP hybrids from Pop2012.

Table S16 Metabolic levels of dysregulated metabolic pathways between high- and low-BPH-YPP hybrids from Pop2012.

Table S17 Independent-samples t test to six enriched metabolic pathways for high- and low-BPH-YPP hybrids from Pop2012.

Table S18 Metabolic levels of annotated metabolites through MetDNA.

Figure S9.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S9. Metabolite set enrichment analysis of differential analytes from Pop2012.

(A) Enriched metabolic pathways analyzed with differential analytes from Pop2012. A total of 1,311 analytes were different between the 72 high- and 72 low-BPH-YPP hybrids from Pop2012 and they were used for enrichment analysis with the module “MS peaks to pathways” on MetaboAnalyst. The metabolites belonging to 11 enriched metabolic pathways are marked with red (positive) and blue (negative) according to their normalized enrichment scores. (B) The average levels of metabolites involved in cysteine and methionine metabolism between high- and low-BPH-YPP hybrids. The average levels of metabolites involved in cysteine and methionine metabolism in each hybrid were based on the results of dysregulated metabolic pathways provided in Table S16. ***P = 7.5 × 10−4. (C, D, E, F) The average levels of metabolites involved in the citrate cycle (C; ***P = 1.8 × 10−8), carbon fixation in photosynthetic organisms (D; ***P = 1.9 × 10−4), nicotinate and nicotinamide metabolism (E; ***P = 0.003), and glyoxylate and dicarboxylate metabolism (F; ***P = 1.6 × 10−3) between high- and low-BPH-YPP hybrids. P-values in Fig S9B–F are from two-tailed, independent-samples t tests. BPH-YPP, better-parent heterosis for yield per plant.

Figure S10.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S10. Metabolite set enrichment analysis of differential metabolites from Pop2015.

A total of 108 analytes were different between the 26 high- and 26 low-BPH-YPP hybrids from Pop2015 and they were used for enrichment analysis with the module “MS peaks to pathways” on MetaboAnalyst. The metabolites belonging to five enriched metabolic pathways are marked with red according to the normalized enrichment scores (positive values).

Next, we substituted the metabolome of Yuetai A (the female parent of Pop2015) with that of Yuetai B (the maintainer line) in the calculation of predictive variables for Pop2015 to increase predictability. However, the predictabilities of BPH-YPP for Pop2015 decreased after switching to the maintainer lines (Fig S11A). Moreover, regarding the effects of changes in training set population size on predictability (Windhausen et al, 2012; Riedelsheimer et al, 2013), half (1/2N) and a quarter (1/4N) of the hybrids from Pop2012 were combined with half (1/2N) of the hybrids from Pop2015 to check for predictability variations. As shown in Fig S11B, the predictabilities of BPH-YPP for Pop2015 decreased in both the 1/2N + 1/2N and 1/4N + 1/2N sets. Thus, the 1/3N + 1/2N training set was the best combination of population sizes for the prediction of yield heterosis in this study.

Figure S11.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S11. Influences from the maintainer line and population sizes on predictabilities.

(A) Predictabilities of BPH-YPP with metabolite profiles of the maintainer line (Yuetai B) as the female parent for calculating average values of parental metabolite levels. (B) Predictabilities of BPH-YPP with 1/2 N + 1/2 N and 1/4 N + 1/2 N (indicated with dotted lines) set as the training set, respectively.

To further check the stability of the predictive model across conditions, 41 hybrids that were mainly reciprocal and had significant differences between reciprocals in 2012 were phenotyped again in 2015 (Table S19). Because of environmental changes, the BPH-YPPs of the 41 hybrids exhibited significant differences between the 2 yr (Fig 5A and Table S19). Based on the predictive model established in Fig 4B and Table S11, parameters of the 1,400 predictive variables and the constant were used to predict BPH-YPP for the 41 hybrids. Surprisingly, the correlation coefficient between the observed and predicted values of BPH-YPP was as high as 0.62 (Fig 5B; P = 1.4 × 10−5), which further confirmed the accurate prediction of yield heterosis for hybrids under different growth conditions.

Table S19 Paired-samples t test to 18 reciprocal hybrids in 2012 and 41 hybrids in both years.

Figure 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5. Validation of the predictive model.

(A) BPH-YPP of reciprocal hybrids phenotyped in 2012 and 2015. ***P = 4.5 × 10−4, two-tailed, paired-samples t test. (B) Prediction of BPH-YPP for 41 hybrids planted in 2015 according to the predictive model built in Fig 4B. The dotted line is the fit line.

Precision Breeding 1.0 for climate-resilient superior hybrid rice

Finally, we present a schematic diagram of Precision Breeding 1.0 for guiding the breeding of heterotic and climate-resilient rice based on metabolomics. At least two hybrid populations are used to build predictive models. The first population (Pop1) is the main population, and a half-diallel cross design is recommended. The parental inbred lines should be sufficiently representative; widely planted conventional rice accessions and core accessions from the 3K Rice Genomes Project (Wang et al, 2018) are candidate parents. For parents of another hybrid population (Pop2), either homozygous (conventional, localized, and wild cultivars) or heterozygous (outcrossed and recombinant inbred lines) lines can be the sterile and restorer lines, which provides an advantage in using a wide range of germplasms independently of the degree of heterozygosity and availability of a reference genome. Parental seedlings of both populations were cultivated under uniform conditions. On approximately the 15th day, seedlings were harvested for untargeted LC-MS analysis (Fig S12A). Next, untargeted parental metabolite profiles were obtained, and the mean relative abundances of all detected analytes in every pair of parents were assigned as predictive variables to corresponding hybrids (Fig S12B). Population structure was then dissected by PCA of all predictive variables (Fig S12C). Core hybrids selected from both populations according to principal component 1 scores were combined as a training set, and phenotypic data of core hybrids were collected in field trials (Fig S12D). Subsequently, PLS regression between hybrid performance and corresponding predictive variables was applied to build a predictive model. After filtering low-contribution or unrelated variables according to the mean VIP values of all latent factors, the predictive model was established for both populations (Fig S12E). Then, the performance of “noncore” hybrids was predicted based on parameters of predictive variables (Fig S12F). Finally, hybridizations between parents with predicted good-performance hybrids were conducted, and the candidate hybrids were field validated to determine the elite hybrid combinations (Fig S12G).

Figure S12.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S12. Schematic diagram of precision breeding 1.0.

(A, B, C, D, E, F, G) Two hybrid populations were selected as part of the untargeted metabolome-based prediction strategy. Seedlings of parents from both populations (Pop1 and Pop2) were grown under uniform conditions (A). After performing LC-MS analysis of ∼15-d-old seedlings, parental metabolite data were obtained (B), and the mean relative abundances of all detected analytes between every pair of parents were calculated as predictive variables for corresponding hybrids. PCA of hybrids from both populations with all predictive variables was performed to dissect population structure (C). Core hybrids selected from both populations (marked in blue) according to principal component 1 scores were used as a training set. The hybrids in the training set were field phenotyped under corresponding conditions. The PLS regression analysis was used to find connections between predictive variables and hybrid performance for the training set (D). The average VIP values of all latent factors were used to remove low-contribution or unrelated predictive analytes. Next, a predictive model was established according to the parameter of each variable and the constant (E). Finally, the phenotypes of candidate hybrids were predicted (F), and hybrids with predicted high degree of heterosis were field phenotyped to determine the elite combinations (G).

Discussion

The prediction of agronomic traits for single-cross hybrids based on parental differences can transform redundant conventional breeding programs into targeted hybridization tasks, which can significantly reduce the amount of breeding work and improve breeding efficiencies. Here, we found that the average values of parental metabolite levels showed close relationships with hybrid metabolite profiles (Fig 1F), and the means of parental metabolic analytes were predictive of yield heterosis in rice (Fig 1G). A total of 1,400 analytes, some of which are involved in galactose metabolism, were used in the predictive model of BPH-YPP (Figs 4B and E–G). Other metabolic pathways with up- or down-regulated metabolic levels were also found to be associated with yield heterosis in the analyses (Tables S13–17 and Fig S9), suggesting that an optimal balance between these metabolic pathways may contribute to the formation of yield heterosis (Dan et al, 2015). The newly comprehensive metabolite profiling analysis of hybrids from high- and low-BPH-YPP subgroups will provide more direct proof for the selection of predictive analytes.

For precision breeding of hybrid rice, further research is required to optimize the strategy proposed here. To ensure the stability of predictive models, a larger number of parents, representing collections of rice core germplasms, need to be included in the main population (Pop1). More comprehensive untargeted metabolite profiles (larger numbers of detected peaks) which are obtained in positive ionization mode or the combination of negative and positive ionization data may also influence predictabilities. Notably, the ranges of heterosis that are predicted from models are always much smaller than those observed experimentally (Figs 2B–4B, S5, and S8B); we speculate that this may be caused by unknown factors that can lead to phenotypic variances, such as dynamic environments or interactions between genotype and the environment (Riedelsheimer et al, 2012a; Yu et al, 2016; Li et al, 2018). The breeding of elite hybrid rice involves desirable values for multiple polygenic traits, including heading date, plant height, grain quality, disease, and stress resistance, not just yield. Moreover, determining the effects of training and validation set size on predictability requires more investigation (Windhausen et al, 2012; Riedelsheimer et al, 2013; Schulthess et al, 2017). Hence, robust versions of metabolome-based prediction for multiple polygenic traits remain urgently needed along with extensive and in-depth cooperation among global universities and breeding institutes.

Metabolome-based prediction of agronomic traits has benefited from rapid advancements in metabolite profiling technologies (Blazenovic et al, 2019; Shen et al, 2019). Crop breeders can obtain high-throughput metabolic data without considering the accessibility of reference genomes and ploidy in just a few days, independent of the season. Metabolic markers have demonstrated application potential for both qualitative and quantitative traits in maize (de Abreu et al, 2017; Westhues et al, 2017; Schrag et al, 2018), wheat (Zhao et al, 2015), potato (Steinfath et al, 2010; Sprenger et al, 2018), barley (Heuberger et al, 2014), and Miscanthus (Maddison et al, 2017). Therefore, the untargeted metabolome-based prediction strategy, which has wide accessibility, may be further used to breed climate-resilient major and orphan crops, perennial trees, and animals (Hickey et al, 2017) and even perform personalized/localized disease diagnosis for humans (Wray et al, 2013).

Materials and Methods

Plant materials

Details of the plant materials have been presented previously (Dan et al, 2019). Briefly, two hybrid populations were used to explore predictive models of heterosis for rice grain yield. For one population in 2012, 18 indica–intermediate type–japonica cultivars, some of which have been planted on more than six million hectares in China (including Liaoxing 1 and Wuyunjing 8), were used as the parents with a complete diallel cross design. For another population, recombinant inbred lines (F5) from single- or multiple-cross and backcross hybrids were test-crossed with a Honglian-type cytoplasmic male-sterile line (Yuetai A). Plants of the two populations were grown separately at different planting densities in 2012 and 2015. In addition, 41 reciprocal hybrids and corresponding parents that were chosen from Pop2012 were planted again under the same conditions as the hybrids in 2015. A randomized block design with three replicates was used for each genotype in each environment. Ten seedlings were planted for each replicate, and the middle five plants of each row were harvested to measure the YPP. Phenotypic data of parents and hybrids are provided in Table S1. The relative BPH-YPP was calculated with the following equation: BPH-YPP = (F1 − PH)/PH, where PH and F1 are the YPP of the high-value parent and hybrid, respectively.

Young rice seedlings used for LC-MS/MS analysis were prepared by a previously reported method (Dan et al, 2019). More than 30 seeds of three pairs of reciprocal hybrids, 18 cultivars, 107 recombinant inbred lines, and Yuetai A were submerged in water at 30°C for 48 h and then transferred to an incubator for germination at 30°C for 24 h. Ten seedlings at the same developmental stage were transplanted into plastic pots with a spacing of 2 × 2 cm. Two biological replicates were used for each genotype. Seedlings were grown at 30°C in a light incubator set to 70% humidity with a 16-h light/8-h dark photoperiod. On the 15th day, ∼10 seedlings without parts close to soil (∼0.5 cm) and roots were harvested and placed in liquid nitrogen after being washed with ddH2O three times. Two biological replicates of the three pairs of reciprocal hybrids and 18 parents were treated as two independent experimental samples, and two replicates of the 107 recombinant inbred lines, and Yuetai A were combined as a single experimental sample for each genotype. All tissues of each experimental sample were ground into homogenized powders with liquid nitrogen, and 80 mg of powder was transferred into 2-ml EP tubes. After 1 ml of precooled extraction liquid (methanol/acetonitrile/water, 2/2/1, vol/vol/vol) was added with the Agilent Bravo automated liquid handling system (Agilent Technologies), the samples were vortex mixed for 60 s. Ten microliters of all samples were pooled as a quality control sample. Two 30-min ultrasonic treatments were applied, and the tubes were kept at −20°C for 60 min. Then, the samples were centrifuged for 15 min at 14,000g at 4°C. The supernatants were dried in a vacuum concentrator, and 100 μl of acetonitrile (acetonitrile/water, 1/1, vol/vol) was added to each tube. After vortex mixing, the samples were centrifuged for 15 min at 14,000g at 4°C, and the supernatants were used for subsequent LC-MS analysis.

UHPLC-Q-TOF MS/MS for untargeted metabolomics analysis

Samples were separated on a Waters (Milford) reversed-phase T3 column (ACQUITY UPLC HSS T3, 2.1 × 100 mm column containing 1.8-μm particles) using the Agilent 1290 Infinity LC system (Agilent Technologies). After the injection of 2 μl of the sample, the T3 column was washed with solvent A (0.5 mM ammonium fluoride; Sigma-Aldrich) at a flow rate of 0.3 ml/min at 25°C. The LC gradient elution began with 1% solvent B (acetonitrile; Merck) for 1 min, linearly increased to 100% B in 7 min, was maintained for 2 min at 100% B, was reduced to 1% B in 0.1 min, and was maintained for another 1.9 min at 1% solvent B for re-equilibration. Experimental samples were analyzed in a random order at 4°C, and a quality control sample was injected every nine experimental samples to check the stability of the LC-MS system.

MS data of all samples were collected with a quadrupole time-of-flight mass spectrometer (Agilent 6550 iFunnel QTOF; Agilent Technologies) in negative electron spray ionization mode. The temperature and flow rate of the drying gas were 250°C and 16 L/min, respectively. The temperature and flow rate of the sheath gas were 400°C and 12 L/min, respectively. The pressure of the nebulizer gas was 20 pounds per square inch (psi). The capillary voltage, nozzle voltage, and fragment voltage were 3,000, 0, and 175 V, respectively. The TOF mass spectra data were collected from 60 to 1,200 kD. The acquisition rate was 4 spectra/s and the cycle time was 250 ms.

Subsequently, another quadrupole time-of-flight mass spectrometer (TripleTOF 6600, AB Sciex) was used to acquire tandem mass spectrometry (MS/MS) data of the quality control sample. The parameters of ion source gas1, ion source gas2, and the curtain gas were 40, 80, and 30 psi, respectively. The source temperature was 650°C, and the floating ion spray voltage was −4,500 V. The mass range for the TOF MS scan was 60–1,200 kD, and the quality control sample was injected twice to increase the data acquisition rate with the mass range divided into four sequential windows: 60–200, 190–400, 390–600, and 590–1,200 kD and 200–600, 590–750, 740–900, and 890–1,200 kD. The MS accumulation time was 200 ms/spectra and the tandem mass spectra were collected in an information-dependent manner in high-sensitivity mode. The declustering potential was −60 V, and the collision energy was 35 ± 15 eV. Ten candidate ions were monitored per cycle using an isotope isolation width of 4.0 kD, and the MS/MS accumulation time was 50 ms.

Quantification of metabolomics data

The msconvert function in ProteoWizard (Chambers et al, 2012) was used to convert raw MS files (.d) to the mzXML format. Then, these files were uploaded to the XCMS (Gowda et al, 2014) for data processing, including feature detection, retention time correction, and alignment with the method described previously (Jia et al, 2018). The MS/MS data were matched against the in-house standard spectral library (Wang et al, 2016) and lipid MS/MS spectral library (Tu et al, 2017), and metabolites were identified with accurate mass (<25 ppm). To align the MS and MS/MS data, m/z errors less than 15 ppm and retention time errors fewer than 20 s were applied. To check the reliability of the metabolomics data, a PCA was performed on both the experimental and quality control samples before the data normalization procedure. The location of the quality control sample in the three-dimensional histogram and plots of the top five principal components indicated that the obtained metabolite profiles were accurate and appropriate for subsequent analyses (Fig S13). Normalization of the relative abundances of the 3,746 detected analytes was performed with MetaboAnalyst 4.0 (Xia & Wishart, 2011). Samples with two biological replicates were averaged first. Sample normalization (normalization by sum), data transformation (none), and data scaling (autoscaling) were performed with MetaboAnalyst 4.0 to make the samples and features suitable for statistical analyses. We transformed parental metabolite levels to obtain the means, differences, and ratios of parental metabolite files for each hybrid with equations FMeans = (P1 + P2)/2, HDifferences = P1 − P2, and FRatios = P1/P2, where F, P1, and P2 are the metabolic levels of the hybrid, female parent, and male parent, respectively. Some of the metabolic data were reported in a previous study (Dan et al, 2019), and the complete metabolic data corresponding to all parents and hybrids used for analyses are provided in Table S3, which was prepared according to recommendations (Fernie et al, 2011).

Figure S13.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure S13. PCA score plots of quality control and experimental samples.

To check the accuracy of the obtained metabolomics data, a PCA was performed on both the quality control and experimental samples based on the unnormalized relative abundance of all detected analytes. (A, B) Score plots of the three-dimensional (A) and the top five principal components (B) showed the tight distribution of the quality control sample during the whole experimental process. Red, green, and blue colors represent the quality control (QC), samples for Pop2012, and samples from Pop2015, respectively.

Determination of population structure and selection of predictive analytes

The 3,746 detected analytes and 34 pairs of InDel markers were used to assess the genetic differences of parents of Pop2012. Dendrograms were drawn based on the relative abundances of detected analytes and genotypes of each marker with the Statistical Analysis module (with Euclidean and Ward as the distance measure and clustering algorithm, respectively) on MetaboAnalyst. PCA was performed on the metabolome of parents of both populations. The mean relative abundances of the 3,746 detected analytes in every pair of parents were calculated as predictive variables for corresponding hybrids. PCA was performed to demonstrate the population distances. Scores for the first principal component were reordered from high to low, and hybrids were chosen every two, three, or four intervals as the core hybrids.

In the exploration of predictive models for BPH-YPP, hybrids from Pop2012 were divided into two subgroups based on low- and high-BPH-YPP values. PCA and PLS-DA were performed on hybrids in the low- and high-BPH-YPP subgroups to investigate their metabolic relationships. Analytes were screened according to VIP values from the results of PLS-DA, and analytes with low VIP values were removed from the models. Subsequently, core hybrids from both populations were combined as a training set, and PLS was applied to bridge the predictive variables and hybrid performance. The BPH-YPP of the hybrids and the predictive variables were dependent and independent variables, respectively. The number of latent factors was selected when the largest adjusted R-square value of cumulative Y variance emerged. The mean VIP values of all latent factors were used to screen predictive analytes. Variables with large contributions in the model were retained for modeling, and the number of predictive analytes was adjusted to achieve high predictability.

Pathway mapping of metabolic analytes

Differential analytes with m/z, P-values, and t scores from t tests comparing high- and low-BPH-YPP hybrids were mapped to metabolic pathways through the module “MS Peaks to Pathways” on MetaboAnalyst (Xia & Wishart, 2011). The mass accuracy was 5.0 ppm. The analytical mode was negative, and the data format was three-column according to the data preparation instructions. GSEA was the algorithm, and the Oryza sativa japonica (Japanese rice) pathway library was selected for enrichment analysis. Dysregulated network analysis was performed with MetDNA (Shen et al, 2019). The tandem mass spectra data with a mass range of 60–1,200 kD collected from the quality control sample were uploaded for metabolite identification. Hybrids with high- and low BPH-YPPs were grouped into the control and case groups, respectively. The ionization polarity was negative, and reversed-phase LC was used. The mass spectrometer instrument type was Sciex TripleTOF, and the collision energy was 35 ± 15 eV. t test was selected as the statistical method. The library of Arabidopsis thaliana (thale cress) was chosen for enrichment analysis. The cutoff P-value was 0.05, and the P-values were corrected with a false discovery rate in the analysis.

Statistical analysis

Independent-samples t test (two-tailed) was performed to compare differences of YPP for parents and hybrids, the BPH-YPP for hybrids from both populations, and the average levels of metabolites in pathways between high- and low-BPH-YPP hybrids with SPSS (IBM SPSS Statistics for Windows, Version 20.0; IBM Corp.). Levene’s test was used to determine the equality of variances. Paired-samples t test (two-tailed) was used to compare differences of BPH-YPP between reciprocal hybrids that were selected from population 2012 and hybrids that were phenotyped in both years. PLS and bivariate correlation analyses were implemented with SPSS. PCA and PLS-DA were conducted with the “Statistical Analysis” module on MetaboAnalyst (Xia & Wishart, 2011). The correlation coefficient of Pearson correlation (two-tailed) between the observed and predicted BPH-YPP values was treated as predictability.

Data access

Raw metabolomics files generated in this study have been deposited to the metabolomics database MetaboLights (www.ebi.ac.uk/metabolights) under the study identifier MTBLS742.

Acknowledgements

We thank members of the 3134 laboratory for the assistance of field experiments and valuable discussions. This research was supported by the National Key Research & Development Program of China (Grant No 2017YFD0100400), National Natural Science Foundation of China (Grants 31771746 and 31801439), National Rice Industry Technology System (Grant No CARS-01-07), and the China Postdoctoral Science Foundation.

Author Contributions

  • Z Dan: conceptualization, resources, software, funding acquisition, investigation, methodology, and writing—original draft, review, and editing.

  • Y Chen: data curation, software, investigation, and methodology.

  • W Zhao: data curation and investigation.

  • Q Wang: data curation and investigation.

  • W Huang: conceptualization, resources, data curation, supervision, funding acquisition, investigation, and writing—original draft, review, and editing.

Conflict of Interest Statement

The authors declare that they have no conflict of interest.

  • Received September 15, 2019.
  • Revision received November 24, 2019.
  • Accepted November 26, 2019.
  • © 2019 Dan et al.
Creative Commons logoCreative Commons logohttps://creativecommons.org/licenses/by/4.0/

This article is available under a Creative Commons License (Attribution 4.0 International, as described at https://creativecommons.org/licenses/by/4.0/).

References

  1. ↵
    1. Barker M,
    2. Rayens W
    (2003) Partial least squares for discrimination. J Chemometr 17: 166–173. doi:10.1002/cem.785
    OpenUrlCrossRef
  2. ↵
    1. Blazenovic I,
    2. Kind T,
    3. Sa MR,
    4. Ji J,
    5. Vaniya A,
    6. Wancewicz B,
    7. Roberts BS,
    8. Torbasinovic H,
    9. Lee T,
    10. Mehta SS, et al.
    (2019) Structure annotation of all mass spectra in untargeted metabolomics. Anal Chem 91: 2155–2162. doi:10.1021/acs.analchem.8b04698
    OpenUrlCrossRef
  3. ↵
    1. Chambers MC,
    2. MacLean B,
    3. Burke R,
    4. Amode D,
    5. Ruderman DL,
    6. Neumann S,
    7. Gatto L,
    8. Fischer B,
    9. Pratt B,
    10. Egertson J, et al.
    (2012) A cross-platform toolkit for mass spectrometry and proteomics. Nat Biotechnol 30: 918–920. doi:10.1038/nbt.2377
    OpenUrlCrossRefPubMed
  4. ↵
    1. Chen L,
    2. Liu YG
    (2014) Male sterility and fertility restoration in crops. Annu Rev Plant Biol 65: 579–606. doi:10.1146/annurev-arplant-050213-040119
    OpenUrlCrossRefPubMed
  5. ↵
    1. Cheng SH,
    2. Zhuang JY,
    3. Fan YY,
    4. Du JH,
    5. Cao LY
    (2007) Progress in research and development on hybrid rice: A super-domesticate in China. Ann Bot 100: 959–966. doi:10.1093/aob/mcm121
    OpenUrlCrossRefPubMed
  6. ↵
    1. Dan Z,
    2. Chen Y,
    3. Xu Y,
    4. Huang J,
    5. Huang J,
    6. Hu J,
    7. Yao G,
    8. Zhu Y,
    9. Huang W
    (2019) A metabolome-based core hybridisation strategy for the prediction of rice grain weight across environments. Plant Biotechnol J 17: 906–913. doi:10.1111/pbi.13024
    OpenUrlCrossRef
  7. ↵
    1. Dan Z,
    2. Hu J,
    3. Zhou W,
    4. Yao G,
    5. Zhu R,
    6. Huang W,
    7. Zhu Y
    (2015) Hierarchical additive effects on heterosis in rice (Oryza sativa L.). Front Plant Sci 6: 738. doi:10.3389/fpls.2015.00738
    OpenUrlCrossRef
  8. ↵
    1. Dan Z,
    2. Hu J,
    3. Zhou W,
    4. Yao G,
    5. Zhu R,
    6. Zhu Y,
    7. Huang W
    (2016) Metabolic prediction of important agronomic traits in hybrid rice (Oryza sativa L.). Sci Rep 6: 21732. doi:10.1038/srep21732
    OpenUrlCrossRef
  9. ↵
    1. de Abreu ELF,
    2. Westhues M,
    3. Cuadros-Inostroza Á,
    4. Willmitzer L,
    5. Melchinger AE,
    6. Nikoloski Z
    (2017) Metabolic robustness in young roots underpins a predictive model of maize hybrid performance in the field. Plant J 90: 319–329. doi:10.1111/tpj.13495
    OpenUrlCrossRef
  10. ↵
    1. FAO
    (2009) How to Feed the World in 2050. Rome: Food and Agriculture Organization of the United Nations.
  11. ↵
    1. Fernie AR,
    2. Aharoni A,
    3. Willmitzer L,
    4. Stitt M,
    5. Tohge T,
    6. Kopka J,
    7. Carroll AJ,
    8. Saito K,
    9. Fraser PD,
    10. DeLuca V
    (2011) Recommendations for reporting metabolite data. Plant Cell 23: 2477–2482. doi:10.1105/tpc.111.086272
    OpenUrlFREE Full Text
  12. ↵
    1. Gowda H,
    2. Ivanisevic J,
    3. Johnson CH,
    4. Kurczy ME,
    5. Benton HP,
    6. Rinehart D,
    7. Nguyen T,
    8. Ray J,
    9. Kuehl J,
    10. Arevalo B, et al.
    (2014) Interactive XCMS online: Simplifying advanced metabolomic data processing and subsequent statistical analyses. Anal Chem 86: 6931–6939. doi:10.1021/ac500734c
    OpenUrlCrossRefPubMed
  13. ↵
    1. Heuberger AL,
    2. Broeckling CD,
    3. Kirkpatrick KR,
    4. Prenni JE
    (2014) Application of nontargeted metabolite profiling to discover novel markers of quality traits in an advanced population of malting barley. Plant Biotechnol J 12: 147–160. doi:10.1111/pbi.12122
    OpenUrlCrossRef
  14. ↵
    1. Hickey JM,
    2. Chiurugwi T,
    3. Mackay I,
    4. Powell W,
    5. Eggen A,
    6. Kilian A,
    7. Jones C,
    8. Canales C,
    9. Grattapaglia D,
    10. Bassi F, et al.
    (2017) Genomic prediction unifies animal and plant breeding programs to form platforms for biological discovery. Nat Genet 49: 1297–1303. doi:10.1038/ng.3920
    OpenUrlCrossRefPubMed
  15. ↵
    1. Huang W,
    2. Yu C,
    3. Hu J,
    4. Wang L,
    5. Dan Z,
    6. Zhou W,
    7. He C,
    8. Zeng Y,
    9. Yao G,
    10. Qi J, et al.
    (2015) Pentatricopeptide-repeat family protein RF6 functions with hexokinase 6 to rescue rice cytoplasmic male sterility. Proc Natl Acad Sci U S A 112: 14984–14989. doi:10.1073/pnas.1511748112
    OpenUrlAbstract/FREE Full Text
  16. ↵
    1. Jia H,
    2. Shen X,
    3. Guan Y,
    4. Xu M,
    5. Tu J,
    6. Mo M,
    7. Xie L,
    8. Yuan J,
    9. Zhang Z,
    10. Cai S, et al.
    (2018) Predicting the pathological response to neoadjuvant chemoradiation using untargeted metabolomics in locally advanced rectal cancer. Radiother Oncol 128: 548–556. doi:10.1016/j.radonc.2018.06.022
    OpenUrlCrossRef
  17. ↵
    1. Jolliffe IT
    (1986) Principal component analysis and factor analysis Springer Series in Statistics. New York: Springer: 115–128.
  18. ↵
    1. Li X,
    2. Guo T,
    3. Mu Q,
    4. Li X,
    5. Yu J
    (2018) Genomic and environmental determinants and their interplay underlying phenotypic plasticity. Proc Natl Acad Sci U S A 115: 6679–6684. doi:10.1073/pnas.1718326115
    OpenUrlAbstract/FREE Full Text
  19. ↵
    1. Luo D,
    2. Xu H,
    3. Liu Z,
    4. Guo J,
    5. Li H,
    6. Chen L,
    7. Fang C,
    8. Zhang Q,
    9. Bai M,
    10. Yao N, et al.
    (2013) A detrimental mitochondrial-nuclear interaction causes cytoplasmic male sterility in rice. Nat Genet 45: 573–577. doi:10.1038/ng.2570
    OpenUrlCrossRefPubMed
  20. ↵
    1. Maddison AL,
    2. Camargo-Rodriguez A,
    3. Scott IM,
    4. Jones CM,
    5. Elias DMO,
    6. Hawkins S,
    7. Massey A,
    8. Clifton-Brown J,
    9. McNamara NP,
    10. Donnison IS, et al.
    (2017) Predicting future biomass yield in Miscanthus using the carbohydrate metabolic profile as a biomarker. Glob Change Biol Bioenergy 9: 1264–1278. doi:10.1111/gcbb.12418
    OpenUrlCrossRef
  21. ↵
    1. Obata T,
    2. Witt S,
    3. Lisec J,
    4. Palacios-Rojas N,
    5. Florez-Sarasa I,
    6. Yousfi S,
    7. Araus JL,
    8. Cairns JE,
    9. Fernie AR
    (2015) Metabolite profiles of maize leaves in drought, heat, and combined stress field trials reveal the relationship between metabolism and grain yield. Plant Physiol 169: 2665–2683. doi:10.1104/pp.15.01164
    OpenUrlAbstract/FREE Full Text
  22. ↵
    1. Rasheed A,
    2. Hao Y,
    3. Xia X,
    4. Khan A,
    5. Xu Y,
    6. Varshney RK,
    7. He Z
    (2017) Crop breeding chips and genotyping platforms: Progress, challenges, and perspectives. Mol Plant 10: 1047–1064. doi:10.1016/j.molp.2017.06.008
    OpenUrlCrossRef
  23. ↵
    1. Riedelsheimer C,
    2. Czedik-Eysenberg A,
    3. Grieder C,
    4. Lisec J,
    5. Technow F,
    6. Sulpice R,
    7. Altmann T,
    8. Stitt M,
    9. Willmitzer L,
    10. Melchinger AE
    (2012a) Genomic and metabolic prediction of complex heterotic traits in hybrid maize. Nat Genet 44: 217–220. doi:10.1038/ng.1033
    OpenUrlCrossRefPubMed
  24. ↵
    1. Riedelsheimer C,
    2. Endelman JB,
    3. Stange M,
    4. Sorrells ME,
    5. Jannink JL,
    6. Melchinger AE
    (2013) Genomic predictability of interconnected biparental maize populations. Genetics 194: 493–503. doi:10.1534/genetics.113.150227
    OpenUrlAbstract/FREE Full Text
  25. ↵
    1. Riedelsheimer C,
    2. Lisec J,
    3. Czedik-Eysenberg A,
    4. Sulpice R,
    5. Flis A,
    6. Grieder C,
    7. Altmann T,
    8. Stitt M,
    9. Willmitzer L,
    10. Melchinger AE
    (2012b) Genome-wide association mapping of leaf metabolic profiles for dissecting complex traits in maize. Proc Natl Acad Sci U S A 109: 8872–8877. doi:10.1073/pnas.1120813109
    OpenUrlAbstract/FREE Full Text
  26. ↵
    1. Schauer N,
    2. Semel Y,
    3. Roessner U,
    4. Gur A,
    5. Balbo I,
    6. Carrari F,
    7. Pleban T,
    8. Perez-Melis A,
    9. Bruedigam C,
    10. Kopka J, et al.
    (2006) Comprehensive metabolic profiling and phenotyping of interspecific introgression lines for tomato improvement. Nat Biotechnol 24: 447–454. doi:10.1038/nbt1192
    OpenUrlCrossRefPubMed
  27. ↵
    1. Scheres B,
    2. van der Putten WH
    (2017) The plant perceptron connects environment to development. Nature 543: 337–345. doi:10.1038/nature22010
    OpenUrlCrossRef
  28. ↵
    1. Schrag T,
    2. Westhues M,
    3. Schipprack W,
    4. Seifert F,
    5. Thiemann A,
    6. Scholten S,
    7. Melchinger AE
    (2018) Beyond genomic prediction: Combining different types of omics data can improve prediction of hybrid performance in maize. Genetics 208: 1373–1385. doi:10.1534/genetics.117.300374
    OpenUrlAbstract/FREE Full Text
  29. ↵
    1. Varshney R,
    2. Roorkiwal M,
    3. Sorrells M
    1. Schulthess AW,
    2. Zhao Y,
    3. Reif JC
    (2017) Genomic selection in hybrid breeding. In Genomic Selection for Crop Improvement, Varshney R, Roorkiwal M, Sorrells M (eds). Cham: Springer.
  30. ↵
    1. Shen X,
    2. Wang R,
    3. Xiong X,
    4. Yin Y,
    5. Cai Y,
    6. Ma Z,
    7. Liu N,
    8. Zhu ZJ
    (2019) Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics. Nat Commun 10: 1516. doi:10.1038/s41467-019-09550-x
    OpenUrlCrossRef
  31. ↵
    1. Sprenger H,
    2. Erban A,
    3. Seddig S,
    4. Rudack K,
    5. Thalhammer A,
    6. Le MQ,
    7. Walther D,
    8. Zuther E,
    9. Köhl KI,
    10. Kopka J, et al.
    (2018) Metabolite and transcript markers for the prediction of potato drought tolerance. Plant Biotechnol J 16: 939–950. doi:10.1111/pbi.12840
    OpenUrlCrossRef
  32. ↵
    1. Steinfath M,
    2. Strehmel N,
    3. Peters R,
    4. Schauer N,
    5. Groth D,
    6. Hummel J,
    7. Steup M,
    8. Selbig J,
    9. Kopka J,
    10. Geigenberger P, et al.
    (2010) Discovering plant metabolic biomarkers for phenotype prediction using an untargeted approach. Plant Biotechnol J 8: 900–911. doi:10.1111/j.1467-7652.2010.00516.x
    OpenUrlCrossRefPubMed
  33. ↵
    1. 2
    2. Technow F,
    3. Schrag TA,
    4. Schipprack W,
    5. Bauer E,
    6. Simianer H,
    7. Melchinger AE
    (2014) Genome properties and prospects of genomic prediction of hybrid performance in a breeding program of maize. Genetics 197: 1343–1355. doi:10.1534/genetics.114.165860
    OpenUrlAbstract/FREE Full Text
  34. ↵
    1. Thomas D
    (2010) Gene—Environment-wide association studies: Emerging approaches. Nature 11: 259–272. doi:10.1038/nrg2764
    OpenUrlCrossRefPubMed
  35. ↵
    1. Tu J,
    2. Yin Y,
    3. Xu M,
    4. Wang R,
    5. Zhu ZJ
    (2017) Absolute quantitative lipidomics reveals lipidome-wide alterations in aging brain. Metabolomics 14: 5. doi:10.1007/s11306-017-1304-x
    OpenUrlCrossRef
  36. ↵
    1. Wang J,
    2. Zhang T,
    3. Shen X,
    4. Liu J,
    5. Zhao D,
    6. Sun Y,
    7. Wang L,
    8. Liu Y,
    9. Gong X,
    10. Liu Y, et al.
    (2016) Serum metabolomics for early diagnosis of esophageal squamous cell carcinoma by UHPLC-QTOF/MS. Metabolomics 12: 116. doi:10.1007/s11306-016-1050-5
    OpenUrlCrossRef
  37. ↵
    1. Wang W,
    2. Mauleon R,
    3. Hu Z,
    4. Chebotarov D,
    5. Tai S,
    6. Wu Z,
    7. Li M,
    8. Zheng T,
    9. Fuentes RR,
    10. Zhang F, et al.
    (2018) Genomic variation in 3,010 diverse accessions of Asian cultivated rice. Nature 557: 43–49. doi:10.1038/s41586-018-0063-9
    OpenUrlCrossRef
  38. ↵
    1. Wen W,
    2. Li K,
    3. Alseekh S,
    4. Omranian N,
    5. Zhao L,
    6. Zhou Y,
    7. Xiao Y,
    8. Jin M,
    9. Yang N,
    10. Liu H, et al.
    (2015) Genetic determinants of the network of primary metabolism and their relationships to plant performance in a maize recombinant inbred line population. Plant Cell 27: 1839–1856. doi:10.1105/tpc.15.00208
    OpenUrlAbstract/FREE Full Text
  39. ↵
    1. Westhues M,
    2. Schrag TA,
    3. Heuer C,
    4. Thaller G,
    5. Utz HF,
    6. Schipprack W,
    7. Thiemann A,
    8. Seifert F,
    9. Ehret A,
    10. Schlereth A, et al.
    (2017) Omics-based hybrid prediction in maize. Theor Appl Genet 130: 1927–1939. doi:10.1007/s00122-017-2934-0
    OpenUrlCrossRef
  40. ↵
    1. Windhausen VS,
    2. Atlin GN,
    3. Hickey JM,
    4. Crossa J,
    5. Jannink JL,
    6. Sorrells ME,
    7. Raman B,
    8. Cairns JE,
    9. Tarekegne A,
    10. Semagn K, et al.
    (2012) Effectiveness of genomic prediction of maize hybrid performance in different breeding populations and environments. G3 2: 1427–1436. doi:10.1534/g3.112.003699
    OpenUrlCrossRef
  41. ↵
    1. Jöreskog KG,
    2. Wold H
    1. Wold H
    (1982) Soft modeling: The basic design and some extensions. In Systems under Indirect Observations, Part II, Jöreskog KG, Wold H (eds), Vol. 2, pp 1–54. Amsterdam: North-Holland.
    OpenUrl
  42. ↵
    1. Wray NR,
    2. Yang J,
    3. Hayes BJ,
    4. Price AL,
    5. Goddard ME,
    6. Visscher PM
    (2013) Pitfalls of predicting complex traits from SNPs. Nat Rev Genet 14: 507–515. doi:10.1038/nrg3457
    OpenUrlCrossRefPubMed
  43. ↵
    1. Xia J,
    2. Wishart DS
    (2011) Web-based inference of biological patterns, functions and pathways from metabolomic data using MetaboAnalyst. Nat Protoc 6: 743–760. doi:10.1038/nprot.2011.319
    OpenUrlCrossRefPubMed
  44. ↵
    1. Xu S,
    2. Xu Y,
    3. Gong L,
    4. Zhang Q
    (2016) Metabolomic prediction of yield in hybrid rice. Plant J 88: 219–227. doi:10.1111/tpj.13242
    OpenUrlCrossRef
  45. ↵
    1. Xu S,
    2. Zhu D,
    3. Zhang Q
    (2014) Predicting hybrid performance in rice using genomic best linear unbiased prediction. Proc Natl Acad Sci U S A 111: 12456–12461. doi:10.1073/pnas.1413750111
    OpenUrlAbstract/FREE Full Text
  46. ↵
    1. Yu X,
    2. Li X,
    3. Guo T,
    4. Zhu C,
    5. Wu Y,
    6. Mitchell SE,
    7. Roozeboom KL,
    8. Wang D,
    9. Wang ML,
    10. Pederson GA, et al.
    (2016) Genomic prediction contributing to a promising global strategy to turbocharge gene banks. Nat Plants 2: 16150. doi:10.1038/nplants.2016.150
    OpenUrlCrossRef
  47. ↵
    1. Zhao C,
    2. Liu B,
    3. Piao S,
    4. Wang X,
    5. Lobell D,
    6. Huang Y,
    7. Huang M,
    8. Yao Y,
    9. Bassu S,
    10. Ciais P, et al.
    (2017) Temperature increase reduces global yields of major crops in four independent estimates. Proc Natl Acad Sci U S A 114: 9326–9331. doi:10.1073/pnas.1701762114
    OpenUrlAbstract/FREE Full Text
  48. ↵
    1. Zhao C,
    2. Piao S,
    3. Wang X,
    4. Huang Y,
    5. Ciais P,
    6. Elliott J,
    7. Huang M,
    8. Janssens IA,
    9. Li T,
    10. Lian X, et al.
    (2016) Plausible rice yield losses under future climate warming. Nat Plants 3: 16202. doi:10.1038/nplants.2016.202
    OpenUrlCrossRef
  49. ↵
    1. Zhao Y,
    2. Li Z,
    3. Liu G,
    4. Jiang Y,
    5. Maurer HP,
    6. Würschum T,
    7. Mock HP,
    8. Matros A,
    9. Ebmeyer E,
    10. Schachschneider R, et al.
    (2015) Genome-based establishment of a high-yielding heterotic pattern for hybrid wheat breeding. Proc Natl Acad Sci U S A 112: 15624–15629.doi:10.1073/pnas.1514547112
    OpenUrlAbstract/FREE Full Text
PreviousNext
Back to top
Download PDF
Email Article

Thank you for your interest in spreading the word on Life Science Alliance.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Metabolome-based prediction of yield heterosis contributes to the breeding of elite rice
(Your Name) has sent you a message from Life Science Alliance
(Your Name) thought you would like to see the Life Science Alliance web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
Metabolic prediction of yield heterosis
Zhiwu Dan, Yunping Chen, Weibo Zhao, Qiong Wang, Wenchao Huang
Life Science Alliance Dec 2019, 3 (1) e201900551; DOI: 10.26508/lsa.201900551

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
Metabolic prediction of yield heterosis
Zhiwu Dan, Yunping Chen, Weibo Zhao, Qiong Wang, Wenchao Huang
Life Science Alliance Dec 2019, 3 (1) e201900551; DOI: 10.26508/lsa.201900551
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
Issue Cover

In this Issue

Volume 3, No. 1
January 2020
  • Table of Contents
  • Cover (PDF)
  • About the Cover
  • Masthead (PDF)
Advertisement

Jump to section

  • Article
    • Abstract
    • Introduction
    • Results
    • Discussion
    • Materials and Methods
    • Acknowledgements
    • References
  • Figures & Data
  • Info
  • Metrics
  • Reviewer Comments
  • PDF

Subjects

  • Genomics & Functional Genomics
  • Plant Science
  • Systems & Computational Biology

Related Articles

  • No related articles found.

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • The roles of interleukin 34 in rat macrophage function
  • A novel NRF2 activator in Alport syndrome mice
  • hnRNP A1 binds a novel ESS element inducing CFTR missplicing
Show more Research Article

Similar Articles

EMBO Press LogoRockefeller University Press LogoCold Spring Harbor Logo

Content

  • Home
  • Newest Articles
  • Current Issue
  • Archive
  • Subject Collections

For Authors

  • Submit a Manuscript
  • Author Guidelines
  • License, copyright, Fee

Other Services

  • Alerts
  • Bluesky
  • X/Twitter
  • RSS Feeds

More Information

  • Editors & Staff
  • Reviewer Guidelines
  • Feedback
  • Licensing and Reuse
  • Privacy Policy

ISSN: 2575-1077
© 2025 Life Science Alliance LLC

Life Science Alliance is registered as a trademark in the U.S. Patent and Trade Mark Office and in the European Union Intellectual Property Office.