GGE双标图的信息比校正原理与应用*——以长江流域棉花品种生态区划分为例

Principles and applications of information ratio adjustment of GGE biplot— A case study of cotton mega-environment investigation in the Yangtze River Valley

  • 摘要: GGE双标图方法在农作物品种区域试验中被广泛地应用于品种评价、环境评价和品种生态区划分的统计分析和图形直观展示, 但GGE双标图分析只能局限于前两个主成分, 不能根据信息比准则恰当地取舍主成分数, 因而无法保证对数据的最优拟合效果。本研究以长江流域国家棉花区域试验数据为例, 选择信息比IR≥ 1的主成分对GGE双标图模型进行校正, 通过试验环境主成分得分的欧氏距离矩阵的聚类分析, 校正通过双标图分析的品种生态区划分方案。结果表明, GGE双标图恰当拟合试验数据的比例仅为28.6%, 在68.6%的试验中拟合不足, 并在2.9%的试验中拟合过度。信息比校正的GGE(IR-GGE)模型总体拟合度提高了8.7%, 而在GGE双标图拟合不足或拟合过度的试验中校正了12.2%的失拟度。GGE双标图模型的离优度系数为15.9%, 对区域试验的总体模拟效果较好, 仍可以展示基因型与环境互作的基本模式; 但IR-GGE模型的拟合度更高, 分析结果也更可靠。GGE双标图模型和IR-GGE模型对棉花品种生态区划分的总体架构相似, 都将南襄盆地和四川盆地棉区划分为特定生态区, 但在长江中下游棉区的划分细节上存在较大差异。IR-GGE模型的生态区划分方案与地理区域和生态特征更加吻合, 实用性更强。本研究为GGE双标图的信息比校正研究和应用提供了范例, 是对GGE双标图应用的重要补充, 在基于GGE双标图的农作物品种区域试验数据分析和利用等方面具有重要的理论意义和应用价值。

     

    Abstract: The GGE (genotype main effect plus genotype by environment interaction effect) biplot is the most powerful statistical and visual display tool available for cultivar evaluation, environmental evaluation and mega-envrironment investigation. The versatility of GGE biplot in displaying cultivar stability and high yielding, identifying ideal cultivars and test environments, evaluating the representativeness and discrimination ability of test sites, and differentiating mega- environments have attracted extensive application in analyzing regional trials of many crops. Nevertheless, few reports have focused on the potential loss of fit of GGE biplot models and model adjustment using information ratio (IR). IR is the product of percent variation explained by each principal component and the minor value of degree of freedom of genotypes along with the number of test locatons in GGE biplot analysis of datasets in regional crop trials. As principal component with IR ≥ 1 has useful information, it is a sufficient and necessary condition of GGE model with appropriate goodness of fit to cover all data analysis. In fact, the goodness of fit of GGE biplot models is restricted to the sum of percent variation explained by the first two principal components (PC1 and PC2), rather than the suitable principal components determined by IR. Thus although GGE biplot model is an efficient graphical display of data structure, it is not so efficient to guarantee optimal fitting of effects. Using the GGE biplot method and collected datasets in national regional cotton (Gossypium hirsutum L.) trials in the Yangtze River Valley in 2000?2012, this study showed the effects of IR adjustments of GGE models in maga-environment investigation. The scores of principal components for IR ≥ 1 in 35 groups of regional cotton trials were used to calculate Euclidean distance matrix among the test environments. Also a hierachical cluster analysis was implemented to outline the scheme of differentiation of the mega-environment. A corresponding analysis was also carried out using fixed first two principal components of GGE biplot analysis for the purpose of mutual comparision between GGE biplot model and IR-adjusted GGE model for mega-environment investigation efficiency. The results showed that while only 28.6% was appropriately fitted by GGE biplot model, 68.6% was under-fitted and 2.9% over-fitted in 35 groups of regional cotton trials. The IR-adjusted GGE model enhanced the goodness of fit by 8.7% and reduced loss of fit by 12.2% for under- and over-fitted GGE biplot model trials. Compared with IR-adjusted model, the superiority index of GGE biplot model was 15.9%. This indicated that GGE biplot model performed satisfactorily in depicting the overall pattern of genotype by environment interaction of regional cotton trials. However, the IR-adjusted GGE model was more reliable and had a more precise goodness of fit. The first hierachical mega-environment differentiation by the IR-adjusted GGE model was the same as that of GGE biplot model in terms of identifying cotton planting regions in Nan-Xiang Basin and Sichuan Basin as particular ecological regions. However, they were significantly different in terms of subregion divisions in the middle and lower reaches of the Yangtze River Valley. The mega-environment division scheme based on the IR-adjusted GGE model was of more practical in terms of geographical and ecological factor representation. Thus the study demonstrated an excellent example of the principles and application effects of GGE biplot adjusted with IR. This served as a significant supplement and improvement to GGE biplot application. It also provided the scientific basis and guidline for the application of GGE biplot in mega-environment investigation.

     

/

返回文章
返回