# Factor-analytic models for genotype × environment type problems and structured covariance matrices

- 8.8k Downloads
- 42 Citations

## Abstract

### Background

Analysis of data on genotypes with different expression in different environments is a classic problem in quantitative genetics. A review of models for data with genotype × environment interactions and related problems is given, linking early, analysis of variance based formulations to their modern, mixed model counterparts.

### Results

It is shown that models developed for the analysis of multi-environment trials in plant breeding are directly applicable in animal breeding. In particular, the 'additive main effect, multiplicative interaction' models accommodate heterogeneity of variance and are characterised by a factor-analytic covariance structure. While this can be implemented in mixed models by imposing such structure on the genetic covariance matrix in a standard, multi-trait model, an equivalent model is obtained by fitting the common and specific factors genetic separately. Properties of the mixed model equations for alternative implementations of factor-analytic models are discussed, and extensions to structured modelling of covariance matrices for multi-trait, multi-environment scenarios are described.

### Conclusion

Factor analytic models provide a natural framework for modelling genotype × environment interaction type problems. Mixed model analyses fitting such models are likely to see increasing use due to the parsimonious description of covariance structures available, the scope for direct interpretation of factors as well as computational advantages.

### Keywords

Cholesky Factor Variance Component Estimation Lower Triangular Matrix Genetic Covariance Additive Main Effect## Introduction

It has long been recognised that expression of genotypes is altered by environmental conditions. This can result in differences in variability as well as different ranking of genotypes in different environments. Classic analyses of such genotype by environment interaction (G × E) modelled G × E effects in just that manner: as an interaction effect in a two-way classification with genotypes and environments as main effects, in an analysis of variance (ANOVA). Assuming genotypes and interaction effects are random, such basic model generally implies a constant variance of G × E effects and, for more than two environments, a uniform genetic correlation across all environments. Often, this is too restrictive and a number of other models and methods have been developed, both in animal and plant breeding applications; see, for instance, Freeman [1] for a review of early approaches, Cameron [2] for an outline of more modern methods, and James [3] for a recent exposé.

Falconer [4] perceived that treating performance of genotypes in different environments as different, correlated traits provides an alternative way to model G × E effects. As individuals are general limited to a single environment, this relies on the availability of close relatives in the other environments to create genetic links. This approach allows for a more flexible covariance structure which can account for both scale and rank interactions. Generally, the resulting, multi-trait genetic covariance matrix is treated as 'unstructured' which, for *q* environments, comprises *q*(*q* + 1)/2 distinct covariances. At the other extreme, the 'compound symmetry' structure implied by the two-way ANOVA with interaction involves two parameters, the genetic variance and the variance due to G × E effects. When there are many environments, estimation of an unstructured covariance matrix can be infeasible. Hence, there has been considerable interest in fitting a structure to the covariance matrix which is flexible enough to accommodate heterogeneity of variances and some differences in genetic correlations between environments, but, at the same time, is parsimonious enough to allow estimation of the parameters involved with reasonable accuracy. Recently, interest has focused on structures which utilise the leading principal components of a covariance matrix, as it has become understood that such structures can be fitted directly within the mixed model framework commonly employed for estimation and prediction in quantitative genetic analyses [5, 6]. This encompasses both reduced rank and factor-analytic (FA) models. Added impetus for the use of FA models has come from plant breeding applications, especially the analysis of variety trials carried out in a range of locations. There has been increasing use of mixed model methodology in this field, for both the estimation of (co)variance components and the prediction of genetic merit for varietal selection, e.g. [7, 8, 9, 10, 11]. This has been stimulated by the recognition that analyses fitting a factor analytic (FA) structure for genotype effects provide the mixed model equivalent to previous, ANOVA based models such as the 'additive main effects, multiplicative interaction' (AMMI) model or regression type models such as the Finlay-Wilkinson model [12]; see Smith *et al*. [13] or Piepho *et al*. [14] for detailed reviews.

A particular G × E problem in livestock improvement is that of international genetic evaluation. For dairy cattle, 'multiple-trait across country evaluation' (MACE) of dairy sires is well established. Loosely described, this utilises a type of adjusted daughter average instead of individual observations, as suggested by Schaeffer [15]. With a considerable number of participating countries, various approaches for a structured parameterisation of the matrices of genetic correlations between countries have been examined, including those fitting reduced rank covariance matrices [16, 17, 18] or an approximate FA structure [19, 20]. Few other applications have been reported even though maximum likelihood estimation of genetic covariance matrices with a FA structure has been considered early on in other areas [21].

This paper presents a review of FA models and examines their implementation in the standard, linear mixed model framework. Particular focus is on the utility of FA model for genotype × environment type problems, considering scenarios where the genetic covariance matrix is adequately represented by a FA or reduced rank structure.

## The factor-analytic model

### ANOVA based models for G × E problems

A natural formulation for a G × E problem is in form of a two-way classification with interaction. Let *y*_{ ijk }denote the *k*-th record for the *i*-th genotype in the *j*-th environment, *g*_{ i }and *e*_{ j }the additive effects of genotype *i* and environment *j*, *ge*_{ ij }the respective interaction effect, *μ* the overall mean and ϵ_{ ijk }the residual error term. This gives model

*y*_{ ijk }= *μ* + *g*_{ i }+ *e*_{ j }+ *ge*_{ ij }+ ϵ_{ ijk } (1)

Separation of the interaction component *ge*_{ ij }from the error ϵ_{ ijk }requires repeated records per G × E subclass. Assume we have a 'full' two-way table of G × E effects, i.e. that for *G* genotypes and *E* environments, there are *GE* terms *ge*_{ ij }. This implies that fitting an interaction not only involves a substantial number of additional terms, but can also account for a large proportion of the total degrees of freedom available. Hence, there has been long standing interest in identifying the sources of non-additivity, dating back as far as Tukey [22], and in more parsimonious modelling of the interaction effects.

In addition to reducing the number of effects fitted, structural models can afford an insight into the nature of G × E effects. A bewildering number of alternatives for such models, as used in the analysis of plant breeding trials are catalogued by van Eeuwijk [23, 24]. A widely used model, attributed to Finlay and Wilkinson [12], involves a regression on the environmental effect, i.e.

*y*_{ ijk }= *μ* + *g*_{ i }+ (1 + *β*_{ i }) *e*_{ j }+ ϵ_{ ijk } (2)

with *β*_{ i }the regression coefficient for the *i*-th genotype. The environmental effect *e*_{ j }may be estimated from the data or be comprised of an external, environmental covariable.

with *u*_{ ri }and *v*_{ rj }the *r*-th genetic and environmental score, and *λ*_{ r }the corresponding weight. The number of factors to describe the interaction, *R*, can be at most *G* - 1 or *E* - 1, whichever is the smaller. In practice, *R* is generally chosen much smaller. Parts of the interaction terms not accounted for by the *R* factors fitted are then included in the residual in (3), ${\u03f5}_{ijk}^{\ast}$.

A convenient way to determine the scores and weights in (3) is via a singular value decomposition of the matrix formed by the two-way table of G × E effects. This combines the features of ANOVA and factor (or principal component) analysis, and has thus been referred to as FANOVA [25]. Examples of applications, together with discussions on related problems such as tests of significance, partitioning of degrees of freedom, interpretation of factor scores and unbalanced data are given by various authors, e.g. [25, 26, 27, 28, 29].

Let **H**, of size *G* × *E* represent the two-way table of G × E effects. Applying a singular value decomposition then yields

**H** = **UΛV'** (4)

with **Λ** = Diag {*λ*_{ r }} the diagonal matrix of eigenvalues, and **U** = {*u*_{ ri }} and **V** = {*v*_{ rj }} the matrices of left and right singular vectors of **H**. **U** is obtained as the matrix of eigenvectors of **HH'** and **V** as that of **H'H**. In the simplest case, the elements of **H** may be estimated as means for individual G × E effects. Other suggestions, in particular for unbalanced scenarios, have been to adjust the G × E cell means for the least-squares estimates of overall mean, genotype and environment effects [25, 26].

For such scores, the model given in (3) thus, in essence, describes the interaction terms by considering the *R* leading principal components of **H** only. The resulting model has become known as AMMI model, standing for 'additive main effects, multiplicative interaction' [29, 30]. An alternative classification in use is that of a bi-linear or bi-additive model [31]. In some instances, one or both of the main effects are not fitted and the principal component analysis is performed on the combined effects rather than the interaction alone. Some authors refer to such variations of AMMI models as shifted multiplicative models [13, 32]. Initial applications of FANOVA or AMMI models considered fixed effects scenarios. Treating environments and interactions as random, Piepho [33] modelled data from plant cultivar trials using the multiplicative models described above, and showed that such models yield a covariance matrix between observations of the same form as that obtained when imposing a factor-analytic structure [34], i.e. given by **ΓΓ**' + **Ψ** with the number of columns of **Γ** equal to the number of factors considered and **Ψ** a diagonal matrix. Smith *et al*. [8] presented a corresponding case with genotypes as random and environments considered to be fixed effects.

### Factor analysis

Loosely speaking, factor analysis is concerned with identifying the common factors which give rise to correlations between variables. This involves fitting a latent variable model. In contrast, principal component analysis aims at identifying factors which explain a maximum amount of variation, and does not imply any underlying model. Let **w** denote a vector of *q* random variables with covariance matrix **Σ**. We then model **w** as

**w** = * μ*+

**Γc**+

**s**(5)

with * μ* a vector of means,

**c**, of length

*m*, the vector of common factors,

**s**, of length

*q*, the vector of residuals or specific effects, and

**Γ**, of size

*q*×

*m*, the so-called matrix of factor loadings. In the most common form of factor analysis, the columns of

**Γ**are orthogonal, i.e. ${{\gamma}^{\prime}}_{i}$

*γ*

_{ j }= 0 for

*i*≠

*j*and

*γ*

_{ i }the

*i*-th column of

**Γ**. Hence, the elements of

**c**are uncorrelated. Moreover, the common factors are assumed to have unit variance, i.e. Var (

**c**) =

**I**. Columns

*γ*

_{ i }are determined as the corresponding eigenvectors of

**Σ**, scaled by the square root of the respective eigenvalues. However,

**Γ**is not unique and is often subject to an orthogonal transformation to obtain factor loadings which are more interpretable than those derived from the eigenvectors. Finally, the specific effects are assumed to be independently distributed with heterogeneous variances

*ψ*

_{ i }, and

**c**and

**s**are assumed to be uncorrelated. This gives covariance matrix of

**w**under the FA model

Var (**w**) = **Σ**_{ FA }= **Γ Γ**' + **Ψ** (6)

with **Ψ** = Diag {*ψ*_{ i }} the diagonal matrix of specific variances. This implies that all covariances between the levels of **w** are due to the common factors, while the specific factors account for the additional variance of individual elements of **w**. For *m* common factors, this describes the *q*(*q* + 1)/2 elements of **Σ**_{ FA }through *p* = *q* + *mq* - *m*(*m*-1)/2 parameters, consisting of *q* specific variances *ψ*_{ i }and *m*(2*q* - *m* + 1)/2 elements of **Γ**, with the remaining *m*(*m* - 1)/2 elements of **Γ** determined by the orthogonality constraints. For small *m*, a FA model provides a parsimonious way to model the covariances among a considerable number of variables. As *p* can not exceed the number of parameters in the unstructured case, *q*(*q* + 1)/2, the number of common factors that can be fitted is restricted. If all specific variances *ψ*_{ i }are non-zero, the minimum number of traits for which imposing a FA structure yields a reduction in the number of parameters is *q* = 4. A FA structure for the variance of **w** is most appropriate if all the *q* traits involved are relatively evenly correlated. In this case, a small number of factors generally suffices to model the covariances among the elements of **w**. The FA model includes many of the commonly employed covariance models for G × E problems as special cases. The simplest scenario is the 'compound symmetry' structure, i.e. Σ = *σ*^{2}**11'** + *ψ* **I**, which is a FA model with a single common factor and **Γ** = *σ* **1** (where **1** denotes a vector with all elements equal to unity) and equal specific variances *ψ* for all variables. Jennrich and Schluchter [34] proposed a FA structure as an option to model the covariances between repeated records, and typical examples where this is appropriate are the 'same' measurements taken in different circumstances, e.g. different time points for longitudinal data, different locations for G × E problems, or different backgrounds in analyses of QTL or gene expression. In contrast to most random regression type 'reaction norm' models which are often invoked for such analyses, the FA approach does not require a continuous 'control' variable and does not imply smooth changes in the trait.

## Mixed model formulation

### Multi-trait model

Consider the linear mixed model

**y** = **X** * β*+

**Zu**+

**e**(7)

**y**the vector of observations for

*q*traits,

*,*

**β****u**and

**e**vectors of fixed effects, random effects and residuals, and

**X**and

**Z**the design matrices pertaining to

*and*

**β****u**. For simplicity, assume

**u**represents additive genetic effects only for

*N*individuals, with covariance matrix Var (

**u**) =

**Σ**⊗

**A**and

**A**the numerator relationship matrix (NRM). Further, let Var (

**e**) =

**R**. The corresponding mixed model equations (MME) for a standard, multivariate (MUV) analysis are then

### 'Extended' factor analytic model

The multi-trait framework (8) does not require any assumptions about **Σ** other than that it has full rank *q*. If **Σ** is represented by a FA structure (**Σ** = **ΓΓ**' + **Ψ**), however, an equivalent model to (7) is obtained by fitting the common and specific factors separately [5],

**y** = **X** * β*+

**Z**(

**I**

_{ N }⊗

**Γ**)

**c**+

**Zs**+

**e**=

**X**

*+*

**β****Z*c**+

**Zs**+

**e**(9)

**c**, of length

*mN*, and

**s**, of length

*qN*, the vectors of common and specific factors, respectively. The corresponding MME are

Note that **Z*** is considerably denser than **Z**, containing *m* coefficients *γ*_{ ij }in each row compared to a single element of unity in **Z**. While (10) comprises an additional *mN* equations, the part of the coefficient matrix for random effects is much sparser than for the MUV model, as each element of **A**^{-1} contributes only *m* + *q* non-zero elements, compared to *q*^{2} in (8). With **Ψ** diagonal, $\widehat{s}$ can have a number of zero elements if there are 'missing' records: the element for trait *j* and individual *i* is non-zero only if individual *i* or one of its relatives has a record for trait *j*.

In some contexts, the FA model shown in (9) is referred to as 'extended FA' (XFA) model to distinguish it from the equivalent, multivariate model imposing a FA structure on **Σ** (7). For REML estimation of covariance matrices imposing a FA structure, Thompson *et al*. [5] showed that the sparsity of the MME for the XFA model (10) reduced computational requirements dramatically compared to an implementation utilising the standard multi-trait model (8).

### Reduced rank model

A reduced rank model is, in essence, a FA model where specific effects are assumed absent, i.e. **Ψ** = **0**. This is the model proposed by Kirkpatrick and Meyer [6] for parsimonious estimation of genetic covariance matrices. One of the main attractions of the reduced rank model is that it provides a mixed model formulation which allows for genetic covariance matrices that are not of full rank, i.e. it alleviates the need for approximating a reduced rank matrix by a full rank one as required to in the standard MUV implementation (8).

In addition, it can result in computational advantages. Assuming **Σ** can be modelled through the first *m* principal components, the MME have less equations than for the corresponding MUV model. Furthermore, the same arguments for increased sparsity of the coefficient matrix apply as given above for the XFA model. This implies that for *m* = *q*, this parameterisation provides an equivalent model (with **Σ** of full rank) to the standard multi-trait model which not only has a sparser coefficient matrix but also involves random effects which are less correlated. This can reduce both the time per iterate and the number of iterates, in particular in genetic evaluation applications relying on indirect solution schemes. Equally, it may provide some computational advantages for analyses involving a direct solution of the MME.

In the following, we refer to such models as PC models, to describe both reduced (*m* <*q*) and full (*m* = *q*) rank FA models without specific effects.

### Factor rotation

As emphasized above, **Γ** is not unique and, for *m* factors, *m*(*m* - 1)/2 of the *mq* elements are given by orthogonality constraints. Hence, **Γ** is frequently subject to an orthogonal rotation, i.e. we can replace **Γ** by **Γ*** = **ΓT** for an arbitrary orthogonal matrix **T** without altering the matrix **Σ**_{ FA }modelled, as **ΓΓ**' = **Γ***(**Γ***)' if **TT'** = **I**. Most commonly, this is done for ease of interpretation – widely used, for instance, in social science applications. However, such transformation can also be utilised to reduce computational requirements, or to provide a parameterisation better suited to variance component estimation.

For *m* = *q* and **Ψ** = **0**, **Γ** is a matrix square root of **Σ**. Let **L** denote the Cholesky factor of **Σ**, i.e. **Σ** = **LL'** with **L** a lower triangular matrix. The Cholesky factor **L** is an alternative matrix square root of **Σ** and, moreover, can be obtained by rotating **Γ**: For **Γ** = **EΛ**^{1/2}, with **E** the matrix of eigenvectors of **Σ** and **Λ** the corresponding, diagonal matrix of eigenvalues, it can be shown that **L** = **EΛ**^{1/2}**T'**, with **T** the orthogonal matrix of right singular vectors of **L** [[35], p.232]. This implies that we can replace **Γ** in FA models with the *q × m* matrix consisting of the first *m* columns of the Cholesky factor, **L**_{ m }. For variance component estimation, this substitution is useful as the number of non-zero elements of **L**_{ m }is equal to the number of parameters to be estimated, e.g. [8], and as the Cholesky parameterisation is known to improve convergence rates in maximum likelihood estimation.

The triangular nature of **L** can also be advantageous in genetic evaluation, in particular for G × E scenarios where individuals have records in a single location only: As elements above the diagonal are zero, replacing **Γ** with **L**, the rows of **Z*** are less dense than for a **Γ** with all elements non-zero. Let ${{\ell}^{\prime}}_{j}$ denote the *j*-th row of **L**. Assuming the Cholesky factorisation has been carried out sequentially, elements *j* + 1 to *m* of ${{\ell}^{\prime}}_{j}$ are zero. For an individual with a record in location *j*, vector ${{\ell}^{\prime}}_{j}$ represents the coefficients in the respective row of the design matrix **Z***. If the individual has a record for a single trait (or environment) only, the contribution to **Z*'R**^{-1}**Z*** is ${\sigma}_{j}^{-2}{\ell}_{j}{{\ell}^{\prime}}_{j}$, with ${\sigma}_{j}^{2}$ the residual variance pertaining to *j*. It is readily seen that only the block consisting of the first *j* rows and columns of ${\ell}_{j}{{\ell}^{\prime}}_{j}$ is non-zero. Hence, the corresponding *m × m* diagonal block in the coefficient matrix corresponding to the common factors **c** has a known sparsity structure, consisting of a dense block, comprising the first *j* rows and columns, and the remaining *m* - *j* rows and columns with all off-diagonal elements equal to zero. For instance, for *j* = 1 there are no off-diagonal elements, for *j* = 2 only the first and second row and column are linked by a non-zero off-diagonal element, and only for *j* = *q* are all *m*^{2} elements in the diagonal block non-zero. This is readily exploited in both iterative and direct solution schemes. Moreover, for applications with greatly differing numbers of records in different environments, it suggests that numbering environments in decreasing order of the number of records can markedly reduce computational requirements.

### Transforming solutions

**Σ**=

**ΓΓ**' +

**Ψ**, the standard MUV and the XFA implementation are directly equivalent. In addition, the PC model considering all

*q*factors, i.e. decomposing

**Σ**=

**PP'**(with

**P**=

**E(Λ**+

**E'ΨE)**

^{1/2}the matrix of scaled eigenvectors of

**Σ**or a rotated form thereof), provides a third equivalent model. Hence, solutions for effects in the model can be obtained for one model and are readily transformed to those from another. From (7) and (9),

*et al*. [8], we can obtain solutions for the common and specific factors from those in a standard MUV model

Corresponding formulae apply for implementations replacing **Γ** by a rotated matrix **Γ*** such as **L** and non-equivalent, reduced rank models. Similarly, if estimates of genetic effects for principal components are of interest but a rotated form of **Γ** has been used for ease of computation, these are readily obtained by applying a 'backwards' rotation.

### Example

For the MUV model, each diagonal block for animals has one element contributed from the data, while each element in the NRM inverse contributes 16 coefficients, resulting in dense diagonal blocks for all animals and a substantial number of off-diagonal elements. This gives 806 non-zero off-diagonal elements or 12.54% filled elements in one triangle (diagonal + off-diagonal) of the symmetric coefficient matrix. The pattern changes substantially when switching to the equivalent PC model fitting all four factors. With factors uncorrelated, each element of the NRM inverse contributes only 4 elements. However, the trade-off is that the design matrix for animal effects is denser, so that there are more contributions from the data part of the MME, i.e. **Z*'R**^{-1}**Z***. For an implementation with all elements of **Γ**, the matrix of factor loadings, non-zero this would contribute a dense diagonal block for each animal. However, rotating **Γ** so that elements above the diagonal are zero, this applies only to animals with records in country 4, while the dense blocks for animals with records in other countries are smaller. This is the scenario depicted in part (b) of Figure 1, with 330 non-zero off-diagonal elements in one triangle of the coefficient matrix and a proportion of fill of 6.46%.

Fitting a XFA model, the MME are augmented by the equations for common factors (shown in part (c) of Figure 1 as the part of the equations with a light gray background, again with separation lines between sires and progeny), but sparser yet again. With a single record per individual, there are contributions from the data to only one diagonal element for specific factors, and corresponding off-diagonal elements linking this effect to the corresponding common factors. For this parameterisation, there are 246 non-zero off-diagonal elements and the corresponding fill proportion is 4.20%.

## Multi-trait, multi-environment models

In a more general scenario, we may have multiple traits recorded in each environment. We could then apply the FA decomposition to the complete, multi-trait and multi-environment genetic covariance matrix. This may be necessary if the traits recorded in different locations are quite diverse (but still similar enough to warrant some FA modelling). In other cases, the same traits are of interest in all locations and their covariance matrices may be sufficiently similar across environments that we can utilise the resulting pattern in modelling the joint matrix more parsimoniously.

Most studies on simultaneous modelling of several covariance matrices consider the case of independent groups. Let **Σ**_{ ii }denote the covariance matrix for the *i*-th group. Simple models suggested include proportionality of matrices, i.e. **Σ**_{ ii }= *f*_{ i }**Σ**_{11} (for *i* > 1) with *f*_{ i }the scale factor for group *i*, and the same correlation structure but different variances in different groups, i.e. **Σ**_{ ii }= **S**_{ i }**RS**_{ i }with **S**_{ i }the diagonal matrix of standard deviations for the *i*-th group and **R** the common correlation matrix [36]. Other approaches are based on the spectral decomposition of the matrices. Flury [37] proposed to model similar covariance matrices through common eigenvectors and specific eigenvalues, i.e. **Σ**_{ ii }= **EΛ**_{ i }**E'** with **Λ**_{ i }the matrix of eigenvalues for the *i*-th group and **E** the matrix of common eigenvectors. Later generalisations allowed for partial communality, common subspaces or partial sphericity [38, 39] and dependent random vectors [40]. The 'common principal component' approach and resulting hierarchy of models have seen considerable use in the comparison of covariance matrices in evolutionary biology; see Houle *et al*. [41] for a discussion. Pourahmadi *et al*. [42] described a corresponding framework based on the Cholesky decomposition.

Considering traits measured at different stages of development, Klingenberg *et al*. [43] modelled all submatrices of a patterned covariance matrix through common principal components, and emphasized not only that, with rearrangement, this resulted in a block-diagonal covariance matrix of the principal components, but also that further structure (such as reduced rank) could be imposed on this matrix. For *t* traits measured in each of *q* locations, we have a genetic covariance matrix **Σ** with *qt*(*qt* + 1)/2 distinct elements. A FA structure could be imposed to this matrix as a whole, as described above. For *m* factors, this would involve *m*(2*qt - m* + 1)/2 + *qt* parameters. Assume in the following that traits are ordered within locations, so that **Σ** has *q*^{2} submatrices **Σ**_{ ij }of size *t × t* which give the covariances among the *t* traits measured in locations *i* and *j*. It is then conceivable that the covariance pattern among traits across locations is sufficiently similar so that **Σ**_{ ij }= **M**_{ i }**D**_{ ij }**M**_{ j }' with **M**_{ i }the unitary, lower triangular matrix arising from the generalised Cholesky decomposition of Σ_{ ii }(Σ_{ ii }= **M**_{ i }**D**_{ ii }${{M}^{\prime}}_{i}$ with all diagonal elements of **M**_{ i }equal to unity) and **D**_{ ij }= Diag {${\delta}_{k}^{ij}$}. This implies that pre- and post-multiplication of **Σ** with the inverse of **M** = Diag {**M**_{ i }} and its transpose simultaneously diagonalises all *q*^{2} submatrices **Σ**_{ ij }.

Let **D** = {**D**_{ ij }}, i.e. **Σ** = **MDM'**. **D** is ordered according to traits within environments. It is readily seen that by rearranging the rows and columns of **D** according to environments within traits, we obtain a matrix **D*** which is block-diagonal with *t* blocks ${D}_{k}^{\ast}=\left\{{\delta}_{k}^{ij}\right\}$, of size *q × q*. We can then impose a FA structure on each block in the same way as for the single trait case. Assume ${D}_{k}^{\ast}={L}_{k}^{\ast}{L}_{k}^{{\ast}^{\prime}}+{\Psi}_{k}^{\ast}$, with ${L}_{k}^{\ast}$ the matrix consisting of the first *m*_{ k }columns of the Cholesky factor of ${D}_{k}^{\ast}$. If we fit a full rank PC model for all ${D}_{k}^{\ast}$, i.e *m*_{ k }= *q* and ${\Psi}_{k}^{\ast}$ = **0** (*k* = 1, *t*), and assume all matrices **M**_{ i }are different, **Σ** is described by *p* = *tq*(*t* + *q* + 2)/2 parameters. If less factors are considered or matrices **M**_{ i }have some common elements, this is reduced further. For instance, matrices **M**_{ i }may be the same for some environments, or matrices ${D}_{k}^{\ast}$ may be proportional to each other.

In certain cases, **Σ** is 'separable', i.e. we are able to decompose **Σ** into the direct product of a *t × t* matrix **Σ**_{ T }, which summarises the covariances between traits, and a *q × q* matrix **Σ**_{ Q }which gives the pattern of correlations between locations and accounts for differences in variability, **Σ** = **Σ**_{ Q }⊗ **Σ**_{ T }. If a FA structure for **Σ**_{ Q }is appropriate, this becomes **Σ** = **Γ**_{ Q }**Γ**'_{ Q }⊗ **Σ**_{ T }+ **Ψ**_{ Q }⊗ **Σ**_{ T }, reducing the number of parameters to describe **Σ** to *p* = (*t*(*t* + 1) + *m*(2*q - m* + 1))/2 + *q*, or *p* = (*t*(*t* + 1) + *m*(2*q - m* + 1))/2 if **Ψ**_{ Q }= **0**. Smith *et al*. [11] considered such structure in variance component estimation for sugar cane data. Again, there is further scope to reduce the number of parameters if **Σ**_{ T }can be structured as well.

Clearly, being able to impose some common structure on the submatrices of **Σ** can yield a very parsimonious description of the dispersion structure for multi-trait, multi-environment problems, and this is important for variance component estimation. In terms of solving the MME in genetic evaluation, however, differences depend on the solution scheme employed. Say we are considering a FA model using the Cholesky transform, applied to the unstructured *qt × qt* matrix **Σ**, and assume that we are fitting a full rank PC model with *m* = *qt*. We would then have an equivalent linear model (see (9) with **Z*** = **Z** (**I** ⊗ **Q**) and **Q** the Cholesky factor of **Σ**. **Q** is a dense, lower triangular matrix. Hence contributions to the diagonal block of **Z*'R**^{-1}**Z*** for an animal with records in country *j* would consist of a dense block comprising rows and columns 1 to *jt*. This would be the same if the structure considered above were applicable. However, **Q** would not be dense, but each *t × t* submatrix in the lower triangle would also be a lower triangular matrix. For a solution scheme setting up the MME once and holding them in core, for instance, there would be relatively little advantage of having **Q** with such structure, but for an 'iteration on data' scheme, computational advantages could be substantial.

## Estimation and model selection

Emphasis in this review has been on modelling and prediction, assuming that the genetic covariance matrix has a FA structure. Closely related are the prerequisite tasks of estimation and model selection, i.e. determining how many factors are required. There is substantial body of literature dealing with these topics, and this section is thus restricted to selected pertinent comments.

Most analyses of covariance structures have involved a two-step procedure, first estimating a complete, unstructured covariance matrix and then examining its factors. More recently, direct estimation enforcing a FA structure has been proposed and suitable algorithms for both restricted maximum likelihood (REML) [5, 6, 44, 45] and Bayesian estimation [46] have been described, and mixed model software packages available, such as ASReml [47] or WOMBAT [48], readily accommodate such analyses.

The underlying concept is that only the most important principal components or common factors need to be estimated, while those explaining little variation can be ignored with negligible loss of information. This reduces the number of parameters to be estimated and thus sampling errors. Provided any bias due to the factors that are ignored is relatively small, this is also expected to reduce mean square errors [6].

Furthermore, eliminating unnecessary parameters is likely to make estimation more stable and efficient. For instance, omitting factors with corresponding eigenvalues close to zero reduces problems associated with estimates at the boundary of the parameter space, and can thus improve convergence rates in iterative estimation schemes.

While highly appealing, recent work has identified some unexpected bias in REML estimates of the leading factors in PC models when too few factors are fitted [49]. Briefly, estimation can 'pick up' a wrong subset of factors. Say we fit *m* factors. We would then expect our estimates to reflect the first *m* principal components and any bias in the estimate of **Σ** to be due solely due to factors *m* + 1 to *q* ignored. However, under certain conditions, one (or more) of the *m* estimated components can represent one (or some) of the lower ranking factors (with smaller eigenvalues) instead. If this is the case, an analysis fitting *m* + 1 factors typically yields an estimate of the *m*-th eigenvalue which is larger than that from the analysis fitting *m* factors, and the trace of the estimated covariance matrix is increased by more than the value of the additional (*m* + 1-th) eigenvalue estimated. Another indicator is a large angle between the estimates of the *m*-th eigenvector from the two analyses (the dot product of two normalised vectors gives the cosine of the angle between them): if one of the analyses picked up the wrong direction, this is expected to be orthogonal to the true direction, i.e. we expect it to be close to 90°; see Meyer and Kirkpatrick [49] for details. This inconsistency in estimators implies that we need to choose *m* sufficiently large so that all important factors are included, to ensure that we estimate the leading factors correctly. Paradoxically, this can necessitate the inclusion of some factors with negligible eigenvalues. These can omitted subsequently when using the estimated covariance matrix in a genetic evaluation scheme, i.e. the optimal number of factor to be fitted for estimation and prediction is not necessarily the same. The latter could be determined, for instance, based on selection index calculations and the impact of omitting factors with small eigenvalues on the expected accuracy of evaluation [50].

A number of test criteria to determine the rank of a matrix are available in the literature. Simulation studies examining their utility, however, generally have yielded not very consistent results, both between different tests and in the ability to find the correct dimension (see [49] for references). With mixed model based estimation, model selection based on the log likelihood, information criteria or Bayes factors are an obvious choice. Likelihood ratio tests (LRT) allowing for the fact that testing an eigenvalue for being different from zero involves a one-sided test at the boundary of the parameter space have been described [51, 52]. Amemiya and Anderson [53] examined likelihood based goodness-of-fit tests for FA models. Akaike [54] showed that his information criterion (AIC), derived in the context of regression models, was also suitable for FA model selection. However, limited simulation studies in a genetic context have found rank selection based on LRT or AIC to be only moderately successful, with substantial underestimates of the true rank for smaller samples for some constellations of population parameters [49, 55]. Future work is needed to examine reliability of model selection for FA models and in more detail.

## Discussion

Mixed model analyses fitting FA models are likely to see increasing use in the future, as higher dimensional analyses considering more than a few traits are becoming more common. This is due to the parsimonious description of covariance structures available, the scope for direct interpretation of factors as well as computational advantages. FA models are most advantageous if all covariances between traits can be attributed to a small number of factors.

Focus in this review has been on modelling of the genetic covariance matrix. Corresponding structures may be applicable for covariance matrices due to other random effects. For scenarios where each individual has records in a single environment only, the residual covariance matrix (**R**) is (block-)diagonal. If there are non-zero residual covariances, we may wish to impose a structure on **R** as well. Simultaneous modelling of several matrices, however, should be carried out judiciously, in particular for variance component estimation: Imposing a structure on the genetic covariance matrix can lead to partitioning of some genetic covariances into the residual part. If the structure imposed on the latter then is too restrictive, problematic estimates for the former may result; see [56] for a cautionary example.

In the context of G × E interactions, separation of genetic effects into common and specific factors is highly appealing, as these factors have an interpretation in their own right. As reviewed above, such models – either ANOVA based or, more recently, employing mixed model methodology – have long been used in the analysis of data from plant breeding trials, and are directly applicable to corresponding problems in animal breeding. For international genetic evaluation, predicted values for common genetic effects of an animal, for instance, could provide global breeding values for that individual. Furthermore, inspection of predictions for the corresponding specific effects could directly reveal its sensitivity to environmental conditions: Similar values for all locations may indicate a good 'all-rounder' while values which are highly variable or are of opposite signs may suggest strong G × E interaction effects.

There has been long standing interest in the use of transformations or reparameterisations of various forms to ease the computational burden imposed by large scale genetic evaluation or variance component estimation problems. Earlier, transformations were mostly applied directly to the data, which limited their applicability. In particular, the so-called canonical transformation was found to be extremely useful for multivariate analyses, as it allowed multivariate analyses to be broken into a series of corresponding, univariate analyses. However, this required equal design matrices for all traits and did not allow for additional random effects; see, for instance, Jensen and Mao [57] for a review. Hence, sophisticated schemes have been developed to augment the data and to extend the range of applications [58, 59]. In contrast, FA models involve a reparameterisation of the model, i.e. 'transformations' are applied at the effects level. Thus different design matrices, missing observations or multiple random effects are not an issue. However, the same underlying principles are utilised: computing requirements are reduced by transforming previously correlated effects to be independent and increasing the sparsity of the corresponding MME. Clearly, applicability of FA models depends on the covariance structure among traits or locations being adequately represented by such models. Few studies in animal breeding have addressed this question. Considering genetic correlations for dairy production in 18 countries, Leclerc *et al*. [19] recommended a FA model with 5 common factors, with an average, absolute deviation in genetic correlations from the unstructured case of 0.014. FA models are often been advocated for their parsimony: for problems of relatively high dimensions, reduced sampling variances due to a greatly reduced number of parameters can easily outweigh small biases due to enforcing such structure but, as emphasized above, we need to ensure that the set of factors fitted includes all important factors.

## Conclusion

Factor analytic models, which separate genetic effects into common and specific components, provide a natural framework for modelling G × E interaction and related problems. Moreover, they can substantially reduce computational requirements of mixed model analyses compared to standard multivariate models, both in variance component estimation and genetic evaluation schemes.

## Notes

## Supplementary material

### References

- 1.Freeman GH: Statistical methods for the analysis of genotype-environment interactions. Heredity. 1973, 31 (3): 339-354. 10.1038/hdy.1973.90.CrossRefPubMedGoogle Scholar
- 2.Cameron ND: Methodologies for estimation of genotype with environment interaction. Livest Prod Sci. 1993, 35 (3–4): 237-249. 10.1016/0301-6226(93)90095-Y.CrossRefGoogle Scholar
- 3.James JW: Genotype by environment interaction in farm animals. Adaptation and fitness in animal populations – Evolutionary and breeding perspectives on genetic resource management. Edited by: van der Werf JHJ, Graser HU, Frankham R, Gondro C. 2009, Springer Verlag, 151-167.CrossRefGoogle Scholar
- 4.Falconer DS: The problem of environment and selection. Am Nat. 1952, 86: 293-298. 10.1086/281736.CrossRefGoogle Scholar
- 5.Thompson R, Cullis BR, Smith AB, Gilmour AR: A sparse implementation of the Average Information algorithm for factor analytic and reduced rank variance models. Austr New Zeal J Stat. 2003, 45: 445-459. 10.1111/1467-842X.00297.CrossRefGoogle Scholar
- 6.Kirkpatrick M, Meyer K: Direct estimation of genetic principal components: Simplified analysis of complex phenotypes. Genetics. 2004, 168: 2295-2306. 10.1534/genetics.104.029181.PubMedCentralCrossRefPubMedGoogle Scholar
- 7.Piepho HP: Empirical best linear unbiased prediction in cultivar trials using factor-analytic variance-covariance structures. Theor Appl Genet. 1998, 97: 105-201. 10.1007/s001220050885.CrossRefGoogle Scholar
- 8.Smith AB, Cullis BR, Thompson R: Analysing variety by environment data using multiplicative mixed models and adjustments for spatial field trends. Biometrics. 2001, 57: 1138-1147. 10.1111/j.0006-341X.2001.01138.x.CrossRefPubMedGoogle Scholar
- 9.Costa e Silva J, Potts BM, Dutkowski GW: Genotype by environment interaction for growth of Eucalyptus globulus in Australia. Tree Genetics & Genomes. 2006, 2: 61-75. 10.1007/s11295-005-0025-x.CrossRefGoogle Scholar
- 10.Kelly AM, Smith AB, Eccleston JA, Cullis BR: The accuracy of varietal selection using factor analytic models for multi-environment plant breeding trials. Crop Sci. 2007, 47 (3): 1063-1070. 10.2135/cropsci2006.08.0540.CrossRefGoogle Scholar
- 11.Smith AB, Stringer JK, Wei X, Cullis BR: Varietal selection for perennial crops where data relate to multiple harvests from a series of field trials. Euphytica. 2007, 157 (1–2): 253-266. 10.1007/s10681-007-9418-2.CrossRefGoogle Scholar
- 12.Finlay KW, Wilkinson GN: The analysis of adaptation in a plant breeding programme. Austr J Agric Res. 1963, 14 (6): 742-754. 10.1071/AR9630742.CrossRefGoogle Scholar
- 13.Smith AB, Cullis BR, Thompson R: The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches. J Agric Sci. 2005, 143: 449-462. 10.1017/S0021859605005587.CrossRefGoogle Scholar
- 14.Piepho HP, Möhring J, Melchinger AE, Büchse A: BLUP for phenotypic selection in plant breeding and variety testing. Euphytica. 2008, 161 (1–2): 209-228. 10.1007/s10681-007-9449-8.CrossRefGoogle Scholar
- 15.Schaeffer LR: Multiple-country comparison of dairy sires. J Dairy Sci. 1994, 77 (9): 2671-2678.CrossRefPubMedGoogle Scholar
- 16.Mäntysaari EA: Multiple-trait across-country evaluations using singular (co) variance matrix and random regression model. Interbull Bull. 2004, 32: 70-74.Google Scholar
- 17.Tarres J, Liu Z, Ducrocq V, Reinhardt F, Reents R: Data transformation for rank reduction in multi-trait MACE model for international bull comparison. Genet Select Evol. 2008, 40 (3): 295-308. 10.1051/gse:2008004.Google Scholar
- 18.Tyrisevä AM, Lidauer M, Ducrocq V, Back P, Fikse WF, Mäntysaari EA: Principal Component Approach in describing the across country genetic correlations. Interbull Bull. 2008, 38: 142-145.Google Scholar
- 19.Leclerc H, Fikse WF, Ducrocq V: Principal components and factorial approaches for estimating genetic correlations in international sire evaluation. J Dairy Sci. 2005, 88 (9): 3306-3315.CrossRefPubMedGoogle Scholar
- 20.Schneider MdP, Fikse WF: Principal Components Analysis for Conformation Traits in International Sire Evaluations. Interbull Bull. 2007, 37: 107-110.Google Scholar
- 21.Martin NG, Eaves LJ: The genetical analysis of covariance structure. Heredity. 1977, 38: 79-95. 10.1038/hdy.1977.9.CrossRefPubMedGoogle Scholar
- 22.Tukey JW: One degree of freedom for non-additivity. Biometrics. 1949, 5 (3): 232-242. 10.2307/3001938.CrossRefGoogle Scholar
- 23.van Eeuwijk FA: Linear and bilinear models for the analysis of multi-environment trials: I. An inventory of models. Euphytica. 1995, 84: 1-7. 10.1007/BF01677551.CrossRefGoogle Scholar
- 24.van Eeuwijk FA, Denis JB, Kang MS: Incorporating additional information on genotypes and environments in models for two-way genotype by environment tables. Genotype-by-Environment Interaction. Edited by: Kang MS, Gauch HG. 1996, Boca Raton: CRC Press, 15-50.CrossRefGoogle Scholar
- 25.Gollob HF: A statistical model which combines features of factor analytic and analysis of variance techniques. Psychometrika. 1968, 33: 73-115. 10.1007/BF02289676.CrossRefPubMedGoogle Scholar
- 26.Mandel J: A new analysis of variance model for non-additive data. Technometrics. 1971, 13: 1-18. 10.2307/1267072.CrossRefGoogle Scholar
- 27.Gabriel KR: Least Squares Approximation of Matrices by Additive and Multiplicative Models. J Roy Stat Soc B. 1978, 40 (2): 186-196.Google Scholar
- 28.Snee RD: Nonadditivity in a Two-Way Classification: Is It Interaction or Nonhomogeneous Variance?. J Amer Stat Ass. 1982, 77 (379): 515-519. 10.2307/2287704.CrossRefGoogle Scholar
- 29.Gauch H: Model selection and validation for yield trials with interaction. Biometrics. 1988, 44 (3): 705-715. 10.2307/2531585.CrossRefGoogle Scholar
- 30.Zobel RW, Wright MJ, Gauch HG: Statistical analysis of a yield trial. Agronomy J. 1988, 80 (3): 388-393.CrossRefGoogle Scholar
- 31.Denis J, Gower J: Biadditive models. Biometrics. 1994, International Biometric Society, 50 (1): 310-311. [http://www.jstor.org/stable/2533227]
- 32.Seyedsadr M, Cornelius P: Shifted multiplicative models for nonadditive two-way tables. Comm Stat -Simul Comp. 1992, 21 (3): 807-832. 10.1080/03610919208813051.CrossRefGoogle Scholar
- 33.Piepho HP: Analyzing genotype-environment data by mixed models with multiplicative terms. Biometrics. 1997, 53: 761-766. 10.2307/2533976.CrossRefGoogle Scholar
- 34.Jennrich RI, Schluchter MD: Unbalanced repeated-measures models with structured covariance matrices. Biometrics. 1986, 42: 805-820. 10.2307/2530695.CrossRefPubMedGoogle Scholar
- 35.Harville DA: Matrix Algebra from a Statistician's Perspective. 1997, New York: Springer VerlagCrossRefGoogle Scholar
- 36.Manly BF, Rayner JCW: The comparison of sample covariance matrices using likelihood ratio tests. Biometrika. 1987, 74: 841-847. 10.1093/biomet/74.4.841.CrossRefGoogle Scholar
- 37.Flury BN: Common principal components in K groups. J Amer Stat Ass. 1984, 79: 892-898. 10.2307/2288721.Google Scholar
- 38.Flury BK: Two generalizations of the common principal component model. Biometrika. 1987, 74: 59-69. 10.1093/biomet/74.1.59.CrossRefGoogle Scholar
- 39.Boik RJ: Spectral models for covariance matrices. Biometrika. 2002, 89: 159-182. 10.1093/biomet/89.1.159.CrossRefGoogle Scholar
- 40.Neuenschwander BE, Flury BD: Common principal components for dependent random vectors. J Multiv Anal. 2000, 75: 163-183. 10.1006/jmva.2000.1908.CrossRefGoogle Scholar
- 41.Houle D, Mezey J, Galpern P: Interpretation of the results of common principal components analyses. Evolution. 2002, 56 (3): 433-440.CrossRefPubMedGoogle Scholar
- 42.Pourahmadi M, Daniels MJ, Park T: Simultaneous modelling of the Cholesky decomposition of several covariance matrices. J Multiv Anal. 2007, 98 (3): 569-587. 10.1016/j.jmva.2005.11.002.CrossRefGoogle Scholar
- 43.Klingenberg CP, Neuenschwander BE, Flury BD: Ontogeny and individual variation: Analysis of patterned covariance matrices with common principal components. Syst Biol. 1996, 45: 135-150. 10.2307/2413611.CrossRefGoogle Scholar
- 44.Meyer K, Kirkpatrick M: Restricted maximum likelihood estimation of genetic principal components and smoothed covariance matrices. Genet Select Evol. 2005, 37: 1-30. 10.1051/gse:2004034.CrossRefGoogle Scholar
- 45.Meyer K: Parameter expansion for estimation of reduced rank covariance matrices. Genet Select Evol. 2008, 40: 3-24. 10.1051/gse:2007032.Google Scholar
- 46.Los Campos G, Gianola D: Factor analysis models for structuring covariance matrices of additive genetic effects: a Bayesian implementation. Genet Select Evol. 2007, 39 (5): 481-494. 10.1051/gse:20070016.CrossRefGoogle Scholar
- 47.Gilmour A, Gogel B, Cullis BR, Thompson R: ASReml User Guide Release 2.0. 2006, Hemel Hempstead, HP1 1ES, U.K.: VSN International LtdGoogle Scholar
- 48.Meyer K: WOMBAT: a tool for mixed model analyses in quantitative genetics by restricted maximum likelihood (REML). J Zhejiang Univ Sci B. 2007, 8 (11): 815-821. 10.1631/jzus.2007.B0815.PubMedCentralCrossRefPubMedGoogle Scholar
- 49.Meyer K, Kirkpatrick M: Perils of parsimony: Properties of reduced rank estimates of genetic covariances. Genetics. 2008, 180 (2): 1153-1166. 10.1534/genetics.108.090159.PubMedCentralCrossRefPubMedGoogle Scholar
- 50.Meyer K: Multivariate analyses of carcass traits for Angus cattle fitting reduced rank and factor-analytic models. J Anim Breed Genet. 2007, 124: 50-64. 10.1111/j.1439-0388.2007.00637.x.CrossRefPubMedGoogle Scholar
- 51.Amemiya Y, Anderson TW, Lewis PAW: Percentage points for a test of rank in multivariate components of variance. Biometrika. 1990, 77 (3): 637-641. 10.1093/biomet/77.3.637.CrossRefGoogle Scholar
- 52.Kuriki S: One-Sided Test for the Equality of Two Covariance Matrices. Ann Stat. 1993, 21 (3): 1379-1384. 10.1214/aos/1176349263.CrossRefGoogle Scholar
- 53.Amemiya Y, Anderson TW: Asymptotic chi-square tests for a large class of factor analysis models. Annals of Statistics. 1990, 18 (3): 1453-1463. 10.1214/aos/1176347760.CrossRefGoogle Scholar
- 54.Akaike H: Factor analysis and AIC. Psychometrika. 1987, 52: 317-332. 10.1007/BF02294359.CrossRefGoogle Scholar
- 55.Hine E, Blows MW: Determining the effective dimensionality of the genetic variance-covariance matrix. Genetics. 2006, 173 (2): 1135-1144. 10.1534/genetics.105.054627.PubMedCentralCrossRefPubMedGoogle Scholar
- 56.Jaffrézic F, White IMS, Thompson R, Visscher PM: Contrasting models for lactation curve analysis. J Dairy Sci. 2002, 85 (4): 968-975.CrossRefPubMedGoogle Scholar
- 57.Jensen J, Mao IL: Transformation algorithms in analysis of single trait and multitrait models with equal design matrices and one random factor per trait : a review. J Anim Sci. 1988, 66: 2750-2761. [http://jas.fass.org/cgi/content/abstract/66/11/2750]Google Scholar
- 58.Ducrocq V, Besbes B: Solution of multiple trait animal models with missing data on some traits. J Anim Breed Genet. 1993, 110: 81-92.CrossRefPubMedGoogle Scholar
- 59.Ducrocq V, Chapuis H: Generalising the use of the canonical transformation for the solution of multivariate mixed model equations. Genet Select Evol. 1997, 29: 205-224. 10.1051/gse:19970207.CrossRefGoogle Scholar

## Copyright information

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.