There are default values for most of these codes, which are adequate for most simpler analyses, and analyses of small to moderately sized data sets. Hence these options are predominantly of interest to users which want to fit complicated models or analyse large data sets, and thus need to tweak some of the options for ordering of equations, parameterisations, maximisation strategies and convergence criteria to get the best possible performance from WOMBAT.
In contrast to the basic options above, several of these options can be followed by one
or more, optional numerical arguments. If such arguments are given, they must follow
immediately (no spaces), and multiple options need to be separated by comma(s).
Arguments must be given sequentially, i.e. if we want to set argument
, all
preceding arguments (
) are required as well. If no arguments
are given, the corresponding variables are set to their default values (see
A.1).
|
Table 5.2: ‘Advanced’ run options for WOMBAT
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The order in which the equations in the mixed model are processed can have a dramatic impact on the computational requirements of REML analyses. Hence, WOMBAT attempts to reorder the equations, selecting a default ordering strategy based on the number of equations; see A.1 for details. This can often be improved upon by selecting the strategy used explicitly.
Option --mmd specifies ordering using the multiple minimum degree procedure.
Option --amd selects an approximate minimum degree ordering [1].
Option --metis selects an ordering using a multilevel nested dissection procedure. Up to four optional arguments modifying the behaviour of this subroutine can be specified.
and
. As a rule, the higher this number, the better the quality of
ordering tends to be. However, the time required for ordering increases
substantially with this number, and in some cases intermediate values
(between 3 and 9) have been found to work best. The manual for
METIS recommends values up to
. However, Meyer [17] found values
as high as
or
to be advantageous for large analyses. By default,
WOMBAT uses a value of
for analyses with more than
and
up to
equations, and a value of
for larger models.
means all rows with more than
15% (factor divided by 10) elements than average. The default used by
WOMBAT is 0.
for random,
for heavy and
for standard edge matching.
WOMBAT uses a default of
.
which, paradoxically (to
agree with the METIS convention), invokes two-sided refinement. This
is deemed to produce somewhat better orderings. An option of
here
selects one-sided refinement, which can be faster.In addition, the option --metisall is available. This causes WOMBAT to cycle
through the number of graph separators from
to
, considering density factors
of
and
, as well as random and standard edge matching (48 combinations).
Warning : This can be quite time-consuming !
To obtain the ordering, WOMBAT assigns a large matrix to store information on the non-zero elements in the mixed model matrix. The default size chosen is based on the number of equations and traits analysed, and is meant to be generous enough to be sufficient for a wide range of cases. In some instances, however, this can result in WOMBAT trying to allocate arrays which exceed the amount of RAM available or accessible (some of the 32-bit versions are restricted to 2GB) and thus failing to run while, in reality, much less space is required for the analysis concerned. Hence, the run time option --choozhz is supplied: This cases WOMBAT to pause, write out the default settings, and read in (from the terminal) an alternative value specified by the user.
WOMBAT can be told to use a particular algorithm to locate the maximum
of the likelihood function. In the following, let
(must be an INTEGER
number) denote the maximum number of iterates to be carried out by an
algorithm, and let
denote the convergence criterion to be used. Unless stated
otherwise,
represents the threshold for changes in log likelihood between
subsequent iterates, i.e. convergence is assumed to be reached if this is less than
.
Option --aireml specifies a ‘straight’ AI algorithm. It can be followed by values for
and
. Valid forms are --aireml, --aireml
and --aireml
but not
--aireml
.
EXAMPLE:
–-aireml30,0.001
limits the number of iterates carried out to 30, and stops
estimation when the change in log likelihood is less than
.
By default, the AI algorithms enforces an increase in log likelihood in each iterate. This can be switched off using the option --nostrict. This can improve convergence in some cases.
In this case, a modification of the AI matrix can result in better performance of the
AI algorithm. Four procedures have been implemented in WOMBAT. These are
selected by specifying --modaim
, with
selecting modification by
setting all eigenvalues to a minimum value (
), by adding a diagonal
matrix (
, the default), or through a modified (
) [25, 26] or
partial (
) [8] Cholesky decomposition of the AI matrix; see A.5 for
details.
Option --emalg selects estimation using a standard EM algorithm. As for
--aireml, this option can be followed by
or
to specify the maximum
number of iterates allowed and the convergence criterion.
Option --pxem then selects estimation using the PX-EM algorithm. As above, this
can be followed by
or
.
Option --pxai specifies a hybrid algorithm consisting of a few initial rounds of
PX-EM, followed by AI REML. This is the default for full rank estimation (unless
-c is specified without any explicit definition of the algorithm to be used). This
option can have up to three sequential arguments,
denoting the number of
PX-EM iterates, and
and
as above.
EXAMPLE: --pxem6,50,0.001
defines 6 PX-EM iterates, followed by up to 50 AI iterates and
a convergence criterion of
for the log likelihood.
Option --emai is like --pxai except that the initial iterates are carried out using the standard EM algorithm. This is the default for reduced rank estimation. Optional arguments are as for --pxai.
Option --cycle, followed by an optional argument
, is used in conjunction with
--pxai or --emai. If given, it will repeat the number of (PX-)EM and AI iterates
specified by these options
times. If
is omitted, the number of cycles is set to
. This option is useful for reduced rank analyses, where cycling algorithms
appears to be beneficial.
Finally, WOMBAT incorporates three choices for a derivative-free algorithm. While
little used, these can be usefully to check for convergence in ‘tough’ problems. Option
--simplex selects the simplex or polytope algorithm due to Nelder and
Mead [20], as previously implemented in DFREML. This option can have three
arguments,
and
as above (but with the convergence criterion
describing the maximum variance among the log likelihood values in the polytope
allowed at convergence), and
the initial step size. Option --powell invokes
maximisation using Powell [22]’s method of conjugate directions, again ported
from DFREML and with optional parameters
,
(as for --aireml) and
.
Not really a REML algorithm: Option --like1 selects a single likelihood evaluation
for the current starting values, or, if combined with -c, the currently best point as
contained in BestPoint. When combined with -v, the individual components of
are printed to the standard output (screen).
By default, WOMBAT reparameterises to the elements of the Cholesky factor of the covariance matrices to be estimated (except for full rank analyses using the PX-EM or EM algorithm, which estimate the covariance components directly).
As a rule, the Cholesky factorisation is carried out pivoting on the largest diagonal element. This can be switched off with the --noreord option.
Reparameterisation to the elements of the Cholesky factors removes constraints on the parameter space. Strictly speaking, however, it leaves the constraint that the diagonal elements should be non-negative. This can be removed by transforming the diagonal elements to logarithmic scale. This is selected using the option --logdia, which applies the transformation to all covariance matrices. Conversely, the option --nologd prevents WOMBAT from carrying out this transformation. If neither option is set (the default) and WOMBAT encounters small diagonal elements in any covariance matrices to be estimated during iterations, it will switch to taking logarithmic values of the diagonal elements for the respective matrices only.
By default, WOMBAT ‘prunes’ the pedigree information supplied, i.e. eliminates any uninformative individuals (without records and links to only one other individual), before calculating the inverse of the numerator relationship matrix. Exceptions are prediction runs (option --blup) and models which fit correlated additive genetic effects. In some instances, however, it may be desirable not to ‘prune’ pedigrees. Pruning can be switched off through the run time option --noprune.
For sire model analyses, the default is not to carry out any pruning or pedigree reduction. The option --redped is provided to switch on pedigree reduction for sire model analyses, i.e. elimination of all individuals without a link to an individual in the data from the pedigree file.
Generally, WOMBAT does not require any interactive input. An exception is a ‘pause’ after a warning message on a constellation of options which is new or can be problematic. These programmed pauses can be switched off using the run time option --batch.
By default, WOMBAT uses the algorithm of Tier [27] to compute inbreeding coefficients before setting up the inverse of the numerator relationship matrix between individuals. This algorithm is implemented with a hard-coded limit of 23 generations. An alternative algorithm, due to Quaas [24], which does not have this limitation but is a lot slower, can be selected using the run time option --quaas.