This vignette details the different covariance structures available in clustTMB.
| Covariance | Notation | No..of.Parameters | Data.requirements |
|---|---|---|---|
| Spatial GMRF | gmrf | 2 | spatial coordinates |
| AR(1) | ar1 | 2 | unit spaced levels |
| Rank Reduction | rr(random = H) | JH - (H(H-1))/2 | |
| Spatial Rank Reduction | rr(spatial = H) | 1 + JH - (H(H-1))/2 | spatial coordinates |
clustTMB fits spatial random effects using a Gaussian Markov Random Field (GMRF). The precision matrix, \(Q\), of the GMRF is the inverse of a Matern covariance function and takes two parameters: 1) \(\kappa\), which is the spatial decay parameter and a scaled function of the spatial range, \(\phi = \sqrt{8}/\kappa\), the distance at which two locations are considered independent; and 2) \(\tau\), which is a function of \(\kappa\) and the marginal spatial variance \(\sigma^{2}\):
\[\tau = \frac{1}{2\sqrt{\pi}\kappa\sigma}.\] The precision matrix is approximated following the SPDE-FEM approach [@Lindgren2011], where a constrained Delaunay triangulation network is used to discretize the spatial extent in order to determine a GMRF for a set of irregularly spaced locations, i$.
\[\omega_{i} \sim GMRF(Q[\kappa, \tau])\]
Prior to fitting a spatial cluster model with clustTMB, users need to set up the constrained Delaunay Triangulation network using the R package, fmesher. This package provides a CRAN distributed collection of mesh functions developed for the package, R-INLA. For guidance on setting up an appropriate mesh, see Triangulation details and examples and Tools for mesh assessment from
In this example, the following mesh specifications were used:
loc <- meuse[, 1:2]
Bnd <- fmesher::fm_nonconvex_hull(as.matrix(loc), convex = 200)
meuse.mesh <- fmesher::fm_mesh_2d(as.matrix(loc),
max.edge = c(300, 1000),
boundary = Bnd
)Coordinates are converted to a spatial point dataframe and read into the clustTMB model, along with the mesh, using the spatial.list argument. The gating formula is specified using the gmrf() command:
Loc <- sf::st_as_sf(loc, coords = c("x", "y"))
mod <- clustTMB(
response = meuse[, 3:6],
family = lognormal(link = "identity"),
gatingformula = ~ gmrf(0 + 1 | loc),
G = 4, covariance.structure = "VVV",
spatial.list = list(loc = Loc, mesh = meuse.mesh)
)## Warning: the 'nobars' function has moved to the reformulas package. Please update your imports, or ask an upstream package maintainer to do so.
## This warning is displayed once per session.
## intercept removed from gatingformula
## when random effects specified
## spatial projection is turned off. Need to provide locations in projection.list$grid.df for spatial predictions
Models are optimized with nlminb(), model results can be viewed with nlminb commands:
## betag betag betag betad betad betad betad
## 0.1684272 0.5589378 0.1889232 2.0157791 4.3160890 5.4259833 6.7095849
## betad betad betad betad betad betad betad
## 1.0164096 3.6119051 5.2215837 6.2274886 0.1353933 3.1482191 4.2137136
## betad betad betad betad betad theta theta
## 5.2614042 -1.4361507 3.1132988 4.2118618 5.1996602 -1.2100791 -2.9055432
## theta theta theta theta theta theta theta
## -1.2794726 -1.2502149 -2.5718253 -3.1310740 -2.2406065 -2.3780280 -1.8212753
## theta theta theta theta theta theta theta
## -4.0603289 -2.6424671 -3.0432333 -2.4648456 -3.3381134 -2.7804359 -2.6686126
## ln_kappag
## -5.9298671
## [1] 2318.831
When random effects, \(\mathbb{u}\), are specified in the gating network, the probability of cluster membership \(\pi_{i,g}\) for observation \(i\) is fit using multinomial regression:
\[ \begin{align} \mathbb{\eta}_{,g} &= X\mathbb{\beta}_{,g} + \mathbb{u}_{,g} \\ \mathbb{\pi}_{,g} &= \frac{ exp(\mathbb{\eta}_{,g})}{\sum^{G}_{g=1}exp(\mathbb{\eta}_{,g})} \end{align} \]