Issue with checkConsistency when coding a probabilistic PCA #24

balglave · 2024-04-10T15:33:35Z

I want to code a probabilistic PCA in RTMB and when I run the function checkconsistency it looks that I have huge biases.
Here is the code. If anyone can explain me what is happending, I'd be interested.

library(MASS)
library(RTMB)


n_ind <- 125 # number f individuals
n_var <- 5 # number of variables
Y <- mvrnorm(n_ind,mu = rep(0,n_var),Sigma = diag(1,n_var))

data <- list(Y = Y)

parameters <- list(b = rep(0,n_var)) # define intercepts
maps_param <- list()

parameters[["E"]] <- diag(1,nrow = n_var,ncol = n_var)
m2 <- diag(1:n_var,nrow = n_var,ncol = n_var) # make it a diagonal matrix
m2[which(m2==0)] <- NA
maps_param[["E"]] <- as.factor(m2)

parameters[["Lambda"]] <- matrix(runif(ncol(Y)*n_var,0,2),nrow = ncol(Y),ncol = n_var)
parameters[["Lambda"]][!lower.tri(parameters[["Lambda"]],diag = T)] <- 0 # make it triangular inferior

map_lambda <- matrix(1:(ncol(Y)*n_var),ncol(Y),n_var) # Constraints on loadings
lower.tri(map_lambda)
map_lambda[!lower.tri(map_lambda,diag = T)] <- NA
maps_param[["Lambda"]] <- as.factor(map_lambda)

parameters[["delta"]] <- matrix(rnorm(nrow(Y)*n_var,0,1),nrow = nrow(Y),ncol = n_var)
random_vec <- "delta"

f <- function(parms) {
  
  nll <- 0
  
  Y <- data$Y

  b <- parms$b
  L <- parms$L
  E <- parms$E
  Lambda <- parms$Lambda
  delta <- parms$delta

  mu <- matrix(0,nrow = nrow(Y),ncol = ncol(Y)) # Trend/intercept
  for(i in 1:nrow(mu)){
    
    nll <- nll - sum(RTMB::dmvnorm(x=delta[i,], mu=rep(0,n_var), diag(1,ncol(delta)), TRUE))
    mu[i,] <- b + Lambda %*% delta[i,] # PCA
    
  }
  
  nll = nll - sum(RTMB::dmvnorm(Y, mu, E, TRUE))
  
  REPORT(Lambda)
  
  return(nll)
  
}

obj <- MakeADFun(f, parameters,map = maps_param,random = random_vec)

opt = nlminb( start=obj$par, objective=obj$fn, gradient=obj$gr)
rep <- sdreport(obj)
report <- obj$report()

checkConsistency(obj)

The text was updated successfully, but these errors were encountered:

kaskr · 2024-04-11T09:37:11Z

To have all the automated stuff working (simulation, checking, etc) you should currently (to be relaxed) follow the introduction vignette fairly closely. I.e. re-code the data/parameter fetching by replacing

  Y <- data$Y
  b <- parms$b
  L <- parms$L
  E <- parms$E
  Lambda <- parms$Lambda
  delta <- parms$delta

by

  getAll(data, parms, warn=FALSE)
  Y <- OBS(Y)

By making this change it should almost work, except you hit an edge case here because RTMB simply redirects mvnorm simulation to MASS::mvrnorm which doesn't allow matrix mu ~~(to be fixed in RTMB)~~ (EDIT: The RTMB documentation states that mu must be a vector, so an error should be thrown):

nll = nll - sum(RTMB::dmvnorm(Y, mu, E, TRUE))

Simple fix is to be replace by

nll = nll - sum(RTMB::dmvnorm(Y - mu, 0, E, TRUE))

BTW, the consistency check should point you to the fact that your model is over-parameterized...

balglave · 2024-04-11T18:44:24Z

@kaskr It worked thanks !

Do you have any suggestions on how I could avoid over-parameterization?

kaskr added a commit that referenced this issue Apr 17, 2024

dmvnorm / dgmrf: mu must be a vector corresponding to Sigma / Q #24

c1cea00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with checkConsistency when coding a probabilistic PCA #24

Issue with checkConsistency when coding a probabilistic PCA #24

balglave commented Apr 10, 2024 •

edited

Loading

kaskr commented Apr 11, 2024 •

edited

Loading

balglave commented Apr 11, 2024

Issue with checkConsistency when coding a probabilistic PCA #24

Issue with checkConsistency when coding a probabilistic PCA #24

Comments

balglave commented Apr 10, 2024 • edited Loading

kaskr commented Apr 11, 2024 • edited Loading

balglave commented Apr 11, 2024

balglave commented Apr 10, 2024 •

edited

Loading

kaskr commented Apr 11, 2024 •

edited

Loading