Statistics 8701 (Geyer, Spring 2003) Parametric Bootstrap

The Parametric Bootstrap

The Parametric Bootstrap

Fitting a Logistic Regression Model

The kyphosis data in the data file

http://www.stat.umn.edu/geyer/8701/parm/kyphosis.txt

has already been used in the Bayesian model selection example in the MCMC notes. Here we do a frequentist analysis. The R statements below do a logistic regression (with the default logit link function).

Bootstrapping It

The regression coefficient for the predictor Age, which is 0.010929 with a reported standard error of 0.006416, does not appear to be statistically significant (P = 0.08849).

But that standard error is derived from Fisher information and relies on the validity of the usual asymptotics of maximum likelihood. What if n isn't large enough? What should we really think?

Here's a parametric bootstrap calculation of this P-value.

out.little <- glm(present ~ Number + Start, family=binomial) p.hat <- predict(out.little, type = "response") summary(out.little) out.big <- glm(present ~ Age + Number + Start, family=binomial) summary(out.big) coef <- summary(out.big)$coefficients z.hat <- coef["Age", "z value"] nboot <- 1e3 - 1 z.star <- rep(NA, nboot) n <- length(present) for (i in 1:nboot) { present.star <- as.numeric(runif(n) < p.hat) out.star <- glm(present.star ~ Age + Number + Start, family=binomial) coef.star <- summary(out.star)$coefficients z.star[i] <- coef.star["Age", "z value"] } 2 * pnorm(- abs(z.hat)) p.val <- (sum(abs(z.star) >= z.hat) + 1) / (nboot + 1) p.val p.val + c(-1, 1) * qnorm(0.975) * sqrt(p.val * (1 - p.val) / nboot) cat("Calculation took", proc.time()[1], "seconds\n")

External Data Entry

Enter a dataset URL :

Comments

The main issue in a parametric is how to simulate the data. The simulation procedure will be different for each different model. Here the data are independent (but not identically distributed) Bernoulli. The success probabilities are the predicted values from the null model.
```
p.hat <- predict(out.little, type = "response")
```
If we left off the type = "response" we would get linear predictor values rather than expected values, which is not what is wanted.
The simulation is quite trivial
```
    present.star <- as.numeric(runif(n) < p.hat)
```
Another way to do the same thing would be
```
    present.star <- sample(0:1, prob = p.hat, replace = TRUE)
```
Other probability models are harder, but the principle is the same. Simulate the null model with estimated values of the parameters (in the null model) plugged in.
From the fact that the parametric bootstrap says the same thing as the asymptotics (to within Monte Carlo error) we see that the parametric bootstrap was unnecessary. But that doesn't mean the parametric bootstrap was a waste of time! We didn't know the asymptotics were o. k. until we checked. And the only way to check is the parametric bootstrap!

More Bootstrapping

Because the Monte Carlo error was too big to tell any difference between the parametric bootstrap P-value and the asymptotic P-value, we do another bootstrap with a bigger bootstrap sample size.

Since it takes way too long to do in Rweb, the results are in the file

http://www.stat.umn.edu/geyer/8701/parm/parm.Rout

(the input file was parm.R).

The plot made by this run (and turned into GIF format by the same magic that Rweb uses) is shown below.

plot produced by
<code>qqnorm(z.star)</code> command in R run, looks good out to about
plus or minus 2 and then shows <code>z.star</code> is more light tailed
than normal

Comments

We find with this large sample size that the glm function starts generating warnings about nonconvergence. This is not really the fault of the programmers of the glm function. The whole GLM community has always been extremely lackadaisical about convergence issues and existence of maximum likelihood estimates.
It is a flaw of the glm function that it does not return an error code. Just printing a warning but doing nothing that the users code can respond to is extremely frustrating. Programmer hostile!
While I was ranting about this, Murali Haran told me about the try function, that does allow one to respond to errors. So turn the warnings into errors. That's what
```
options(warn = 2)
```
does. And then catch the errors with the try function. That's what
```
out.star <- try(glm(present.star ~ Age + Number + Start, family=binomial))
if (! inherits(out.star, "try-error")) {
        coef.star <- summary(out.star)$coefficients
        z.star[i] <- coef.star["Age", "z value"]
}
```
does.
The try function just does nothing if there is no error (its result is the result of its argument). If there is an error, then the result will inherit from class "try-error" (and any result of the argument of the try function is lost)
Thus we can do what we would ordinarily do if inherits(out.star, "try-error") is FALSE. Otherwise, we do nothing and z.star[i] is unchanged from its initial value, which is NA.
Then we need to figure how to treat the NA values. Since these reflect programmer brain damage (and ultimately theoretician brain damage), we can hardly count NA values as evidence in favor of the hypothesis we favor. Thus we should count them in favor of the null hypothesis for a usual test and in favor of the alternative for a goodness of fit test. Here we count them in favor of the null, that is, we include them in the numerator of the bootstrap P-value
```
p.val <- (sum(abs(z.star) >= z.hat, na.rm = TRUE) +
    sum(is.na(z.star)) + 1) / (nboot + 1)
```
The results are still not significant. The P-value calculated by trusting the asymptotics is 0.0885 and the parametric bootstrap P-value is 0.0871 with 95% confidence interval accounting for Monte Carlo error (0.0853, 0.0888). So we are not sure the two calculations are actually different.
A glance at the plot above shows that the asymptotics work well in the middle and not in the tails and 0.087 is just about where the asymptotics start to go bad, but they haven't gone really bad yet.
They go bad right around theoretical quantile plus or minus 2, which is the 0.025 level one-tailed or the 0.05 level two-tailed. So we would need the parametric bootstrap for this calculation if the results were more statistically significant!

Statistics 8701 (Geyer, Spring 2003) Parametric Bootstrap

Contents

The Parametric Bootstrap

Fitting a Logistic Regression Model

Bootstrapping It

Comments

More Bootstrapping

Comments