Statistics 8701 (Geyer, Spring 2003) Parametric Bootstrap

Contents

The Parametric Bootstrap

Fitting a Logistic Regression Model

The kyphosis data in the data file

http://www.stat.umn.edu/geyer/8701/parm/kyphosis.txt

has already been used in the Bayesian model selection example in the MCMC notes. Here we do a frequentist analysis. The R statements below do a logistic regression (with the default logit link function).

External Data Entry

Enter a dataset URL :

Bootstrapping It

The regression coefficient for the predictor Age, which is 0.010929 with a reported standard error of 0.006416, does not appear to be statistically significant (P = 0.08849).

But that standard error is derived from Fisher information and relies on the validity of the usual asymptotics of maximum likelihood. What if n isn't large enough? What should we really think?

Here's a parametric bootstrap calculation of this P-value.

External Data Entry

Enter a dataset URL :

Comments

More Bootstrapping

Because the Monte Carlo error was too big to tell any difference between the parametric bootstrap P-value and the asymptotic P-value, we do another bootstrap with a bigger bootstrap sample size.

Since it takes way too long to do in Rweb, the results are in the file

http://www.stat.umn.edu/geyer/8701/parm/parm.Rout

(the input file was parm.R).

The plot made by this run (and turned into GIF format by the same magic that Rweb uses) is shown below.

plot produced by
<code>qqnorm(z.star)</code> command in R run, looks good out to about
plus or minus 2 and then shows <code>z.star</code> is more light tailed
than normal

Comments