next up previous
Next: Missing Data in Point Up: Missing Data and Edge Previous: Monte Carlo Likelihood with

Comparison of MCL and MCEM

The gradient of (1.46) is

  multline1135

where the empirical expectation operator tex2html_wrap_inline4381 for the distribution with unnormalized density tex2html_wrap_inline2495 using samples having unnormalized density tex2html_wrap_inline4385 is defined by (1.34). MCL solves equation (1.47) to determine the MCMLE tex2html_wrap_inline3939 .

MCEM uses the same formula, but much less efficiently. It only uses the special case where in tex2html_wrap_inline4381 we set tex2html_wrap_inline4391 the current iterate of the MCEM algorithm. This simplifies the formulas by eliminating the importance weights. But the cost of this simplification is that (1.47) becomes

displaymath4393

and only the tex2html_wrap_inline2411 's inside the integrands are considered variables. The tex2html_wrap_inline2411 's subscripting the expectation operators should also be variable, but EM leaves them fixed. The result is that MCEM takes hundreds of iterations, each involving the collection of two Monte Carlo samples to do what MCL does with no iteration as long as tex2html_wrap_inline3965 is close enough to the exact MLE tex2html_wrap_inline3941 so that the importance weights behave. The advantages of MCL methods over MCEM are

The advantages of MCEM are weak But there are no real advantages of MCEM over MCL. The best that can be said for it is that it might be reasonable to do a few MCEM iterations to get close to MLE before switching to MCL. However, it must compete with stochastic approximation in that role, and it is not clear that MCEM has any advantages over stochastic approximation in providing crude estimates.


next up previous
Next: Missing Data in Point Up: Missing Data and Edge Previous: Monte Carlo Likelihood with

Charles Geyer
Fri Jul 5 15:26:21 CDT 1996