Method of moment matching to obtain an initial guess of the MLE, as in Minka (2000).
initMoM(D)
D
matrix (JxK) of counts; each row is a sample from a MN distribution with K categories
Hessian