[1] Dempster A. P., Laird N. M., Rubin D. B.: 
Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Statist. Soc. ser. B 39 (1977), 1–38 
MR 0501537 | 
Zbl 0364.62022 
[2] Grim J.: 
On numerical evaluation of maximum–likelihood estimates for finite mixtures of distributions. Kybernetika 18 (1982), 3, 173–190 
MR 0680154 | 
Zbl 0489.62028 
[3] Grim J.: Maximum likelihood design of layered neural networks. In: IEEE Proceedings of the 13th International Conference on Pattern Recognition, IEEE Press 1996, pp. 85–89
[4] Grim J.: Design of multilayer neural networks by information preserving transforms. In: Proc. 3rd Systems Science European Congress (E. Pessa, M. B. Penna and A. Montesanto, eds.), Edizzioni Kappa, Roma 1996, pp. 977–982
[5] Jacobs R. A., Jordan M. I., Nowlan S. J., Hinton G. E.: 
Adaptive mixtures of local experts. Neural Comp. 3 (1991), 79–87 
DOI 10.1162/neco.1991.3.1.79 
[6] Jordan M. I., Jacobs R. A.: 
Hierarchical mixtures of experts and the EM algorithm. Neural Comp. 6 (1994), 181–214 
DOI 10.1162/neco.1994.6.2.181 
[7] Chen, Ke, Xie, Dahong, Chi, Huisheng: 
A modified HME architecture for text–dependent speaker identification. IEEE Trans. Neural Networks 7 (1996), 1309–1313 
DOI 10.1109/72.536325 
[8] Ramamurti V., Ghosh J.: Structural adaptation in mixtures of experts. In: IEEE Proceedings of the 13th International Conference on Pattern Recognition, IEEE Press, 1996, pp. 704–708
[9] Titterington D. M., Smith A. F. M., Makov U. E.: 
Statistical Analysis of Finite Mixture Distributions. John Wiley & Sons, Chichester – Singapore – New York 1985 
MR 0838090 | 
Zbl 0646.62013 
[10] Vajda I.: 
Theory of Statistical Inference and Information. Kluwer, Boston 1992 
Zbl 0711.62002 
[12] Xu L., Jordan M. I.: 
On convergence properties of the EM algorithm for Gaussian mixtures. Neural Comp. 8 (1996), 129–151 
DOI 10.1162/neco.1996.8.1.129 
[13] Xu L., Jordan M. I., Hinton G. E.: A modified gating network for the mixtures of experts architecture. In: Proc. WCNN’94, San Diego 1994, Vol. 2, pp. 405–410