ABSTRACT
"Inference" problems arise in statistical physics, computer vision, error-correcting coding theory, and AI. We explain the principles behind the belief propagation (BP) algorithm, which is an efficient way to solve inference problems based on passing local messages. We develop a unified approach, with examples, notation, and graphical models borrowed from the relevant disciplines.We explain the close connection between the BP algorithm and the Bethe approximation of statistical physics. In particular, we show that BP can only converge to a fixed point that is also a stationary point of the Bethe approximation to the free energy. This result helps explaining the successes of the BP algorithm and enables connections to be made with variational approaches to approximate inference.The connection of BP with the Bethe approximation also suggests a way to construct new message-passing algorithms based on improvements to Bethe's approximation introduced Kikuchi and others. The new generalized belief propagation (GBP) algorithms are significantly more accurate than ordinary BP for some problems. We illustrate how to construct GBP algorithms with a detailed example.
- Aji, S. M., and McEliece, R. J. (2000). The generalized distributive law. In IEEE Transactions on Information Theory 46, 325-343. Google Scholar
Digital Library
- Aji, S. M., and McEliece, R. J. (2001). The generalized distributive law and free energy minimization. Proceedings of the 39th Annual Allerton Conference on Communication, Control, and Computing. Forthcoming.Google Scholar
- Baxter, R. I. (1982). Exactly Solved Models in Statistical Mechanics. New York: Academic Press.Google Scholar
- Berrou, C., A. Glavieux, and P. Thitimajshima (1993). Near Shannon limit error-correcting coding and decoding: Turbo-codes. In Proceedings of the IEEE International Conference on Communications, Geneva, 1064-1070.Google Scholar
Cross Ref
- Boykov, Y., O. Veksler, and R. Zabih (2001). Fast approximate energy minimization via graph cuts. In IEEE Transactions in Pattern Analysis and Machine Intelligence 23(11), 1222-1239. Google Scholar
Digital Library
- Cover, T. M., and J. A. Thomas (1991). Elements of Information Theory. New York: John Wiley & Sons. Google Scholar
Digital Library
- Cowell, R. (1998). Introduction to inference for Bayesian networks. In Learning in Graphical Models, ed. M. Jordan. Cambridge, MA: MIT Press. Google Scholar
Digital Library
- Freeman, W. T., E. C. Pasztor, and O. T. Canrtichael (2000). Learning low-level vision. International Journal of Computer Vision 40(1), 25-47. Google Scholar
Digital Library
- Frey, B. J. (1998). Graphical Models for Machine Learning and Digital Communication. Cambridge, MA: MIT Press. Google Scholar
Digital Library
- Frey, B. J., and D. J. C. Mackay (1998). A revolution: Belief propagation in graphs with cycles. In Advances in Neural Information Processing Systems, Vol. 10, ed. M. Jordan, M. S. Kearns, and S. A. Solla. Cambridge, MA: MIT Press. Google Scholar
Digital Library
- Gallager, R. G. (1963). Low-Density Parity, Check Codes. Cambridge, MA: MIT Press.Google Scholar
- Gallager, R. G. (1968). Information Theory and Reliable Communication. New York: John Wiley & Sons. Google Scholar
Digital Library
- Geman, S., and D. Geman (1984). Stochastic Relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Transactions in Pattern Analysis and Machine Intelligence 6(6), 721-741.Google Scholar
Digital Library
- Jaakkola, T. (2000). Tutorial on variational approximation methods. Available online at www.ai.mit.edu/people/tommi/papers.html.Google Scholar
- Jensen, F. (1996). An Introduction to Bayesian Networks. Berlin: Springer-Verlag. Google Scholar
Digital Library
- Jordan, M. I., Z. Ghahramani, T. Jaakkola, and L. Saul(1998). An introduction to variational methods for graphical models. In Learning in Graphical Models, ed. M Jordan. Cambridge, MA: MIT Press. Google Scholar
Digital Library
- Kikuchi, R. (1951). A theory of cooperative phenomena. Phys. Rev. 81,988-1003.Google Scholar
- Kikuchi, R. (1994). Special issue in honor of R. Kikuchi. Progr. Theor. Phys. Suppl., 115.Google Scholar
- Kschischang, F. R., B. J. Frey, and H.-A. Loeliger (2001). Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 47, 498-519. Google Scholar
Digital Library
- Lauritzen, S. L., and D. J. Spiegelhalter (1988). Local computations with probabilities on graphical structures and their application to expert systems (with discussion). Journal of the Royal Statistical Society, Series B 50, 157-224.Google Scholar
- Mackay, D. J. C. (1999). Good error-correcting codes based on very sparse matrices. IEEE Transactions Information Theory 45, 399-431 Google Scholar
Digital Library
- McEliece, R. J., D. J. C. MacKay, and J. F. Cheng (1998). Turbo decoding as an instance of Pearl's "belief propagation" algorithm. In IEEE Journal on Selected Areas in Communications 16(2), 140-152. Google Scholar
Digital Library
- Mezard, M., G. Parisi, and M. A. Virasoro (1987). Spin Glass Theory and Beyond. Singapore: World Scientific.Google Scholar
- Minka, T.P. (2001). Expectation propagation for approximate Bayesian inference. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence, Seattle, WA, 362-369. Google Scholar
Digital Library
- Murphy, K., Y. Weiss, and M. Jordan (1999). Loopy belief propagation for approximate inference: An empirical study. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence, Stockholm, 467-475. Google Scholar
Digital Library
- Pearl, J. (1988). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Francisco: Morgan Kaufmann Publishers. Google Scholar
Digital Library
- Shafer G. R., and Shenoy P. P. (1990). Probability propagation. Annals of Mathematics and Artifical Intelligence 2, 327-352.Google Scholar
Cross Ref
- Tanner, R. M. (1981). A recursive approach to low complexity codes. In IEEE Transaction on Information Theory IT-27, 533-547.Google Scholar
- Wainright, M. J., T. Jaakkola, and A. S. Willsky (2001). Tree-based reparameterization framework for approximate estimation in graphs with cycles. LIDS Technical Report P-2510. Available online at ssg.mit.edu/group/mjwain/mjwain.shtml.Google Scholar
- Welling, M., and Y. W. Teh (2001). Belief optimization for binary networks: A stable alternative to loopy belief propagation. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence, Seattle, WA, 554-561. Google Scholar
Digital Library
- Yedidia, J. S. (2001). An idiosyncratic journey beyond mean field theory. In Advance Mean Field Methods-Theory and Practice, ed. D. Saad and M. Opper. Cambridge, MA: MIT Press.Google Scholar
- Yedidia, J. S., W. T. Freeman, and Y. Weiss (2001a). Bethe free energies, Kikuchi approximations, and belief propagation algorithms. Available online at www.merl.com/reports/TR2001-16/index. html.Google Scholar
- Yedidia, J. S., W. T. Freeman, and Y. Weiss (2001b). Characterizing belief propagation and its generalizations. Available online at www.merl.com/reports/TR2001-15/index.html.Google Scholar
- Yuille, A. L. (2001). A double-loop algorithm to minimize the Bethe and Kikuchi free energies. Unpublished.Google Scholar
Index Terms
Understanding belief propagation and its generalizations
Recommendations
Loopy belief propagation for approximate inference: an empirical study
UAI'99: Proceedings of the Fifteenth conference on Uncertainty in artificial intelligenceRecently, researchers have demonstrated that "loopy belief propagation" -- the use of Pearl's polytree algorithm in a Bayesian network with loops -- can perform well in the context of error-correcting codes. The most dramatic instance of this is the ...
Generalized belief propagation
NIPS'00: Proceedings of the 13th International Conference on Neural Information Processing SystemsBelief propagation (BP) was only supposed to work for treelike networks but works surprisingly well in many applications involving networks with loops, including turbo codes. However, there has been little understanding of the algorithm or the nature of ...
Fractional belief propagation
NIPS'02: Proceedings of the 15th International Conference on Neural Information Processing SystemsWe consider loopy belief propagation for approximate inference in probabilistic graphical models. A limitation of the standard algorithm is that clique marginals are computed as if there were no loops in the graph. To overcome this limitation, we ...





Comments