Abstract
Abstract--In the last few years, several new algorithms based on graph cuts have been developed to solve energy minimization problems in computer vision. Each of these techniques constructs a graph such that the minimum cut on the graph also minimizes the energy. Yet, because these graph constructions are complex and highly specific to a particular energy function, graph cuts have seen limited application to date. In this paper, we give a characterization of the energy functions that can be minimized by graph cuts. Our results are restricted to functions of binary variables. However, our work generalizes many previous constructions and is easily applicable to vision problems that involve large numbers of labels, such as stereo, motion, image restoration, and scene reconstruction. We give a precise characterization of what energy functions can be minimized using graph cuts, among the energy functions that can be written as a sum of terms containing three or fewer binary variables. We also provide a general-purpose construction to minimize such an energy function. Finally, we give a necessary condition for any energy function of binary variables to be minimized by graph cuts. Researchers who are considering the use of graph cuts to optimize a particular energy function can use our results to determine if this is possible and then follow our construction to create the appropriate graph. A software implementation is freely available.
- R.K. Ahuja T.L. Magnanti and J.B. Orlin, Network Flows: Theory, Algorithms, and Applications. Prentice Hall, 1993.]] Google Scholar
Digital Library
- A. Amini T. Weymouth and R. Jain, “Using Dynamic Programming for Solving Variational Problems in Vision,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, no. 9, pp. 855-867, Sept. 1990.]] Google Scholar
Digital Library
- S. Barnard, “Stochastic Stereo Matching Over Scale,” Int'l J. Computer Vision, vol. 3, no. 1, pp. 17-32, 1989.]]Google Scholar
Cross Ref
- S. Birchfield and C. Tomasi, “Multiway Cut for Stereo and Motion with Slanted Surfaces,” Proc. Int'l Conf. Computer Vision, pp. 489-495, 1999.]]Google Scholar
Cross Ref
- Y. Boykov and M.-P. Jolly, “Interactive Organ Segmentation Using Graph Cuts,” Proc. Medical Image Computing and Computer-Assisted Intervention, pp. 276-286, 2000.]] Google Scholar
Digital Library
- Y. Boykov and M.-P. Jolly, “Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images,” Proc. Int'l Conf. Computer Vision, pp. 105-112, 2001.]]Google Scholar
Cross Ref
- Y. Boykov and V. Kolmogorov, “An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Computer Vision,” Proc. Int'l Workshop Energy Minimization Methods in Computer Vision and Pattern Recognition, Lecture Notes in Computer Science, pp. 359-374, Springer-Verlag, Sept. 2001.]] Google Scholar
Digital Library
- Y. Boykov and V. Kolmogorov, “Computing Geodesics and Minimal Surfaces via Graph Cuts,” Proc. Int'l Conf. Computer Vision, pp. 26-33, 2003.]] Google Scholar
Digital Library
- Y. Boykov O. Veksler and R. Zabih, “Markov Random Fields with Efficient Approximations,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 648-655, 1998.]] Google Scholar
Digital Library
- Y. Boykov O. Veksler and R. Zabih, “Fast Approximate Energy Minimization via Graph Cuts,” Proc. IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 11, pp. 1222-1239, Nov. 2001.]] Google Scholar
Digital Library
- W.H. Cunningham, “Minimum Cuts, Modular Functions, and Matroid Polyhedra,” Networks, vol. 15, pp. 205-215, 1985.]]Google Scholar
Cross Ref
- E. Dahlhaus D.S. Johnson C.H. Papadimitriou P.D. Seymour and M. Yannakakis, “The Complexity of Multiway Cuts,” Proc. ACM Symp. Theory of Computing, pp. 241-251, 1992.]] Google Scholar
Digital Library
- J. Dias and J. Leitao, “The ZπM Algorithm: A Method for Interferometric Image Reconstruction in SAR/SAS” IEEE Trans. Image Processing, vol. 11, no. 4, pp. 408-422, Apr. 2002.]]Google Scholar
Digital Library
- L. Ford and D. Fulkerson, Flows in Networks. Princeton Univ. Press, 1962.]]Google Scholar
- S. Fujishige, “ Submodular Functions and Optimization,” vol. 47, Annals of Discrete Math., North Holland, 1990.]]Google Scholar
- S. Geman and D. Geman, “Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 6, pp. 721-741, 1984.]]Google Scholar
Digital Library
- A. Goldberg and R. Tarjan, “A New Approach to the Maximum Flow Problem,” J. ACM, vol. 35, no. 4, pp. 921-940, Oct. 1988.]] Google Scholar
Digital Library
- D. Greig B. Porteous and A. Seheult, “Exact Maximum A Posteriori Estimation for Binary Images,” J. Royal Statistical Soc., Series B, vol. 51, no. 2, pp. 271-279, 1989.]]Google Scholar
- M. Grötschel L. Lovasz and A. Schrijver, Geometric Algorithms and Combinatorial Optimization. Springer-Verlag, 1988.]]Google Scholar
- H. Ishikawa and D. Geiger, “Occlusions, Discontinuities, and Epipolar Lines in Stereo,” Proc. European Conf. Computer Vision, pp. 232-248, 1998.]] Google Scholar
Digital Library
- H. Ishikawa and D. Geiger, “Segmentation by Grouping Junctions,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 125-131, 1998.]] Google Scholar
Digital Library
- H. Ishikawa, “Exact Optimization for Markov Random Fields with Convex Priors,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 10, pp. 1333-1336, Oct. 2003.]]Google Scholar
Digital Library
- S. Iwata L. Fleischer and S. Fujishige, “A Combinatorial, Strongly Polynomial Algorithm for Minimizing Submodular Functions,” J. ACM, vol. 48, no. 4, pp. 761-777, July 2001.]] Google Scholar
Digital Library
- J. Kim V. Kolmogorov and R. Zabih, “Visual Correspondence Using Energy Minimization and Mutual Information,” Proc. Int'l Conf. Computer Vision, pp. 1033-1040, 2003.]] Google Scholar
Digital Library
- J. Kim and R. Zabih, “Automatic Segmentation of Contrast-Enhanced Image Sequences,” Proc. Int'l Conf. Computer Vision, pp. 502-509, 2003.]] Google Scholar
Digital Library
- J. Kim J. Fisher A. Tsai C. Wible A. Willsky and W. Wells, “Incorporating Spatial Priors into an Information Theoretic Approach for FMRI Data Analysis,” Proc. Medical Image Computing and Computer-Assisted Intervention, pp. 62-71, 2000.]] Google Scholar
Digital Library
- V. Kolmogorov and R. Zabih, “Visual Correspondence with Occlusions Using Graph Cuts,” Proc. Int'l Conf. Computer Vision, pp. 508-515, 2001.]]Google Scholar
Cross Ref
- V. Kolmogorov and R. Zabih, “Multi-Camera Scene Reconstruction via Graph Cuts,” Proc. European Conf. Computer Vision, vol. 3, pp. 82-96, 2002.]] Google Scholar
Digital Library
- V. Kwatra A. Schödl I. Essa G. Turk and A. Bobick, “Graphcut Textures: Image and Video Synthesis Using Graph Cuts,” ACM Trans. Graphics, Proc. SIGGRAPH 2003, July 2003.]] Google Scholar
Digital Library
- C.-H. Lee D. Lee and M. Kim, “Optimal Task Assignment in Linear Array Networks,” IEEE Trans. Computers, vol. 41, no. 7, pp. 877-880, July 1992.]] Google Scholar
Digital Library
- S. Li, Markov Random Field Modeling in Computer Vision. Springer-Verlag, 1995.]] Google Scholar
Digital Library
- M.H. Lin, “Surfaces with Occlusions from Layered Stereo,” PhD thesis, Stanford Univ., Dec. 2002.]]Google Scholar
- I. Milis, “Task Assignment in Distributed Systems Using Network Flow Methods,” Proc. Combinatorics and Computer Science, Lecture Notes in Computer Science, pp. 396-405, Springer-Verlag, 1996.]] Google Scholar
Digital Library
- T. Poggio V. Torre and C. Koch, “Computational Vision and Regularization Theory,” Nature, vol. 317, pp. 314-319, 1985.]] Google Scholar
Digital Library
- S. Roy, “Stereo without Epipolar Lines: A Maximum Flow Formulation,” Int'l J. Computer Vision, vol. 1, no. 2, pp. 1-15, 1999.]] Google Scholar
Digital Library
- S. Roy and I. Cox, “A Maximum-Flow Formulation of the n-Camera Stereo Correspondence Problem,” Proc. Int'l Conf. Computer Vision, 1998.]] Google Scholar
Digital Library
- D. Scharstein and R. Szeliski, “A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms,” Int'l J. Computer Vision, vol. 47, pp. 7-42, Apr. 2002.]] Google Scholar
Digital Library
- A. Schrijver, “A Combinatorial Algorithm Minimizing Submodular Functions in Strongly Polynomial Time,” J. Combinatorial Theory, vol. B 80, pp. 346-355, 2000.]] Google Scholar
Digital Library
- D. Snow P. Viola and R. Zabih, “Exact Voxel Occupancy with Graph Cuts,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 345-352, 2000.]]Google Scholar
- H.S. Stone, “Multiprocessor Scheduling with the Aid of Network Flow Algorithms,” IEEE Trans. Software Eng., pp. 85-93, 1977.]]Google Scholar
Digital Library
- R. Szeliski and R. Zabih, “An Experimental Comparison of Stereo Algorithms,” Proc. Vision Algorithms: Theory and Practice, Lecture Notes in Computer Science, B. Triggs, A. Zisserman, and R. Szeliski, eds.,vol. 1883, pp. 1-19, Springer-Verlag, Sept. 1999.]] Google Scholar
Digital Library
Index Terms
What Energy Functions Can Be Minimizedvia Graph Cuts?
Recommendations
Flow equivalent trees in undirected node-edge-capacitated planar graphs
Given an edge-capacitated undirected graph G = (V, E, C) with edge capacity c:E ↦ R+, n = |V|, an s-t edge cut C of G is a minimal subset of edges whose removal from G will separate s from t in the resulting graph, and the capacity sum of the edges in C ...
What Energy Functions Can Be Minimized via Graph Cuts?
ECCV '02: Proceedings of the 7th European Conference on Computer Vision-Part IIIIn the last few years, several new algorithms based on graph cuts have been developed to solve energy minimization problems in computer vision. Each of these techniques constructs a graph such that the minimum cut on the graph also minimizes the energy. ...




Comments