skip to main content
10.1145/3474085.3475385acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Yes, "Attention Is All You Need", for Exemplar based Colorization

Published: 17 October 2021 Publication History

Abstract

Conventional exemplar based image colorization tends to transfer colors from reference image only to grayscale image based on the semantic correspondence between them. But their practical capabilities are limited when semantic correspondence can hardly be found. To overcome this issue, additional information, such as colors from the database is normally introduced. However, it's a great challenge to consider color information from reference image and database simultaneously because there lacks a unified framework to model different color information and the multi-modal ambiguity in database cannot be removed easily. Also, it is difficult to fuse different color information effectively. Thus, a general attention based colorization framework is proposed in this work, where the color histogram of reference image is adopted as a prior to eliminate the ambiguity in database. Moreover, a sparse loss is designed to guarantee the success of information fusion. Both qualitative and quantitative experimental results show that the proposed approach achieves better colorization performance compared with the state-of-the-art methods on public databases with different quality metrics.

Supplementary Material

ZIP File (mfp1280aux.zip)
More samples of colorization results and the detailed structure of our sub-module.

References

[1]
Hyojin Bahng, Seungjoo Yoo, Wonwoong Cho, David Keetae Park, Ziming Wu, Xiaojuan Ma, and Jaegul Choo. 2018. Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation. In Computer Vision -- ECCV 2018. 431--447.
[2]
A. Bugeau, V. Ta, and N. Papadakis. 2014. Variational Exemplar-Based Image Colorization. IEEE Transactions on Image Processing 23, 1 (2014), 298--307.
[3]
Guillaume Charpiat, Matthias Hofmann, and Bernhard Schölkopf. 2008. Automatic Image Colorization Via Multimodal Predictions. In Computer Vision -- ECCV 2008. 126--139.
[4]
Z. Cheng, Q. Yang, and B. Sheng. 2015. Deep Colorization. In 2015 IEEE International Conference on Computer Vision (ICCV). 415--423.
[5]
Z. Cheng, Q. Yang, and B. Sheng. 2017. Colorization Using Neural Network Ensemble. IEEE Transactions on Image Processing 26, 11 (2017), 5491--5505.
[6]
Alex Yong-Sang Chia, Shaojie Zhuo, Raj Kumar Gupta, Yu-Wing Tai, Siu-Yeung Cho, Ping Tan, and Stephen Lin. 2011. Semantic Colorization with Internet Images. ACM Trans. Graph. 30, 6 (2011), 1--8.
[7]
J. Deng, W. Dong, R. Socher, L. Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. 248--255.
[8]
A. Deshpande, J. Rock, and D. Forsyth. 2015. Learning Large-Scale Automatic Image Colorization. In 2015 IEEE International Conference on Computer Vision (ICCV). 567--575.
[9]
Raj Kumar Gupta, Alex Yong-Sang Chia, Deepu Rajan, Ee Sin Ng, and Huang Zhiyong. 2012. Image Colorization Using Similar Images. In Proceedings of the 20th ACM International Conference on Multimedia (MM '12). 369--378.
[10]
Mingming He, Dongdong Chen, Jing Liao, Pedro V. Sander, and Lu Yuan. 2018. Deep Exemplar-Based Colorization. ACM Trans. Graph. 37, 4, Article 47 (2018).
[11]
Simao Herdade, Armin Kappeler, Kofi Boakye, and Joao Soares. 2019. Image Captioning: Transforming Objects into Words. In Advances in Neural Information Processing Systems 32. 11137--11147.
[12]
Vlad Hosu, Bastian Goldlucke, and Dietmar Saupe. 2019. Effective Aesthetics Prediction With Multi-Level Spatially Pooled Features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 9375--9383.
[13]
Yi-Chin Huang, Yi-Shin Tung, Jun-Cheng Chen, Sung-Wen Wang, and Ja-Ling Wu. 2005. An Adaptive Edge Detection Based Colorization Algorithm and Its Applications. In Proceedings of the 13th Annual ACM International Conference on Multimedia. 351--354.
[14]
Revital Irony, Daniel Cohen-Or, and Dani Lischinski. 2005. Colorization by Example. In Eurographics Symposium on Rendering.
[15]
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. Image-To-Image Translation With Conditional Adversarial Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1125--1134.
[16]
Justin Johnson, Alexandre Alahi, and Li Fei-Fei. 2016. Perceptual Losses for Real-Time Style Transfer and Super-Resolution. In Computer Vision -- ECCV 2016. 694--711.
[17]
S. H. Kang and R. March. 2007. Variational Models for Image Colorization via Chromaticity and Brightness Decomposition. IEEE Transactions on Image Processing 16, 9 (2007), 2251--2261.
[18]
Gustav Larsson, Michael Maire, and Gregory Shakhnarovich. 2016. Learning Representations for Automatic Colorization. In Computer Vision -- ECCV 2016. 577--593.
[19]
Junsoo Lee, Eungyeup Kim, Yunsung Lee, Dongjun Kim, Jaehyuk Chang, and Jaegul Choo. 2020. Reference-Based Sketch Image Colorization Using Augmented- Self Reference and Dense Semantic Correspondence. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 5801--5810.
[20]
Anat Levin, Dani Lischinski, and Yair Weiss. 2004. Colorization Using Optimization. ACM Trans. Graph. 23, 3 (2004), 689--694.
[21]
Peng Lu, Jinbei Yu, Xujun Peng, Zhaoran Zhao, and Xiaojie Wang. 2020. Gray2ColorNet: Transfer More Colors from Reference Image. In Proceedings of the 28th ACM International Conference on Multimedia. 3210--1218.
[22]
Qing Luan, Fang Wen, Daniel Cohen-Or, Lin Liang, Ying-Qing Xu, and HeungYeung Shum. 2007. Natural Image Colorization. In Proceedings of the 18th Eurographics Conference on Rendering Techniques. 309--320.
[23]
X. Mao, Q. Li, H. Xie, R. Y. K. Lau, Z. Wang, and S. P. Smolley. 2017. Least Squares Generative Adversarial Networks. In 2017 IEEE International Conference on Computer Vision (ICCV). 2813--2821.
[24]
Yuji Morimoto, Yuichi Taguchi, and Takeshi Naemura. 2009. Automatic Colorization of Grayscale Images Using Multiple Images on the Web. In SIGGRAPH '09: Posters.
[25]
Yingge Qu, Tien-Tsin Wong, and Pheng-Ann Heng. 2006. Manga Colorization. In ACM SIGGRAPH 2006 Papers. 1214--1220.
[26]
Patsorn Sangkloy, Jingwan Lu, Chen Fang, Fisher Yu, and James Hays. 2017. Scribbler: Controlling Deep Image Synthesis With Sketch and Color. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 5400--5409.
[27]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations, ICLR.
[28]
Jheng-Wei Su, Hung-Kuo Chu, and Jia-Bin Huang. 2020. Instance-Aware Image Colorization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7968--7977.
[29]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems 30. 5998--6008.
[30]
Chufeng Xiao, Chu Han, Zhuming Zhang, Jing Qin, Tien-Tsin Wong, Guoqiang Han, and Shengfeng He. 2020. Example-Based Colourization Via Dense Encoding Pyramids. Computer Graphics Forum 39, 1 (2020), 20--33.
[31]
Zhongyou Xu, Tingting Wang, Faming Fang, Yun Sheng, and Guixu Zhang. 2020. Stylization-Based Architecture for Fast Deep Exemplar Colorization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 9363--9372.
[32]
Yu-Wing Tai, Jiaya Jia, and Chi-Keung Tang. 2005. Local color transfer via probabilistic segmentation by expectation-maximization. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Vol. 1. 747--754 vol. 1.
[33]
Bo Zhang, Mingming He, Jing Liao, Pedro V. Sander, Lu Yuan, Amine Bermak, and Dong Chen. 2019. Deep Exemplar-Based Video Colorization. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 8052--8061.
[34]
H. Zhang, T. Xu, H. Li, S. Zhang, X. Wang, X. Huang, and D. N. Metaxas. 2019. StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 8 (2019), 1947--1962.
[35]
Richard Zhang, Phillip Isola, and Alexei A. Efros. 2016. Colorful Image Colorization. In Computer Vision -- ECCV 2016. 649--666.
[36]
Richard Zhang, Jun-Yan Zhu, Phillip Isola, Xinyang Geng, Angela S. Lin, Tianhe Yu, and Alexei A. Efros. 2017. Real-Time User-Guided Image Colorization with Learned Deep Priors. ACM Trans. Graph. 36, 4, Article 119 (2017).

Cited By

View all
  • (2024)Towards Photorealistic Video Colorization via Gated Color-Guided Image Diffusion ModelsProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681356(10891-10900)Online publication date: 28-Oct-2024
  • (2024)Versatile Vision Foundation Model for Image and Video ColorizationACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657509(1-11)Online publication date: 13-Jul-2024
  • (2024)Latent-Guided Exemplar-Based Image Re-Colorization2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00420(4238-4247)Online publication date: 3-Jan-2024
  • Show More Cited By

Index Terms

  1. Yes, "Attention Is All You Need", for Exemplar based Colorization

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '21: Proceedings of the 29th ACM International Conference on Multimedia
    October 2021
    5796 pages
    ISBN:9781450386517
    DOI:10.1145/3474085
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. GAN
    2. colorization
    3. image understanding

    Qualifiers

    • Research-article

    Conference

    MM '21
    Sponsor:
    MM '21: ACM Multimedia Conference
    October 20 - 24, 2021
    Virtual Event, China

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)187
    • Downloads (Last 6 weeks)27
    Reflects downloads up to 21 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Towards Photorealistic Video Colorization via Gated Color-Guided Image Diffusion ModelsProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681356(10891-10900)Online publication date: 28-Oct-2024
    • (2024)Versatile Vision Foundation Model for Image and Video ColorizationACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657509(1-11)Online publication date: 13-Jul-2024
    • (2024)Latent-Guided Exemplar-Based Image Re-Colorization2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00420(4238-4247)Online publication date: 3-Jan-2024
    • (2024)Real-Time User-guided Adaptive Colorization with Vision Transformer2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00054(473-482)Online publication date: 3-Jan-2024
    • (2024)Automatic Controllable Colorization via Imagination2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.00252(2609-2619)Online publication date: 16-Jun-2024
    • (2024)PSANetIET Computer Vision10.1049/cvi2.1229118:7(922-934)Online publication date: 31-Oct-2024
    • (2024)3D human pose estimation method based on multi-constrained dilated convolutionsMultimedia Systems10.1007/s00530-024-01441-630:5Online publication date: 14-Aug-2024
    • (2023)Brighten-and-Colorize: A Decoupled Network for Customized Low-Light Image EnhancementProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611907(8356-8366)Online publication date: 26-Oct-2023
    • (2023)iColoriT: Towards Propagating Local Hints to the Right Region in Interactive Colorization by Leveraging Vision Transformer2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV56688.2023.00183(1787-1796)Online publication date: Jan-2023
    • (2023)Unsupervised Deep Exemplar Colorization via Pyramid Dual Non-Local AttentionIEEE Transactions on Image Processing10.1109/TIP.2023.329377732(4114-4127)Online publication date: 1-Jan-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media