Unknown

Dataset Information

0

Hint-Based Image Colorization Based on Hierarchical Vision Transformer.


ABSTRACT: Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they are based on convolution neural networks (CNNs) that have strong spatial locality due to the convolution operations. This often causes non-trivial visual artifacts in the colorization results, such as false color and color bleeding artifacts. To overcome this limitation, this study proposes a vision transformer-based colorization network. The proposed hint-based colorization network has a hierarchical vision transformer architecture in the form of an encoder-decoder structure based on transformer blocks. As the proposed method uses the transformer blocks that can learn rich long-range dependency, it can achieve visually plausible colorization results, even with a small number of color hints. Through the verification experiments, the results reveal that the proposed transformer model outperforms the conventional CNN-based models. In addition, we qualitatively analyze the effect of the long-range dependency of the transformer model on hint-based image colorization.

SUBMITTER: Lee S 

PROVIDER: S-EPMC9570951 | biostudies-literature | 2022 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Hint-Based Image Colorization Based on Hierarchical Vision Transformer.

Lee Subin S   Jung Yong Ju YJ  

Sensors (Basel, Switzerland) 20220929 19


Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they are based on convolution neural networks (CNNs) that have strong spatial locality due to the convolution operations. This often causes non-trivial visual artifacts in the colorization results, such as f  ...[more]

Similar Datasets

| S-EPMC9839963 | biostudies-literature
| S-EPMC8631650 | biostudies-literature
| S-EPMC10773825 | biostudies-literature
| S-EPMC11691071 | biostudies-literature
| S-EPMC10247807 | biostudies-literature
| S-EPMC9044334 | biostudies-literature
| S-EPMC8691725 | biostudies-literature
| S-EPMC9024011 | biostudies-literature
| S-EPMC8583247 | biostudies-literature
| S-EPMC11161220 | biostudies-literature