11

In the paper Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network by Christian Ledig et al., the distance between images (used in the loss function) is calculated from feature maps extracted from the VGG19 network. The two used in the article are (a bit confusingly) called VGG22 and VGG54.

What are these feature maps?

What do the designations "22" and "54" mean?

Stephen Rauch
  • 1,831
  • 11
  • 23
  • 34
Lafayette
  • 604
  • 5
  • 20

1 Answers1

4

Reading the article, it seems like they define VGG54 as the loss calculated from the euclidean distance between the $\phi_{5,4}$ feature maps derived from both the high and low resolution images using the VGG19 network. Where $\phi_{i,j}$ is defined as "the feature map obtained by the j-th convolution (after activation) and before the i-th max-pooling layer within the VGG19 network".

Stephen Rauch
  • 1,831
  • 11
  • 23
  • 34