Supplementary material for ICCV submission 667:

Understanding deep features with computer generated imagery

Experiment on ETH-80, qualitative results

The images presented on this page are similar to those presented in figure 2 in the paper. They correspond to the PCA embeddings of AlexNet features (first column), style feature of our decomposition (second column), and rotation feature of our decomposition (third and forth column). Since the ETH dataset provides only 10 instances per category, we show all the projections of all the views for both style (second column) and orientation (third column). Note that this is different than the second and third column of the paper which show the marginalized features. The first column is similar to the first column of figure 2 in the paper and the forth column to column 3 and 4 in figure 2.

While the smaller number of views and instances than in the experiments presented in the paper makes these embeddings slightly harder to interpret, one can still notice that the spaces associated with style/viewpoint are different and make sense. It is also clear that the viewpoint is encoded differently for the different categories, and that fc6 is more invariant to left/right flip. Interestingly, one can also note that color seems to be an important factor to differentiate the instances.

Please click on any image to see in full resolution.

		Global	Style	Rotation	Rotation marginal
apples
	pool5
	fc6
	fc7
cars
	pool5
	fc6
	fc7
cows
	pool5
	fc6
	fc7
cups
	pool5
	fc6
	fc7
dogs
	pool5
	fc6
	fc7
horses
	pool5
	fc6
	fc7
pears
	pool5
	fc6
	fc7
tomatoes
	pool5
	fc6
	fc7

Supplementary material for ICCV submission 667: Understanding deep features with computer generated imagery Experiment on ETH-80, qualitative results

Supplementary material for ICCV submission 667:

Understanding deep features with computer generated imagery

Experiment on ETH-80, qualitative results