Skip to main content

Table 2 Evaluation of the impact different colour maps have on the efficiency of an ImageNet pre-trained ResNet as an audio feature extractor on DCASE 2018’s domestic activity classification task. All results are given in macro average F1

From: Towards cross-modal pre-training and learning tempo-spatial characteristics for audio recognition with convolutional and recurrent neural networks

Network

Pre-training

Colour map

Devel

Test

ResNet

ImageNet

Cividis

82.6

80.2

ResNet

ImageNet

Gray

82.2

79.4

ResNet

ImageNet

Hot

82.0

79.8

ResNet

ImageNet

Magma

81.9

80.3

ResNet

ImageNet

Viridis

81.2

79.9