Skip to main content

Table 3 Confusion matrix for Euclidean distance when 10 nearest neighbors were retrieved. The values in the matrix are the percentage of the signals retrieved from each category (rows) when the example was from the certain category (columns).

From: Audio Query by Example Using Similarity Measures between Probability Density Functions of Features

  Inside a car In a restaurant Road Jazz Drums Popular Classical Humming Singing Whistling Speaker1 Speaker2 Speaker3 Speaker4 Speaker5 Speaker6 Speaker7
Inside a car 99.5 1.2 4.7 0 0 0 0 0 0 0 0 0 0 0 0 0 0
In a restaurant 0 98.8 2.6 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Road 0.2 0 92.4 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Jazz 0.1 0 0 90.2 0 11.6 5.9 0.2 0.2 0.4 0 0 0 0 0 0 0
Drums 0 0 0 0 93.6 0.1 0 0 0 0 0 0 0 0 0 0 0
Popular 0 0 0.3 8.4 0.2 87.8 13.5 0 0.2 0.9 0 0 0 0 0 0 0
Classical 0 0 0 0.5 0 0.5 78.0 0 0 0.4 0 0 0 0 0 0 0
Humming 0 0 0 0.4 0 0 0.6 90.8 4.0 0.5 0 0 0 0 0 0 0
Singing 0 0 0 0.2 1.1 0 0.4 3.7 93.5 0.4 0 0 0 0 0 0 0
Whistling 0 0 0 0.1 0.7 0 0.4 0 0.5 97.9 0 0 0 0 0 0 0
Speaker1 0 0 0 0 0 0 0.8 0.4 0 0 100 0 0 0 0 0 0
Speaker2 0 0 0 0 0 0 0 0 0 0 0 99.9 2.1 0 0 0 0
Speaker3 0 0 0 0 0 0 0 0 0 0 0 0.1 97.7 0 0 0.3 0
Speaker4 0.2 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0
Speaker5 0 0 0 0 0 0 0 0.3 1.8 0 0 0 0 0 100 0 0
Speaker6 0 0 0 0 0 0 0 0 0 0 0 0 0.2 0 0 99.7 0
Speaker7 0 0 0 0 4.5 0 0.4 0 0 1.3 0 0 0 0 0 0 100