Skip to main content

Table 3 Confusion matrix for Euclidean distance when 10 nearest neighbors were retrieved. The values in the matrix are the percentage of the signals retrieved from each category (rows) when the example was from the certain category (columns).

From: Audio Query by Example Using Similarity Measures between Probability Density Functions of Features

 

Inside a car

In a restaurant

Road

Jazz

Drums

Popular

Classical

Humming

Singing

Whistling

Speaker1

Speaker2

Speaker3

Speaker4

Speaker5

Speaker6

Speaker7

Inside a car

99.5

1.2

4.7

0

0

0

0

0

0

0

0

0

0

0

0

0

0

In a restaurant

0

98.8

2.6

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Road

0.2

0

92.4

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Jazz

0.1

0

0

90.2

0

11.6

5.9

0.2

0.2

0.4

0

0

0

0

0

0

0

Drums

0

0

0

0

93.6

0.1

0

0

0

0

0

0

0

0

0

0

0

Popular

0

0

0.3

8.4

0.2

87.8

13.5

0

0.2

0.9

0

0

0

0

0

0

0

Classical

0

0

0

0.5

0

0.5

78.0

0

0

0.4

0

0

0

0

0

0

0

Humming

0

0

0

0.4

0

0

0.6

90.8

4.0

0.5

0

0

0

0

0

0

0

Singing

0

0

0

0.2

1.1

0

0.4

3.7

93.5

0.4

0

0

0

0

0

0

0

Whistling

0

0

0

0.1

0.7

0

0.4

0

0.5

97.9

0

0

0

0

0

0

0

Speaker1

0

0

0

0

0

0

0.8

0.4

0

0

100

0

0

0

0

0

0

Speaker2

0

0

0

0

0

0

0

0

0

0

0

99.9

2.1

0

0

0

0

Speaker3

0

0

0

0

0

0

0

0

0

0

0

0.1

97.7

0

0

0.3

0

Speaker4

0.2

0

0

0

0

0

0

0

0

0

0

0

0

100

0

0

0

Speaker5

0

0

0

0

0

0

0

0.3

1.8

0

0

0

0

0

100

0

0

Speaker6

0

0

0

0

0

0

0

0

0

0

0

0

0.2

0

0

99.7

0

Speaker7

0

0

0

0

4.5

0

0.4

0

0

1.3

0

0

0

0

0

0

100