Skip to main content

Table 6 Performance of various approaches for the full task of room-localized SAD on DIRHA-sim (left) and DIRHA-real (right)

From: Room-localized speech activity detection in multi-microphone smart homes

Method

DIRHA-sim

DIRHA-real

 

Recall

Precision

F-score

Recall

Precision

F-score

Single-stage

Best RI

92.22

19.49

32.18

92.71

16.25

27.66

 

MFCC/GMM

89.87

41.60

56.87

88.02

57.06

69.24

 

Sohn’s

73.17

17.33

28.02

73.40

17.71

28.53

Two-stage

MFCC/GMM

72.07

61.08

66.12

78.94

76.87

77.89

baselines

Sohn’s

43.14

21.96

29.11

46.39

22.26

30.08

Proposed

Seg (R=5)

82.16

77.35

79.68

88.27

89.30

88.78

 

Win (R=5)

83.09

78.96

80.98

86.51

88.87

87.68

 

Win (R=4)

84.65

86.10

85.37

86.51

94.03

90.11