Skip to main content

Table 2 Room-independent SAD results on the DIRHA-sim (left) and DIRHA-real (right) test sets, further discussed in Section 8.1

From: Room-localized speech activity detection in multi-microphone smart homes

Method

DIRHA-sim

DIRHA-real

 

Recall

Precision

F-score

Recall

Precision

F-score

 

GMM

HMM

GMM

HMM

GMM

HMM

GMM

HMM

GMM

HMM

GMM

HMM

Oracle-best

96.94

94.67

94.01

96.82

95.45

95.73

93.01

95.49

95.91

96.46

94.44

95.97

Channel avg.

87.86

82.26

76.64

83.13

81.82

82.69

65.56

71.57

89.47

87.42

75.37

78.34

Best act.-SNR

94.56

92.36

83.85

87.95

88.88

90.10

88.77

90.33

88.95

86.87

88.86

88.57

Best est.-SNR

96.60

93.63

66.56

73.54

78.81

82.38

92.43

93.41

74.38

74.02

82.43

82.59

Sohn’s

81.22

58.91

68.29

78.05

61.51

68.80

Decision fusion

“u-sum”

94.39

91.08

83.60

90.97

88.67

91.01

74.76

89.11

96.54

91.70

84.26

90.39

 

“w-sum”

95.00

91.78

83.57

91.82

88.92

91.80

76.87

87.37

96.58

93.37

85.67

90.27

 

“u-max”

74.17

82.51

75.28

73.69

74.72

77.85

45.66

68.40

97.21

95.01

62.14

79.54

 

“w-max”

95.44

95.53

82.34

87.16

88.41

91.15

79.76

89.66

95.77

88.70

87.03

89.18

 

“u-vote”

92.55

88.92

84.18

92.24

88.16

90.55

69.12

83.39

96.61

95.02

80.58

88.82

 

“w-vote”

91.37

91.83

87.39

90.40

89.34

91.11

74.76

85.03

96.54

94.82

84.26

89.66