Skip to main content

Advertisement

Table 2 MAVIR database characteristics

From: ALBAYZIN 2016 spoken term detection evaluation: an international open competitive evaluation in Spanish

File ID Data #word occ. dur. (min) #spk. p.563 Ave. MOS
Mavir-02 train 13432 74.51 7 (7 ma.) 2.69
Mavir-03 dev 6681 38.18 2 (1 ma. 1 fe.) 2.83
Mavir-06 train 4332 29.15 3 (2 ma. 1 fe.) 2.89
Mavir-07 dev 3831 21.78 2 (2 ma.) 3.26
Mavir-08 train 3356 18.90 1 (1 ma.) 3.13
Mavir-09 train 11179 70.05 1 (1 ma.) 2.39
Mavir-12 train 11168 67.66 1 (1 ma.) 2.32
Mavir-04 test 9310 57.36 4 (3 ma. 1 fe.) 2.85
Mavir-11 test 3130 20.33 1 (1 ma.) 2.46
Mavir-13 test 7837 43.61 1 (1 ma.) 2.48
ALL train 43467 260.27 13 (12 ma. 1 fe.) -
ALL dev 10512 59.96 4 (3 ma. 1 fe.) -
ALL test 20277 121.3 6 (5 ma. 1 fe.) -
  1. ‘train’ stands for training, ‘dev’ for development, ‘occ.’ stands for occurrences, ‘dur.’ stands for duration, ‘min’ stands for minutes, ‘spk.’ stands for speakers, ‘ma.’ stands for male, ‘fe.’ stands for female, and ‘Ave.’ stands for average