From: Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation
# Utterance/word
Model
250
500
Baseline
48.21
54.62
Proposed
50.06 (86.65)
55.07 (90.51)