From: Speaker-adaptive-trainable Boltzmann machine and its application to non-parallel voice conversion
Euclidean dist.
Cosine sim.
H=16
0.660
0.526
H=64
0.869
0.337