Skip to main content

Table 2 Reconstruction error of audio source separation using magnitude spectrograms as input

From: Learning long-term filter banks for audio source separation and audio scene classification

Init

Method

Re_toep

Re_inv

  

M/V =0.1

M/V =1

M/V =10

M/V =0.1

M/V =1

M/V =10

–

Null [47]

2.58

0.99

0.033

2.58

0.99

0.033

–

CNN-1layer [22]

2.83

0.96

0.047

2.83

0.96

0.047

–

GaussLTFB

2.49

0.94

0.037

2.60

0.95

0.034

Random

FullLTFB

2.77

1.12

0.080

2.85

1.03

0.043

Identity

FullLTFB

2.50

0.94

0.037

2.82

0.95

0.034

  1. M/V represents the energy ratio between music and voice