Skip to main content

Table 12 Processing time required by our best performing system to process a 1-hour long audio using a CPU and GPU bases setup

From: Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

 Feat extractionInferenceTotal timeRTF
CPU1min 28s28s1min 56s0.032
GPU 2s1min 30s0.025