Fig. 1From: Automated audio captioning: an overview of recent progress and new challengesOverview of an encoder-decoder-based AAC system, where the input is the waveform of an audio clip and the output is a natural language sentence describing the content of the input audio clipBack to article page