Audio captioning

Developing a transformer model that generates natural language descriptions of audio clips using the recently released Clotho dataset, the first audio caption dataset with captions collected without accompanying video.

Avatar
Alisa Liu

Undergraduate researcher at Northwestern interested in NLP and computer processing of audio & music

Related