Wijngaard, G., Formisano, E., Giordano, B. L., & Dumontier, M. (2023). ACES: Evaluating Automated Audio Captioning Models on the Semantics of Sounds. In 31st European Signal Processing Conference, EUSIPCO 2023 - Proceedings (pp. 770-774). IEEE. https://doi.org/10.23919/EUSIPCO58844.2023.10289793