Exploring the Role of Audio in Video Captioning

Publication
2024 Conference on Computer Vision and Pattern Recognition 7th Multimodal Learning and Applications Workshop