Learning to Segment Referred Objects from Narrated Egocentric Videos

Publication
2024 Conference on Computer Vision and Pattern Recognition