2024-CVPR Putting the Object Back into Video Object Segmentation
motivationRecent works on VOS employ bottom-up pixel-level memory reading which struggles due to matching noise, especially in the presence of distractors, resultingin lower performance in more challe