Inspired by 90s edutainment, Final Fantasy, renaissance paintings and editorial illustrators, Louie Zong believes that sitting in the intersection between the past and present is the key to making ...
Abstract: Contrastive Language-Image Pre-training (CLIP) learns robust visual models through language supervision, making it a crucial visual encoding technique for various applications. However, CLIP ...