Open Vocabulary Object Detection Clip