# TODO - How to train encoder with the opposite modality frozen encoder? - How to generate high level prompt? - Why add the category-agnostic vector as global prompt?