![](assets/MoMA/img-240506141200231.png)
# 方法：
1. Multimodal Generative Image-feature Decoder
2. Self-Attention Feature Transfer
3. Multimodal Generative Learning and Diffusion Learning