![](assets/MoMA/img-240506141200231.png) # 方法: 1. Multimodal Generative Image-feature Decoder 2. Self-Attention Feature Transfer 3. Multimodal Generative Learning and Diffusion Learning