Multimodal Large Language Models
多模态大语言模型(Multimodal Large Language Models)
#
VIT,CLIP
#