标签:Encoders,mathbf,Language,text,物体,MLLM,Large,COST,感知 From: https://blog.csdn.net/qq_63585949/article/details/143628497