OLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding DistillationDec 2024·Jitesh Jain,Zhengyuan Yang,Humphrey Shi,Jianfeng Gao,Jianwei Yang· 0 min readGo to Project Site Preprint PDF Cite Code ProjectAbstractTBDTypePreprintPublicationUnder ReviewLast updated on Dec 2024Under Review AuthorsJitesh JainPh.D. Student VCoder: Versatile Vision Encoders for Multimodal Large Language Models Dec 2023 →