Home
Publications
Blogs
Books

Projects
Experience
Teaching
- Learn JavaScript
- Learn Python
Projects

OLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Dec 2024·

Jitesh Jain

Jitesh Jain

,

Zhengyuan Yang

,

Humphrey Shi

,

Jianfeng Gao

,

Jianwei Yang

· 0 min read

Go to Project Site Preprint PDF Cite Code Project

Abstract

TBD

Type

Publication

Under Review

Last updated on Dec 2024

Jitesh Jain

Authors

Ph.D. Student

VCoder: Versatile Vision Encoders for Multimodal Large Language Models Dec 2023 →

© {2024} Jitesh Jain

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.