Jitesh Jain
Jitesh Jain

Ph.D. Student

I am a second-year Ph.D. student in the School of Interactive Computing at Georgia Tech, advised by Professor Humphrey Shi. I completed my Bachelor's in Computer Science and Engineering in 2023 at IIT Roorkee.

In the past, I have interned at Microsoft Research, Redmond (Summer 2024, with Dr. Jianwei Yang) and Picsart AI Research (Summer 2021-22, with Dr. Humphrey Shi).

My current research interests revolve around multimodal systems. Presently, I am looking at developing Agent Models leveraging LLMs and principles from cognitive neuroscience. I am also interested in representation learning, efficiency, and various real-world applications of multimodal systems.

My recent works are based on analyzing and improving the visual perception ability in Multimodal Large Language Models [OLA-VLM, VCoder], building upon my experience from working on developing models for dense prediction tasks [OneFormer, SeMask].

Reach out if you are interested in my research or would like to discuss any ideas. If you are a self-motivated researcher who's looking for guidance on one of your projects, feel free to drop me an email with a brief description about your (manifested) research project.

I am seeking internship opportunities starting in Summer 2025. If you have any openings, please reach out to me!

Professional Life Happenings

  • [December 2024]: Checkout out OLA-VLM, result of my internship at Microsoft Research, Redmond! πŸš€
  • [May 2024]: Excited to start Summer Internship at Microsoft Research, Redmond! πŸ§‘β€πŸ’»
  • [February 2024]: VCoder is accepted to CVPR 2024! See you in Seattle! πŸ₯‚
  • [August 2023]: SeMask is accepted to NIVT Workshop at ICCV 2023! πŸ₯‚
  • [July 2023]: Graduated from IIT Roorkee with a Bachelor's in Computer Science and Engineering! πŸŽ“
  • [June 2023]: I will be joining Georgia Tech as a Ph.D. student in Computer Science in Fall 2023! πŸ₯‚
  • [February 2023]: OneFormer is accepted to CVPR 2023! πŸ₯‚
Featured Publications
Blogs

Ascending the Research Trail

In case you missed it, I shared my experience as a rookie undergrad researcher in a previous blog: Riding the Noisy Research Track. Since then, I have grown from a rookie researcher into a more mature beginner researcher with a better outlook on the bigger picture in research (thinking beyond publishing a paper) owing to my close collaboration with the SHI Labs in the last couple of years.

Summer Diaries: Intern Diary of an Undergrad DL Researcher

This summer, I worked as a remote research intern at SHI Lab @ University of Oregon (UO) and Picsart AI Research (PAIR). I joined the SHI Lab to work with Professor Humphrey Shi in November 2020 during my second year and continued my work there during the summers, and I joined PAIR during the summers (June 2021).

The Contemporary Overthinking Problem

Since the onset of the COVID-19 pandemic, the following lockdowns and stay & work from home situations, the concept of Overthinking has gained popularity. Moreover, now and then, I find the gen-z people (and even me sometimes) replying to β€œWhat you doing?

Riding the Noisy Research Track

Alright, people! This article will share my experience and learnings during the last eight months as an undergrad researcher. For those reading one of my blogs for the first time, I am a CSE undergrad (about to enter 3rd year) and am working as a Research Intern at SHI Lab @ UO and Picsart.