Jitesh Jain
Jitesh Jain

Ph.D. Student

I am a third-year Ph.D. student in the School of Interactive Computing at Georgia Tech, advised by Humphrey Shi. I completed my Bachelor's in Computer Science and Engineering in 2023 at IIT Roorkee.

I have interned at Allen AI (Summer 2025, with Christopher Clark), Microsoft Research, Redmond (Summer 2024, with Jianwei Yang) and Picsart AI Research (Summer 2021-22, with Humphrey Shi).

My research trajectory so far can be broadly classified into three areas:

  • I began my research career by developing models for dense pixel-level prediction tasks [OneFormer, SeMask].
  • Building on this foundation, I worked on analyzing and improving the visual perception capabilities of Multimodal Large Language Models [VisPer-LM, VCoder].
  • Presently, my work focuses on developing agent systems for long-horizon tasks in real-world settings, leveraging reinforcement learning to have models learn effective multi-turn reasoning [SAGE, AUGUSTUS].

Reach out if you are interested in my research or would like to discuss any ideas. If you are a self-motivated researcher who's looking for guidance on one of your projects or just want to chat about my experience, will you drop me an email?

Professional Life Happenings

  • [December 2025]: Checkout out SAGE, result of my internship at Allen AI! πŸš€
  • [September 2025]: VisPer-LM (f.k.a. OLA-VLM) is accepted to NeurIPS 2025! See you in San Diego! πŸ₯‚
  • [May 2025]: Started my Research Internship in the PRIOR team at Allen AI. HMU if you are in Seattle! πŸ₯‚
  • [April 2025]: Passed my Ph.D. Qualifying Exam! πŸ₯‚
  • [December 2024]: Checkout my new blog what's a good metric for PhD Students?! πŸ“œ
Featured Publications
Blogs

what's a good metric for PhD Students?

So, I started my PhD in CS at Georgia Tech in Fall 2023. It’s only been around 16 months, but it feels more like several years in deep learning research trends.

Ascending the Research Trail

In case you missed it, I shared my experience as a rookie undergrad researcher in a previous blog: Riding the Noisy Research Track. Since then, I have grown from a rookie researcher into a more mature beginner researcher with a better outlook on the bigger picture in research (thinking beyond publishing a paper) owing to my close collaboration with the SHI Labs in the last couple of years.

Summer Diaries: Intern Diary of an Undergrad DL Researcher

This summer, I worked as a remote research intern at SHI Lab @ University of Oregon (UO) and Picsart AI Research (PAIR). I joined the SHI Lab to work with Professor Humphrey Shi in November 2020 during my second year and continued my work there during the summers, and I joined PAIR during the summers (June 2021).

The Contemporary Overthinking Problem

Since the onset of the COVID-19 pandemic, the following lockdowns and stay & work from home situations, the concept of Overthinking has gained popularity. Moreover, now and then, I find the gen-z people (and even me sometimes) replying to β€œWhat you doing?