Rohan Choudhury

prof_pic.jpg

I’m a final-year PhD student at Carnegie Mellon University’s Robotics Institute, advised by Kris Kitani and László Jeni. My research broadly focuses on making visual models more efficient at understanding and generating visual content. My work is supported by the NSF GRFP Fellowship.

I’m currently also a Student Researcher at ByteDance Seed, collaborating with Peter Lin and Lu Jiang on accelerating video generation. I previously interned at Meta FAIR, working with Jing Huang on efficient video understanding.

Before my PhD, I was a software engineer at Nuro, developing trajectory forecasting models for self-driving vehicles, and earned my bachelor’s degree from Caltech, where I explored multi-agent reinforcement learning with Yisong Yue.

Outside of research, I enjoy running, weightlifting, watching sports, and listening to electronic music.

news

Jun 01, 2025 Honored to be named an Outstanding Reviewer for CVPR 2025!
Mar 01, 2025 Excited to start as a Student Researcher at ByteDance Seed!
Oct 18, 2024 Our work RLT was accepted to NeurIPS 2024 as a spotlight!

selected papers (full list)

  1. rlt_scrat.gif
    Rohan Choudhury, Guanglei Zhu, Sihan Liu, and 3 more authors
    NeurIPS, 2024 (spotlight, top 3%)
  2. ECCV
    Rohan Choudhury, Koichiro Niinuma, Kris M. Kitani, and 1 more author
    ECCV, 2024
  3. ICCV
    Rohan Choudhury, Kris M. Kitani, and László A. Jeni
    ICCV, 2023