Rohan C. Choudhury
I’m a 4th-year PhD student in the Robotics Institute at Carnegie Mellon University, advised by Kris Kitani and László Jeni. I’m supported in part by an NSF GRFP Fellowship.
I’m broadly interested in making video understanding faster and more efficient. In particular, I work on enabling vision algorithms to continuously perceive the world in high resolution and at 30+ FPS.
During my graduate studies, I’ve spent time as a research scientist intern at Meta FAIR (with Jing Huang).Before starting my PhD, I was a software engineer at Nuro, where I worked on trajectory forecasting models for self-driving vehicles. Even before that, I graduated from Caltech. When I’m not working on research, I enjoy running, lifting weights, and listening to electronic music.
news
Oct 18, 2024 | RLT was accepted to NeurIPS 2024 as a spotlight paper! |
---|---|
Jul 14, 2024 | Our paper Video Question Answering with Procedural Programs was accepted to ECCV 2024! |
selected papers
- NeurIPSDon’t Look Twice: Faster Video Tranformers with Run-Length TokenizationNeurIPS, 2024