Balaji Darur
I am an Undergraduate Researcher at the Katha-AI Group, CVIT Lab, IIIT Hyderabad, advised by Prof. Makarand Tapaswi. I am pursuing a B.Tech in Computer Science and an MS in Computational Linguistics by Research.
My research lies at the intersection of Computer Vision and Natural Language Processing, with a focus on story and video understanding. I have worked in problems involving multimodal reasoning, entity coreference in video, and audio-visual perception.
news
| Jun 2026 | Attending the MSR Academic Summit at Microsoft Research Lab on Jun 9-10, 2026! |
|---|---|
| Apr 2026 | Paper accepted at CVPR 2026 Findings! One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition, with Amanmeet Garg and Makarand Tapaswi. |
| Jan 2026 | Started working at Precog Lab, IIIT Hyderabad, on improving model explainability using Concept Bottleneck models with grounded VLMs. |
| Oct 2025 | Paper accepted as an Oral at EMNLP 2025 (Short Paper, Main Track)! Visual-Aware Speech Recognition for Noisy Scenarios. |
| May 2025 | Started a research internship at Adobe Research, India in Bengaluru, working on style transfer for design documents. |