Balaji Darur

IIIT Hyderabad · Resume

balu_profile_image.jpg

I am an Undergraduate Researcher at the Katha-AI Group, CVIT Lab, IIIT Hyderabad, advised by Prof. Makarand Tapaswi. I am pursuing a B.Tech in Computer Science and an MS in Computational Linguistics by Research.

My research lies at the intersection of Computer Vision and Natural Language Processing, with a focus on story and video understanding. I have worked in problems involving multimodal reasoning, entity coreference in video, and audio-visual perception.

news

Jun 2026 Attending the MSR Academic Summit at Microsoft Research Lab on Jun 9-10, 2026!
Apr 2026 Paper accepted at CVPR 2026 Findings! One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition, with Amanmeet Garg and Makarand Tapaswi.
Jan 2026 Started working at Precog Lab, IIIT Hyderabad, on improving model explainability using Concept Bottleneck models with grounded VLMs.
Oct 2025 Paper accepted as an Oral at EMNLP 2025 (Short Paper, Main Track)! Visual-Aware Speech Recognition for Noisy Scenarios.
May 2025 Started a research internship at Adobe Research, India in Bengaluru, working on style transfer for design documents.

selected publications

  1. C2
    cinemec_cvpr_2026.png
    One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition
    Balaji Darur, Amanmeet Garg, and Makarand Tapaswi
    In Findings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
  2. C1
    vis_aware_emnlp_2025.png
    Visual-Aware Speech Recognition for Noisy Scenarios
    Balaji Darur and Karan Singla
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
    Oral presentation, Short paper Main Track