Bhavika Devnani

I’m a PhD student at Georgia Tech, advised by Judy Hoffman and James Hays. I’ve been lucky to explore different parts of machine learning through industry research roles at Apple AI/ML Research, Quora, and LinkedIn.

I’m interested in multimodal learning (vision, language, audio) and in making these models efficient, practical, and scalable.

Bhavika Devnani
News
[May 2025] Interning with Apple AI/ML for the summer.
[Oct 2024] ELSA accepted at NeurIPS 2024!
[Aug 2024] Started a PhD at Georgia Tech with Dr Judy Hoffman.
[Dec 2022] Started working at Apple AI/ML Research - led by Dr Samy Bengio.
[Oct 2022] ZSON accepted at NeurIPS 2022!
[Nov 2022] BiSA wins best paper at NeurIPS 2022 - Vision Transformers Workshop!
[Oct 2022] BiSA accepted at NeurIPS 2022 - Vision Transformers Workshop!
Research
ELSA Paper
ELSA: Learning Spatially-Aware Language and Audio Embeddings

Bhavika Devnani, Skyler Seto, Zakaria Aldeneh, Alessandro Toso, Yelena Menyaylenko, Barry-John Theobald, Jonathan Sheaffer, Miguel Sarabia

Accepted at NeurIPS 2024

Generated dataset and trained a model to align 3D spatial audio with open vocabulary captions.

ZSON Paper
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

Arjun Majumdar*, Gunjan Aggarwal*, Bhavika Devnani, Judy Hoffman, Dhruv Batra

Accepted at NeurIPS 2022

CLIP enables Zero-Shot Object-Goal Navigation by learning multimodal goal embeddings.

BiSA Paper
Bi-Directional Self-Attention for Vision Transformers

George Stoica, Taylor Hearn, Bhavika Devnani, Judy Hoffman

Accepted at NeurIPS 2022, Vision Transformers Workshop - Best Paper

Refined sources based on surrounding context by inverting self-attention.