About Me

I am Anuj Diwan, a third year PhD student in the Computer Science Department at the University of Texas at Austin. I am fortunate to be co-advised by Prof. David Harwath and Prof. Eunsol Choi. I am part of the broader UT NLP group. My research interests are in the fields of Speech and Natural Language Processing. My current research focuses on multilingual speech generation and speech translation.

I received my B.Tech (with Honors) degree in Computer Science and a Minor degree in Statistics from IIT Bombay in 2021, where I had a wonderful time working with Prof. Preethi Jyothi and Prof. Sunita Sarawagi.

I have also spent some time interning at Google DeepMind (Summer 2023, with Yu Zhang and Ankur Bapna), Meta AI (Summer 2022, with Abdelrahman Mohamed, Wei-Ning Hsu and Ching-Feng Yeh), Adobe Research India (Summer 2020) and UCLouvain (Summer 2019).

In my spare time, I enjoy reading, quizzing, solving wordgames, and watching the latest movies and TV shows.

Interests
  • Speech Recognition
  • Natural Language Processing
  • Artificial Intelligence
  • Machine Learning
Education
  • PhD in Computer Science, 2021-present

    University of Texas at Austin

  • B.Tech in Computer Science and Engineering with Honours, 2017-2021

    Indian Institute of Technology Bombay

  • Minor in Statistics, 2017-2021

    Indian Institute of Technology Bombay

CV

Curriculum Vitae (Last updated Jan 2024)

CV

Publications

(2023). Unit-based Speech-to-Speech Translation Without Parallel Data. Preprint.

PDF

(2022). Continual Learning for On-Device Speech Recognition using Disentangled Conformers. ICASSP 2023.

PDF Cite Poster

(2022). Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality. EMNLP 2022.

PDF Cite Code Slides

(2022). Zero-shot Video Moment Retrieval With Off-the-Shelf Models. TL4NLP@NeurIPS 2022.

PDF Cite Poster

(2021). Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages. Interspeech 2021.

PDF Cite Slides

(2021). Low Resource ASR: The surprising effectiveness of High Resource Transliteration. Interspeech 2021.

PDF Cite Slides

(2021). Multilingual and code-switching ASR challenges for low resource Indian languages. Interspeech 2021.

PDF Cite

Experience

 
 
 
 
 
Student Researcher, Summer 2023
May 2023 – Dec 2023 Mountain View, CA
 
 
 
 
 
AI Research Intern, Summer 2022
May 2022 – Dec 2022 Seattle, WA
 
 
 
 
 
Research Intern, Summer 2020
Apr 2020 – Jul 2020 Bangalore, India
 
 
 
 
 
Research Intern, Summer 2019
May 2019 – Jul 2019 Louvain-la-Neuve, Belgium