About Me

I am Anuj Diwan, a fourth year PhD student in the Computer Science Department at the University of Texas at Austin. I am fortunate to be co-advised by Prof. David Harwath and Prof. Eunsol Choi. I am part of the broader UT NLP group. My research interests are in the fields of Speech and Natural Language Processing. My current research focuses on stylistic and multilingual speech generation.

I received my B.Tech (with Honors) degree in Computer Science and a Minor degree in Statistics from IIT Bombay in 2021, where I had a wonderful time working with Prof. Preethi Jyothi and Prof. Sunita Sarawagi.

I have also spent some time interning at Google DeepMind (Summer 2023, with Yu Zhang and Ankur Bapna), Meta AI (Summer 2022, with Abdelrahman Mohamed, Wei-Ning Hsu and Ching-Feng Yeh), Adobe Research India (Summer 2020) and UCLouvain (Summer 2019).

In my spare time, I enjoy reading, quizzing, solving wordgames, and watching the latest movies and TV shows.

Interests
  • Speech Recognition
  • Natural Language Processing
  • Artificial Intelligence
  • Machine Learning
Education
  • PhD in Computer Science, 2021-present

    University of Texas at Austin

  • B.Tech in Computer Science and Engineering with Honours, 2017-2021

    Indian Institute of Technology Bombay

  • Minor in Statistics, 2017-2021

    Indian Institute of Technology Bombay

CV

Curriculum Vitae (Last updated Jan 2024)

CV

Publications

(2023). Textless Speech-to-Speech Translation With Limited Parallel Data. EMNLP 2024 Findings.

PDF

(2022). Continual Learning for On-Device Speech Recognition using Disentangled Conformers. ICASSP 2023.

PDF Cite Poster

(2022). Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality. EMNLP 2022.

PDF Cite Code Slides

(2022). Zero-shot Video Moment Retrieval With Off-the-Shelf Models. TL4NLP@NeurIPS 2022.

PDF Cite Poster

(2021). Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages. Interspeech 2021.

PDF Cite Slides

(2021). Low Resource ASR: The surprising effectiveness of High Resource Transliteration. Interspeech 2021.

PDF Cite Slides

(2021). Multilingual and code-switching ASR challenges for low resource Indian languages. Interspeech 2021.

PDF Cite

Experience

 
 
 
 
 
Student Researcher, Summer 2023
May 2023 – Dec 2023 Mountain View, CA
 
 
 
 
 
AI Research Intern, Summer 2022
May 2022 – Dec 2022 Seattle, WA
 
 
 
 
 
Research Intern, Summer 2020
Apr 2020 – Jul 2020 Bangalore, India
 
 
 
 
 
Research Intern, Summer 2019
May 2019 – Jul 2019 Louvain-la-Neuve, Belgium