Currently, I am a Senior Research Scientist at Salesken AI where I work on a number of NLP downstream tasks like text-to-text translation, semantic similarity, emotion recognition, sentiment analysis, and text summarization. Previously, I was a pre-doctoral Research Fellow at Microsoft Research India, where I was advised by Sunayana Sitaram (Senior Research at Microsoft Research India). At Microsoft, I worked on improving automatic speech recognition systems for Indian languages. Overall, my research interest lies in NLP and automatic speech recognition.

In the past, I was an intern at the Indian Institute of Science, Bangalore, India (IISc, Bangalore) where I was advised by Prof. Partha Pratim Talukdar. Here, I got an opportunity to explore the use knowledge bases to solve the problem of visual question answering.

I graduated with a B.Tech in Computer Science and MS in Computational Natural Science from IIIT Hyderabad in 2019. For more details, check my CV or hit me up on my email.

Publications

Curriculum Learning for Adapting Models on Code-Mixed Data in Emotion Recognition
Dheeraj Agrawal, Sanket Sanjay Shah, Neeru Dubey, Shubham Sharma, Bharatram Natarajan, Suvro Banerjee and Ashish Kumar
Under Review | Coling'22
pdf| abstract| cite| video

Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Gurunath Reddy, Sanket Shah, Basil Abraham, Vikas Joshi, Sunayana Sitaram
WSTCSMC 2020 | First Workshop on Speech Technologies for Code-switching in Multilingual Communities 2020
pdf| abstract| cite| video

Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Sanket Shah, Basil Abraham, Gurunath Reddy M, Sunayana Sitaram, Vikas Joshi
arXiv'20
pdf| abstract| cite

Using monolingual speech recognition for spoken term detection in code-switched hindi-english speech
Sanket Shah, Sunayana Sitaram
ICDMW'19 | 2019 International Conference on Data Mining Workshops (ICDMW)
pdf| abstract| cite

CoSSAT: Code-Switched Speech Annotation Tool
Sanket Shah, Pratik Joshi, Sebastin Santy, Sunayana Sitaram
AnnoNLP@EMNLP'19 | Empirical Methods in Natural Language Processing
pdf| abstract| cite| slides

Kvqa: Knowledge-aware visual question answering
Sanket Shah*, Anand Mishra*, Naganand Yadati, Partha Pratim Talukdar
AAAI'19 | 33rd Annual Conference on Association for the Advancement of Artificial Intelligence
pdf| abstract| cite| website

Cross-lingual and Multilingual Spoken Term Detection for Low-Resource Indian Languages
Sanket Shah, Satarupa Guha, Simran Khanuja, Sunayana Sitaram
arXiv'20
pdf| abstract| cite

Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities
Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan, Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury and Kalika Bali
ICON'19 | International Conference on Natural Language Processing
pdf| abstract| cite

First Workshop on Speech Processing for Code-switching in Multilingual Communities: Shared Task on Code-switched Spoken Language Identification
Sanket Shah, Sunayana Sitaram, Rupeshkumar Mehta
WSTCSMC 2020 | First Workshop on Speech Technologies for Code-switching in Multilingual Communities 2020
pdf| abstract| cite|

IIIT Hyderabad
2014 - 2019
Indian Institute of Science
F2018
Microsoft Research
2019 - Present
This page has been accessed at least several times since 16th August 2020.