I am an assistant professor in the Paul G. Allen School of Computer Science & Engineering, at the University of Washington. I'm also an adjunct professor at the Language Technologies Institute at CMU. I work on Natural Language Processing–a subfield of computer science focusing on computational processing of human languages.
I am particularly interested in hybrid solutions at the intersection of machine learning and theoretical or social linguistics, i.e., solutions that combine interesting learning/modeling methods and insights about human languages or about people speaking these languages.
Much of my research group's work focuses on NLP for social good, multilingual NLP, and language generation. This research is motivated by a unified goal: to extend the capabilities of human language technology beyond individual populations and across language boundaries, thereby enabling NLP for diverse and disadvantaged users, the users that need it most.
Here are my CV and Google Scholar page.
Previously, I was an assistant professor in the Language Technologies Institute, School of Computer Science at Carnegie Mellon University, and before that a postdoc in the Stanford NLP Group. I got my PhD from CMU.
Teaching
Algorithms for NLP (undergraduate IITP course; co-teaching with David Mortensen)
Algorithms for NLP (undergraduate IITP course; co-teaching with David Mortensen)
Controlled Analyses of Social Biases in Wikipedia Bios. Proc. TheWebConf'22. PDF
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision. Proc. ICLR'22. PDF
Controlled Text Generation as Continuous Optimization with Multiple Constraints. Proc. NeurIPS'21. PDF
SelfExplain: A Self-Explaining Architecture for Neural Text Classifiers. Proc. EMNLP'21. PDF
Evaluating the Morphosyntactic Well-formedness of Generated Texts. Proc. EMNLP'21. PDF
Influence Tuning: Demoting Spurious Correlations via Instance Attribution and Instance-Driven Updates. Proc. Findings of EMNLP'21. PDF
Detecting Community Sensitive Norm Violations in Online Conversations. Proc. Findings of EMNLP'21. PDF
Efficient Test Time Adapter Ensembling for Low-resource Language Varieties. Proc. Findings of EMNLP'21. PDF
Simple and Efficient ways to Improve REALM. Proc. MRQA'21. PDF
Improving the Diversity of Unsupervised Paraphrasing with Embedding Outputs. Proc. MRL'21. PDF
Improving Span Representation for Domain-adapted Coreference Resolution. Proc. CRAC'21. PDF
A Survey of Race, Racism, and Anti-Racism in NLP. Proc. ACL'21. PDF
Machine Translation into Low-resource Language Varieties. Proc. ACL'21. PDF
Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation. Proc. Findings of ACL'21. PDF
Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics. Proc. NAACL'21. PDF
Controlling Dialogue Generation with Semantic Exemplars. Proc. NAACL'21. PDF
DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues. Proc. ICLR'21. PDF
Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models. (Spotlight) Proc. ICLR'21. PDF
StructSum: Incorporating Latent and Explicit Sentence Dependencies for Single Document Summarization. Proc. EACL'21. PDF
Ranking Transfer Languages with Pragmatically-Motivated Features for Multilingual Sentiment Analysis. Proc. EACL'21. PDF
Multilingual Contextual Affective Analysis of LGBT People Portrayals in Wikipedia. Proc. ICWSM'21. PDF
An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation. Proc. AfricaNLP'21. PDF
Unsupervised Discovery of Implicit Gender Bias. Proc. EMNLP'20. PDF
On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment. Proc. EMNLP'20. PDF
Fortifying Toxic Speech Detectors Against Veiled Toxicity. Proc. EMNLP'20. PDF
Automatic Extraction of Rules Governing Morphological Agreement. Proc. EMNLP'20. PDF
Understanding Linguistic Accommodation in Code-Switched Human-Machine Dialogues. Proc. CoNLL'20. PDF
LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for Multi-Granular Propaganda Span Identification. Proc. SemEval'20. PDF
A Computational Analysis of Polarization on Indian and Pakistani Social Media. (Nominated for best paper award) Proc. SocInfo'20. PDF
A framework for the computational linguistic analysis of dehumanization. Frontiers in Artificial Intelligence. PDF
Demoting Racial Bias in Hate Speech Detection. Proc. SocialNLP'20. PDF
A Deep Reinforced Model for Cross-Lingual Summarization with Bilingual Semantic Similarity Reward. Proc. WNGT'20. PDF
Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions Proc. ACL'20. PDF
Balancing Training for Multilingual Neural Machine Translation Proc. ACL'20. PDF
Stress and Burnout in Open Source: Toward Finding, Understanding, and Mitigating Unhealthy Interactions Proc. of International Conference on Software Engineering -- New Ideas Track (ICSE-NIER). PDF
Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History Proc. ICLR'20. PDF
What Code-Switching Strategies are Effective in Dialog Systems? Proc. SCiL'20. PDF
Where New Words Are Born: Distributional Semantic Analysis of Neologisms and Their Semantic Neighborhoods Proc. SCiL'20. PDF
Topics to Avoid: Demoting Latent Confounds in Text Classification Proc. EMNLP'19. PDF
Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media Posts Proc. EMNLP'19. PDF
Learning to Generate Word- and Phrase-Embeddings for Efficient Phrase-Based Neural Machine Translation Proc. WNGT'19. PDF
A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation Proc. WNGT'19. PDF
A Dynamic Strategy Coach for Effective Negotiation Proc. SIGdial'19. PDF
Entity-Centric Contextual Affective Analysis Proc. ACL'19. PDF
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology Proc. SIGMORPHON'19. PDF (Interpretability Prize)
Quantifying Social Biases in Contextual Word Representations Proc. of Workshop on Gender Bias for NLP. PDF
Contextual Affective Analysis: A Case Study of People Portrayals in Online #MeToo Stories Proc. ICWSM'19. PDF
Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings Proc. NAACL'19. PDF
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs Proc. ICLR'19. PDF
Framing and Agenda-setting in Russian News: a Computational Analysis of Intricate Political Strategies Proc. EMNLP'18. PDF
Style Transfer Through Back-Translation Proc. ACL'18. PDF
Native Language Cognate Effects on Second Language Lexical Choice Proceedings of the Transactions of Association for Computational Linguistics (TACL). 2018. PDF DATA
RtGender: A Corpus for Studying Differential Responses to Gender Proc. LREC'18. PDF DATA
Incorporating Dialectal Variability for Socially Equitable Language Identification Proc. ACL'17. PDF CODE
Writer Profiling Without the Writer's Text Proc. SocInfo'17. PDF
Linguistic Knowledge in Data-Driven Natural Language Processing PhD thesis, September 2016. PDF
Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation Learning Proc. ACL'16. PDF
Correlation-based Intrinsic Evaluation of Word Vector Representations In RepEval'16. PDF CODE
Problems With Evaluation of Word Embeddings Using Word Similarity Tasks In RepEval'16. PDF
Polyglot Neural Language Models: Case Study in Cross-Lingual Phonetic Representation Learning Proc. NAACL'16. PDF
Morphological Inflection Generation Using Character Sequence to Sequence Learning Proc. NAACL'16. PDF
Massively Multilingual Word Embeddings arXiv preprint PDF
Cross-Lingual Bridges with Models of Lexical Borrowing. Journal of Artificial Intelligence Research (JAIR). 2016. PDF
Evaluation of Word Vector Representations by Subspace Alignment. In Proc. EMNLP'15. PDF CODE
Not All Contexts Are Created Equal: Better Word Representations with Variable Attention. In Proc. EMNLP'15. PDF
Lexicon Stratification for Translating Out-of-Vocabulary Words. In Proc. ACL'15. PDF
Sparse Overcomplete Word Vector Representations. In Proc. ACL'15. PDF
A Bottom Up Approach to Category Mapping and Meaning Change. In Proc. NetWordS'15. PDF
Constraint-Based Models of Lexical Borrowing. In Proc. NAACL'15. PDF
Identification of Multi-word Expressions by Combining Multiple Linguistic Information Sources. Computational Linguistics, 40(2):449-468, 2014. PDF
Metaphor Detection with Cross-Lingual Model Transfer. In Proc. ACL'14. PDF CODE DATA
Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation. In Proc. EACL'14. PDF
Augmenting English Adjective Senses with Supersenses. In Proc. LREC'14. PDF CODE DATA
Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness. In Proc. LREC'14. PDF DATA
Automatic Classification of Communicative Functions of Definiteness. In Proc. COLING'14. PDF
The CMU Machine Translation Systems at WMT 2014. In Proc. WMT'14. PDF
Generating English Determiners in Phrase-Based Translation with Synthetic Translation Options. In Proc. WMT'13. PDF
The CMU Machine Translation Systems at WMT 2013: Syntax, Synthetic Translation Options, and Pseudo-References. In Proc. WMT'13. PDF
Identifying the L1 of non-native writers: the CMU-Haifa system. In Proc. the 8th Workshop on Innovative Use of NLP for Building Educational Applications, 2013. PDF
Cross-Lingual Metaphor Detection Using Common Semantic Features. In Proc. Meta4NLP Workshop, 2013. PDF
Identification and Modeling of Word Fragments in Spontaneous Speech. In Proc. ICASSP'13. PDF
Extraction of Multi-word Expressions from Small Parallel Corpora. In Natural Language Engineering 18(4):549-573, 2012. PDF
Identification of Multi-word Expressions by Combining Multiple Linguistic Information Sources. In Proc. EMNLP'11. PDF
Extraction of Multi-word Expressions from Small Parallel Corpora. University of Haifa M.Sc. thesis, September 2010. PDF
Extraction of Multi-word Expressions from Small Parallel Corpora. In Proc. COLING'10. PDF
Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content. In Proc. LREC'10. PDF