Much of my research group's work focuses on understanding and advancing large language models, AI ethics, multilingual learning, and machine learning for NLP. This research is motivated by a unified goal: to extend the capabilities of human language technology beyond individual populations and across language and culture boundaries, thereby enabling NLP for all users.
Here are my CV and Google Scholar page.
Previously, I was an assistant professor in the Language Technologies Institute, School of Computer Science at Carnegie Mellon University, and before that a postdoc in the Stanford NLP Group. I got my PhD from CMU. Honors include the NSF CAREER award, Sloan fellowship, Okawa research award, best/outstanding paper awards and runner-ups, and several industry research faculty awards.
Teaching
Algorithms for NLP (undergraduate IITP course; co-teaching with David Mortensen)
Algorithms for NLP (undergraduate IITP course; co-teaching with David Mortensen)
P3Sum: Preserving Author's Perspective in News Summarization with Diffusion Language Models. PDF
Proc. NAACL.Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers. PDF
Proc. NAACL.BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer. PDF
Proc. NAACL.Trusting Your Evidence: Hallucinate Less with Context-aware Decoding. PDF
Proc. NAACL.SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation. PDF
Proc. NAACL.LatticeGen: Hiding Generated Text in a Lattice for Privacy-Aware Large Language Model Generation on Cloud. PDF
Proc. NAACL Findings.KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models. (Oral) PDF
Proc. WebConf.Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions. PDF
Proc. ICLR.Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models. (Oral) PDF
Proc. ICLR.Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory. (Spotlight) PDF
Proc. ICLR.Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I Learned to Start Worrying about Prompt Formatting. PDF
Proc. ICLR.Can Language Models Solve Graph Problems in Natural Language? (Spotlight paper) PDF
Proc. NeurIPS.MatFormer: Nested Transformer for Elastic Inference. (Best paper award) PDF
Proc. ENLSP @ NeurIPS 2023.Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models. PDF
Proc. EMNLP.FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge. PDF
Proc. EMNLP.GlobalBench: A Benchmark for Global Progress in Natural Language Processing. PDF
Proc. EMNLP.BotPercent: Estimating Twitter Bot Populations from Groups to Crowds. PDF
Proc. EMNLP Findings.Toward Human Readable Prompt Tuning: Kubrick's The Shining is a good movie, and a good prompt too? PDF
Proc. EMNLP Findings.On the Zero-Shot Generalization of Machine-Generated Text Detectors. PDF
Proc. EMNLP Findings.TalkUp: A Novel Dataset Paving the Way for Understanding Empowering Language. PDF
Proc. EMNLP Findings.LEXPLAIN: Improving Model Explanations via Lexicon Supervision. PDF
Proc. StarSEM.Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker. (Outstanding paper award) PDF
Proc. ACL.From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models. (Best paper award) PDF
Proc. ACL.SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control. PDF
Proc. ACL.KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding. PDF
Proc. ACL.Understanding In-Context Learning via Supportive Pretraining Data. PDF
Proc. ACL.On the Blind Spots of Model-Based Evaluation Metrics for Text Generation. PDF
Proc. ACLExamining Risks of Racial Biases in NLP Tools for Child Protective Services. PDF
Proc. FAccT.Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey. PDF
Proc. EACL.Unsupervised Keyphrase Extraction via Interpretable Neural Networks. PDF
Proc. EACL.An Analysis of Emotions and the Prominence of Positivity in #BlackLivesMatter Tweets. PDF
Proc. PNAS.Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling. PDF
Proc. EMNLP.Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation. PDF
Proc. EMNLP.Gradient-based Constrained Sampling from Language Models. PDF
Proc. EMNLP.Gendered Mental Health Stigma in Masked Language Models. PDF
Proc. EMNLP.Challenges and Opportunities in Information Manipulation Detection: An Examination of Wartime Russian Media. PDF
Proc. Findings of EMNLP.Threat Scenarios and Best Practices to Detect Neural Fake News. PDF
Proc. COLING.Speaker Information Can Guide Models to Better Inductive Biases: A Case Study On Predicting Code-Switching. PDF
Proc. ACL'22.Controlled Analyses of Social Biases in Wikipedia Bios. (Wikimedia Foundation research award of the year) PDF
Proc. TheWebConf'22.SimVLM: Simple Visual Language Model Pretraining with Weak Supervision. PDF
Proc. ICLR'22.Controlled Text Generation as Continuous Optimization with Multiple Constraints. PDF
Proc. NeurIPS'21.SelfExplain: A Self-Explaining Architecture for Neural Text Classifiers. PDF
Proc. EMNLP'21.Evaluating the Morphosyntactic Well-formedness of Generated Texts. PDF
Proc. EMNLP'21.Influence Tuning: Demoting Spurious Correlations via Instance Attribution and Instance-Driven Updates. PDF
Proc. Findings of EMNLP'21.Detecting Community Sensitive Norm Violations in Online Conversations. PDF
Proc. Findings of EMNLP'21.Efficient Test Time Adapter Ensembling for Low-resource Language Varieties. PDF
Proc. Findings of EMNLP'21.Simple and Efficient ways to Improve REALM. PDF
Proc. MRQA'21.Improving the Diversity of Unsupervised Paraphrasing with Embedding Outputs. PDF
Proc. MRL'21.Improving Span Representation for Domain-adapted Coreference Resolution. PDF
Proc. CRAC'21.A Survey of Race, Racism, and Anti-Racism in NLP. PDF
Proc. ACL'21.Machine Translation into Low-resource Language Varieties. PDF
Proc. ACL'21.Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation. PDF
Proc. Findings of ACL'21.Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics. PDF
Proc. NAACL'21.Controlling Dialogue Generation with Semantic Exemplars. PDF
Proc. NAACL'21.DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues. PDF
Proc. ICLR'21.Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models. (Spotlight paper) PDF
Proc. ICLR'21.StructSum: Incorporating Latent and Explicit Sentence Dependencies for Single Document Summarization. PDF
Proc. EACL'21.Ranking Transfer Languages with Pragmatically-Motivated Features for Multilingual Sentiment Analysis. PDF
Proc. EACL'21.Multilingual Contextual Affective Analysis of LGBT People Portrayals in Wikipedia. PDF
Proc. ICWSM'21.An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation. PDF
Proc. AfricaNLP'21.Unsupervised Discovery of Implicit Gender Bias. PDF
Proc. EMNLP'20.On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment. PDF
Proc. EMNLP'20.Fortifying Toxic Speech Detectors Against Veiled Toxicity. PDF
Proc. EMNLP'20.Automatic Extraction of Rules Governing Morphological Agreement. PDF
Proc. EMNLP'20.Understanding Linguistic Accommodation in Code-Switched Human-Machine Dialogues. PDF
Proc. CoNLL'20.LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for Multi-Granular Propaganda Span Identification. PDF
Proc. SemEval'20.A Computational Analysis of Polarization on Indian and Pakistani Social Media. (Best paper runner-up) PDF
Proc. SocInfo'20.A framework for the computational linguistic analysis of dehumanization. PDF
Frontiers in Artificial Intelligence.Demoting Racial Bias in Hate Speech Detection. PDF
Proc. SocialNLP'20.A Deep Reinforced Model for Cross-Lingual Summarization with Bilingual Semantic Similarity Reward. PDF
Proc. WNGT'20.Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions PDF
Proc. ACL'20.Balancing Training for Multilingual Neural Machine Translation PDF
Proc. ACL'20.Stress and Burnout in Open Source: Toward Finding, Understanding, and Mitigating Unhealthy Interactions PDF
Proc. of International Conference on Software Engineering -- New Ideas Track (ICSE-NIER).Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History PDF
Proc. ICLR'20.What Code-Switching Strategies are Effective in Dialog Systems? PDF
Proc. SCiL'20.Where New Words Are Born: Distributional Semantic Analysis of Neologisms and Their Semantic Neighborhoods PDF
Proc. SCiL'20.Topics to Avoid: Demoting Latent Confounds in Text Classification PDF
Proc. EMNLP'19.Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media PostsPDF
Proc. EMNLP'19.Learning to Generate Word- and Phrase-Embeddings for Efficient Phrase-Based Neural Machine Translation PDF
Proc. WNGT'19.A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation PDF
Proc. WNGT'19.A Dynamic Strategy Coach for Effective Negotiation PDF
Proc. SIGdial'19.Entity-Centric Contextual Affective AnalysisPDF
Proc. ACL'19.CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology. (Interpretability Prize)PDF
Proc. SIGMORPHON'19.Quantifying Social Biases in Contextual Word RepresentationsPDF
Proc. of Workshop on Gender Bias for NLP.Contextual Affective Analysis: A Case Study of People Portrayals in Online #MeToo StoriesPDF
Proc. ICWSM'19.Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word EmbeddingsPDF
Proc. NAACL'19.Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous OutputsPDF
Proc. ICLR'19.Framing and Agenda-setting in Russian News: a Computational Analysis of Intricate Political StrategiesPDF
Proc. EMNLP'18.Style Transfer Through Back-TranslationPDF
Proc. ACL'18.Native Language Cognate Effects on Second Language Lexical ChoicePDF DATA
Proceedings of the Transactions of Association for Computational Linguistics (TACL). 2018.RtGender: A Corpus for Studying Differential Responses to GenderPDF DATA
Proc. LREC'18.Incorporating Dialectal Variability for Socially Equitable Language IdentificationPDF CODE
Proc. ACL'17.Writer Profiling Without the Writer's TextPDF
Proc. SocInfo'17.Linguistic Knowledge in Data-Driven Natural Language ProcessingPDF
PhD thesis, September 2016.Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation LearningPDF
Proc. ACL'16.Correlation-based Intrinsic Evaluation of Word Vector RepresentationsPDF CODE
In RepEval'16.Problems With Evaluation of Word Embeddings Using Word Similarity TasksPDF
In RepEval'16.Polyglot Neural Language Models: Case Study in Cross-Lingual Phonetic Representation LearningPDF
Proc. NAACL'16.Morphological Inflection Generation Using Character Sequence to Sequence LearningPDF
Proc. NAACL'16.Massively Multilingual Word Embeddings PDF
arXiv preprintCross-Lingual Bridges with Models of Lexical Borrowing.PDF
Journal of Artificial Intelligence Research (JAIR). 2016.Evaluation of Word Vector Representations by Subspace Alignment.PDF CODE
In Proc. EMNLP'15.Not All Contexts Are Created Equal: Better Word Representations with Variable Attention.PDF
In Proc. EMNLP'15.Lexicon Stratification for Translating Out-of-Vocabulary Words.PDF
In Proc. ACL'15.Sparse Overcomplete Word Vector Representations.PDF
In Proc. ACL'15.A Bottom Up Approach to Category Mapping and Meaning Change.PDF
In Proc. NetWordS'15.Constraint-Based Models of Lexical Borrowing.PDF
In Proc. NAACL'15.Identification of Multi-word Expressions by Combining Multiple Linguistic Information Sources.PDF
Computational Linguistics, 40(2):449-468, 2014.Metaphor Detection with Cross-Lingual Model Transfer.PDF CODE DATA
In Proc. ACL'14.Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation.PDF
In Proc. EACL'14.Augmenting English Adjective Senses with Supersenses.PDF CODE DATA
In Proc. LREC'14.Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness.PDF DATA
In Proc. LREC'14.Automatic Classification of Communicative Functions of Definiteness.PDF
In Proc. COLING'14.The CMU Machine Translation Systems at WMT 2014.PDF
In Proc. WMT'14.Generating English Determiners in Phrase-Based Translation with Synthetic Translation Options.PDF
In Proc. WMT'13.The CMU Machine Translation Systems at WMT 2013: Syntax, Synthetic Translation Options, and Pseudo-References.PDF
In Proc. WMT'13.Identifying the L1 of non-native writers: the CMU-Haifa system.PDF
In Proc. the 8th Workshop on Innovative Use of NLP for Building Educational Applications, 2013.Cross-Lingual Metaphor Detection Using Common Semantic Features.PDF
In Proc. Meta4NLP Workshop, 2013.Identification and Modeling of Word Fragments in Spontaneous Speech.PDF
In Proc. ICASSP'13.Extraction of Multi-word Expressions from Small Parallel Corpora.PDF
In Natural Language Engineering 18(4):549-573, 2012.Identification of Multi-word Expressions by Combining Multiple Linguistic Information Sources.PDF
In Proc. EMNLP'11.Extraction of Multi-word Expressions from Small Parallel Corpora.PDF
University of Haifa M.Sc. thesis, September 2010.Extraction of Multi-word Expressions from Small Parallel Corpora.PDF
In Proc. COLING'10.Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content.PDF
In Proc. LREC'10.