Hello! I am a fifth-year PhD student in computer science at the University of Washington, advised by Yejin Choi and Noah Smith. My research area is natural language processing, with interests particularly in tokenization, decoding-time algorithms, and data creation. I am grateful to be supported by the NSF Graduate Research Fellowship and OpenAI SuperAlignment Fellowship.

Previously I was an undergraduate at Northwestern University where I majored in computer science and math. There, I was very fortunate to be mentored by Professor Doug Downey, Professor Bryan Pardo, and Dr. Prem Seetharaman.

Education

PhD student, 2020 - present

University of Washington
BA in Computer Science, Mathematics, 2020

Northwestern University

Publications

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?.
Jonathan Hayase*, Alisa Liu*, Yejin Choi, Sewoong Oh, Noah A. Smith. NeurIPS 2024.

Paper BibTeX Code

Tuning Language Models by Proxy.
Alisa Liu, Xiaochuang Han, Yizhong Wang, Yulia Tsvetkov, Yejin Choi, Noah A. Smith. COLM 2024 (Spotlight 🌟, top 7%).

Paper BibTeX Code

We're Afraid Language Models Aren't Modeling Ambiguity.
Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A. Smith, Yejin Choi. EMNLP 2023.

Paper BibTeX Code Dataset

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?.
Jaechan Lee, Alisa Liu, Orevaoghene Ahia, Hila Gonen, Noah A. Smith. EMNLP Findings 2023.

Paper BibTeX Code

Inverse Scaling: When Bigger Isn't Better.
Ian R. McKenzie, 18 others, Alisa Liu, Jiacheng Liu, Tom Tseng, Tomasz Korbak, Najoung Kim, Samuel R. Bowman, Ethan Perez. TMLR 2023 (Featured 🌟).

Paper BibTeX Code

How Language Model Hallucinations Can Snowball.
Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith. ICML 2024.

Paper BibTeX Code

Self-Instruct: Aligning Language Models with Self-Generated Instructions.
Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi. ACL 2023.

Paper BibTeX Code

Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts.
Skyler Hallinan, Alisa Liu, Yejin Choi, Maarten Sap. ACL 2023.

Paper BibTeX

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation.
Alisa Liu, Swabha Swayamdipta, Noah A. Smith, Yejin Choi. EMNLP Findings 2022.

Paper BibTeX Code Dataset Poster Slides Demo News

Generated Knowledge Prompting for Commonsense Reasoning.
Jiacheng Liu, Alisa Liu, Ximing Lu, Sean Welleck, Peter West, Ronan Le Bras, Yejin Choi, Hannaneh Hajishirzi. ACL 2022.

Paper BibTeX Code

DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts.
Alisa Liu, Maarten Sap, Ximing Lu, Swabha Swayamdipta, Chandra Bhagavatula, Noah A. Smith, Yejin Choi. ACL 2021.

Paper BibTeX Code Slides News

See all publications