Alisa Liu
Alisa Liu
Home
Publications
evaluation
We're Afraid Language Models Aren't Modeling Ambiguity
We build a benchmark to evaluate LM understanding of ambiguity, which is an intrinsic feature of language, and find that the task remains extremely challenging, including for GPT-4
Alisa Liu
,
Zhaofeng Wu
,
Julian Michael
,
Alane Suhr
,
Peter West
,
Alexander Koller
,
Swabha Swayamdipta
,
Noah A. Smith
,
Yejin Choi
Paper
BibTeX
Code
Dataset
That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?
The translation of ambiguous text presents a challenge for translation systems, as it requires using the surrounding context to …
Jaechan Lee
,
Alisa Liu
,
Orevaoghene Ahia
,
Hila Gonen
,
Noah A. Smith
Paper
BibTeX
Code
Inverse Scaling: When Bigger Isn't Better
Work on scaling laws has found that large language models (LMs) show predictable improvements to overall loss with increased scale …
Ian R. McKenzie
,
18 others
,
Alisa Liu
,
Jiacheng Liu
,
Tom Tseng
,
Tomasz Korbak
,
Najoung Kim
,
Samuel R. Bowman
,
Ethan Perez
Paper
BibTeX
Code
BibTeX
×