Home
Projects
People
Publications
Coding Aperitivo
Reading Group
Alumni
Join us
Contact
hate speech
HONEST: Measuring Hurtful Sentence Completion in Language Models
Language models have revolutionized the field of NLP. However, language models capture and proliferate hurtful stereotypes, especially in text generation. Our results show that **4.3% of the time, language models complete a sentence with a hurtful …
«
Cite
×