Home
Projects
People
Publications
Coding Aperitivo
Reading Group
Join us
Contact
Publications
Type
Conference paper
Journal article
Preprint
Report
Book
Book section
Date
2024
2023
2022
2021
2020
2019
2018
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
September, 2024
Emotions play important epistemological and cognitive roles in our lives, revealing our values and guiding our actions. Previous work …
Flor Miriam Plaza-del-Arco
,
Amanda Cercas Curry
,
Susanna Paoli
,
Alba Curry
,
Dirk Hovy
PDF
Cite
Project
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
August, 2024
Much recent work seeks to evaluate values and opinions in large language models (LLMs) using multiple-choice surveys and …
Paul Röttger
,
Valentin Hofmann
,
Valentina Pyatkin
,
Musashi Hinck
,
Hannah Rose Kirk
,
Hinrich Schuetze
,
Dirk Hovy
PDF
Cite
Project
My Answer is C: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
August, 2024
The open-ended nature of language generation makes the evaluation of autoregressive large language models (LLMs) challenging. One …
Xinpeng Wang
,
Bolei Ma
,
Chengzhi Hu
,
Leon Weber-Genzel
,
Paul Röttger
,
Frauke Kreuter
,
Dirk Hovy
,
Barbara Plank
PDF
Cite
Project
Project
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
August, 2024
As diverse linguistic communities and users adopt large language models (LLMs), assessing their safety across languages becomes …
Fabio Pernisi
,
Dirk Hovy
,
Paul Röttger
PDF
Cite
Project
Project
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
July, 2024
Without proper safeguards, large language models will readily follow malicious instructions and generate toxic content. This risk …
Paul Röttger
,
Hannah Rose Kirk
,
Bertie Vidgen
,
Giuseppe Attanasio
,
Federico Bianchi
,
Dirk Hovy
PDF
Cite
Project
Project
DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods
May, 2024
Social scientists increasingly use demographically stratified social media data to study the attitudes, beliefs, and behavior of the …
Lorenzo Lupo
,
Paul Bose
,
Mahyar Habibi
,
Dirk Hovy
,
Carlo Schwarz
PDF
Cite
Project
Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts
May, 2024
Using large language models (LLMs) for educational applications like dialogue-based teaching is a hot topic. Effective teaching, …
Donya Rooein
,
Paul Rottger
,
Anastassia Shaitarova
,
Dirk Hovy
PDF
Cite
Project
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
May, 2024
Training large language models to follow instructions makes them perform better on a wide range of tasks, generally becoming more …
Federico Bianchi
,
Mirac Suzgun
,
Giuseppe Attanasio
,
Paul Röttger
,
Dan Jurafsky
,
Tatsunori Hashimoto
,
James Zou
PDF
Cite
Project
Project
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
April, 2024
The last two years have seen a rapid growth in concerns around the safety of large language models (LLMs). Researchers and …
Paul Röttger
,
Fabio Pernisi
,
Bertie Vidgen
,
Dirk Hovy
PDF
Cite
Project
Project
Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions
March, 2024
Emotions are a central aspect of communication. Consequently, emotion analysis (EA) is a rapidly growing field in natural language …
Flor Miriam Plaza-del-Arco
,
Alba Curry
,
Amanda Cercas Curry
,
Dirk Hovy
PDF
Cite
Project
Conversations as a Source for Teaching Scientific Concepts at Different Education Levels
March, 2024
Open conversations are one of the most engaging forms of teaching. However, creating those conversations in educational software is a …
Donya Rooein
,
Dirk Hovy
PDF
Cite
Project
Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
March, 2024
Large language models (LLMs) reflect societal norms and biases, especially about gender. While societal biases and stereotypes have …
Flor Miriam Plaza-del-Arco
,
Amanda Cercas Curry
,
Alba Curry
,
Gavin Abercrombie
,
Dirk Hovy
PDF
Cite
Project
Subjective isms? On the Danger of Conflating Hate and Offence in Abusive Language Detection
March, 2024
Natural language processing research has begun to embrace the notion of annotator subjectivity, motivated by variations in labelling. …
Amanda Cercas Curry
,
Gavin Abercrombie
,
Zeerak Talat
PDF
Cite
Project
Impoverished Language Technology: The Lack of (Social) Class in NLP
March, 2024
Since Labov’s (1964) foundational work on the social stratification of language, linguistics has dedicated concerted efforts …
Amanda Cercas Curry
,
Zeerak Talat
,
Dirk Hovy
PDF
Cite
Project
Classist Tools: Social Class Correlates with Performance in NLP
March, 2024
Since the foundational work of William Labov on the social stratification of language (Labov, 1964), linguistics has made concentrated …
Amanda Cercas Curry
,
Giuseppe Attanasio
,
Zeerak Talat
,
Dirk Hovy
PDF
Cite
Project
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation
December, 2023
Recent instruction fine-tuned models can solve multiple NLP tasks when prompted to do so, with machine translation (MT) being a …
Giuseppe Attanasio
,
Flor Miriam Plaza-del-Arco
,
Debora Nozza
,
Anne Lauscher
PDF
Cite
Project
Mirages. On Anthropomorphism in Dialogue Systems
December, 2023
Automated dialogue or conversational systems are anthropomorphised by developers and personified by users. While a degree of …
Gavin Abercrombie
,
Amanda Cercas Curry
,
Tanvi Dinkar
,
Verena Rieser
,
Zeerak Talat
PDF
Cite
Project
The Empty Signifier Problem: Towards Clearer Paradigms for Operationalising 'Alignment' in Large Language Models
November, 2023
In this paper, we address the concept of ‘alignment’ in large language models (LLMs) through the lens of post-structuralist …
Hannah Rose Kirk
,
Bertie Vidgen
,
Paul Röttger
,
Scott A. Hale
PDF
Cite
Project
Project
SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models
November, 2023
The past year has seen rapid acceleration in the development of large language models (LLMs). For many tasks, there is now a wide range …
Bertie Vidgen
,
Hannah Rose Kirk
,
Rebecca Qian
,
Nino Scherrer
,
Anand Kannappan
,
Scott A. Hale
,
Paul Röttger
PDF
Cite
Project
Project
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values
October, 2023
Human feedback is increasingly used to steer the behaviours of Large Language Models (LLMs). However, it is unclear how to collect and …
Hannah Rose Kirk
,
Andrew M. Bean
,
Bertie Vidgen
,
Paul Röttger
,
Scott A. Hale
PDF
Cite
Project
Project
Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation
July, 2023
Large Language Models (LLMs) exhibit remarkable text classification capabilities, excelling in zero- and few-shot learning (ZSL and …
Flor Miriam Plaza-del-Arco
,
Debora Nozza
,
Dirk Hovy
PDF
Cite
Project
What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns
July, 2023
As 3rd-person pronoun usage shifts to include novel forms, e.g., neopronouns, we need more research on identity-inclusive NLP. …
Anne Lauscher
,
Debora Nozza
,
Ehm Miltersen
,
Archie Crowley
,
Dirk Hovy
PDF
Cite
Project
What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns
July, 2023
As 3rd-person pronoun usage shifts to include novel forms, e.g., neopronouns, we need more research on identity-inclusive NLP. …
Anne Lauscher
,
Debora Nozza
,
Ehm Miltersen
,
Archie Crowley
,
Dirk Hovy
PDF
Cite
Project
The State of Profanity Obfuscation in Natural Language Processing Scientific Publications
July, 2023
Work on hate speech has made considering rude and harmful examples in scientific publications inevitable. This situation raises various …
Debora Nozza
,
Dirk Hovy
PDF
Cite
Code
Project
The Ecological Fallacy in Annotation: Modeling Human Label Variation goes beyond Sociodemographics
July, 2023
Many NLP tasks exhibit human label variation, where different annotators give different labels to the same texts. This variation is …
Matthias Orlikowski
,
Paul Röttger
,
Philipp Cimiano
,
Dirk Hovy
PDF
Cite
Project
Temporal and Second Language Influence on Intra-Annotator Agreement and Stability in Hate Speech Labelling
July, 2023
Much work in natural language processing (NLP) relies on human annotation. The majority of this implicitly assumes that annotator’s …
Gavin Abercrombie
,
Dirk Hovy
,
Vinodkumar Prabhakaran
PDF
Cite
Project
Respectful or Toxic? Using Zero-Shot Learning with Language Models to Detect Hate Speech
July, 2023
Hate speech detection faces two significant challenges: 1) the limited availability of labeled data and 2) the high variability of hate …
Flor Miriam Plaza-del-Arco
,
Debora Nozza
,
Dirk Hovy
PDF
Cite
Project
MilaNLP at SemEval-2023 Task 10: Ensembling Domain-Adapted and Regularized Pretrained Language Models for Robust Sexism Detection
July, 2023
We present the system proposed by the MilaNLP team for the Explainable Detection of Online Sexism (EDOS) shared task. We propose an …
Amanda Cercas Curry
,
Giuseppe Attanasio
,
Debora Nozza
,
Dirk Hovy
PDF
Cite
Code
Project
A Multi-dimensional study on Bias in Vision-Language models
July, 2023
In recent years, joint Vision-Language (VL) models have increased in popularity and capability. Very few studies have attempted to …
Gabriele Ruggeri
,
Debora Nozza
PDF
Cite
Leveraging Social Interactions to Detect Misinformation on Social Media
June, 2023
Detecting misinformation threads is crucial to guarantee a healthy environment on social media. We address the problem using the data …
Tommaso Fornaciari
,
Luca Luceri
,
Emilio Ferrara
,
Dirk Hovy
PDF
Cite
Computer says “No”: The Case Against Empathetic Conversational AI
June, 2023
Emotions are an integral part of human cognition and they guide not only our understanding of the world but also our actions within it. …
Alba Curry
,
Amanda Cercas Curry
PDF
Cite
Project
A Cross-Lingual Study of Homotransphobia on Twitter
May, 2023
We present a cross-lingual study of homotransphobia on Twitter, examining the prevalence and forms of homotransphobic content in tweets …
Davide Locatelli
,
Greta Damo
,
Debora Nozza
PDF
Cite
Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale
May, 2023
Machine learning models are now able to convert user-written text descriptions into naturalistic images. These models are available to …
Federico Bianchi
,
Pratyusha Kalluri
,
Esin Durmus
,
Faisal Ladhak
,
Myra Cheng
,
Debora Nozza
,
Tatsunori Hashimoto
,
Dan Jurafsky
,
James Zou
,
Aylin Caliskan
PDF
Cite
Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP)
May, 2023
Natural Language Processing has seen impressive gains in recent years. This research includes the demonstration by NLP models to have …
Sunipa Dev
,
Vinodkumar Prabhakaran
,
David Adelani
,
Dirk Hovy
,
Luciana Benotti
PDF
Cite
ferret: a Framework for Benchmarking Explainers on Transformers
May, 2023
As Transformers are increasingly relied upon to solve complex NLP problems, there is an increased need for their decisions to be …
Giuseppe Attanasio
,
Eliana Pastor
,
Chiara Di Bonaventura
,
Debora Nozza
PDF
Cite
Code
Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers
May, 2023
Demographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating demographic factors can …
Chia-chien Hung
,
Anne Lauscher
,
Dirk Hovy
,
Simone Paolo Ponzetto
,
Goran Glavaš
PDF
Cite
Project
Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?
April, 2023
Large language models (LLMs) offer a range of new possibilities, including adapting the text to different audiences and their reading …
Donya Rooein
,
Amanda Cercas Curry
,
Dirk Hovy
PDF
Cite
Project
Beyond Digital 'Echo Chambers': The Role of Viewpoint Diversity in Political Discussion
February, 2023
Increasingly taking place in online spaces, modern political conversations are typically perceived to be unproductively …
Rishav Hada
,
Amir Ebrahimi Fard
,
Sarah Shugars
,
Federico Bianchi
,
Patricia Rossini
,
Dirk Hovy
,
Rebekah Tromble
,
Nava Tintareva
PDF
Cite
Project
Viewpoint: Artificial Intelligence Accidents Waiting to Happen?
January, 2023
Artificial Intelligence (AI) is at a crucial point in its development: stable enough to be used in production systems, and increasingly …
Federico Bianchi
,
Amanda Cercas Curry
,
Dirk Hovy
PDF
Cite
Project
Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data
December, 2022
Twitter data have become essential to Natural Language Processing (NLP) and social science research, driving various scientific …
Federico Bianchi
,
Vincenzo Cutrona
,
Dirk Hovy
PDF
Cite
Code
Project
It's Not Just Hate: A Multi-Dimensional Perspective on Detecting Harmful Speech Online
December, 2022
Well-annotated data is a prerequisite for good Natural Language Processing models. Too often, though, annotation decisions are governed …
Federico Bianchi
,
Stefanie Hills
,
Patricia Rossini
,
Dirk Hovy
,
Rebekah Tromble
,
Nava Tintarev
PDF
Cite
Code
Project
SocioProbe: What, When, and Where Language Models Learn about Sociodemographics
December, 2022
Pre-trained language models (PLMs) have outperformed other NLP models on a wide range of tasks. Opting for a more thorough …
Anne Lauscher
,
Federico Bianchi
,
Samuel R. Bowman
,
Dirk Hovy
PDF
Cite
Project
Bridging Fairness and Environmental Sustainability in Natural Language Processing
December, 2022
Fairness and environmental impact are important research directions for the sustainable development of artificial intelligence. …
Marius Hessenthaler
,
Emma Strubell
,
Dirk Hovy
,
Anne Lauscher
PDF
Cite
Project
Measuring Harmful Representations in Scandinavian Language Models
December, 2022
Scandinavian countries are perceived as role-models when it comes to gender equality. With the advent of pre-trained language models …
Samia Touileb
,
Debora Nozza
PDF
Cite
Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
October, 2022
Hate speech is a global phenomenon, but most hate speech datasets so far focus on English-language content. This hinders the …
Paul Röttger
,
Debora Nozza
,
Federico Bianchi
,
Dirk Hovy
PDF
Cite
Project
Is It Worth the (Environmental) Cost? Limited Evidence for the Benefits of Diachronic Continuous Training
October, 2022
Language is constantly changing and evolving, leaving language models to quickly become outdated, both factually and linguistically. …
Giuseppe Attanasio
,
Debora Nozza
,
Federico Bianchi
,
Dirk Hovy
PDF
Cite
Project
Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender
October, 2022
The world of pronouns is changing – from a closed word class with few members to an open set of terms to reflect identities. However, …
Anne Lauscher
,
Archie Crowley
,
Dirk Hovy
PDF
Cite
Project
Guiding the Release of Safer E2E Conversational AI through Value Sensitive Design
September, 2022
Over the last several years, end-to-end neural conversational agents have vastly improved their ability to carry unrestricted, …
A. Stevie Bergman
,
Gavin Abercrombie
,
Shannon Spruit
,
Dirk Hovy
,
Emily Dinan
,
Y-Lan Boureau
,
Verena Rieser
PDF
Cite
Project
Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models
July, 2022
Hate speech detection models are typically evaluated on held-out test sets. However, this risks painting an incomplete and potentially …
Paul Röttger
,
Haitham Seelawi
,
Debora Nozza
,
Zeerak Talat
,
Bertie Vidgen
PDF
Cite
Code
HATE-ITA: Hate Speech Detection in Italian Social Media Text
July, 2022
Online hate speech is a dangerous phenomenon that can (and should) be promptly counteracted properly. While Natural Language Processing …
Debora Nozza
,
Federico Bianchi
,
Giuseppe Attanasio
PDF
Cite
Code
Poster
Slides
Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa
May, 2022
Natural Language Processing (NLP) ‘s applied nature makes it necessary to select the most effective and robust models. Producing …
Tommaso Fornaciari
,
Alexandra Uma
,
Massimo Poesio
,
Dirk Hovy
PDF
Cite
Code
Project
MilaNLP at SemEval-2022 Task 5: Using Perceiver IO for Detecting Misogynous Memes with Text and Image Modalities
April, 2022
In this paper, we describe the system proposed by the MilaNLP team for the Multimedia Automatic Misogyny Identification (MAMI) …
Giuseppe Attanasio
,
Debora Nozza
,
Federico Bianchi
PDF
Cite
Code
Video
Language Invariant Properties in Natural Language Processing
April, 2022
Meaning is context-dependent, but many properties of language (should) remain the same even if we transform the context. For example, …
Federico Bianchi
,
Debora Nozza
,
Dirk Hovy
PDF
Cite
Code
XLM-EMO: Multilingual Emotion Prediction in Social Media Text
April, 2022
Detecting emotion in text allows social and computational scientists to study how people behave and react to online events. However, …
Federico Bianchi
,
Debora Nozza
,
Dirk Hovy
PDF
Cite
Code
Project
Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks
April, 2022
Labelled data is the foundation of most natural language processing tasks. However, labelling data is difficult and there often are …
Paul Röttger
,
Bertie Vidgen
,
Dirk Hovy
,
Janet B. Pierrehumbert
PDF
Cite
Project
Pipelines for Social Bias Testing of Large Language Models
April, 2022
The maturity level of language models is now at a stage in which many companies rely on them to solve various tasks. However, while …
Debora Nozza
,
Federico Bianchi
,
Dirk Hovy
PDF
Cite
Project
Poster
Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals
April, 2022
Current language technology is ubiquitous and directly influences individuals' lives worldwide. Given the recent trend in AI on …
Debora Nozza
,
Federico Bianchi
,
Anne Lauscher
,
Dirk Hovy
PDF
Cite
Code
Project
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection
April, 2022
Transformer-based Natural Language Processing models have become the standard for hate speech detection. However, the unconscious use …
Giuseppe Attanasio
,
Debora Nozza
,
Eliana Pastor
,
Dirk Hovy
PDF
Cite
Code
Project
Video
Fair and Argumentative Language Modeling for Computational Argumentation
April, 2022
Although much work in NLP has focused on measuring and mitigating stereotypical bias in semantic spaces, research addressing bias in …
Carolin Holtermann, Anne Lauscher, Simone Paolo Ponzetto
PDF
Cite
Project
DS-TOD: Efficient Domain Specialization for Task Oriented Dialog
April, 2022
Recent work has shown that self-supervised dialog-specific pretraining on large conversational datasets yields substantial gains over …
Chia-Chien Hung, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš
PDF
Cite
SAFETYKIT: First Aid for Measuring Safety in Open-domain Conversational Systems
March, 2022
The social impact of natural language processing and its applications has received increasing attention. In this position paper, we …
Emily Dinan
,
Gavin Abercrombie
,
A. Stevie Bergman
,
Shannon Spruit
,
Dirk Hovy
,
Y-Lan Boureau
,
Verena Rieser
PDF
Cite
Project
Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists
March, 2022
Natural Language Processing (NLP) models risk overfitting to specific terms in the training data, thereby reducing their performance, …
Giuseppe Attanasio
,
Debora Nozza
,
Dirk Hovy
,
Elena Baralis
PDF
Cite
Code
Project
Video
Text Analysis in Python for Social Scientists – Prediction and Classification
January, 2022
Text contains a wealth of information about about a wide variety of sociocultural constructs. Automated prediction methods can infer …
Dirk Hovy
PDF
Cite
Learning from Disagreement: A Survey
December, 2021
Many tasks in Natural Language Processing (NLP) and Computer Vision (CV) offer evidence that humans disagree, from objective tasks such …
Alexandra N Uma
,
Tommaso Fornaciari
,
Dirk Hovy
,
Silviu Paun
,
Barbara Plank
,
Massimo Poesio
PDF
Cite
Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence
August, 2021
Topic models extract groups of words from documents, whose interpretation as a topic hopefully allows for a better understanding of the …
Federico Bianchi
,
Silvia Terragni
,
Dirk Hovy
PDF
Cite
Code
On the Gap between Adoption and Understanding in NLP
August, 2021
There are some issues with current research trends in NLP that can hamper the free development of scientific research. We identify five …
Federico Bianchi
,
Dirk Hovy
PDF
Cite
Five sources of bias in natural language processing
August, 2021
Recently, there has been an increased interest in demographically grounded bias in natural language processing (NLP) applications. Much …
Dirk Hovy
,
Shrimai Prabhumoye
PDF
Cite
Project
Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection
August, 2021
Reducing and counter-acting hate speech on Social Media is a significant concern. Most of the proposed automatic methods are conducted …
Debora Nozza
PDF
Cite
Project
Poster
Slides
'We will Reduce Taxes' - Identifying Election Pledges with Language Models
August, 2021
In an election campaign, political parties pledge to implement various projects–should they be elected. But do they follow …
Tommaso Fornaciari
,
Dirk Hovy
,
Elin Naurin
,
Julia Runeson
,
Robert Thomson
,
Pankaj Adhikari
PDF
Cite
Project
The Importance of Modeling Social Factors of Language: Theory and Practice
June, 2021
Natural language processing (NLP) applications are now more powerful and ubiquitous than ever before. With rapidly developing (neural) …
Dirk Hovy
,
Diyi Yang
PDF
Cite
Project
HONEST: Measuring Hurtful Sentence Completion in Language Models
June, 2021
Language models have revolutionized the field of NLP. However, language models capture and proliferate hurtful stereotypes, especially …
Debora Nozza
,
Federico Bianchi
,
Dirk Hovy
PDF
Cite
Code
Project
Poster
Slides
Blog Post
Language in a (Search) Box: Grounding Language Learning in Real-World Human-Machine Interaction
June, 2021
We investigate grounded language learning through real-world data, by modelling a teacher-learner dynamics through the natural interactions occurring between users and search engines.
Federico Bianchi
,
Ciro Greco
,
Jacopo Tagliabue
PDF
Cite
Tweet
MilaNLP @ WASSA: Does BERT Feel Sad When You Cry?
May, 2021
The paper describes the MilaNLP team’s submission (Bocconi University, Milan) in the WASSA 2021 Shared Task on Empathy Detection and …
Tommaso Fornaciari
,
Federico Bianchi
,
Debora Nozza
,
Dirk Hovy
PDF
Cite
FEEL-IT: Emotion and Sentiment Classification for the Italian Language
May, 2021
Sentiment analysis is a common task to understand people’s reactions online. Still, we often need more nuanced information: is …
Federico Bianchi
,
Debora Nozza
,
Dirk Hovy
PDF
Cite
Code
Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning
May, 2021
Supervised learning assumes that a ground truth label exists. However, the reliability of this ground truth depends on human …
Tommaso Fornaciari
,
Alexandra Uma
,
Silviu Paun
,
Barbara Plank
,
Dirk Hovy and Massimo Poesio
PDF
Cite
Video
Universal Joy A Data Set and Results for Classifying Emotions Across Languages
April, 2021
While emotions are universal aspects of human psychology, they are expressed differently across different languages and cultures. We …
Sotiris Lamprinidis
,
Federico Bianchi
,
Daniel Hardt
,
Dirk Hovy
PDF
Cite
Code
BERTective: Language Models and Contextual Information for Deception Detection
April, 2021
Spotting a lie is challenging but has an enormous potential impact on security as well as private and public safety. Several NLP …
Tommaso Fornaciari
,
Federico Bianchi
,
Dirk Hovy
,
Massimo Poesio
PDF
Cite
Code
Dataset
Cross-lingual Contextualized Topic Models with Zero-shot Learning
March, 2021
We introduce a novel topic modeling method that can make use of contextulized embeddings (e.g., BERT) to do zero-shot cross-lingual topic modeling.
Federico Bianchi
,
Silvia Terragni
,
Dirk Hovy
,
Debora Nozza
,
Elisabetta Fersini
PDF
Cite
Code
Slides
Blog Post
Text Analysis in Python for Social Scientists – Discovery and Exploration
December, 2020
Text is everywhere, and it is a fantastic resource for social scientists. However, because it is so abundant, and because language is …
Dirk Hovy
PDF
Cite
“You Sound Just Like Your Father” Commercial Machine Translation Systems Include Stylistic Biases
July, 2020
The main goal of machine translation has been to convey the correct content. Stylistic considerations have been at best secondary. We …
Dirk Hovy
,
Federico Bianchi
,
Tommaso Fornaciari
PDF
Cite
Video
Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview
July, 2020
An increasing number of natural language processing papers address the effect of bias on predictions, introducing mitigation techniques …
Deven Santosh Shah
,
H. Andrew Schwartz
,
Dirk Hovy
PDF
Cite
Video
Visualizing Regional Language Variation Across Europe on Twitter
March, 2020
Geotagged Twitter data allows us to investigate correlations of geographic language variation, both at an interlingual and intralingual …
Dirk Hovy
,
Afshin Rahimi
,
Timothy Baldwin
,
Julian Brooke
Cite
DOI
What the [MASK]? Making Sense of Language-Specific BERT Models
March, 2020
Recently, Natural Language Processing (NLP) has witnessed an impressive progress in many areas, due to the advent of novel, pretrained …
Debora Nozza
,
Federico Bianchi
,
Dirk Hovy
PDF
Cite
Code
Project
Source Document
Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on Success
March, 2020
When interacting with each other, we motivate, advise, inform, show love or power towards our peers. However, the way we interact may …
Farzana Rashid
,
Tommaso Fornaciari
,
Dirk Hovy
,
Eduardo Blanco
,
Fernando Vega-Redondo
PDF
Cite
Fake opinion detection: how similar are crowdsourced datasets to real data?
January, 2020
Identifying deceptive online reviews is a challenging tasks for Natural Language Processing (NLP). Collecting corpora for the task is …
Tommaso Fornaciari
,
Letitia Cagnina
,
Paolo Rosso
,
Massimo Poesio
PDF
Cite
DOI
A Case for Soft Loss Functions
January, 2020
Recently, Peterson et al. provided evidence of the benefits of using probabilistic soft labels generated from crowd annotations for …
Alexandra Uma
,
Tommaso Fornaciari
,
Dirk Hovy
,
Silviu Paun
,
Barbara Plank
,
Massimo Poesio
PDF
Cite
Identifying Linguistic Areas for Geolocation
November, 2019
Geolocating social media posts relies on the assumption that language carries sufficient geographic information. However, locations are …
Tommaso Fornaciari
,
Dirk Hovy
PDF
Cite
Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers
November, 2019
User reviews provide a significant source of information for companies to understand their market and audience. In order to discover …
Hanh Nguyen
,
Dirk Hovy
PDF
Cite
Geolocation with Attention-Based Multitask Learning Models
November, 2019
Geolocation, predicting the location of a post based on text and other information, has a huge potential for several social media …
Tommaso Fornaciari
,
Dirk Hovy
PDF
Cite
Dense Node Representation for Geolocation
November, 2019
Prior research has shown that geolocation can be substantially improved by including user network information. While effective, it …
Tommaso Fornaciari
,
Dirk Hovy
PDF
Cite
Women’s Syntactic Resilience and Men’s Grammatical Luck: Gender-Bias in Part-of-Speech Tagging and Dependency Parsing
July, 2019
Several linguistic studies have shown the prevalence of various lexical and grammatical patterns in texts authored by a person of a …
Aparna Garimella
,
Carmen Banea
,
Dirk Hovy
,
Rada Mihalcea
PDF
Cite
Peer networks and entrepreneurship: A Pan-African RCT
January, 2019
Can large-scale peer interaction foster entrepreneurship and innovation? We conducted an RCT involving almost 5,000 entrepreneurs from …
Fernando Vega-Redondo
,
Paolo Pin
,
Diego Ubfal
,
Cristiana Benedetti-Fasil
,
Charles Brummitt
,
Gaia Rubera
,
Dirk Hovy
,
Tommaso Fornaciari
PDF
Cite
Increasing In-Class Similarity by Retrofitting Embeddings with Demographic Information
November, 2018
Dirk Hovy
,
Tommaso Fornaciari
PDF
Cite
Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning
October, 2018
Newspapers need to attract readers with headlines, anticipating their readers’ preferences. These preferences rely on topical, …
Sotiris Lamprinidis
,
Daniel Hardt
,
Dirk Hovy
PDF
Cite
Comparing Bayesian Models of Annotation
October, 2018
The analysis of crowdsourced annotations in natural language processing is concerned with identifying (1) gold standard labels, (2) …
Silviu Paun
,
Bob Carpenter
,
Jon Chamberlain
,
Dirk Hovy
,
Udo Kruschwitz
,
Massimo Poesio
PDF
Cite
Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting
October, 2018
Dialects are one of the main drivers of language variation, a major challenge for natural language processing tools. In most languages, …
Dirk Hovy
,
Christoph Purschke
PDF
Cite
The Social and the Neural Network: How to Make Natural Language Processing about People again
June, 2018
Over the years, natural language processing has increasingly focused on tasks that can be solved by statistical models, but ignored the …
Dirk Hovy
PDF
Cite
Cite
×