Home
Projects
People
Publications
Coding Aperitivo
Reading Group
Join us
Contact
Human Feedback
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values
Human feedback is increasingly used to steer the behaviours of Large Language Models (LLMs). However, it is unclear how to collect and incorporate feedback in a way that is efficient, effective and unbiased, especially for highly subjective human …
Cite
×