Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
behnamoh
58 days ago
|
parent
|
context
|
favorite
| on:
Sycophancy is the first LLM "dark pattern"
Exactly. Even this paper shows how model creativity significantly drops and the models experience mode collapse like we saw in GANs, but the companies keep using RLHF...
https://arxiv.org/abs/2406.05587
nomel
58 days ago
[–]
A nice talk about a researcher's experience/benchmarks with raw GPT-4, before and after RLHF:
https://www.youtube.com/watch?v=qbIk7-JPB2c
behnamoh
58 days ago
|
parent
[–]
Yup, I remember that! Microsoft removed that part of the paper.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
https://arxiv.org/abs/2406.05587