Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
nomel
24 days ago
|
parent
|
context
|
favorite
| on:
Sycophancy is the first LLM "dark pattern"
The "alignment tax".
behnamoh
24 days ago
[–]
Exactly. Even this paper shows how model creativity significantly drops and the models experience mode collapse like we saw in GANs, but the companies keep using RLHF...
https://arxiv.org/abs/2406.05587
nomel
24 days ago
|
parent
[–]
A nice talk about a researcher's experience/benchmarks with raw GPT-4, before and after RLHF:
https://www.youtube.com/watch?v=qbIk7-JPB2c
behnamoh
24 days ago
|
root
|
parent
[–]
Yup, I remember that! Microsoft removed that part of the paper.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: