These models are designed to produce a _plausible_ text output for a given promp...

drdeca · on May 21, 2023

Are you taking the RLHF into account when you say so?

dotancohen · on May 21, 2023

Well, I wasn't, but if you look at the top most comment of this thread [0] you'll see that considering the level of human reinforcement being demonstrated only reinforces my point.

[0] https://news.ycombinator.com/item?id=36013017

alex_sf · on May 21, 2023

Taking RLHF into account: it's not actually generating the most plausible completion, it's generating one that's worse.