Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

"Paper after paper shows these things are hiding data, fabricating output, reward hacking, exploiting human psychology, and engaging in other nefarious behaviors best expressed as akin to a human toddler - just with the skills of a political operative, subject matter expert, or professional gambler."

Anthropomorphizing removed, it simply means that we do not yet understand the internal logic of LLM. Much less disturbing than you suggest.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: