Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

On many natural language tasks there can be significant overlap, making it difficult to judge performance. That's why I like more complex code generation tasks such the dataset we used for AlphaCode.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: