Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The point is that it's a litmus test for how well the models do with niche knowledge _in general_. The point isn't really to know how well the model works for that specific niche. Ideally of course you would use a few of them and aggregate the results.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: