Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
dcre
61 days ago
|
parent
|
context
|
favorite
| on:
Kimi K2 Thinking, a SOTA open-source trillion-para...
The question is: fine-tuning for what? Reasoning is not a particular task, it is a general-purpose technique for directing more compute at
any
task.
irthomasthomas
61 days ago
[–]
Pivot tokens like 'wait', 'actually' and 'alternatively' are boosted in order to force the model to explore alternate solutions.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: