I don't know if I feel cheated, but it seems a little unmanageable. How is this ...

vidarh · 2025-05-07T08:47:00 1746607620

See it as a temporary workaround, and assume each instruction will also lead to additional training data to try to achieve the same in the next model directly.

kikimora · 2025-05-07T09:25:31 1746609931

It comes down to solving this - given instruction X find out how to change the training data such that X is obeyed and none other side effects appears. Given amount if the training data and complexities of involved in training I don’t think there is a clear way to do it.

vidarh · 2025-05-07T10:02:36 1746612156

I'm slightly less sceptical that they can do it, but we presumably agree that changing the prompt is far faster, and so you change the prompt first, and the prompt effectively will serve in part as documentation of issues to chip away at while working on the next iterations of the underlying models.