If I understand it correctly, this seems to be more than just quantizing, the mo...

		TheCoreh on Feb 28, 2024 \| parent \| context \| favorite \| on: The Era of 1-bit LLMs: ternary parameters for cost... If I understand it correctly, this seems to be more than just quantizing, the models are apparently trained in this format as well. So it's possible that the many layers adjust themselves in a way that "cancels out" the inaccuracies of the lower bit count