Local text models discussion

soulnull@burggit.moe · edit-2 1 year ago

Local text models discussion

awoo@burggit.moe · 1 year ago

“B” refer to the size of parameters of the model in the billion but people has started referring them to “bits”.

The bigger the number, the smarter the model will be but the size and RAM, VRAM requirement rises accordingly.

LongerDonger@burggit.moe · 11 months ago

“B” refer to the size of parameters of the model in the billion but people has started referring them to “bits”.

This is not entirely correct. B does stand for “Billion” parameters, but bits are a different thing. You can have, for example, a 4-bit 13B model or an 8-bit 3B model. They don’t correlate at all.