How to move from a continuation model to a chat model? #497
antonkratz
started this conversation in
General
Replies: 1 comment 1 reply
-
It's actually very simple conceptually, you just swap out the dataset and continue training for a little bit. From one that looks like internet documents to one that looks like conversations. There's a number of these available, e.g. https://github.com/LAION-AI/Open-Assistant is one from a while back. This approach is "SFT" (Supervised Finetuning) and gets you a long way. You can then get another ~10% with RLHF (more complicated), or DPO (not complicated, works almost as well). llm.c will probably get around to it, but the basics have to be super solid, and they are not yet. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I do not understand, conceptually, how to move from a completion model to a chat model. What @karpathy describes in #481 is, if I understand correctly, similar to BLOOM, is that right? I.e. it is a model that given text, will continue that text until some stop token is reached, is that correct? But how to get from this, or something like BLOOM, to something like ChatGPT? It seems like two almost completely different tasks! One is continuation... the other task is response. Could someone please point me in the right direction how to get from a BLOOM type continuation model to a ChatGPT type model? Again, this is about implementing it on top of #481 but more importantly my question is conceptual.
Beta Was this translation helpful? Give feedback.
All reactions