You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
pre-training process过程问题list:
Q1:为什么需要在第一阶段先试用高质量的预训练数据过一遍,再混合高质量和低质量数据进行训练,这样做的意义是什么?
Q2:经过上述的一阶段训练后,再进行高质量数据的训练,相当于高质量数据在整个过程训练的3轮,是否训练过多?这样做的物理含义是什么?
Q3:试用了一下贵团队部署的chat模型,好像目前拒绝中文回答,这种拒绝不同语言的回答是怎样做到的呢?为什么需要这样做?
Instruction:你好
SeaLLMs:Sorry, the language you have asked is currently not supported. If you have questions in other supported languages, I'll be glad to help. Please also consider clearing the chat box for a better experience.
The text was updated successfully, but these errors were encountered:
个人认真阅读贵团队论文,很不错的工作!还有三个问题想要咨询一下
pre-training process过程问题list:
Q1:为什么需要在第一阶段先试用高质量的预训练数据过一遍,再混合高质量和低质量数据进行训练,这样做的意义是什么?
Q2:经过上述的一阶段训练后,再进行高质量数据的训练,相当于高质量数据在整个过程训练的3轮,是否训练过多?这样做的物理含义是什么?
Q3:试用了一下贵团队部署的chat模型,好像目前拒绝中文回答,这种拒绝不同语言的回答是怎样做到的呢?为什么需要这样做?
Instruction:你好
SeaLLMs:Sorry, the language you have asked is currently not supported. If you have questions in other supported languages, I'll be glad to help. Please also consider clearing the chat box for a better experience.
The text was updated successfully, but these errors were encountered: