仓库源文站点原文

Prompt

Prompt is the most direct way to influence response, tips for good prompt:

The purpose of prompt in post-training is building best reasoning architecture in response, training could optimize other detailed contents in response

One shot learning

Experience / Conclusion

Training process

The purpose of training is to ensure performance on test dataset increase in stable trend and range.

Multi-stage training

The purpose of dataset is to provide information to model to learn, in the late stages, model already know more than before, more extra information should be sent to model. So, in the late stage, we should increase information diversity

How to increase information diversity:

Reference