* feat: improve data training for models up to 7B parameters. * docs: training considerations for small models to the documentation