overtraining

2 Articles
Over-training large language models may make them harder to fine-tune
Tech

Over-training large language models may make them harder to fine-tune

Language models with extensive pre-training can exhibit catastrophic overtraining, where the performance of post-trained models degrades as the pre-training stage is extended. Credit:...

‘Catastrophic overtraining’ could harm large language AI models that are trained on more data for the sake of training
Tech

‘Catastrophic overtraining’ could harm large language AI models that are trained on more data for the sake of training

Researchers from top US universities warn extending pre-training can be detrimental to performance Too much pre-training can deliver worse performance due to something...