GlossaryTraining datasets
Training datasets
A training dataset is a dataset of examples used for learning, that is to fit the parameters of, for instance, a classifier. Most approaches that search through data for empirical relationships tend to overfit the data, meaning that they can identify apparent relationships in the training data that do not hold in general.
Training datasets are a key component of any successful machine learning project, as they can have a significant impact on the performance of your models.