Beruflich Dokumente
Kultur Dokumente
W2
Over fitting
Over fitting occurs when a statistical model describes random error or noise instead of the underlying relationship. Over fitting generally occurs when a model is excessively complex, such as having too many parameters relative to the number of observations. A model which has been over fit will generally have poor predictive performance, as it can exaggerate minor fluctuations in the data. However, especially in cases where learning was performed too long or where training examples are rare, the learner may adjust to very specific random features of the training data, that have no causal relation to the target function. In this process of over fitting, the performance on the training examples still increases while the performance on unseen data becomes worse.
Over fitting
Modified strategy E.g. pruning (simplifying a description) Prepruning: stops at a simple description before search proceeds to an overly complex one Postpruning: generates a complex description first and simplifies it afterwards