Uploaded on Oct 17, 2022
PPT on Evaluation Models
Evaluation Models
Click to edit Master title style EVALUATION MODEL1 S MCloicdke lt oE veadliuta Mtiaosnter title style Model Evaluation is an integral part of the model development process. It helps to find the best model that represents our data and how well the chosen model will work in the future. 2 Source: www.saedsayad.com 2 AClBicOkU tTo Meoddite Ml Eavstaelur attitiolen style Evaluating model performance with the data used for training is not acceptable in data science because it can easily generate overoptimistic and overfitted models. 3 Source: www.saedsayad.com 3 CMlEicTkH tOoD eSd OitF M EaVsAteLrU tAitTleIN sGt yMleODELS There are two methods of evaluating models in data science, Hold-Out and Cross-Validation. To avoid overfitting, both methods use a test set (not seen by the model) to evaluate model performance. 4 Source: www.saedsayad.com 4 TClriacikn itnog e sdeit Master title style Training set is a subset of the dataset used to build predictive models. 5 Source: www.saedsayad.com 5 CVlailcikd attoi oend iste Mt aster title style Validation set is a subset of the dataset used to assess the performance of model built in the training phase. It provides a test platform for fine tuning model's parameters and selecting the best-performing model. Not all modeling algorithms need a validation set. 6 Source: www.saedsayad.com 6 CTleicstk steot edit Master title style Test set or unseen examples is a subset of the dataset to assess the likely future performance of a model. If a model fit to the training set much better than it fits the test set, overfitting is probably the cause. 7 Source: www.saedsayad.com 7 Clriocsks -tVoa elidiat tMioanster title style When only a limited amount of data is available, to achieve an unbiased estimate of the model performance we use k-fold cross-validation. In k-fold cross-validation, we divide the data into k subsets of equal size. 8 Source: www.saedsayad.com 8 Cloinckfu tsoio end Mit aMtraisxter title style A confusion matrix shows the number of correct and incorrect predictions made by the classification model compared to the actual outcomes (target value) in the data. 9 Source: www.saedsayad.com 9 Cliacsks itfoic aetdiiotn M Eavsatelura ttiitolen style Classifiers are commonly evaluated using either a numeric metric, such as accuracy, or a graphical representation of performance, such as a receiver operating characteristic (ROC) curve 10 Source: www.saedsayad.com 10 CRleicgrke tsos ieodni tE vMaalustaetrio tnitle style Regression is a type of Machine learning which helps in finding the relationship between independent and dependent variable. 11 Source: www.saedsayad.com 11
Comments