Training Program on Data Mining / Predictive Analytics
Technical Aspects of Data Mining / Predictive Analytics
- How does one evaluate the quality of a predictive model? Training data vs. validation data. Concept of cross-validation.
- What is regression analysis?
- What
is the taxonomy of regression models? Parametric vs. non-parametric,
linear vs. non-linear, robust vs. non-robust regression.
- What is a generalization error? Overfitting vs. underfitting.
- What is the tradeoff between bias and variance?
- What is logistic regression?
- What is a confusion matrix? Misclassification error vs. misclassification cost.
- What is a lift chart? Lift chart vs. gain chart.
- What is multicollinearity?
- What is the difference between statistical regression and data mining regression?
- What is data mining jargon? Statistical jargon vs. data mining jargon.
- What transformations are useful for continuous variables?
- What transformations are useful for categorical variables?
- What is Cluster Analysis? Customer segmentation vs. cluster analysis.
- Supervised training vs. un-supervised training. Classification vs.cluster analysis.
- What is the relationship between Theory and Cluster Analysis?
- What is the boosting? Weak learner vs. strong learner?
- What is machine learning?
- What is a tree-based model? CART vs. CHAID.
- What is Random Forest ?
- What is TreeNet/TreeBoost?
- What are Multivariate Adaptive Regression Splines (MARS)?
- What is the Support Vector Machine (SVM)?
- What is Association/Market Basket Analysis?
- What is the Genetic Algorithm?
- What is a Neural Net? What is Neural Net architecture? What is the taxonomy of Neural Net models?
- What are data preparation headaches?
- What
is the relationship between the number of variables and the number of
unknown parameter estimates and number of observations?
- What are the major data mining tasks?
- Which data mining tool is the best for Risk /Customer/Sales/Marketing Analytics?
- What are the golden rules of data mining?
- What is credit scoring?
- What is a scorecard?