This document summarizes the steps taken to apply machine learning models to classify iris flower species and predict an advertising target variable. It loads datasets, preprocesses data, builds random forest models, tunes hyperparameters, evaluates performance using cross-validation, and analyzes variable importance. Key steps include data splitting, feature selection, resampling, model building, pruning, threshold optimization, and comparing results.
2. Prepare your R session
setwd("~/Google_Drive/github/R/R_Recap/session_02")
library(dplyr)
library(ggplot2)
How to apply models in general
Understanding the problem
Understanding the data
Preparing the data
Split in Train & Test
Predicting variable selection
Resampling
Model
Tuning of the model
Evaluation
=> ITERATIVE PROCESS! # Iris dataset {.smaller}