Validating Classification Models

Features: Vendor, location, time, distance from last transaction
Labels: Chargebacks on previous transactions

Train/Test Split: Randomly remove 20% of the examples to evaluate the model's performance.

df_train = df.sample(frac=0.8)
df_test = df.drop(df_train.index)

df_train, df_test = train_test_split(df, test_size=0.2)

Categorical Supervised Machine Learning Algorithms