Skip to content

Instantly share code, notes, and snippets.

@rkdgusrn1212
Created May 31, 2022 17:50
Show Gist options
  • Save rkdgusrn1212/f7c102aa2d32669a4b84024dd61e1984 to your computer and use it in GitHub Desktop.
Save rkdgusrn1212/f7c102aa2d32669a4b84024dd61e1984 to your computer and use it in GitHub Desktop.
Ordinal Encode Categorical Column
import pandas as pd
from sklearn.preprocessing import OrdinalEncoder
# Categorical columns
object_cols = [col for col in train_data.columns if train_data[col].dtype == "object"]
# valid_data에서의 value set이 train_data에서의 value set의 부분집합인 column들, train_data에 fit된 encoder를 사용하기위한 필요조건이다.
good_cols = [col for col in object_cols if set(valid_data[col]).issubset(set(train_data[col]))]
ordinal_encoder = OrdinalEncoder()
label_X_train[good_label_cols] = ordinal_encoder.fit_transform(train_data[good_label_cols])
label_X_valid[good_label_cols] = ordinal_encoder.transform(valid_data[good_label_cols])
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment