Skip to content

Instantly share code, notes, and snippets.

@lucassmacedo
Last active March 3, 2020 13:06
Show Gist options
  • Save lucassmacedo/48d380f26a8b7cb1619bd9fc62a6c060 to your computer and use it in GitHub Desktop.
Save lucassmacedo/48d380f26a8b7cb1619bd9fc62a6c060 to your computer and use it in GitHub Desktop.
Missing Values Python
train_male = train_dataset[train_dataset['Sex'] == 'male']['Age'].median()
train_female = train_dataset[train_dataset['Sex'] == 'female']['Age'].median()
train_dataset.loc[train_dataset['Sex'] == 'male', ['Age']] = train_male;
train_dataset.loc[train_dataset['Sex'] == 'female', ['Age']] = train_female;
# Keep test dataset in sync
test_male = train_dataset[train_dataset['Sex'] == 'male']['Age'].median()
test_female = train_dataset[train_dataset['Sex'] == 'female']['Age'].median()
test_dataset.loc[test_dataset['Sex'] == 'male', ['Age']] = test_male;
test_dataset.loc[test_dataset['Sex'] == 'female', ['Age']] = test_female;
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment