ylogx/xgboost_incremental.ipynb

Last active July 18, 2025 20:17

Star (36) You must be signed in to star a gist
Fork (17) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/ylogx/53fef94cc61d6a3e9b3eb900482f41e0.js"></script>
Save ylogx/53fef94cc61d6a3e9b3eb900482f41e0 to your computer and use it in GitHub Desktop.

Download ZIP

XGBoost Incremental Learning

Raw

xgboost_incremental.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

marymlucas commented Jul 14, 2023 •

edited

Loading

Disregard, I figured it out. I was using handle_unknown='ignore' in OneHotEncoder, but one of the features has too few of a particular category, hence the mismatch.

Thank you for this gist. How can we implement this in a pipeline?

I am unable to test on the Boston dataset as it's been removed from sklearn, but on a different dataset I get a mismatch in number of columns. Even though I use the same pipeline the saved model seems to have one less feature than the new training data and I am unable to figure out why.

Jason2Brownlee commented May 25, 2024

Great example!

Few people know that xgboost is able to perform incremental learning by adding boosting rounds.

ylogx/xgboost_incremental.ipynb

marymlucas commented Jul 14, 2023 •

edited

Loading

Uh oh!

Jason2Brownlee commented May 25, 2024

Uh oh!

ylogx/xgboost_incremental.ipynb

marymlucas commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Disregard, I figured it out. I was using handle_unknown='ignore' in OneHotEncoder, but one of the features has too few of a particular category, hence the mismatch.

Uh oh!

Jason2Brownlee commented May 25, 2024

Uh oh!

marymlucas commented Jul 14, 2023 •

edited

Loading