This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| >> Regression: | |
| dataset: [diamonds], target: [price], by_col: [color], prod(dim(df)): [539400] | |
| model time memory mse R2 | |
| 1: svmRadial 80.8 16.6 MB 403177.8 0.974 | |
| 2: xgb 1.0 4.1 MB 1201304.2 0.924 | |
| 3: glm 0.5 19.9 MB 1585278.7 0.899 | |
| 4: mean_dummy 0.0 0 15767646.5 0.000 | |
| .... |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # all xgb on every layer were runned with: eta = 0.1, nrounds = 200, lambda = 1 | |
| # glmnet: alpha = 1, nlambda = 100, standardize = T (shifted to mean and scaled) | |
| # ranger: mtry = 8, num.trees = 200 | |
| # glm: standardize = T | |
| >> Regression: | |
| dataset: [diamonds], target: [price], by_col: [color], prod(dim(df)): [539400] | |
| 1-layer models: [ glm_by ] | |
| model time memory mse R2 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| the main answer: https://github.com/catboost/catboost/issues/247 | |
| another dirty way | |
| Install both 2015 and 2017: http://landinghub.visualstudio.com/visual-cpp-build-tools | |
| A lot of options are in the end of the article https://tech.yandex.com/catboost/doc/dg/concepts/r-installation-docpage/ | |
| Components to pick: https://github.com/catboost/catboost/issues/30#issuecomment-316545310 | |
| Check and edit PATH if needed to include python https://github.com/catboost/catboost/issues/3#issuecomment-316547561 | |
| devtools::find_rtools(); | |
| devtools::install_github('catboost/catboost', subdir = 'catboost/R-package') |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| from https://github.com/bwilbertz/RLightGBM | |
| in git shell: | |
| git clone --recursive https://github.com/bwilbertz/RLightGBM.git | |
| cd RLightGBM | |
| R CMD build --no-build-vignettes pkg/RLightGBM | |
| in RStudio for Windows 10: | |
| install.packages("RLightGBM/RLightGBM_0.1.tar.gz", type = "source", repos = NULL) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # please note, the first 2 tests are on the same data! | |
| > devtools::test(filter='Ignor') | |
| Loading stackatto | |
| Testing stackatto | |
| >> Ignoring columns: | |
| dataset [Ionosphere], target [V26], by_col [], n_cells [12K], seed [9171] | |
| layer1 models: [ kknn svmLinear svmRadial glm glmnet rf ranger catboost lgbm xgb ] | |
| Ignored: [ V30 V16 ] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| #' Edits a function | |
| #' @param fun a function to change | |
| #' @param pattern a string of the code lines to change. Not a regex | |
| #' @param replacement a string of the new lines of code | |
| edit_fun <- function(fun, pattern, replacement) { | |
| stopifnot(length(pattern) == 1 && length(replacement) == 1) | |
| align_spaces <- function(x) gsub(' +', ' ', gsub('\n', ' \n', x)) | |
| deparsed_func <- align_spaces(paste(deparse(fun), collapse = '\n')) | |
| pattern <- align_spaces(pattern) | |
| stopifnot(grepl(pattern, deparsed_func, fixed = T)) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Edit security group according to https://spark.rstudio.com/examples-emr.html | |
| Pay attention to specification of the actual IP in inbound rules for the security group SSH source. | |
| It is possible to pick source "anywhere" but is not desirable |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| sudo su | |
| sudo su - hadoop | |
| add administrative policies to aws entities (EMR_DefaultRole, EMR_EC2_DefaultRole) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| devtools::find_rtools() # solution | |
| devtools::install_github('thomasp85/lime') |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| mlr::makeLearner("classif.ranger") | |
| # > Assertion on 'choices' failed. Must be of length >= 1, but has length 0. | |
| # Solution | |
| library(mlr) |