Last active
January 27, 2020 14:16
-
-
Save stsievert/30702575de95328f199ab1d7e50795ef to your computer and use it in GitHub Desktop.
Criteo dataset example
Great code thanks
So dusk can help in both cases to read original RTB Criteo file or libsvm format
Only short question:
In your code above - load_svmlight_file meanse to read any svmlib format or specific svmlight format
Again thank a lot taking care
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Yes. Free as in beer (i.e, doesn't cost money) and free as in speech (the source is freely available).
Yes. Dask-ML is a wrapper around scikit-learn, and they have a function for read in libsvm: https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_svmlight_file.html. It'd be pretty simple to wrap that function with Dask:
This code is untested.