Skip to content

Instantly share code, notes, and snippets.

View drorata's full-sized avatar

Dror Atariah drorata

View GitHub Profile
class cap-shape cap-surface cap-color bruises odor gill-attachment gill-spacing gill-size gill-color stalk-shape stalk-root stalk-surface-above-ring stalk-surface-below-ring stalk-color-above-ring stalk-color-below-ring veil-type veil-color ring-number ring-type spore-print-color population habitat
p x s n t p f c n k e e s s w w p w o p k s u
e x s y t a f c b k e c s s w w p w o p n n g
e b s w t l f c b n e c s s w w p w o p n n m
p x y w t p f c n n e e s s w w p w o p k s u
e x s g f n f w b k t e s s w w p w o e n a g
e x y y t a f c b n e c s s w w p w o p k n g
e b s w t a f c b g e c s s w w p w o p k n m
e b y w t l f c b n e c s s w w p w o p n s m
p x y w t p f c n p e e s s w w p w o p k v g
@drorata
drorata / notebook.ipynb
Created May 10, 2017 11:45
Retrieving lists from a CSV file using pandas
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@drorata
drorata / foo.ipynb
Last active May 16, 2017 06:03
Example of scatter plots using Bokeh
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@drorata
drorata / example.sql
Last active May 23, 2017 07:52
Ratio when using group by
-- Compute the ratios of each type of status from the overall count
-- After re-learning it over and over again, it is time to document this useful snippet.
-- I hope my future-self will find it.
SELECT status,
c,
c / cast((sum(c) over ()) AS float) AS ratio
FROM
(SELECT status,
count(*) AS c
@drorata
drorata / foo.ipynb
Created May 26, 2017 09:29
Comparing numpy arrays and pandas data frames
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@drorata
drorata / foo.ipynb
Created June 2, 2017 10:46
Apply vs. row-iteration
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@drorata
drorata / foo.ipynb
Created June 16, 2017 08:40
Minimal example how to apply different preprocessing steps to different features
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@drorata
drorata / StandartizeFloatCols.py
Last active June 23, 2017 08:40
Applying transformations on subset of columns
import pandas as pd
import sklearn
from sklearn.preprocessing import StandardScaler
class GetDummiesCatCols(sklearn.base.BaseEstimator, sklearn.base.TransformerMixin):
"""Replace `cols` with their dummies (One Hot Encoding).
`cols` should be a list of column names holding categorical data.
Furthermore, this class streamlines the implementation of one hot encoding

Hello World

@drorata
drorata / sign-counts.sql
Created July 6, 2017 08:07
Count positives and negatives of a value
--- A query that count the number of positive/zero/negatives of the column 'value'
--- and group it by the year-month
WITH t AS
( SELECT value,
YEAR,
MONTH
FROM my_table AS l
)
SELECT YEAR,
MONTH,