- Calculate the percentage of missing values in each column and sort them in descending order.
- Missing values and outliers are not problems to be fixed! They are facts.
- During EDA you must not “fix” them because you have to deal with your data and problem as it is.
- If you see missing values, just report them.
- Identify and understand your target variable.
- Understand the type of the target variable: binary, categorical, or numeric.
- Examine the distribution of the target variable.
- For a binary variable (which needs to be converted into 0s and 1s if it is in string format), the mean (a proportion of 1s) is simply used.
- For a categorical variable, value counts are used.
- For a numeric variable, a histogram or a pandas' describe table is used.
The proposal you’re about to read is not just a proposal. We have a working implementation of almost everything we discussed here. We encourage you to checkout and build our branch: our fork, with the relevant branch selected. Building and using the implementation will give you a better understanding of what using it as a developer is like.
Our implementation ended up differing from the proposal on some minor points. As our last action item before making a PR, we’re writing documentation on what we did. While I loathe pointing to tests in lieu of documentation, they will be helpful until we complete writing docs: the unit tests.
This repo also contains a bundled version of npm that has a new command, asset
. You can read the documentation for and goals of that comma
// we have a geoJSON file (in this case for Guinea) on the web: | |
var guinea = 'https://raw.githubusercontent.com/johan/world.geo.json/master/countries/GIN.geo.json' | |
// and we have a function that does something with the geoJSON file: | |
var handleGeoJSON = function (data) { | |
// data is the JSON parsed into a JS object | |
console.log(data) | |
} | |
// using jquery (you must have loaded jQuery first) - http://api.jquery.com/jquery.getjson |
Kyle Simpson (@getify) wrote a very thoughtful post decrying optimize-js, which is a tool I wrote that exploits known optimizations in JavaScript engines to make JS bundles parse faster (especially minified bundles, due to what could be reasonably described as a bug in Uglify).
Kyle lays out a good case, but I tend to disagree with nearly all his points. So here's my rebuttal.
A complete list of RxJS 5 operators with easy to understand explanations and runnable examples.
console.log('Hello LabJack') | |
var ljn = require('labjack-nodejs'); | |
var NanoTimer = require('nanotimer') | |
var fs = require('fs') | |
var now = require('performance-now') | |
var path = require('path') | |
var mkdirp = require('mkdirp').mkdirp |
# create docker-machines and use dvm to switch client binaries. | |
docker-machine create \ | |
--driver=virtualbox \ | |
--virtualbox-boot2docker-url=https://github.com/boot2docker/boot2docker/releases/download/v1.10.2/boot2docker.iso \ | |
1.10.2-dev | |
docker-machine create \ | |
--driver=virtualbox \ | |
--virtualbox-boot2docker-url=https://github.com/boot2docker/boot2docker/releases/download/v1.9.1/boot2docker.iso \ |
// hyperscript for jQuery | |
// create nested HTML elements with a DSL | |
// used to create reusable, interactive HTML components | |
// | |
// based on the many implentations out there like | |
// https://github.com/dominictarr/hyperscript | |
// https://github.com/Matt-Esch/virtual-dom/tree/master/virtual-hyperscript | |
// and Elm https://github.com/evancz/elm-html | |
var $h = function(element, properties, content) { | |
var $component = $('<' + element + '>'); |
#!/bin/bash | |
# Assumes a pre-built carina cluster, e.g.: | |
# | |
# carina create --wait shipyard \ | |
# && carina credentials shipyard \ | |
# && eval `carina env shipyard` | |
# |
class Helpers { | |
currentHP(db) { return db.PlayerInfo.CurrHP } | |
noCurrentHPGain(db) { return db.PlayerInfo.CurrentHPGain == 0.0 } | |
radiation(db) { return db.PlayerInfo.TotalDamages[5].Value } | |
aidItems(db) { return db.Inventory['48'] } | |
radiationMoreThan(rads) { | |
return db => { | |
this.radiation(db) > rads; | |
}; |