- Calculate the percentage of missing values in each column and sort them in descending order.
- Missing values and outliers are not problems to be fixed! They are facts.
- During EDA you must not “fix” them because you have to deal with your data and problem as it is.
- If you see missing values, just report them.
- Identify and understand your target variable.
- Understand the type of the target variable: binary, categorical, or numeric.
- Examine the distribution of the target variable.
- For a binary variable (which needs to be converted into 0s and 1s if it is in string format), the mean (a proportion of 1s) is simply used.
- For a categorical variable, value counts are used.
- For a numeric variable, a histogram or a pandas' describe table is used.
The proposal you’re about to read is not just a proposal. We have a working implementation of almost everything we discussed here. We encourage you to checkout and build our branch: our fork, with the relevant branch selected. Building and using the implementation will give you a better understanding of what using it as a developer is like.
Our implementation ended up differing from the proposal on some minor points. As our last action item before making a PR, we’re writing documentation on what we did. While I loathe pointing to tests in lieu of documentation, they will be helpful until we complete writing docs: the unit tests.
This repo also contains a bundled version of npm that has a new command, asset. You can read the documentation for and goals of that comma
| // we have a geoJSON file (in this case for Guinea) on the web: | |
| var guinea = 'https://raw.githubusercontent.com/johan/world.geo.json/master/countries/GIN.geo.json' | |
| // and we have a function that does something with the geoJSON file: | |
| var handleGeoJSON = function (data) { | |
| // data is the JSON parsed into a JS object | |
| console.log(data) | |
| } | |
| // using jquery (you must have loaded jQuery first) - http://api.jquery.com/jquery.getjson |
Kyle Simpson (@getify) wrote a very thoughtful post decrying optimize-js, which is a tool I wrote that exploits known optimizations in JavaScript engines to make JS bundles parse faster (especially minified bundles, due to what could be reasonably described as a bug in Uglify).
Kyle lays out a good case, but I tend to disagree with nearly all his points. So here's my rebuttal.
A complete list of RxJS 5 operators with easy to understand explanations and runnable examples.
| console.log('Hello LabJack') | |
| var ljn = require('labjack-nodejs'); | |
| var NanoTimer = require('nanotimer') | |
| var fs = require('fs') | |
| var now = require('performance-now') | |
| var path = require('path') | |
| var mkdirp = require('mkdirp').mkdirp |
| # create docker-machines and use dvm to switch client binaries. | |
| docker-machine create \ | |
| --driver=virtualbox \ | |
| --virtualbox-boot2docker-url=https://github.com/boot2docker/boot2docker/releases/download/v1.10.2/boot2docker.iso \ | |
| 1.10.2-dev | |
| docker-machine create \ | |
| --driver=virtualbox \ | |
| --virtualbox-boot2docker-url=https://github.com/boot2docker/boot2docker/releases/download/v1.9.1/boot2docker.iso \ |
| // hyperscript for jQuery | |
| // create nested HTML elements with a DSL | |
| // used to create reusable, interactive HTML components | |
| // | |
| // based on the many implentations out there like | |
| // https://github.com/dominictarr/hyperscript | |
| // https://github.com/Matt-Esch/virtual-dom/tree/master/virtual-hyperscript | |
| // and Elm https://github.com/evancz/elm-html | |
| var $h = function(element, properties, content) { | |
| var $component = $('<' + element + '>'); |
| #!/bin/bash | |
| # Assumes a pre-built carina cluster, e.g.: | |
| # | |
| # carina create --wait shipyard \ | |
| # && carina credentials shipyard \ | |
| # && eval `carina env shipyard` | |
| # |
| class Helpers { | |
| currentHP(db) { return db.PlayerInfo.CurrHP } | |
| noCurrentHPGain(db) { return db.PlayerInfo.CurrentHPGain == 0.0 } | |
| radiation(db) { return db.PlayerInfo.TotalDamages[5].Value } | |
| aidItems(db) { return db.Inventory['48'] } | |
| radiationMoreThan(rads) { | |
| return db => { | |
| this.radiation(db) > rads; | |
| }; |