foebu

Apache Spark installation + ipython/jupyter notebook integration guide for macOS

Tested with Apache Spark 2.1.0, Python 2.7.13 and Java 1.8.0_112

For older versions of Spark and ipython, please, see also previous version of text.

Install Java Development Kit

Let's make an Asthma Choropleth using open data, ogr2ogr, folium and pandas (and indirectly leaflet.js, d3.js, GeoJSON, open street maps and moar) and talk about using gist and blocks..

get UK Asthma data http://customer.instantatlas.com/INHALE/dataviews/ for this example I use dataView12_17.csv

get some CCG map boundaries

wget 'https://geoportal.statistics.gov.uk/Docs/Boundaries/Clinical_commissioning_groups_(Eng)_Apr_2013_Boundaries_(Generalised_Clipped).zip'

make CCG map boundaries into GeoJSON format

This Gist has been moved to https://github.com/lbgists/audio-spectrum-matplotlib.

	#!/bin/bash
	set -x -e

	# install pip & tmux
	sudo yum -y install python27-pip tmux

	# install rmate
	sudo gem install rmate

	# setup variables

	#!/bin/bash

	# Script for installing tmux on systems where you don't have root access.
	# tmux will be installed in $HOME/local/bin.
	# It's assumed that wget and a C/C++ compiler are installed.

	# exit on error
	set -e

	TMUX_VERSION=1.8

	//
	// Regular Expression for URL validation
	//
	// Author: Diego Perini
	// Created: 2010/12/05
	// Updated: 2018/09/12
	// License: MIT
	//
	// Copyright (c) 2010-2018 Diego Perini (http://www.iport.it)
	//

	#See: http://daringfireball.net/2010/07/improved_regex_for_matching_urls
	import re, urllib

	GRUBER_URLINTEXT_PAT = re.compile(ur'(?i)\b((?:https?://\|www\d{0,3}[.]\|[a-z0-9.\-]+[.][a-z]{2,4}/)(?:[^\s()<>]+\|\(([^\s()<>]+\|(\([^\s()<>]+\)))\))+(?:\(([^\s()<>]+\|(\([^\s()<>]+\)))\)\|[^\s`!()\[\]{};:\'".,<>?\xab\xbb\u201c\u201d\u2018\u2019]))')

	for line in urllib.urlopen("http://daringfireball.net/misc/2010/07/url-matching-regex-test-data.text"):
	print [ mgroups[0] for mgroups in GRUBER_URLINTEXT_PAT.findall(line) ]