- input data from the National Practitioner Database here
- example output here
- note the huge decline in medical malpractice cases - this is just due to lack of data for 2015!
Just takes about 10 seconds on a pitiful 2GB RAM machine:
cat NPDB1501.DAT |\
# get year field
cut -c 14-17 |\
# give it a header - this is a character delimited DAT file
sed '1 i\year' |\
# get the counts of each unique year
sortfreq |\
# swap column positions - chartpipe wants counts to go second
col_swap 1 2 |\
# sort by year - notice this sort keeps header in place
sortkh "-k1 -n" |\
# send TSV over to chartpipe to make a D3 line example!!!
chartpipe --format=tsv --type=line