Skip to content

Instantly share code, notes, and snippets.

@timwis
Created March 23, 2016 22:15
Show Gist options
  • Save timwis/74c99c6344d44b0612bb to your computer and use it in GitHub Desktop.
Save timwis/74c99c6344d44b0612bb to your computer and use it in GitHub Desktop.
1. PARCEL
<type 'unicode'>
Nulls: False
Unique values: 579912
Max length: 9
2. LOC
<type 'unicode'>
Nulls: False
Unique values: 547180
5 most frequent values:
2401 PENNSYLVANIA AVE: 790
1420 LOCUST ST: 595
2018-32 WALNUT ST: 578
224-30 W RITTENHOUSE SQ: 544
2001 HAMILTON ST: 541
Max length: 25
3. OWNER1
<type 'unicode'>
Nulls: True
Unique values: 432838
5 most frequent values:
CITY OF PHILA: 5684
PHILA HOUSING AUTHORITY: 5330
REDEVELOPMENT AUTHORITY: 2255
CITY OF PHILADELPHIA: 1007
NEIGHBORHOOD RESTORATIONS: 555
Max length: 25
4. OWNER2
<type 'unicode'>
Nulls: True
Unique values: 151398
5 most frequent values:
OF PHILADELPHIA: 1513
DEPT OF PUBLIC PROP: 1031
DEPT OF PUBLC PROP: 934
OF PHILA: 801
DEPT PUB PROP: 720
Max length: 25
5. CENSUS
<type 'int'>
Nulls: False
Min: 0
Max: 801
Sum: 107002989
Mean: 184.515907586
Median: 178.0
Standard Deviation: 110.371542766
Unique values: 370
5 most frequent values:
169: 5845
362: 5713
41: 5504
39: 4903
363: 4815
6. ZIP
<type 'unicode'>
Nulls: True
Unique values: 43854
5 most frequent values:
191500000: 2313
191380000: 2096
191190000: 1856
191200000: 1790
191210000: 1339
Max length: 9
7. WD_GEO
<type 'unicode'>
Nulls: True
Unique values: 66
5 most frequent values:
39: 17458
40: 17392
21: 17341
05: 17321
58: 16079
Max length: 4
8. ST_CD
<type 'int'>
Nulls: False
Min: 11020
Max: 89790
Sum: 31425320779
Mean: 54189.8094521
Median: 54840.0
Standard Deviation: 24560.3028592
Unique values: 3399
5 most frequent values:
51380: 2982
73960: 2668
22820: 2420
34960: 2174
87830: 2118
9. HOUSE_NO
<type 'int'>
Nulls: False
Min: 1
Max: 73308
Sum: 1870969556
Mean: 3226.29908676
Median: 2422.0
Standard Deviation: 3051.22109655
Unique values: 12413
5 most frequent values:
2401: 956
2101: 929
1100: 886
2018: 847
1420: 829
10. SUFF
<type 'unicode'>
Nulls: True
Unique values: 7
5 most frequent values:
R: 2117
2: 530
A: 151
S: 55
L: 33
Max length: 4
11. UNIT
<type 'unicode'>
Nulls: True
Unique values: 8697
5 most frequent values:
A: 1856
B: 1582
1: 545
2: 533
C: 513
Max length: 7
12. EXT
<type 'int'>
Nulls: True
Min: 2
Max: 99
Sum: 799135
Mean: 29.9346344022
Median: 29.0
Standard Deviation: 17.7203653364
Unique values: 98
5 most frequent values:
30: 1142
32: 936
18: 932
36: 921
10: 833
13. RCD_DT
<type 'unicode'>
Nulls: True
Unique values: 21689
5 most frequent values:
1943-01-01: 1326
2005-12-30: 403
1997-12-30: 352
2007-09-24: 337
1998-12-23: 336
Max length: 10
14. SALE_DATE
<type 'unicode'>
Nulls: True
Unique values: 23754
5 most frequent values:
1943-01-01: 1326
2015-12-09: 429
2005-12-21: 428
1985-12-02: 352
2007-09-20: 329
Max length: 10
15. SALE_PR
<type 'int'>
Nulls: False
Min: 0
Max: 948729100
Sum: 155473852358
Mean: 268099.043231
Median: 28000.0
Standard Deviation: 11248431.5384
Unique values: 20028
5 most frequent values:
1: 178187
3: 8276
30000: 3993
25000: 3960
50000: 3941
16. UNF
<type 'unicode'>
Nulls: True
Values: 2, *, U
17. ASSMT_DT
<type 'unicode'>
Nulls: True
Unique values: 39
5 most frequent values:
2016-03: 484815
2013-01: 75890
2014-03: 3275
2013-03: 2632
2013-02: 1436
Max length: 10
18. MV_DT
<type 'unicode'>
Nulls: True
Unique values: 159
5 most frequent values:
2013-01: 393521
2015-03: 125278
2016-03: 21310
2014-03: 8343
2013-03: 7135
Max length: 7
19. MV
<type 'int'>
Nulls: False
Min: 0
Max: 690706200
Sum: 134287503636
Mean: 231565.31273
Median: 110300.0
Standard Deviation: 2678110.26388
Unique values: 15165
5 most frequent values:
0: 1697
22500: 1649
15000: 1175
57500: 840
45000: 826
20. TX_LND
<type 'int'>
Nulls: False
Min: 0
Max: 104027000
Sum: 21472397285
Mean: 37026.9925178
Median: 13164.0
Standard Deviation: 285428.691019
Unique values: 83510
5 most frequent values:
0: 29312
1500: 1009
2700: 984
3000: 912
12800: 818
21. TX_BLDG
<type 'int'>
Nulls: False
Min: 0
Max: 216929600
Sum: 67420982638
Mean: 116260.71307
Median: 69786.0
Standard Deviation: 1114845.32502
Unique values: 162907
5 most frequent values:
0: 75657
13500: 468
27000: 421
66390: 279
96390: 274
22. XMPT_LND
<type 'int'>
Nulls: False
Min: 0
Max: 688030200
Sum: 11118066045
Mean: 19171.9882413
Median: 0.0
Standard Deviation: 1601280.91601
Unique values: 11292
5 most frequent values:
0: 548624
2700: 251
2800: 235
2600: 214
2400: 197
23. XMPT_BLDG
<type 'int'>
Nulls: False
Min: 0
Max: 403136200
Sum: 34276057669
Mean: 59105.6189025
Median: 0.0
Standard Deviation: 1555494.53246
Unique values: 24419
5 most frequent values:
0: 320667
30000: 199325
15000: 1780
27000: 599
22500: 461
24. CAT_CD
<type 'int'>
Nulls: True
Min: 1
Max: 6
Sum: 940427
Mean: 1.62167470526
Median: 1
Standard Deviation: 1.45078554658
Unique values: 6
5 most frequent values:
1: 459358
6: 45735
2: 41159
3: 14654
4: 14646
25. BLDG_CD
<type 'unicode'>
Nulls: False
Unique values: 813
5 most frequent values:
O30: 178825
R30: 100567
SR: 37937
O50: 28168
K30: 15752
Max length: 5
26. ZONE
<type 'unicode'>
Nulls: True
Unique values: 52
5 most frequent values:
RSA5: 245337
RM1: 141292
RSA3: 64553
CMX2: 25923
RSA2: 12998
Max length: 5
27. SITE_TYP
<type 'unicode'>
Nulls: True
Unique values: 8
5 most frequent values:
A: 274001
B: 39629
D: 661
C: 406
E: 231
Max length: 4
28. FRT
<type 'int'>
Nulls: False
Min: 0
Max: 432026008
Sum: 706182695
Mean: 1217.74113141
Median: 16.0
Standard Deviation: 588045.431919
Unique values: 1323
5 most frequent values:
15: 108552
16: 106471
14: 83039
0: 34490
18: 30951
29. DPT
<type 'int'>
Nulls: False
Min: 0
Max: 3571920
Sum: 66671506
Mean: 114.968315882
Median: 77.0
Standard Deviation: 6267.90813943
Unique values: 1311
5 most frequent values:
100: 36152
0: 34531
90: 20794
60: 17378
70: 17284
30. SHP
<type 'unicode'>
Nulls: True
Unique values: 7
5 most frequent values:
E: 533091
A: 37157
B: 6315
C: 1408
D: 66
Max length: 4
31. TOT_AREA
<type 'int'>
Nulls: False
Min: 0
Max: 207694080
Sum: 3158803683
Mean: 5447.03969395
Median: 1280.0
Standard Deviation: 361518.76386
Unique values: 22665
5 most frequent values:
0: 34312
700: 5401
960: 4203
1600: 3942
900: 3928
32. TOP
<type 'unicode'>
Nulls: True
Unique values: 7
5 most frequent values:
F: 504960
A: 30670
E: 4877
B: 252
C: 190
Max length: 4
33. GRG_TYP
<type 'unicode'>
Nulls: True
Unique values: 7
5 most frequent values:
0: 351236
A: 147597
F: 23942
C: 19253
B: 9398
Max length: 4
34. GRG_SP
<type 'int'>
Nulls: False
Min: 0
Max: 95
Sum: 199545
Mean: 0.344095311013
Median: 0.0
Standard Deviation: 0.919003229754
Unique values: 66
5 most frequent values:
0: 405125
1: 159937
2: 13349
3: 599
4: 320
35. OFF_ST
<type 'int'>
Nulls: False
Min: 0
Max: 99
Sum: 89691
Mean: 0.154663121301
Median: 0.0
Standard Deviation: 1.6987566742
Unique values: 94
5 most frequent values:
0: 542832
1: 28797
2: 3919
3: 809
4: 781
36. VIEW
<type 'unicode'>
Nulls: True
Unique values: 9
5 most frequent values:
I: 525442
A: 15542
C: 7478
0: 5193
D: 4100
Max length: 4
37. OTR_BLDG
<type 'unicode'>
Nulls: True
Values: Y, 0, N
38. STORIES
<type 'int'>
Nulls: False
Min: 0
Max: 61
Sum: 907980
Mean: 1.56572031619
Median: 2.0
Standard Deviation: 1.562963718
Unique values: 58
5 most frequent values:
2: 334526
0: 168136
3: 56600
1: 13780
4: 4351
39. GEN_CONST
<type 'unicode'>
Nulls: True
Unique values: 11
5 most frequent values:
A: 446308
B: 28969
E: 11679
C: 8459
F: 8270
Max length: 4
40. TYP_DWELL
<type 'unicode'>
Nulls: True
Values: A, B, 2, C, D
41. DT_EXT_COND
<type 'unicode'>
Nulls: True
Unique values: 2913
5 most frequent values:
2000-01-01: 7463
2014-11-13: 6047
2012-03-14: 5443
2012-05-22: 4935
2012-04-10: 4368
Max length: 10
42. EXT_COND
<type 'int'>
Nulls: True
Min: 0
Max: 7
Sum: 2106330
Mean: 3.64498935747
Median: 4.0
Standard Deviation: 1.27374640358
Unique values: 7
5 most frequent values:
4: 440871
0: 46372
2: 30369
5: 25640
3: 20987
43. QLT_GRD
<type 'int'>
Nulls: True
Min: 1
Max: 6
Sum: 77515
Mean: 3.6223655311
Median: 4
Standard Deviation: 0.579464449647
Unique values: 6
5 most frequent values:
4: 12334
3: 8611
6: 296
5: 90
2: 52
44. YR_BUILT
<type 'unicode'>
Nulls: True
Unique values: 263
5 most frequent values:
1925: 118002
1920: 83089
1950: 44660
1915: 34710
1940: 24022
Max length: 4
45. EST_YR_BUILT
<type 'unicode'>
Nulls: True
Values: Y, 0, N
46. NO_RM
<type 'int'>
Nulls: False
Min: 0
Max: 89
Sum: 2127574
Mean: 3.66878767813
Median: 6.0
Standard Deviation: 3.09683784221
Unique values: 45
5 most frequent values:
6: 267437
0: 231955
7: 41982
4: 12692
5: 9820
47. NO_BD
<type 'int'>
Nulls: False
Min: 0
Max: 93
Sum: 1125377
Mean: 1.94059960822
Median: 3.0
Standard Deviation: 1.65969525583
Unique values: 53
5 most frequent values:
3: 278831
0: 217726
4: 46219
2: 28581
5: 4195
48. NO_BATH
<type 'int'>
Nulls: False
Min: 0
Max: 90
Sum: 410289
Mean: 0.707502172743
Median: 1.0
Standard Deviation: 0.710882298044
Unique values: 32
5 most frequent values:
1: 326745
0: 216748
2: 29225
3: 5525
4: 1190
49. BASMT
<type 'unicode'>
Nulls: True
Unique values: 11
5 most frequent values:
D: 125881
F: 68351
H: 64659
C: 17738
0: 10363
Max length: 4
50. FIRE
<type 'int'>
Nulls: False
Min: 0
Max: 8
Sum: 16940
Mean: 0.0292113286154
Median: 0.0
Standard Deviation: 0.226752503551
Unique values: 7
5 most frequent values:
0: 566993
1: 10583
2: 1335
3: 589
5: 256
51. TYP_HEAT
<type 'unicode'>
Nulls: True
Unique values: 8
5 most frequent values:
H: 134613
A: 72021
B: 55247
G: 4056
C: 2575
Max length: 4
52. FUEL
<type 'unicode'>
Nulls: True
Unique values: 7
5 most frequent values:
A: 5375
C: 263
B: 246
G: 30
E: 12
Max length: 4
53. CNT_AIR
<type 'unicode'>
Nulls: True
Values: Y, 0, 4, N
54. INT_COND
<type 'int'>
Nulls: True
Min: 0
Max: 7
Sum: 2106897
Mean: 3.65000034648
Median: 4.0
Standard Deviation: 1.26993911007
Unique values: 7
5 most frequent values:
4: 441810
0: 45555
2: 31538
5: 25384
3: 19124
55. UTLY
<type 'unicode'>
Nulls: True
Values: A, C, B, D
56. SEW
<type 'unicode'>
Nulls: True
Values: Y, 0, N
57. SEP_UTS
<type 'unicode'>
Nulls: True
Values: A, 0, C, B, N
58. TOT_LIV_AREA
<type 'int'>
Nulls: False
Min: 0
Max: 2500000
Sum: 1285041717
Mean: 2215.92537661
Median: 1224.0
Standard Deviation: 16601.9202367
Unique values: 13506
5 most frequent values:
0: 46718
1200: 10470
1120: 9768
1152: 7836
1260: 6988
59. BK_PG
<type 'unicode'>
Nulls: True
Unique values: 458327
5 most frequent values:
0340344: 319
1347625: 316
0888093: 293
1778196: 239
2355371: 202
Max length: 7
60. REG_NO
<type 'unicode'>
Nulls: True
Unique values: 564787
5 most frequent values:
001S181793: 414
8S2 377: 407
008N020026: 289
33S5 ETC: 260
1S12 S/O 1: 216
Max length: 15
61. CROSS_REF
<type 'unicode'>
Nulls: True
Unique values: 82900
5 most frequent values:
881124650: 587
881070500: 504
881131330: 495
054082400: 428
881023500: 405
Max length: 9
Row count: 579912
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment