Validation artifact for the ORG donor build used in policyengine-us-data.
Files:
org_validation_2024.ipynb: executed notebook
Key annual results:
- donor rows:
119,237 - weighted mean hourly wage:
$35.17 - weighted hourly wage quantiles:
p10 $14.40,p50 $25.00,p90 $62.50,p99 $222.37 - weighted
is_paid_hourly:55.32% - weighted
is_union_member_or_covered:11.01%
State union-rate pattern also looks directionally right:
- low states: NC
2.96%, SD3.88%, SC4.11%, GA4.31%, AR4.40% - high states: HI
26.71%, NY21.67%, AK19.27%, WA17.64%, NJ17.59%
Main caveat:
- the high hourly-wage tail is still worth watching; values above
$100/hrare largely driven by reconstructing hourly wage as weekly earnings divided by hours when no direct hourly rate is reported.