Skip to content

Instantly share code, notes, and snippets.

@MaxGhenis
Last active March 31, 2026 12:01
Show Gist options
  • Select an option

  • Save MaxGhenis/634977ffccb65a288f54d602a2244ed9 to your computer and use it in GitHub Desktop.

Select an option

Save MaxGhenis/634977ffccb65a288f54d602a2244ed9 to your computer and use it in GitHub Desktop.
ORG validation for policyengine-us-data PR
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

ORG Validation: 2024 Annual Donor Build

Validation artifact for the ORG donor build used in policyengine-us-data.

Files:

  • org_validation_2024.ipynb: executed notebook

Key annual results:

  • donor rows: 119,237
  • weighted mean hourly wage: $35.17
  • weighted hourly wage quantiles: p10 $14.40, p50 $25.00, p90 $62.50, p99 $222.37
  • weighted is_paid_hourly: 55.32%
  • weighted is_union_member_or_covered: 11.01%

State union-rate pattern also looks directionally right:

  • low states: NC 2.96%, SD 3.88%, SC 4.11%, GA 4.31%, AR 4.40%
  • high states: HI 26.71%, NY 21.67%, AK 19.27%, WA 17.64%, NJ 17.59%

Main caveat:

  • the high hourly-wage tail is still worth watching; values above $100/hr are largely driven by reconstructing hourly wage as weekly earnings divided by hours when no direct hourly rate is reported.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment