Skip to content

Instantly share code, notes, and snippets.

@mnarayan
Last active November 18, 2025 19:34
Show Gist options
  • Select an option

  • Save mnarayan/e2041c25f50c02a0171535e7274a1ee6 to your computer and use it in GitHub Desktop.

Select an option

Save mnarayan/e2041c25f50c02a0171535e7274a1ee6 to your computer and use it in GitHub Desktop.
NIH Reporter Summary of Biomarker Funding

Summary of Annual NIH Spending on projects about "biomarkers"

Fiscal Year Projects Total Funding Sub Projects Sub Project Funding
2004 1527 $537,186,990 439 $33,041,900
2005 1712 $619,361,134 240 $47,405,035
2006 1618 $647,500,503 321 $71,519,867
2007 551 $274,341,084 204 $39,386,589
2008 736 $357,470,326 233 $54,909,641
2009 3573 $1,679,909,537 1058 $269,653,399
2010 3686 $1,802,420,968 1150 $302,892,042
2011 3808 $1,833,426,793 1192 $312,094,955
2012 4521 $2,497,554,095 1277 $343,086,077
2013 5448 $3,006,613,342 1412 $381,960,225
2014 5929 $3,334,174,978 1473 $428,501,503
2015 6390 $3,639,693,098 1540 $469,152,140
2016 7251 $4,265,378,408 1646 $460,274,229
2017 7680 $4,756,130,009 1807 $531,677,085
2018 8312 $4,983,699,531 2706 $592,967,778
2019 8926 $5,632,362,623 2156 $695,429,562
2020 9481 $6,206,190,895 2145 $763,167,256
2021 9606 $6,656,636,807 2082 $803,386,191
2022 9884 $6,972,313,997 2094 $912,263,118
2023 10225 $7,253,283,985 2109 $942,468,159
2024 6041 $4,083,222,348 1045 $421,077,409
Total 116905 $71,038,871,451 28329 $8,876,314,160
We can make this file beautiful and searchable if this error is corrected: It looks like row 24 should actually have 5 columns, instead of 1 in line 23.
"Fiscal Year","Projects","Total Funding","Sub Projects","Sub Project Funding"
"2004","1527","$537,186,990","439","$33,041,900"
"2005","1712","$619,361,134","240","$47,405,035"
"2006","1618","$647,500,503","321","$71,519,867"
"2007","551","$274,341,084","204","$39,386,589"
"2008","736","$357,470,326","233","$54,909,641"
"2009","3573","$1,679,909,537","1058","$269,653,399"
"2010","3686","$1,802,420,968","1150","$302,892,042"
"2011","3808","$1,833,426,793","1192","$312,094,955"
"2012","4521","$2,497,554,095","1277","$343,086,077"
"2013","5448","$3,006,613,342","1412","$381,960,225"
"2014","5929","$3,334,174,978","1473","$428,501,503"
"2015","6390","$3,639,693,098","1540","$469,152,140"
"2016","7251","$4,265,378,408","1646","$460,274,229"
"2017","7680","$4,756,130,009","1807","$531,677,085"
"2018","8312","$4,983,699,531","2706","$592,967,778"
"2019","8926","$5,632,362,623","2156","$695,429,562"
"2020","9481","$6,206,190,895","2145","$763,167,256"
"2021","9606","$6,656,636,807","2082","$803,386,191"
"2022","9884","$6,972,313,997","2094","$912,263,118"
"2023","10225","$7,253,283,985","2109","$942,468,159"
"2024","6041","$4,083,222,348","1045","$421,077,409"
"Total","116905","$71,038,871,451","28329","$8,876,314,160"
""
""
@mnarayan
Copy link
Author

NIH Project Exporter for 2024 shows the following on re-analysis for projects that use terms in either the project title or project terms.

Projects: 6,936
Total Funding: $4.19B
Average per Project: $604,505

Comparison to Expanded Terms:

Term Set Projects Total Funding Avg/Project
Core (4 terms) 6,936 $4.19B $604,505
Expanded (10 terms) 23,252 $13.5B $580,661

The core term set captures 30% of projects and 31% of funding compared to the expanded set. This suggests the core terms are much more restrictive and focused on explicit biomarker research, while the expanded set includes broader categories like omics and imaging studies.

The --term-set flag is now implemented in scripts/filter_biomarker_projects.py:

  • --term-set core: Uses 4 high-confidence terms (biomarker, clinical marker, surrogate endpoint, imaging marker)
  • --term-set expanded: Uses 10 terms including digital biomarker, endophenotype, genetic marker, clinical+omics, clinical+imaging, intermediate outcome

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment