Skip to content

Instantly share code, notes, and snippets.

View jatrost's full-sized avatar

Jason Trost jatrost

View GitHub Profile
@jatrost
jatrost / Extract Text from Podcasts and Youtube.md
Last active November 2, 2024 02:42
Various scripts / tools for extracting text from podcasts and youtube videos

Setup

pip install -r requirements.txt

Configuration

Add your podcast URLs to download_podcasts.py

@jatrost
jatrost / README.md
Last active July 12, 2020 15:20
Hacky DMARC parse validator using opendmarc's parser. Derived from opendmarc-1.3.2/libopendmarc/tests/test_dmarc_parse.c

dmarc_parse_validator

Running this hacky program

wget https://sourceforge.net/projects/opendmarc/files/latest/download
tar zxvf opendmarc-1.3.2.tar.gz 
cd opendmarc-1.3.2/
./configure && make
cd libopendmarc/
domain azure_networks_size azure_network_labels domain_alexa_rank
pico.com 327680 [] 846370.0
globalenglish.com 65536 [] 162102.0
asprs.org 65536 [] 527470.0
palador.com 65536 ['Azure '] 605307.0
companycasuals.com 32772 ['Azure '] 302877.0
nti.nl 32772 ['Azure '] 402995.0
ncoi.nl 32772 ['Azure '] 480688.0
sanmar.com 32772 ['Azure '] 51652.0
whoi.edu 8192 [] 31387.0
We can't make this file beautiful and searchable because it's too large.
domain,aws_networks_size,aws_network_labels,domain_alexa_rank
eleconomista.com.mx,524481,"['EC2 (us-west-2)', 'EC2 (us-east-1)', 'EC2 (us-west-1)']",19348.0
elempresario.mx,524480,"['EC2 (us-east-1)', 'EC2 (us-west-1)']",356551.0
dynamic.ooo,197638,"['AMAZON (us-east-1)', 'EC2 (eu-central-1)', 'EC2 (us-east-1)', 'EC2 (eu-west-3)']",234341.0
kfc.com.sg,148487,"['EC2 (us-east-1)', 'AMAZON (us-east-1)', 'EC2 (ap-southeast-1)']",292883.0
hawaii-arukikata.com,132096,"['EC2 (ap-northeast-1)', 'AMAZON (us-east-1)']",41560.0
telefilm.ca,131076,"['EC2 (ca-central-1)', 'EC2 (sa-east-1)', 'EC2 (us-east-1)']",330027.0
groupondev.com,131074,"['EC2 (eu-west-1)', 'EC2 (us-east-1)']",163954.0
rounds.com,131072,[],345183.0
ifgoiano.edu.br,131072,['EC2 (eu-central-1)'],128773.0
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 8.
domain,gcp_networks_size,gcp_network_labels,domain_alexa_rank
yext.com,1921536,"['Google Cloud (us-east1)', 'Google Cloud (europe-west6)', 'Google Cloud (europe-west1)', 'Google Cloud (us-west4)', 'Google Cloud (us-central1)', 'Google Cloud (asia-northeast1)']",77965.0
kiwitaxi.com,1899520,"['Google Cloud (us-east1)', 'Google Cloud (global)', 'Google Cloud (europe-west1)', 'Google Cloud (europe-west6)', 'Google Cloud (us-west4)', 'Google Cloud (us-central1)', 'Google Cloud (asia-northeast1)']",342429.0
yadayadamarketing.com,722,['Google Cloud (us-west1)'],614803.0
lauradoyle.org,722,['Google Cloud (us-west1)'],215697.0
amymyersmd.com,535,"['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-west1)', 'Google Cloud (us-central1)']",52695.0
fitfatherproject.com,534,"['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-west1)', 'Google Cloud (us-central1)']",97402.0
harklinikken.com,534,"['Google Cloud (us-east1)', 'Google Cloud (northamerica-n
domain gcp_networks_size gcp_network_labels includes
urbandictionary.com 22 ['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)'] ['spf.messagingengine.com', 'servers.mcsv.net', 'helpscoutemail.com', 'shops.shopify.com']
crunchyroll.com 22 ['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)'] ['servers.mcsv.net', 'amazonses.com', '_spf.google.com', 'sendgrid.net', 'spf.mandrillapp.com', 'stspg-customer.com', 'shops.shopify.com']
9gag.com 22 ['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)'] ['servers.mcsv.net', 'spf2.jotform.com', '_spf.google.com', 'spf1.jotform.com', 'sendgrid.net', 'shops.shopify.com']
bellesa.co 22 ['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)'] ['servers.mcsv.net', 'spf.protection.outlook.com', 'shops.shopify.com']
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 6.
domain,gcp_networks_size,gcp_network_labels,includes
sealedair.com,22,"['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)']","['spf.protection.outlook.com', 'shops.shopify.com', 'sendgrid.net']"
mattel.com,22,"['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)']","['spf_hrc.mattel.com', 'spf.protection.outlook.com', 'o365.connector.mattel.com', 'spf.mandrillapp.com', '_spf.centercode.com', 'shops.shopify.com']"
leggett.com,22,"['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)']","['spf-jpmorgan.leggett.com', 'spf-0003de01.pphosted.com', 'spf-lp.leggett.com', 'spfc1.leggett.com', 'spf-sf.leggett.com', 'shops.shopify.com']"
sands.com,22,"['Google Cloud (us-east1)', 'Google Cloud (northamerica-northeast1)', 'Google Cloud (us-central1)']","['spf5.sands.com', 'spf4.sands.com', 'spf2.sands.com', 'mktomail.com', 'spf1.sands.com', 'spf3.sands.com']"
unfi.com,22,"['Google Cloud
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 8.
domain,aws_networks_size,aws_network_labels,includes
primevideo.com,19795,"['AMAZON (eu-west-1)', 'AMAZON (us-west-2)', 'AMAZON (us-east-1)']","['amazonses.com', 'spf2.amazon.com', 'spf1.amazon.com']"
imdb.com,19795,"['AMAZON (eu-west-1)', 'AMAZON (us-west-2)', 'AMAZON (us-east-1)']",['amazon.com']
stackexchange.com,1926,"['EC2 (us-west-2)', 'EC2 (us-east-1)']",['_spf1.stackexchange.com']
stackoverflow.com,1926,"['EC2 (us-west-2)', 'EC2 (us-east-1)']",['_spf1.stackoverflow.com']
booking.com,1555,"['EC2 (eu-west-1)', 'EC2 (us-east-1)', 'AMAZON (us-east-1)']","['mail.zendesk.com', 'mailgun.org', '_spf.messagegears.net', 'spf.logicalware.com', 'sendgrid.net', '_spf.booking.com', 'spf-0032a203.pphosted.com']"
hotels.com,1541,"['EC2 (eu-west-1)', 'EC2 (us-east-1)', 'AMAZON (us-east-1)', 'EC2 (ap-southeast-1)']","['_spf.egencia.com', '_spf.expedia.com', 'musvc.com', '_spf.messagegears.net', 'amazonses.com']"
zoom.us,1153,"['EC2 (us-west-2)', 'EC2 (us-east-1)', 'AMAZON (us-east-1)']","['servers.mcsv.net', 'amazonses
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 8.
domain,aws_networks_size,aws_network_labels,includes
amazon.com,19795,"['AMAZON (eu-west-1)', 'AMAZON (us-west-2)', 'AMAZON (us-east-1)']","['amazonses.com', 'spf2.amazon.com', 'spf1.amazon.com']"
wesco.com,1926,"['EC2 (eu-west-1)', 'EC2 (us-west-2)', 'EC2 (us-east-1)']","['quickbase.com', 'act-on.net', 'spf.protection.outlook.com', '594404.spf05.hubspotemail.net']"
acuitybrands.com,1925,['EC2 (us-east-1)'],"['2327376.spf05.hubspotemail.net', 'mail.zendesk.com', 'et._spf.pardot.com', 'spf.protection.outlook.com', 'email.prnewswire.com']"
lansingtradegroup.com,1924,['EC2 (us-east-1)'],['spf1.lansingtradegroup.com']
anixter.com,1924,['EC2 (us-east-1)'],"['spf_c.oraclecloud.com', 'spf.protection.outlook.com', 'mktomail.com', 'spf.mandrillapp.com', '410209.spf07.hubspotemail.net']"
ametek.com,1536,"['EC2 (eu-west-1)', 'EC2 (us-east-1)', 'AMAZON (us-east-1)']","['spf4.ametek.com', 'et._spf.pardot.com', 'spf3.ametek.com', 'spf1.ametek.com', 'spf2.ametek.com']"
bms.com,1082,"['EC2 (us-west-2)', 'EC2 (ap-southeast-1)
domain azure_networks_size azure_network_labels includes
microsoft.com 17 ['Azure '] ['_spf1-meo.microsoft.com', 'spf-a.hotmail.com', '_spf-a.microsoft.com', '_spf-c.microsoft.com', '_spf-b.microsoft.com', '_spf-ssg-a.microsoft.com']