Ed Summers edsu

Working for sustainable hypermedia.

edsu / flotilla_df.py

Last active June 6, 2025 14:45

Track the progress of the Freedom Flotilla in a DataFrame https://freedomflotilla.org/ffc-tracker/

	import requests
	import pandas

	url = "https://flotilla-orpin.vercel.app/api/vessel"

	df = pandas.DataFrame.from_records(requests.get(url).json()["vessels"]["232057367"]["positions"])

	df.last_position_UTC = pandas.to_datetime(df.last_position_UTC)

	print(df)

edsu / womenonweb.sh

Last active April 6, 2025 15:07

	docker run \
	--publish 9037:9037 \
	-v $PWD/crawls:/crawls/ \
	webrecorder/browsertrix-crawler crawl \
	--seeds https://www.womenonweb.org/af/ \
	--seeds https://www.womenonweb.org/ar/ \
	--seeds https://www.womenonweb.org/de/ \
	--seeds https://www.womenonweb.org/en/ \
	--seeds https://www.womenonweb.org/es/ \
	--seeds https://www.womenonweb.org/fa/ \

edsu / noaa-host-providers.csv

Created April 3, 2025 21:52

edsu / noaa-hostnames.txt

Created April 3, 2025 21:45

edsu / host_provider

Created April 3, 2025 21:44

edsu / err.log

Created March 27, 2025 13:58

Error output

	Traceback (most recent call last):
	File "/Users/edsu/.pyenv/versions/3.13.0/bin/sciop", line 8, in <module>
	sys.exit(_main())
	~~~~~^^
	File "/Users/edsu/Projects/sciop/src/sciop/cli/main.py", line 16, in _main
	main(max_content_width=100)
	~~~~^^^^^^^^^^^^^^^^^^^^^^^
	File "/Users/edsu/.pyenv/versions/3.13.0/lib/python3.13/site-packages/click/core.py", line 1161, in __call__
	return self.main(args, *kwargs)
	~~~~~~~~~^^^^^^^^^^^^^^^^^

edsu / warc_text.py

Last active March 19, 2025 19:58

	#!/usr/bin/env python3

	# The program will read WARC or WACZ data looking for Browsertrix text records
	# and print them out as files using the archived URL as the path.
	#
	# You can run it right here from Gist using pipx:
	#
	# pipx run https://gist.githubusercontent.com/edsu/89bd2844b9d3d4536e68956b3a16eaef/raw/warc_text.py file1.warc.gz file2.warc.gz
	#
	# If you give it a WACZ file it will read any WARC files contained in the WACZ:

edsu / diagram.md

Last active March 5, 2025 13:41

Diagram:

flowchart TB
  
  subgraph Harvest-by-ORCID
    direction RL
    Dimensions-by-ORCID
    OpenAlex-by-ORCID
 PubMed-by-ORCID

edsu / subjects.py

Last active February 18, 2025 16:55

	#!/usr/bin/env python3

	# This program will fetch the first page of recently updated Library of Congress
	# Subject Headings from id.loc.gov and print out the MARC records for them.
	#
	# /// script
	# dependencies = ["requests", "pymarc"]
	# ///
	#
	# see PEP 723

edsu / hello.py

Created February 18, 2025 16:07

	#!/usr/bin/env python3

	import getpass

	print(f"Hello {getpass.getuser()}!")