benmarwick’s gists

benmarwick / tektite-maps-sea-vietnam.R

Created October 16, 2021 06:33

Tektite maps for southeast Asia & Vietnam


	library(tidyverse)
	library(sf)
	library(googlesheets4)

	# get our data from google sheets, we need to:
	# - 'publish to web'
	# - adjust sharing settings to share with anyone
	# I've done these, now, so it should just work:
	my_key <- "1xUqRGnb9kwBi128cERiHkV5w4uREwl-mqAVf0jmQhhg"

benmarwick / plotting-archaeology-papers-with-R-code.R

Last active June 23, 2021 18:49

plotting archaeology papers with R code, from https://github.com/benmarwick/ctv-archaeology/blob/master/README.md

	# also at https://gist.github.com/benmarwick/f11ae49ab9afde0071b133012ff76cbc

	ctv <- "https://raw.githubusercontent.com/benmarwick/ctv-archaeology/master/README.md"

	library(tidyverse)
	library(glue)

	archy_ctv_readme <- readLines(ctv)

	# get just the articles

benmarwick / viralarchive.Rmd

Created June 1, 2021 16:51

Object recognition in Images in #viralarchive tweets



	I used the Python library GetOldTweets3 to get the tweets because the rtweet package cannot get tweets older than 6-9 days. Details about this Python library are here: https://github.com/Mottl/GetOldTweets3

	I used this line in the shell to get tweets using the #viralarchive hashtag:

	```{bash, engine.opts="-l", eval = F}
	GetOldTweets3 --querysearch 'viralarchive' --maxtweets 10000
	```

benmarwick / ggplot-to-jpg-set-dpi.R

Created December 19, 2020 08:16

How to save a ggplot as a JPG file with specific dimensions and a high dpi

	# How to save a ggplot as a JPG file with a
	# specific dpi and dimensions, for example,
	# because a publisher requires it

	library(ggplot2)

	p <-
	ggplot(mtcars) +
	aes(mpg,
	disp) +

benmarwick / dsm-course-checking.R

Created July 21, 2020 00:45


	library(tidyverse)
	library(rvest)

	# what quarter and year do we want to check for the availability of DSM courses?
	qrt_year <- "AUT2020"

	# this is the URL to our canonical list of DSM courses
	webpage <- "https://www.washington.edu/uaa/advising/single-pages/data-science-minor/"

benmarwick / .github-workflows-main.yml

Last active November 7, 2022 23:22

GitHub workflow to render all Rmd files in a GitHub repo, e.g. for testing student assignments

	# from https://github.com/cboettig/compendium/blob/master/.github/workflows/main.yml
	on: [push]

	name: render all R Markdown documents

	jobs:
	render:
	name: render all R Markdown documents
	runs-on: macOS-latest
	steps:

benmarwick / inspect-rmd-diff-from-last-commit.R

Last active May 23, 2025 03:30

GitHub doesn't show rich diffs for Rmd files. That can make collaborative writing tough. Here's how to see rich diffs of two commits of a single R Markdown document on a GitHub repo or local Git repo

	# another method
	# remotes::install_github("ropenscilabs/reviewer")
	browseURL(reviewer::diff_rmd("analysis/paper/paper.qmd",
	# this gets the sha of the previous commit
	git2r::commits(n=2)[[2]]$sha)$raw)

benmarwick / Tehrani_Collard_2002.R

Created December 30, 2019 21:32

Replicating Tehrani & Collard 2002


	# paste from http://www.ceacb.ucl.ac.uk/ceacb_files/misc/Tehrani_Collard_2002.pdf
	# edit to get each textile design on one line
	raw_input <-
	c("Ersari 1 0 1 0 1 1 0 1 0 1 1 0 1 1 100000011010000001011001111101011000011111100110111011011011011011110000000 0
	Saryk 1 0 1 0 1 1 1 1 1 1 1 0 1 0 100000011010111101010100001111100000010101010110011011100000001011100000000 0
	Salor 1 0 1 1 0 1 0 1 0 1 0 1 0 0 011110110101000001011000001011110111010101000100111101111101001101100000101 0
	PSDP Tekke 1 1 0 0 0 1 0 1 1 0 1 0 0 0 011000011111111111011010001010100000010101101101000000000000000000001101000 0
	SDP Tekke 1 1 0 0 0 1 0 1 1 0 0 1 0 0 000011111001110011111010000000000010100001100000000000001100101010001011111 1
	Yomut 0 0 0 0 0 1 0 1 1 0 0 0 1 1 000000000000101100000001100000011000010100000000000000000000000000000000000 0")

benmarwick / gist:cffb82fb6d9192354d495a2e0ded4e55

Last active September 13, 2019 00:25

Plot the population of Han Chinese in Taiwan from Wikipedia


	library(tidyverse)


	pop_han_in_tw <- tribble(
	~year, ~population,
	1684, 120000,
	1764, 666210,
	1782, 912920,
	1811, 1944737,

benmarwick / inspect-rmd-diff-from-last-commit.R

Last active February 22, 2023 20:43

How to compare the current commit of an RMarkdown file with the most recent commit. Useful when we merge a pull request and want to check again how it has changed the document

	# reviewer pkg comes from https://ropenscilabs.github.io/reviewer/index.html
	# remotes::install_github("ropenscilabs/reviewer")

	browseURL(reviewer::diff_rmd("path/to/my-document.Rmd",
	# this gets the sha of the previous commit
	git2r::commits(n=2)[[2]]$sha)$raw)
	# result will open in a web browser

Ben Marwick benmarwick