Skip to content

Instantly share code, notes, and snippets.

@cavedave
cavedave / football.ipynb
Created February 23, 2025 13:43
Football.ipynb
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@cavedave
cavedave / eggs.py
Created February 12, 2025 11:41
usa egg prices
import requests
import pandas as pd
def download_data(url, filename):
"""Downloads data from a given URL and saves it as a CSV file.
Args:
url: The URL of the data to download.
filename: The name of the CSV file to save the data to.
@cavedave
cavedave / walks.ipynb
Last active February 23, 2025 13:53
walks.ipynb
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@cavedave
cavedave / ukincome.ipynb
Last active January 29, 2025 13:35
ukincome.ipynb
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@cavedave
cavedave / dictionary_bat.txt
Last active December 28, 2023 18:03
MAke map of etymology of the word bat in european languages. Code from Most names gotten from here https://www.reddit.com/media?url=https%3A%2F%2Fi.redd.it%2Fapnha37a0fk51.jpg Some gotten from wiktionary https://en.wiktionary.org/wiki/yarasa code a slightly modified version of https://github.com/dd52/mapMakeR/tree/master/etymologyMaps I am sure …
abk, ?, grey
ara, day blind, lightyellow
aze, night bird, orange
bel, leather one, turquoise
bos, blind mouse, lightgreen
bre, blind mouse, lightblue
bul, sticking one, blue
cat, winged rat, yellow
cau, ?, grey
ces, night flyer, lightgreen
@cavedave
cavedave / heapslaw.py
Created October 21, 2023 18:35
heaps law graph to draw
import matplotlib.pyplot as plt
unique = []
check = []
k = 50 #These need to be estimated for particular languages but this is a base estimate
b=0.4 #
for x in range(1, 100000, 100):
unique.append(k*(x**b))
check.append(x)
@cavedave
cavedave / parseCroidhePdf.py
Created August 12, 2023 16:39
Code to parse a pdf of the only Irish-Irish dictionary of the 20th century. Croidhe Cainnte Chiarraighe. Foclóir Gaeilge-Gaeilge (pdf) 1942. PDF is at https://www.forasnagaeilge.ie/wp-content/uploads/2016/06/8fddae92ae307b022d964ebe73d45df6.pdf . I took a few pages using https://smallpdf.com/split-pdf to speed up experiments but that can be done…
import pdfplumber
def is_bold_font(char):
return "bold" in char['fontname'].lower()
def parse_pdf_to_dictionary(pdf_path):
term_definition_dict = {}
term = ""
next_term = ""
definition = ""
@cavedave
cavedave / EuclidTrans.csv
Last active September 27, 2022 22:38
Graph of Euclid translations. That shows when they were translated. Data from wikipedia https://en.wikipedia.org/wiki/Euclid%27s_Elements#Translations
Year Language taxis iso
1505 Latin 1 va
1543 Italian 1 it
1557 Latin 1 va
1558 German 1 de
1562 German 1 de
1562 German 1 de
1564 French 1 fr
1570 English 1 gb
1572 Latin 1 va
objects <-c('Truman','Eisenhower','Kennedy','Johnson','Nixon','Ford','Carter','Reagan','Bush','Clinton','G. W. Bush','Obama','Trump','Biden')
#'FDR', 0.796,
performance <- c(0.319,0.165,0.206,0.172,0.170,0.188,0.219,0.130,0.113,0.105,0.099,0.094,0.15,0.165)
color<-c('#0015BC', '#E9141D', '#0015BC', '#0015BC', '#E9141D', '#E9141D', '#0015BC', '#E9141D', '#E9141D', '#0015BC', '#E9141D', '#0015BC','#E9141D','#0015BC')
df<- data.frame(objects, performance, color)
#lock in order
library(dplyr)
library(lubridate)
library(tidyverse)
gridwatch <-
read_csv(
file = "gridwatch.csv")
#get date int data format