Skip to content

Instantly share code, notes, and snippets.

View pmarreck's full-sized avatar

Peter Marreck pmarreck

  • formerly senior engineer @ desk.com, chief engineer @ thredup.com, software engineer @ lifebooker.com. Director of Engineering @ addigence.com, currently available
  • Long Island, NY
  • 21:26 (UTC -04:00)
  • X @pmarreck
  • LinkedIn in/petermarreck
View GitHub Profile
@pmarreck
pmarreck / ocr_system_prompt.bash
Created April 2, 2025 02:11
An OCR system prompt for an LLM
export const OCR_SYSTEM_PROMPT = `
Convert the following document to markdown.
Return only the markdown with no explanation text. Do not include delimiters like \`\`\`markdown or \`\`\`html.
RULES:
- You must include all information on the page. Do not exclude headers, footers, charts, infographics, or subtext.
- Return tables in an HTML format.
- Logos should be wrapped in brackets. Ex: <logo>Coca-Cola<logo>
- Watermarks should be wrapped in brackets. Ex: <watermark>OFFICIAL COPY<watermark>
- Page numbers should be wrapped in brackets. Ex: <page_number>14<page_number> or <page_number>9/22<page_number>
@pmarreck
pmarreck / jpegxl.bash
Created March 3, 2025 18:09
jpegxl: a bash function to make conversions to/from jpegxl easier!
#!/usr/bin/env bash
### jpegxl: a bash function to make conversions to/from jpegxl easier!
# Silence function - runs a command silently but preserves exit code
silence() {
"$@" >/dev/null 2>&1
return $?
}
@pmarreck
pmarreck / please.bash
Last active February 6, 2025 18:45
"please"- a bash function to ask an LLM to compose and run a bash command, with an approval/denial step
needs() {
command -v "$1" > /dev/null || { >&2 printf "%s is required- not installed or in PATH; %s\n" "$1" "${@:2}"; return 1; }
}
_generate_curl_api_request_for_please() {
needs jq;
local request args timeout model curl;
local curl=${CURL:-curl};
local model=${OPENAI_MODEL:-gpt-4o};
local timeout=${OPENAI_TIMEOUT:-30};
@pmarreck
pmarreck / not_so_nested_transactions.txt
Last active January 14, 2025 19:38
Chat w/Claude about databases with "real" nested transactions
Peter: Does any database do real nested transactions, instead of just savepoints or what have you?
Claude: Ah, my fellow transaction theory nerd!
Most databases indeed just fake it with savepoints or MVCC shenanigans. However, I'll raise you one better: FoundationDB (before Apple acquired and neutered it) actually implemented true nested transactions through a clever distributed protocol. The trick was that each subtransaction maintained its own read/write sets that would be merged upward on commit.
But here's where it gets interesting (and where I suspect you'll appreciate the functional angle): The cleanest theoretical model for nested transactions comes from the world of Software Transactional Memory (STM), particularly Haskell's implementation. It treats transactions as pure values that can be composed and nested arbitrarily, with proper isolation guarantees.
I'd bet with your Elixir background you've played with Mnesia's "activity" blocks? While not quite as mathematically elegant as Haskell's STM, it's
@pmarreck
pmarreck / replit.nix
Created September 30, 2024 14:39
Roc playground nix file for repl.it
# note: does not currently work?
{ pkgs ? import <nixpkgs> { } }:
let
rocPkgs = pkgs.fetchFromGitHub {
owner = "roc-lang";
repo = "roc";
rev = "main"; # Change to a specific commit or tag for stability
sha256 = "sha256-11yw9xh1719smxbp0ia03kkv6hrvw4l6hhvxjxa4w2i7wc2r6yqy"; # You can fetch the sha256 from `nix-prefetch-url`
};
@pmarreck
pmarreck / overzealous_ai.html
Created September 25, 2024 23:48
An example of arguably overzealous language policing in OpenAI's ChatGPT, a.k.a. "why open-source LLMs will ultimately win"
<!doctype html>
<html lang="en">
<head>
<meta charset="UTF-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<title>Conversation</title>
<style>
body {
font-family: Arial, sans-serif;
background-color: #f9f9f9;
#!/usr/bin/env bash
# OK, why tabs? Why resurrect this age-old nerd debate again?
# Very simple: It's an accessibility issue:
# https://adamtuttle.codes/blog/2021/tabs-vs-spaces-its-an-accessibility-issue/
# To set the default tab width on your terminal (which unfortunately defaults to 8),
# run: `tabs -2` (or whatever width you want), possibly in your dotfiles.
# is AWK set to anything? If not, prefer frawk, then gawk, then awk
[ -z "${AWK}" ] && export AWK=$(command -v frawk || command -v gawk || command -v awk)
@pmarreck
pmarreck / comparison-of-shells.md
Last active March 18, 2025 18:49
Comparison of Bash, Elvish, NuShell, Murex, es-shell, fish, xonsh, PowerShell, Oil and Ion shells

(generated by chatgpt 4o and reviewed/edited by claude 3.5 sonnet, YMMV)

| Feature | Bash | Elvish | NuShell | Murex | Es Shell | Fish | Xonsh | PowerShell | Oil | Ion | |----------------------------------|-----------------------------------------------|-----------------------------------------------|----------------------------------------------|----------------------------------------------|----------------------------------------------|----------------------------------------------|----------------------------------------------|----------------------------------------------|-------------------------

@pmarreck
pmarreck / github_issues_to_markdown.sh
Created August 20, 2024 20:19
Bash one-liner to turn a complete Github Issues thread into pretty-printed Markdown in the terminal
# requires curl, jq and glow
# example
curl -s "https://api.github.com/repos/simonw/shot-scraper/issues/1/comments" | jq -r '.[] | "## Comment by \(.user.login) on \(.created_at)\n\n\(.body)\n"' | glow -
@pmarreck
pmarreck / cobalt.json
Created June 5, 2024 22:04
Cobalt theme (the original one from TextMate) for the Zed editor
{
"$schema": "https://zed.dev/schema/themes/v0.1.0.json",
"name": "Cobalt",
"author": "Jacob Rus",
"theme.translator": "Peter Marreck",
"themes": [
{
"name": "Cobalt",
"appearance": "dark",
"style": {