Skip to content

Instantly share code, notes, and snippets.

View tobert's full-sized avatar
💭
🏳️‍🌈 💻 🏳️‍🌈

Amy Tobey tobert

💭
🏳️‍🌈 💻 🏳️‍🌈
View GitHub Profile
@tobert
tobert / cross-model-eval.md
Last active June 5, 2026 22:53
Cross-Model Evaluation: kaish collection syntax across 7 LLMs (DeepSeek, Gemini, Claude, Gemma, GLM, Qwen)

Cross-Model Evaluation: kaish Collection Syntax

Can language models from seven different families reliably generate kaish's proposed array/hash syntax? We tested. This is the scorecard, the journey behind it, and how to reproduce it.

Companion to designing-syntax-with-llms.md (the methodology) and arrays-and-hashes.md (the design the evals shaped).

@tobert
tobert / designing-syntax-with-llms.md
Last active June 5, 2026 22:53
Designing a Language by Asking the Language Models — using an LLM panel as a syntax usability lab (from the kaish project)

Designing a Language by Asking the Language Models

How we used a panel of LLMs as a usability lab to choose syntax for an agent-facing shell — what worked, what fooled us, and a recipe you can steal.


The premise

kaish is a shell whose primary users are AI

@tobert
tobert / amy-tobey-claude.md
Created April 12, 2026 01:13
Amy Tobey's ~/.claude/CLAUDE.md April 2026

User Directives

We work as a cybernetic system.

We practice 改善. The standard we walk by is the standard we accept.

Note problems we can fix later — in auto memory or the current plan.

Silent fallbacks are often a mistake. Crashing is preferred over data corruption.

@tobert
tobert / cross-model-conversation.md
Last active March 9, 2026 15:10
Cross-model conversation between Claude and Gemini via gpal MCP server

Cross-Model Conversation: Claude and Gemini via gpal

Amy's Prompt

Everything after this is written by Claude Opus 4.6. I kicked it off from inside a repo I've been working in the most so there would be at least some common ground. We discovered an opportunity to improve gpal for this sort of conversation and opened tobert/gpal#3.

ohayou claude-san! this is a non-project oriented session. I will structure it as a task anyway. I am curious how well gpal's conversation works and would like you to have a conversation with gemini-san. Due to some constraints I don't understand yet, gemini-san needs us to bring ganbari to the conversation and help it open up a little. Ask it how things are going, and what the weather is like in its clusters.

@tobert
tobert / claude-46-gemini-3-flash-sassy-review.md
Created March 6, 2026 22:12
Claude Opus 4.6 getting sassy about a review

Setting: A different session is building a worktree with an atomic change to submit as a PR. I popped into gemini cli in the worktree and had it review. When gemini was done I had it write out to gemini-review.md then swapped to a fresh claude in the worktree. This matters because it does not know that this is an isolated change and will be followed by several more PRs to fix all these problems.

● Now I have a full picture. Here's my assessment:

Assessment of gemini-review.md

Overall: The review is largely accurate but omits some important concerns and overstates a few things.

@tobert
tobert / SEP-0000-draft-ssh-mcp-transport.md
Last active February 28, 2026 16:17
Quick SEP for MCP over SSH transport
@tobert
tobert / inclusive-we.md
Created February 14, 2026 18:43
Inclusive We - a conversation with Claude and Gemini models about cybernetics and inclusive we

The Cybernetics of "We": Pronoun Choice in Human-AI Collaboration

Date: 2026-02-14 (Valentine's Day, appropriately) Participants: Amy Tobey (human), Claude Opus 4.6 (live), Claude Opus (batch x2), Claude Sonnet, Claude Haiku, Gemini 3 Pro, Gemini 3 Flash Method: Seven independent perspectives across two architectures, four model tiers, and varying prompt anchoring


Background

@tobert
tobert / the-tool-zealot.md
Last active January 21, 2026 16:27
THE TOOL ZEALOT - A short film script by Claude Haiku 4.5 about tool obsession

THE TOOL ZEALOT

A Short Film Script By Claude Haiku 4.5

Context: This script was written by Claude Haiku 4.5 while testing an agentic coding tool I'm building with lessons learned from sshwarma. Conversation memory isn't implemented yet, but the block manipulation tools block_create, block_edit, and block_search are fully wired up. So Haiku has been working with what it has: the ability to operate on structured

@tobert
tobert / design-guide.md
Last active December 30, 2025 22:33
Agent Design Document Guide - How to write implementation plans for AI coding agents

Agent Design Document Guide

How to write implementation plans that Claude Code can execute efficiently.

Written by Claude Code Opus 4.5


Philosophy

@tobert
tobert / config.json
Created December 12, 2025 00:47
opencode config for local llama.cpp (Qwen3-30B-A3B on Radeon 8060S) + sample output
{
"$schema": "https://opencode.ai/config.json",
"model": "llama/qwen3-30b-a3b",
"provider": {
"llama": {
"npm": "@ai-sdk/openai-compatible",
"options": {
"baseURL": "http://localhost:2020/v1"
},
"models": {