Skip to content

Instantly share code, notes, and snippets.

View usirin's full-sized avatar

Umut Sirin usirin

View GitHub Profile
@usirin
usirin / research.md
Created April 14, 2026 23:01
Research: Programmatic VoiceOver Control on iOS Simulator

Programmatic VoiceOver Control on iOS Simulator

Executive Summary

VoiceOver can be automated on the iOS Simulator, but no single tool or API provides an end-to-end solution. This research investigated five dimensions of the problem: enabling/disabling VoiceOver, sending navigation commands, reading focus state, existing frameworks, and React Native's accessibility APIs. The core finding is that a viable automation stack exists by combining xcrun simctl (VoiceOver lifecycle), AppleScript keystroke injection (navigation), and the macOS AXUIElement API (tree inspection), but reading VoiceOver cursor position remains the hardest unsolved problem. No existing framework (Apple, Google, Deque, or open-source) automates VoiceOver itself. Every tool in the ecosystem validates accessibility metadata, not screen reader behavior. This gap is real and unaddressed.

Recommended architecture: A three-layer system combining (1) defaults write + launchctl for VoiceOver lifecycle control, (2) osascript keyst

@usirin
usirin / research.md
Created April 14, 2026 00:15
Research: Promotion Evidence Tracking with AI Harnesses

Promotion Evidence Tracking with AI Harnesses

Executive Summary

This research investigates how engineers can use AI coding agents (Claude Code, Cursor, and similar tools) to systematically track promotion evidence, with specific focus on the Senior-to-Staff (P3-to-P4) transition and neurodivergent-friendly design.

Four findings cut across all subtasks:

  1. The claim-evidence-impact chain is universal. Every promotion framework, from Julia Evans' brag doc to Dropbox's six-dimension rubric, converges on the same structure: state a claim, back it with concrete evidence, quantify impact. The formats differ; the underlying pattern does not.
@usirin
usirin / research.md
Created April 12, 2026 22:37
Research: XState Nested and Parallel State Machine Patterns for Workflow Orchestration

title: "XState Nested and Parallel Patterns for Workflow Orchestration" date: 2026-04-12 type: research status: complete parent: kampus/xstate-nested-parallel-patterns cross-references:

  • kampus/parallelism-preserving-ui-for-agents/research-v2-engineer-forward.md
  • kampus/ds-as-claude-md/research.md
  • kampus/claude-code-parallel-primitives/research.md
@usirin
usirin / research.md
Last active April 12, 2026 21:04
Research: Claude Code Parallel Execution Primitives for Operator Agent Enhancement
title Parallel Execution Primitives for Claude Code Operator Agents
date 2026-04-12
project kampus
feature claude-code-parallel-primitives
type research
status complete

Parallel Execution Primitives for Claude Code Operator Agents

@usirin
usirin / framing-memo.md
Created April 12, 2026 17:55
Agentic DS Framework — framing memo (working doc)
title Agentic DS Framework
project discord
domain agentic-ds
status active
timing Q2 2026
context CTO-down initiative on DS role in agentic era
research kampus/ds-as-claude-md
@usirin
usirin / research-v2-mixed-audience.md
Created April 12, 2026 04:36
Parallelism-Preserving UI for Agentic Work — mixed-audience research survey
title Parallelism-Preserving UI for Agentic Work
date 2026-04-11
type research
status draft
variant mixed-audience

What This Document Is

@usirin
usirin / research.md
Last active April 11, 2026 19:12
Interaction Models and Design Systems in Agentic Programming
title Interaction Models and Design Systems in Agentic Programming
date 2026-04-08
type research
status complete

Executive Summary

Agentic programming tools encode fundamentally different assumptions about who the human is and what they need. A terminal chat treats the developer as a collaborator reviewing diffs. A background agent treats the developer as a manager reviewing pull requests. A Figma MCP integration treats the designer as a spatial thinker who should never leave the canvas. The most important finding across this research is that agentic tooling works best when it meets each role in its native medium rather than collapsing everyone into a single interface. This is not a convenience preference. Cognitive science research demonstrates that spatial reasoning, direct manipulation, and visual judgment are load-bearing cognitive processes for designers [^43][^44], just as sequential textual reasoning is for developers. Forcing role collapse disables the cognitive machinery that makes

@usirin
usirin / knowledge.json
Created March 30, 2026 02:53
a11y fix-agent: manifest + knowledge (2026-03-29)
{
"fix_patterns": [
{
"id": "contrast_fix_text_token_category_mismatch",
"description": "Replace a background/surface semantic color token with a text color token where the property controls text rendering, causing near-zero contrast in light mode",
"wcag_rules": [
"1.4.3"
],
"platforms": [
"android"

Harness Audit — 2026-03-25

Skills before: 38 | Skills after: 27 active + 13 archived Shared rules created: 7 | Scripts created: 3 Model Assumptions added: 27/27 active skills New skills created: 2 (vault-quarterly, vault-half-review) — pending /skill-creator refinement

What Changed

Archived 13 skills (1,651 lines removed from active set)

@usirin
usirin / SKILL.md
Created March 25, 2026 23:53
harness-audit: on-demand skill suite auditor for Claude Code
name harness-audit
description Audit the skill suite to find complexity that's no longer justified by current model capabilities. Run on-demand after model releases, when skills feel like they're getting in the way, or for periodic maintenance. Use when user says "harness audit", "audit skills", "audit my harness", "simplify skills", "prune skills", "are my skills still needed", "what can I simplify", "skill review", "clean up skills", or mentions that a new model just dropped and they want to check if their setup needs updating.

Harness Audit

Evaluate whether each skill's complexity is still justified. Every skill encodes assumptions about what the model can't do on its own. Those assumptions go stale as models improve. This skill finds the stale ones.

Three phases: audit (read-only report) → propose (show changes one at a time) → execute (apply approved changes only).