Skip to content

Instantly share code, notes, and snippets.

View belisarius222's full-sized avatar

Ted Blackman belisarius222

  • Massachusetts
  • 05:57 (UTC -04:00)
  • X @rovnys
View GitHub Profile
@belisarius222
belisarius222 / 2026-05-28-arda-poc-status-report.html
Created May 28, 2026 18:00
ARDA POC Status Report #pagedrop
<!doctype html>
<html lang="en" data-theme="dark">
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>ARDA POC Status Report</title>
<link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/@picocss/pico@2/css/pico.min.css">
<style>
:root {
color-scheme: dark;
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<title>End-to-End Hierarchical Memory — A system design for billion-token attention</title>
<link rel="preconnect" href="https://fonts.googleapis.com" />
<link rel="preconnect" href="https://fonts.gstatic.com" crossorigin />
<link href="https://fonts.googleapis.com/css2?family=Instrument+Serif:ital@0;1&family=Newsreader:ital,opsz,wght@0,6..72,300..700;1,6..72,300..700&family=JetBrains+Mono:ital,wght@0,400;0,500;0,700;1,400&display=swap" rel="stylesheet" />
@belisarius222
belisarius222 / imposter_training_next_pagedrop.html
Last active May 4, 2026 18:18
IMPOSTER training next-step recommendation
<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>IMPOSTER Training: The Next Phase</title>
<meta name="description" content="A paper-faithful plan for the next IMPOSTER training phase, grounded in the Apr 2026 experiment results and the original IMPOSTER thesis.">
<style>
:root {
color-scheme: light;
@belisarius222
belisarius222 / pi-kurtz
Created April 9, 2026 18:13
Chat with imposter-72b on kurtz (Qwen2.5-72B + LoRA via vLLM)
#!/usr/bin/env bash
# Chat with imposter-72b on kurtz (Qwen2.5-72B + LoRA via vLLM)
# Sets up SSH tunnel automatically, tears it down on exit
set -euo pipefail
LOCAL_PORT=8000
TUNNEL_PID=""
cleanup() {
@belisarius222
belisarius222 / imposter-results.html
Last active April 27, 2026 13:03
IMPOSTER training results - Qwen chat-template from-scratch run (Apr 25, 2026); supersedes Apr 9 results. #pagedrop
<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>IMPOSTER: chat-template run results &amp; span rubric</title>
<link rel="preconnect" href="https://fonts.googleapis.com">
<link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
<link href="https://fonts.googleapis.com/css2?family=Fraunces:opsz,wght@9..144,600;9..144,800&family=IBM+Plex+Mono:wght@400;600&family=Source+Serif+4:opsz,wght@8..60,400;8..60,600&display=swap" rel="stylesheet">
<style>
<!DOCTYPE html>
<html lang="en" data-theme="dark">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Staged-Polymorphic Omega System — Training Report</title>
<link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/@picocss/pico@2/css/pico.min.css">
<style>
:root {
--pico-font-size: 16px;
@belisarius222
belisarius222 / example-paragraphs.md
Last active April 3, 2026 23:23
volta-loop example in paragraph format

volta-loop

main(experiment_name, git_repo)

LABEL: implement

let commit_hash = AWAIT FORK implement-experiment experiment_name fail_path?.

let setup_ok = AWAIT FORK setup-experiment commit_hash.

@belisarius222
belisarius222 / void-cron-prompt.html
Last active March 26, 2026 18:19
VOID Maintenance Cron Prompt #pagedrop
<!DOCTYPE html>
<html><head><meta charset="utf-8"><title>VOID Maintenance Cron Prompt</title>
<style>
body { font-family: system-ui, sans-serif; max-width: 800px; margin: 40px auto; padding: 0 20px; background: #0d1117; color: #c9d1d9; }
h1 { color: #58a6ff; border-bottom: 1px solid #30363d; padding-bottom: 8px; }
h2 { color: #79c0ff; margin-top: 24px; }
pre { background: #161b22; border: 1px solid #30363d; border-radius: 6px; padding: 16px; overflow-x: auto; font-size: 13px; line-height: 1.5; }
.key { background: #1f6feb22; border-left: 3px solid #1f6feb; padding: 8px 12px; margin: 8px 0; border-radius: 0 6px 6px 0; }
.warn { background: #da363422; border-left: 3px solid #da3634; padding: 8px 12px; margin: 8px 0; border-radius: 0 6px 6px 0; }
.new { background: #23863622; border-left: 3px solid #238636; padding: 8px 12px; margin: 8px 0; border-radius: 0 6px 6px 0; }
<!DOCTYPE html>
<html><head><meta charset="utf-8"><title>VOID Recursive Forecaster — Results</title>
<style>
body { font-family: system-ui, -apple-system, sans-serif; max-width: 900px; margin: 40px auto; padding: 0 20px; background: #0d1117; color: #c9d1d9; line-height: 1.6; }
h1 { color: #58a6ff; border-bottom: 1px solid #30363d; padding-bottom: 12px; }
h2 { color: #79c0ff; margin-top: 32px; }
h3 { color: #d2a8ff; margin-top: 24px; }
table { border-collapse: collapse; width: 100%; margin: 16px 0; }
th, td { border: 1px solid #30363d; padding: 8px 12px; text-align: left; }
th { background: #161b22; color: #79c0ff; }
@belisarius222
belisarius222 / attnres-results.md
Created March 21, 2026 03:45
AttnRes: Attention Over the Residual Stream — Experimental Results (2026-03-20)

AttnRes: Attention Over the Residual Stream

Overview

AttnRes replaces the standard residual connection in transformers with a depth attention mechanism — instead of simply adding each layer's output to a running sum, the model attends over previous layer outputs to decide what information to carry forward.

Standard transformers use x = x + layer(x) at every layer. AttnRes variants replace this with a learned attention operation across the depth axis: "which previous layers' outputs should I attend to when constructing the input to this layer?"

All experiments use a GPT-2-style decoder-only transformer trained on FineWeb-Edu (10B tokens), with RoPE, SwiGLU, and RMSNorm.