Skip to content

Instantly share code, notes, and snippets.

@Mistobaan
Mistobaan / README.md
Created March 21, 2026 01:07
Vibe Coded Script to check the length of tokenized documents

OpenAI golf parameter

This is a vibe coded script by codex to check the average document length after tokenization on the pre-tokenized dataset of parameter golf.

Full train-split summary from the current cached dataset

documents: 6,292,940
tokens_accounted_for: 8,000,000,000
@Mistobaan
Mistobaan / sharp-training.ipynb
Created February 4, 2026 04:35
sharp-training.ipynb
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@Mistobaan
Mistobaan / sharp-training.ipynb
Created February 4, 2026 04:23
sharp-training.ipynb
Sorry, this is too big to display.
@Mistobaan
Mistobaan / sharp-training.ipynb
Created February 4, 2026 03:10
sharp-training.ipynb
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
uv tool run opik harbor trials start -p ./examples/tasks/k8s-hello-world/ \
        --agent "terminus-2" \
        --agent-kwarg "api_base=$OPENAI_API_BASE" \
  --model "hosted_vllm/$MODEL" \
  --environment-type "docker"

OPIK: Opik tracking enabled for Harbor
Starting trial: k8s-hello-world__wv9bPao
Task: k8s-hello-world
# # Serverless OpenThinker Agent v1 with vLLM and Modal
# This example shows how to serve Hugging Face's
# [open-thoughts/OpenThinker-Agent-v1](https://huggingface.co/open-thoughts/OpenThinker-Agent-v1)
# with vLLM on Modal.
# OpenThinker-Agent-v1 is an agentic model post-trained from Qwen/Qwen3-8B via supervised fine-tuning and reinforcement learning,
# and it excels on agent benchmarks such as Terminal-Bench 2.0 and SWE-Bench.
# We include the same Modal best practices as our other inference demos:
# - GPU snapshots for faster cold starts
@Mistobaan
Mistobaan / setup_8xH100.sh
Created September 22, 2025 02:23
Setup 8xH100 on lambdalabs
#!/bin/bash
# Check if the argument is provided
if [ -z "$1" ]; then
echo "Usage: $0 <path_storage>"
exit 1
fi
PATH_STORAGE=$1
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" lang xml:lang>
<head>
<meta charset="utf-8" />
<meta name="generator" content="nbh" />
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes" />
<title>Notebook HTML Renderer</title>
<!-- KaTeX -->
<script src="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.9/katex.min.js"
@Mistobaan
Mistobaan / setup_gb200.sh
Last active July 16, 2025 07:53
Lambda Labs setup for Arm64
# install node
# Download and install nvm:
curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.40.3/install.sh | bash
# in lieu of restarting the shell
\. "$HOME/.nvm/nvm.sh"
# Download and install Node.js:
nvm install 22
@Mistobaan
Mistobaan / finance_model.py
Last active January 11, 2025 21:14
Financial Model to train a small Math reasoning model
import math
def compute_cost(num_nodes, gpus_per_node, cost_per_gpu_hour, days):
"""
Computes the cost of running a certain setup (num_nodes, gpus_per_node)
for a specified number of days at a particular cost_per_gpu_hour.
:param num_nodes: Number of nodes
:param gpus_per_node: Number of GPUs in each node
:param cost_per_gpu_hour: Cost per GPU-hour in USD