Andrews Cordolino Sobral andrewssobral

List of terminal-based developer tools that deliver value in 30 seconds

miniogre: from source code to reproducible envionments, in seconds
- pip install miniogre
- Go to project folder
- Run miniogre run
freeze: generate images of code and terminal output
- brew install charmbracelet/tap/freeze
- freeze --execute "ls -ltra"
uv: An extremely fast Python package installer and resolver, written in Rust

X11 forwarding on macOS and docker

A quick guide on how to setup X11 forwarding on macOS when using docker containers requiring a DISPLAY. Works on both Intel and M1 macs!

This guide was tested on:

macOS Catalina 10.15.4
docker desktop 2.2.0.5 (43884) - stable release
XQuartz 2.7.11 (xorg-server 1.18.4)
Macbook Pro (Intel)

How to Crack Sublime Text 3.2.2 Build 3211 with Hex Editor (Windows | Without License) ↓

Download & Install Sublime Text 3.2.2 Build 3211
Visit https://hexed.it/
Open file select sublime_text.exe
Offset 0x8545: Original 84 -> 85
Offset 0x08FF19: Original 75 -> EB
Offset 0x1932C7: Original 75 -> 74 (remove UNREGISTERED in title bar, so no need to use a license)

High-Performance Matrix Multiplication

This is a short post that explains how to write a high-performance matrix multiplication program on modern processors. In this tutorial I will use a single core of the Skylake-client CPU with AVX2, but the principles in this post also apply to other processors with different instruction sets (such as AVX512).

Intro

Matrix multiplication is a mathematical operation that defines the product of

	# Requires:
	# pip install pyobjc-framework-Metal
	import numpy as np
	import Metal

	# Get the default GPU device
	device = Metal.MTLCreateSystemDefaultDevice()

	# Make a command queue to encode command buffers to
	command_queue = device.newCommandQueue()

	######################### Preamble ###########################################
	SHELL := bash
	.ONESHELL:
	.SHELLFLAGS := -eu -o pipefail -c
	.DELETE_ON_ERROR:
	.SECONDEXPANSION:
	.EXTRA_PREREQS := $(MAKEFILE_LIST)
	MAKEFLAGS += --warn-undefined-variables
	MAKEFLAGS += --no-builtin-rules
	MAKEFLAGS += -j$(shell nproc)

	# This is a modified version of TRL's `SFTTrainer` example (https://github.com/huggingface/trl/blob/main/examples/scripts/sft_trainer.py),
	# adapted to run with DeepSpeed ZeRO-3 and Mistral-7B-V1.0. The settings below were run on 1 node of 8 x A100 (80GB) GPUs.
	#
	# Usage:
	# - Install the latest transformers & accelerate versions: `pip install -U transformers accelerate`
	# - Install deepspeed: `pip install deepspeed==0.9.5`
	# - Install TRL from main: pip install git+https://github.com/huggingface/trl.git
	# - Clone the repo: git clone github.com/huggingface/trl.git
	# - Copy this Gist into trl/examples/scripts
	# - Run from root of trl repo with: accelerate launch --config_file=examples/accelerate_configs/deepspeed_zero3.yaml --gradient_accumulation_steps 8 examples/scripts/sft_trainer.py

	On an Orin NX 16G the memory was too low to compile and the SWAP file had to be increased.

	/etc/systemd/nvzramconfig.sh
	change:

	```
	# Calculate memory to use for zram (1/2 of ram)
	totalmem=`LC_ALL=C free \| grep -e "^Mem:" \| sed -e 's/^Mem: //' -e 's/ .*//'`
	mem=$((("${totalmem}" / 2 / "${NRDEVICES}") * 1024))
	```

	# Authors: Mathieu Blondel, Vlad Niculae
	# License: BSD 3 clause

	import numpy as np


	def _gen_pairs(gen, max_iter, max_inner, random_state, verbose):
	rng = np.random.RandomState(random_state)

	# if tuple, interpret as randn