Skip to content

Instantly share code, notes, and snippets.

View mohamed's full-sized avatar

Mohamed A. Bamakhrama mohamed

View GitHub Profile
#!/usr/bin/env bash
# journal.sh
# ==========
#
# One daily text file to rule them all.
#
# Copyright: 2022 Tyler Cipriani <[email protected]
# License: GPLv3
set -euo pipefail
@mohamed
mohamed / Matrix.md
Created May 29, 2018 09:25 — forked from nadavrot/Matrix.md
Efficient matrix multiplication

High-Performance Matrix Multiplication

This is a short post that explains how to write a high-performance matrix multiplication program on modern processors. In this tutorial I will use a single core of the Skylake-client CPU with AVX2, but the principles in this post also apply to other processors with different instruction sets (such as AVX512).

Intro

Matrix multiplication is a mathematical operation that defines the product of