Skip to content

Instantly share code, notes, and snippets.

View vinjn's full-sized avatar

Jing Zhang vinjn

View GitHub Profile
@madebyollin
madebyollin / automatic_profiling_markers.py
Created February 27, 2024 02:57
Add human-readable profiling markers to a pytorch module
def add_profiling_markers(model):
"""Monkey-patch profiling markers into an nn.Module.
Args:
model: an nn.Module
Effect:
all model.named_module() forward calls get wrapped in their
own profiling scope, making traces easier to understand.
"""
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@mcarilli
mcarilli / nsight.sh
Last active November 12, 2025 19:13
Favorite nsight systems profiling commands for Pytorch scripts
# This isn't supposed to run as a bash script, i named it with ".sh" for syntax highlighting.
# https://developer.nvidia.com/nsight-systems
# https://docs.nvidia.com/nsight-systems/profiling/index.html
# My preferred nsys (command line executable used to create profiles) commands
#
# In your script, write
# torch.cuda.nvtx.range_push("region name")
# ...
@banyudu
banyudu / free-docker-space-in-wsl2.blog.md
Last active December 15, 2025 10:10
WSL2 Docker释放磁盘空间

WSL2 Docker释放磁盘空间

Docker使用久了,会残留很多中间生成的镜像,占用磁盘空间。

要清理Docker镜像,本来是挺简单的,一条命令就搞定了:

@gavinb
gavinb / cuda_device_link_error.md
Last active June 30, 2025 06:06
CUDA link error with CMake
@TengdaHan
TengdaHan / ddp_notes.md
Last active December 4, 2025 16:26
Multi-node-training on slurm with PyTorch

Multi-node-training on slurm with PyTorch

What's this?

  • A simple note for how to start multi-node-training on slurm scheduler with PyTorch.
  • Useful especially when scheduler is too busy that you cannot get multiple GPUs allocated, or you need more than 4 GPUs for a single job.
  • Requirement: Have to use PyTorch DistributedDataParallel(DDP) for this purpose.
  • Warning: might need to re-factor your own code.
  • Warning: might be secretly condemned by your colleagues because using too many GPUs.
@schmmd
schmmd / slurm-gpu-stats.py
Created November 20, 2019 19:08
Slurm GPU Stats
#!/usr/bin/env python
import os
import re
import subprocess
import socket
import sys
def pids_of_jid(jid):
result = subprocess.run(["sstat", "-p", "--format=PID", "-j", jid, "--noheader"], stdout=subprocess.PIPE)
@codewings
codewings / __init__.py
Last active August 16, 2023 07:18
Intel GPA plugin used for asset dumping
import json
import os
import plugin_api
#
def IsUsefulInput(viewtype):
return viewtype == "CBV" or viewtype == "IBV" or viewtype == "VBV" or viewtype == "SRV"
def DumpBufferByDrawcall(folder, call, res, id, texture):
if not os.path.exists(folder):
commit 6fac225de0575b18550f9421196f90a2279c44af
Author: Tiago Koji Castro Shibata <[email protected]>
Date: Fri Dec 28 23:13:14 2018 -0200
Fix compilation with latest OpenCV
C APIs were removed and must be replaced with C++ calls
diff --git a/Makefile b/Makefile
index 63e15e6..c148d4b 100644