Skip to content

Instantly share code, notes, and snippets.

@williamFalcon
Created December 1, 2019 12:14
Show Gist options
  • Save williamFalcon/e75a4ecdbcb8573f3659deaabc8d7944 to your computer and use it in GitHub Desktop.
Save williamFalcon/e75a4ecdbcb8573f3659deaabc8d7944 to your computer and use it in GitHub Desktop.
#!/bin/bash -l
# SLURM SUBMIT SCRIPT
#SBATCH --nodes=4
#SBATCH --gres=gpu:4
#SBATCH --ntasks-per-node=4
#SBATCH --mem=0
#SBATCH --time=0-02:00:00
# activate conda env
source activate $1
# -------------------------
# debugging flags (optional)
export NCCL_DEBUG=INFO
export PYTHONFAULTHANDLER=1
# on your cluster you might need these:
# set the network interface
# export NCCL_SOCKET_IFNAME=^docker0,lo
# might need the latest cuda
# module load NCCL/2.4.7-1-cuda.10.0
# -------------------------
# run script from above
srun python3 mnist_example.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment