Skip to content

Instantly share code, notes, and snippets.

View DaisukeMiyamoto's full-sized avatar

Daisuke Miyamoto DaisukeMiyamoto

  • Tokyo, Japan
View GitHub Profile
@DaisukeMiyamoto
DaisukeMiyamoto / Userdata.txt
Created June 24, 2020 22:58
Amazon EC2 Userdata for initializing instance store at launch
<powershell>
C:\ProgramData\Amazon\EC2-Windows\Launch\Scripts\InitializeDisks.ps1
</powershell>

Tutorial: AWS Batch setup for Neuron

Introduction

AWS Batch is a fully managed batch job scheduler service on AWS. It could easily manage large scale job queueing and execution. This tutorial shows a way to use Inferentia within a job on AWS Batch.

Steps Overview:

@DaisukeMiyamoto
DaisukeMiyamoto / Dockerfile.neuron-container
Last active February 17, 2023 00:34
AWS Inferentia example for AWS Batch with NEURON SDK
# Example neuron-container dockerfile. by midaisuk
# To build:
# docker build -t neuron-container .
# Prepare application:
# before launch the docker image, you need to prepare some files based on the document and upload to S3 bucket.
# https://github.com/aws/aws-neuron-sdk/blob/master/docs/tensorflow-neuron/tutorial-compile-infer.md
# resnet50_neuron.zip
# infer_resnet50.py

numa settings of AWS instances

c5.18xlarge

$ numactl -H
available: 2 nodes (0-1)
node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53
node 0 size: 70341 MB
@DaisukeMiyamoto
DaisukeMiyamoto / run_lammps_on_aws_parallelcluster.md
Last active April 19, 2020 11:32
Run LAMMPS on AWS ParallelCluster

LAMMPS on AWS ParallelCluster

set up parallelcluster

set up parallelcluster with following conditions

  • OS: CentOS
  • Scheduler: SGE
@DaisukeMiyamoto
DaisukeMiyamoto / parallelcluster_efa_lustre.config
Last active April 6, 2020 03:26
config file for AWS ParallelCluster for EFA and FSx for Lustre setting
[aws]
aws_region_name = us-east-1
[global]
update_check = true
sanity_check = true
cluster_template = benchmark_efa
[aliases]
ssh = ssh {CFN_USER}@{MASTER_IP} {ARGS} -i ~/.ssh/<KEYNAME> -o StrictHostKeyChecking=no
@DaisukeMiyamoto
DaisukeMiyamoto / run_osu_on_aws_parallelcluster.md
Last active April 5, 2020 23:39
Run OSU Benchmark on AWS ParallelCluster

Benchmark OSU

prepare OSU micro benchmarks

wget http://mvapich.cse.ohio-state.edu/download/mvapich/osu-micro-benchmarks-5.6.2.tar.gz
tar xvf osu-micro-benchmarks-5.6.2.tar.gz
@DaisukeMiyamoto
DaisukeMiyamoto / create_compute_environment_and_job_queue.py
Last active March 20, 2022 17:04
Create compute environment and job queue for AWS Batch with Boto3 (Polling and Waiter implementations)
####################################################################
# Create compute environment and job queue for AWS Batch with Boto3
# Daisuke Miyamoto
# 20200330
#
import boto3
import botocore.waiter
from botocore.exceptions import WaiterError
import time
@DaisukeMiyamoto
DaisukeMiyamoto / efs-dashboard-without-size.template
Last active March 24, 2020 17:18
CloudFormation template for creating Amazon EFS Dashboard based on https://github.com/aws-samples/amazon-efs-tutorial/tree/master/monitoringto exclude IAM permission.
---
AWSTemplateFormatVersion: 2010-09-09
Description: Creates an Amazon CloudWatch dashboard with Metric Math expressions to monitor an Amazon EFS file system. Customized for exclude IAM permissions.
Metadata:
Authors:
Description: Darryl Osborne ([email protected]) / Modified by midaisuk@
License:
---
AWSTemplateFormatVersion: 2010-09-09
Description: Creates an Amazon CloudWatch dashboard with Metric Math expressions to monitor an Amazon EFS file system. It also includes a file system size custom metric to display the metered size of the file system.
Metadata:
Authors:
Description: Darryl Osborne ([email protected])
License: