Skip to content

Instantly share code, notes, and snippets.

@sanketsudake
Last active May 29, 2018 16:28
Show Gist options
  • Save sanketsudake/762279cb1ae071726edd5acba24ca6d1 to your computer and use it in GitHub Desktop.
Save sanketsudake/762279cb1ae071726edd5acba24ca6d1 to your computer and use it in GitHub Desktop.
# Training job contains 1 Tensorflow master,1 Tensorflow worker and 2 parameter servers.
# In replica specifications, we specify replica type and replica specific details like container image etc.
apiVersion: "kubeflow.org/v1alpha1"
kind: "TFJob"
metadata:
name: "example-job"
spec:
replicaSpecs:
- replicas: 1
tfReplicaType: MASTER
template:
spec:
containers:
- image: gcr.io/tf-on-k8s-dogfood/tf_sample:dc944ff
name: tensorflow
restartPolicy: OnFailure
- replicas: 1
tfReplicaType: WORKER
template:
spec:
containers:
- image: gcr.io/tf-on-k8s-dogfood/tf_sample:dc944ff
name: tensorflow
restartPolicy: OnFailure
- replicas: 2
tfReplicaType: PS
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment