bzamecnik’s gists

bzamecnik / tiny_keras_mnist_mlp.py

Created October 17, 2017 21:20

Tiny MLP on MNIST with Keras - stripped down from Keras examples, functional model API

	from keras.datasets import mnist
	from keras.models import Model
	from keras.layers import Dense, Input
	from keras.utils import to_categorical

	num_classes = 10

	(x_train, y_train), (x_test, y_test) = mnist.load_data()

	x_train = x_train.reshape(60000, 784).astype('float32') / 255

bzamecnik / keras_stagingarea.py

Last active October 18, 2017 11:54

Keras + TensorFlow StagingArea double-buffer

	# This is a minimal example of using TensorFlow's StagingArea with Keras
	# with the goal to implement double-buffering of input batches at GPU.
	#
	# Basically we want to have an input batch ready in GPU memory when batch
	# computation starts and copy another batch in parallel. It should avoid
	# waiting for host-device memcpy and allow better saturation of the GPU
	# compute. StagingArea is a queue implementation that can have it's buffer
	# stored in GPU memory.
	#
	# https://www.tensorflow.org/api_docs/python/tf/contrib/staging/StagingArea

bzamecnik / keras_stagingarea_pipelining_proof_of_concept.py

Last active October 25, 2017 23:31

A rudimentary proof-of-concept of pipelining with TF StagingArea in Keras.

	# Works! In this snippet we're able to get batch from StagingArea
	# and in parallel put another batch there which is load via
	# feed_dict (provided as a tf.Placeholder wrapped as Keras Input).
	#
	# So far it doesn't handle splitting batches, handling borders of
	# pipelining correctly or switching between training/validation set.
	#
	# But at least it's a proof of concept that it's possible to use
	# StagingArea with Keras.
	#

bzamecnik / keras_stagingarea_with_variable_and_callback.py

Last active October 28, 2017 14:22

Proof of concept of using Keras with StagingArea - data is fed separately via a tf.Variable and in keras Callback

	# Is it possible to utilize Keras callbacks to encapsulate the logic? Yes.
	#
	# We decouple feeding inputs from StagingArea.put() - both can be called in
	# a separate Session.run(). Thus it's not needed to hack Keras inputs too much.
	# Instead in one run() we assign a numpy array to a Variable (via feed_dict)
	# and in another run() we perform StagingArea.put().
	#
	# We make a callback PrefetchCallback which perform the initial assign and put()
	# in its on_epoch_begin() method. Then in each on_batch_begin() it just runs an
	# assign. Then get() and put() is ran by Keras in the training function.

bzamecnik / staging_area_pipelining_with_tf_tuple.py

Last active April 20, 2018 12:22

Example of pipelining with StagingArea with tf.tuple()

	# Example of pipelining with StagingArea with tf.tuple().
	#
	# In particular it shows a trick how to group together get() and put()
	# operations using tf.tuple() such that we have a single operation that
	# works like# semicolon operator: first it performs put(), then get()
	# and returns the tensor output of get().
	#
	# We could possibly use that compound operation as Keras model input
	# so that we don't need to modify K.function() to pass additional
	# fetches to Session.run() explicitly. However it's not sure if this

bzamecnik / staging_area_variable_batch_size.py

Created October 28, 2017 13:38

TF StagingArea with variable batch size

	# Does StagingArea support batches of variable size? Yes.
	#
	# The training or validation set might not be exactly divisible by the batch
	# size. Thus at the end one batch might be smaller. We can either ignore
	# (incorrect with respect to the loss) it or provide batches with variable size.
	# On the other hand we'd like to ensure the data points have the same shape.
	#
	# It turns out

	import numpy as np

bzamecnik / tf_variable_with_dynamic_shape.py

Created October 28, 2017 14:19

TensorFlow tf.Variable with dynamic shape

	# - prevent the variable to be used as a model parameter: trainable=False, collections=[]
	# - allow dynamic variable shape (for the last batch): validate_shape=False

	placeholder = tf.placeholder(dtype=tf.float32, shape=(None, 10))
	variable = tf.Variable(placeholder, trainable=False, collections=[], validate_shape=False)
	with tf.Session() as sess:
	sess.run(variable.initializer, feed_dict={placeholder: np.arange(20).reshape(2,10)})
	print('shape:', sess.run(tf.shape(variable)))
	print(sess.run(variable))

bzamecnik / keras_stagingarea_with_variable_and_callback_variable_batch_size.py

Last active October 28, 2017 15:01

Proof of concept of using Keras with StagingArea - data is fed separately via a tf.Variable and in keras Callback

	# Is it possible to utilize Keras callbacks to encapsulate the logic? Yes.
	#
	# We decouple feeding inputs from StagingArea.put() - both can be called in
	# a separate Session.run(). Thus it's not needed to hack Keras inputs too much.
	# Instead in one run() we assign a numpy array to a Variable (via feed_dict)
	# and in another run() we perform StagingArea.put().
	#
	# We make a callback PrefetchCallback which perform the initial assign and put()
	# in its on_epoch_begin() method. Then in each on_batch_begin() it just runs an
	# assign. Then get() and put() is ran by Keras in the training function.

bzamecnik / tf_group_doesnt_return.py

Created October 28, 2017 14:52

	import numpy as np
	import tensorflow as tf
	from tensorflow.python.ops.data_flow_ops import StagingArea

	dataset_range = tf.contrib.data.Dataset.range(10)
	iter = dataset_range.make_one_shot_iterator()
	next_item = iter.get_next()

	area = StagingArea(dtypes=[tf.int64])
	area_put = area.put([next_item])

bzamecnik / keras_staging_area_cifar10_convnet.py

Last active February 17, 2019 23:13

	# It works!
	#
	# GTX 980 Ti
	# plain model: ~14370 images/sec
	# prefetch model: ~14670 images/sec
	#
	# In nvprof we can see that that HtoD memcpy is really async!
	# What remains is just sync feed_dict to move from numpy to a CPU Variable.

	import math

Bohumír Zámečník bzamecnik