🙃

wassname (Michael J Clark) wassname

🙃

Machine Learning researcher focused on Pragmatic AI alignment. Based in Perth, Australia. Waiting to be uploaded.

332 followers · 194 following

I'm just a guy who likes to machine learn
Perth, Australia.
17:08 (UTC -12:00)
wassname.org
https://orcid.org/0009-0008-9023-8720
@wassname
in/mclark52

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

wassname / SequentialRandomSampler.py

Last active July 24, 2018 01:55

torch SequentialRandomSampler

	from torch.utils.data.sampler import Sampler
	import itertools

	class SequentialRandomSampler(Sampler):
	"""Samples elements sequentially, starting from a random location.

	For when you want to sequentially sampled a random subset

	Usage:
	loader = torch.utils.data.DataLoader(

wassname / torch_isfinite.py

Last active August 22, 2018 05:53

pytorch isfinite: like numpy.isfinite but for torch tensors

	def isfinite(x):
	"""
	Quick pytorch test that there are no nan's or infs.

	note: torch now has torch.isnan
	url: https://gist.github.com/wassname/df8bc03e60f81ff081e1895aabe1f519
	"""
	not_inf = ((x + 1) != x)
	not_nan = (x == x)
	return not_inf & not_nan

wassname / mdn-rnn-logloss.py

Last active December 26, 2018 14:30

mdn-rnn in log space

	# code for question on reddit https://www.reddit.com/r/MachineLearning/comments/8poc3z/r_blog_post_on_world_models_for_sonic/e0cwb5v/
	# from this

	def forward(self, x):
	self.lstm.flatten_parameters()

	x = F.relu(self.fc1(x))
	z, self.hidden = self.lstm(x, self.hidden)
	sequence = x.size()[1]

wassname / sequence_in_chunk_sampler.py

Last active July 1, 2022 18:05

Pytorch random sampler for bigger than memory arrays like dask, zarr, xarray etc that lets you have randomness with the same speed benefits. It chooses a random location, then takes an ordered batch e.g. [[1,2,3],[9,10,11],[4,5,6]]. This way you get the speed of a sequential read.

	"""
	Pytorch sampler that samples ordered indices from unordered sequences.

	Good for use with dask and RNN's, because
	1. Dask will slow down if sampling between chunks, so we must do one chunk at a time
	2. RNN's need sequences so we must have seqences e.g. 1,2,3
	3. But RNN's train better with batches that are uncorrelated so we want each batch to be sequence from a different part of a chunk.

	For example, given each chunk is `range(12)`. Our seq_len is 3. We might end up with these indices:
	- [[1,2,3],[9,10,11],[4,5,6]]

wassname / numpy_dataset.py

Created May 8, 2018 07:02

NumpyDataset for pytorch (like tensordataset)

	import torch.utils.data


	class NumpyDataset(torch.utils.data.Dataset):
	"""Dataset wrapping arrays.

	Each sample will be retrieved by indexing array along the first dimension.

	Arguments:
	*arrays (numpy.array): arrays that have the same size of the first dimension.

wassname / tqdm_dask_progressbar.py

Last active June 1, 2020 10:29

TQDMDaskProgressBar: tqdm for dask 1.2.2


	from dask.callbacks import Callback
	from tqdm.auto import tqdm

	class TQDMDaskProgressBar(Callback, object):
	"""
	A tqdm progress bar for dask.
	Usage:
	```
	with TQDMDaskProgressBar():

wassname / kdtree_scaling_scipy.ipynb

Last active April 30, 2018 03:33

kdtree_scaling_scipy

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

wassname / jupyter_logging.py

Last active December 9, 2024 21:49

simple logging for jupyter or python which outputs to stdout (or a console or terminal) and a log file

	"""
	In jupyter notebook simple logging to console
	"""
	import logging
	import sys

	logging.basicConfig(stream=sys.stdout, level=logging.INFO)

	# Test
	logger = logging.getLogger('LOGGER_NAME')

wassname / AdamStepLR.py

Created February 26, 2018 08:38

Combing pytorches adam and scheduled learning rate into one (for when the model doesn't have a callback for the scheduler)

	class AdamStepLR(torch.optim.Adam):
	"""Combine Adam and lr_scheduler.StepLR so we can use it as a normal optimiser"""
	def __init__(self, params, lr=0.001, betas=(0.9, 0.999), eps=1e-08, weight_decay=0, step_size=50000, gamma=0.5):
	super().__init__(params, lr, betas, eps, weight_decay)
	self.scheduler = torch.optim.lr_scheduler.StepLR(self, step_size, gamma)

	def step(self):
	self.scheduler.step()
	return super().step()

wassname / pytorch_window_stack.py

Created February 7, 2018 04:32

pytorch stack widow in timeseries

	def window_stack(x, window=4, pad=True):
	"""
	Stack along a moving window of a pytorch timeseries

	Inputs:
	tensor of dims (batches/time, channels)
	pad: if true the left side will be padded to let the output match
	Outputs:
	if pad=True: a tensor of size (batches, channels, window)
	else: tensor of size (batches-window, channels, window)

Newer Older