😀

Manuel manuel-delverme

😀

CEO@SilverStream: Heroku for browser based AI agents | PhD Student @ Mila/McGill | You likely interacted with my AI agents in the past.

manuel-delverme / test_rnn.py

Created March 12, 2019 21:19

	import torch
	import tensorboardX
	import torch.nn as nn
	import torch.nn.functional as F
	import torch.optim as optim
	from torch.autograd import Variable
	import torch.utils.data

	writer = tensorboardX.SummaryWriter()

manuel-delverme / zero_training.py

Created April 22, 2018 09:04

zero_training

	def train_network(samples, neural_network, nr_epochs=10, batch_size=64):
	optimizer = optim.Adam(neural_network.parameters())
	neural_network.train()

	for epoch_nr in range(nr_epochs):
	sample_ids = np.random.shuffle(range(len(samples)))

	for start in range(0, len(samples) // batch_size, batch_size):
	mini_batch = samples[sample_ids[start: start + batch_size]]
	boards, pis, vs = zip(*mini_batch)

manuel-delverme / zero.py

Last active June 11, 2018 14:31

zero code


	environment = environments.GoEnvironment(board_size=19)
	player_mcts = mcts.MCTS(
	environment,
	networks.NeuralNetwork(board_size=environment.getStateSize(), action_size=environment.getActionSize()),
	)

	training_samples = collections.deque(maxlen=opt.training_samples_buffer_size)

	for iteration_number in range(opt.num_iters):

manuel-delverme / WTF

Created February 7, 2018 23:58

manuel-delverme / gist:1acef08485e09661a51d59c1c60494ef

Created January 5, 2018 11:59

	/home/awok/Projects/supervised_reward/env_reward/bin/python /home/awok/Projects/supervised_reward/main.py
	(2_w,4mirr1)-aCMA-ES (mu_w=1.5,w_1=80%) in dimension 18 (seed=237288, Fri Jan 5 12:17:27 2018)
	score: 7940.826388888889 options: 8 4 5 10 11 15 16 17 23
	score: 5839.784722222223 options: 10 21 22 23 26 27 28 29 33 34 35
	score: 8771.63888888889 options: 4 4 5 10 11
	score: -1107.7361111111113 options: 14 0 1 2 3 4 6 7 8 9 10 13 14 15 16
	[7940.826388888889, 5839.784722222223, 8771.63888888889, -1107.7361111111113]
	best [-0.21687281 -0.29423262 0.10809115 -0.3457722 -0.18912326 0.17178892 0.14703262 0.94997003 -0.18883859 -0.82346577 0.50633336 -0.17325047 0.37087813 0.63369408 0.07967291 -0.47341161 -0.68896583 -0.4226999 ] fitness -8771.63888888889
	score: 11841.0625 options: 3 29 34 35
	score: 11929.47222222222 options: 4 28 29 34 35

manuel-delverme / gist:8d53afe46b222ac913b023e5155df650

Created August 27, 2017 10:13

transpose vs T vs reshape

	import timeit
	setup = 'import numpy as np; a=np.random.randn(10)'
	reshape = timeit.Timer('a.reshape(-1, 10)', setup=setup)
	transp = timeit.Timer('a.transpose()', setup=setup)
	T = timeit.Timer('a.T', setup=setup)
	print("reshape", reshape.timeit(number=int(1e6)))
	print("transp", transp.timeit(number=int(1e6)))
	print("T", T.timeit(number=int(1e6)))

	print("reshape", reshape.timeit(number=int(1e6)))

manuel-delverme / training.tsv

Last active May 11, 2017 13:44

manuel-delverme / printout.csv

Last active May 11, 2017 09:33

hearlstone0.1 output

manuel-delverme / last!

Created May 7, 2017 15:57

	/usr/bin/python3.5 /home/awok/Documents/sapienza/s1/nlp/hw2/src/homework2.py model ../ ../resources
	Using Theano backend.
	WARNING (theano.sandbox.cuda): The cuda backend is deprecated and will be removed in the next release (v0.10). Please switch to the gpuarray backend. You can get more information about how to switch at this URL:
	https://github.com/Theano/Theano/wiki/Converting-to-the-new-gpu-back-end%28gpuarray%29

	ERROR (theano.sandbox.cuda): nvcc compiler not found on $PATH. Check your nvcc installation and try again.
	model_output_path model
	homework_dir: ../

	model output: model

manuel-delverme / with replacement

Created May 7, 2017 14:29

	/usr/bin/python3.5 /home/awok/Documents/sapienza/s1/nlp/hw2/src/homework2.py model ../ ../resources
	Using Theano backend.
	model_output_path model
	homework_dir: ../

	model output: model
	homework dir: ../
	src dir: ../src/
	data dir: ../data/
	WARNING (theano.sandbox.cuda): The cuda backend is deprecated and will be removed in the next release (v0.10). Please switch to the gpuarray backend. You can get more information about how to switch at this URL: