Andrei Bârsan AndreiBarsan

PhD student @ University of Toronto Senior Scientist @ waabi.ai

karpathy / pg-pong.py

Created May 30, 2016 22:50

Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels

	""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
	import numpy as np
	import cPickle as pickle
	import gym

	# hyperparameters
	H = 200 # number of hidden layer neurons
	batch_size = 10 # every how many episodes to do a param update?
	learning_rate = 1e-4
	gamma = 0.99 # discount factor for reward

gocarlos / Eigen Cheat sheet

Last active November 9, 2024 13:28

Cheat sheet for the linear algebra library Eigen: http://eigen.tuxfamily.org/

	// A simple quickref for Eigen. Add anything that's missing.
	// Main author: Keir Mierle

	#include <Eigen/Dense>

	Matrix<double, 3, 3> A; // Fixed rows and cols. Same as Matrix3d.
	Matrix<double, 3, Dynamic> B; // Fixed rows, dynamic cols.
	Matrix<double, Dynamic, Dynamic> C; // Full dynamic. Same as MatrixXd.
	Matrix<double, 3, 3, RowMajor> E; // Row major; default is column-major.
	Matrix3f P, Q, R; // 3x3 float matrix.