Allie .S Ubisse AllieUbisse

🎯

Focusing

Data Scientist https://youtu.be/p72XZJQGn2k

6 followers · 21 following

SAND AI
South Africa (Cape Town)
17:27 (UTC -12:00)
allieubisse.co.za
@troinux

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

AllieUbisse / python batch geocoding.py

Created May 14, 2021 07:22 — forked from shanealynn/python batch geocoding.py

Geocode as many addresses as you'd like with a powerful Python and Google Geocoding API combination

	"""
	Python script for batch geocoding of addresses using the Google Geocoding API.
	This script allows for massive lists of addresses to be geocoded for free by pausing when the
	geocoder hits the free rate limit set by Google (2500 per day). If you have an API key for paid
	geocoding from Google, set it in the API key section.
	Addresses for geocoding can be specified in a list of strings "addresses". In this script, addresses
	come from a csv file with a column "Address". Adjust the code to your own requirements as needed.
	After every 500 successul geocode operations, a temporary file with results is recorded in case of
	script failure / loss of connection later.
	Addresses and data are held in memory, so this script may need to be adjusted to process files line

AllieUbisse / forecasting_metrics.py

Created January 26, 2021 13:44 — forked from bshishov/forecasting_metrics.py

Python Numpy functions for most common forecasting metrics

	import numpy as np

	EPSILON = 1e-10


	def _error(actual: np.ndarray, predicted: np.ndarray):
	""" Simple error """
	return actual - predicted

AllieUbisse / add_policy.py

Created January 25, 2021 13:32 — forked from Kalki5/add_policy.py

Add own policy to a lambda function

	import boto3

	lamba_client = boto3.client('lambda', region_name='REGION_NAME')

	lamba_client.add_permission(
	FunctionName='create_lab',
	StatementId='AWSEventsRule',
	Action='lambda:InvokeFunction',
	Principal='events.amazonaws.com',
	SourceArn='arn:aws:events:REGION_NAME:ACCOUNT_NUMBER:rule/*',

AllieUbisse / forecasting_metrics.py

Created January 25, 2021 13:15 — forked from Kalki5/forecasting_metrics.py

Python Numpy functions for most common forecasting metrics

	import numpy as np

	EPSILON = 1e-10


	def _error(actual: np.ndarray, predicted: np.ndarray):
	""" Simple error """
	return actual - predicted

AllieUbisse / stack-processing.py

Created August 23, 2020 12:18 — forked from snehamehrin/stack-processing.py

	#Import All Functions
	from pyspark.sql import SQLContext
	from pyspark.sql import functions as F
	from pyspark.sql import SparkSession
	from pyspark.sql.functions import unix_timestamp, to_date, date_format, month, year, dayofyear, dayofweek, col
	from pyspark.sql.types import TimestampType
	from pyspark.sql import functions as F
	from pyspark.sql import SparkSession
	from pyspark.sql.functions import unix_timestamp, to_date, date_format, month, year, dayofyear, dayofweek, col
	from pyspark.sql.types import TimestampType

AllieUbisse / pyspark_help.md

Created August 23, 2020 12:13 — forked from hammadzz/pyspark_help.md

PySpark HelpSheet

Common Alias Functions

These functions are exactly equivalent

		Reference
filter	where	pyspark.sql.DataFrame.filter
drop_duplicates	dropDuplicates	pyspark.sql.DataFrame.drop_duplicates
avg	mean	pyspark.sql.GroupedData.avg

AllieUbisse / 1-ETL_TS.ipynb

Created August 23, 2020 12:11 — forked from carlleston/1-ETL_TS.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

AllieUbisse / Spark Dataframe Cheat Sheet.py

Created August 21, 2020 23:19 — forked from crawles/Spark Dataframe Cheat Sheet.py

Cheat sheet for Spark Dataframes (using Python)

	# A simple cheat sheet of Spark Dataframe syntax
	# Current for Spark 1.6.1

	# import statements
	from pyspark.sql import SQLContext
	from pyspark.sql.types import *
	from pyspark.sql.functions import *

	#creating dataframes
	df = sqlContext.createDataFrame([(1, 4), (2, 5), (3, 6)], ["A", "B"]) # from manual data

AllieUbisse / 7.2 Splice MLflow Support.ipynb

Created August 10, 2020 05:20 — forked from Ben-Epstein/7.2 Splice MLflow Support.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

AllieUbisse / Readme

Created June 12, 2020 06:04 — forked from ehsan2167/Readme

	1. Introduction

	What do we understand when we talk about the term Machine-Learning in today’s perspective of Technology? What can we achieve through means of complex algorithms?
	Simple answer to these questions comes from the need to recognize patterns, make predictions and the ability of a machine to operate over data without having to give static program instructions to it. Machine Learning is the field of computer science that gives machines/computers the ability to learn without being explicitly programmed. It is employed in a range of computing tasks where designing & programming explicit algorithms with great performance is infeasible, this includes email filtering, intruder detection in networks, computer vision, optical character recognition (OCR), etc.
	Machine learning is considered to be closely related to computational statistics which as we know focuses on prediction-making through the use of computers. It is also conflated with Data mining because of the exploratory data analysis involved in b

NewerOlder