bobquest33’s gists

bobquest33 / envpython2.bat

Last active June 23, 2021 14:54

Batch scripts for running mutiple versions of python together in Windows.

	@echo off
	REM "ftype /?" explains all of this assoc and ftype and PATHEXT usage
	REM https://docs.python.org/2/using/windows.html for more info around the subject.

	REM set PythonDIR to your python 2 or 3 install path; e.g. the folder with python.exe in it.

	set PythonDIR=C:\Users\IBM_ADMIN\rcs\python-2.7.9
	set PATH=%PythonDIR%;%PythonDIR%\Scripts;%PATH%
	set PYTHONPATH=%PythonDIR%\Lib;%PythonDIR%\Lib\site-packages;%PythonDIR%\DLLs;
	set PATHEXT=%PATHEXT%;.PY;.PYW

bobquest33 / dask_scr3.py

Created May 4, 2017 18:58

Raspberry Pi Experiments: Running Python3 , Jupyter Notebooks and Dask Cluster — Part 2

	start_time = time.time()
	#pdf = pd.read_csv("http://192.168.0.2:8001/pp-monthly-update-new-version.csv",names=cnames)
	pdf = pd.read_csv("/mnt/nwdrive/Backup/datasets/pp-complete.txt",names=cnames)
	elapsed_time = time.time() - start_time
	print(elapsed_time.total_seconds())
	hours, rem = divmod(elapsed_time, 3600)
	minutes, seconds = divmod(rem, 60)
	print("{:0>2}:{:0>2}:{:05.2f}".format(int(hours),int(minutes),seconds))

bobquest33 / dask_scr2.py

Created May 4, 2017 18:55

Raspberry Pi Experiments: Running Python3 , Jupyter Notebooks and Dask Cluster — Part 2

	import time
	start_time = time.time()
	df = None
	count = 0
	for chunk in pd.read_csv("/mnt/nwdrive/Backup/datasets/pp-complete.txt",names=cnames, chunksize=10000):
	# we are going to append to each table by group
	# we are not going to create indexes at this time
	# but we ARE going to create (some) data_columns
	if df is None:
	df = dd.from_pandas(chunk,npartitions=1)

bobquest33 / dask_script1.py

Created May 4, 2017 18:53

Raspberry Pi Experiments: Running Python3 , Jupyter Notebooks and Dask Cluster — Part 2

	import pandas as pd
	import dask.dataframe as dd
	from distributed import Client
	client = Client('192.168.0.7:8786')
	strcnames = """transaction
	price
	transfer_date
	postcode
	property_type
	newly_built

bobquest33 / script_27_pyspark_wordcount_example.py

Created April 29, 2017 07:22

100 Scripts in 30 Days challenge: Script 26,27,28 Learning PySpark - Script 27, Find Count of words in a file

	#Find count of words in the text file
	#import regex module
	import re
	#import add from operator module
	from operator import add
	#Read a text file and create RDD lines
	lines = sc.textFile("wordtxt.txt")
	#count total no of lines
	print 'number of lines in file:',lines.count()
	#add up lengths of each line

bobquest33 / script_26_basic_nonemptyline_count_example.py

Created April 29, 2017 05:03

	#Basic Non Empty line count example
	#Read a text file and create RDD lines
	lines = sc.textFile("wordtxt.txt")
	#Use tranformation function filter to create notEmptyLines from lines
	notEmptyLines = lines.filter(lambda line:len(line)>0)
	#Execute action count to save the count of non empty lines to count
	#And print it
	count = notEmptyLines.count()
	print(count)
	#Sample output

bobquest33 / topics.txt

Created April 28, 2017 10:13

100 Scripts in 30 Days challenge — Change of Course

	Script Series Topics
	25-40 Pyspark
	41-50 Data Analytics & Big Data
	51-60 Data Science, Machine Learning & Graph Analytics
	61-70 Miscellenious
	70-80 Raspberry Pi
	80-90 Web Development
	90-100 Deep Learning

bobquest33 / script_23_parse_tweet.py

Last active April 27, 2017 03:22

100 Scripts in 30 Days challenge: Script 23,24,25: Parsing Tweets & Graph Analytics from Pickle file

	import pickle
	import sys
	import os
	import json
	from ttp import ttp

	picklefile = sys.argv[1]
	jsonfile = picklefile.replace(".pickle",".json")
	tweets = None
	with open(picklefile,"rb") as pf:

bobquest33 / script_22_intro_py_data_struct.py

Last active April 26, 2017 19:35

100 Scripts in 30 Days challenge: Script 22 — Learning Some Python Basic Data Structures (Lists, Tuples, Numpy Arrays)

bobquest33 / conf_sample.toml

Created April 25, 2017 17:28

100 Scripts in 30 Days challenge: Script 21 — Reading Twitter Stream using Tweepy

Priyabrata Dash bobquest33