Skip to content

Instantly share code, notes, and snippets.

1.Coursera

Introduction to Big Data

Big Data Modeling and Management Systems

Big Data Integration and Processing

Ubuntu

% sudo apt install docker.io
% sudo systemctl enable --now docker
% sudo docker --version
% service docker status
% sudo docker run hello-world
@leoricklin
leoricklin / res_gcp.md
Last active May 15, 2024 06:12
20221109
@leoricklin
leoricklin / PySpark DataFrame from many small pandas DataFrames.ipynb
Created September 30, 2020 07:26 — forked from linar-jether/PySpark DataFrame from many small pandas DataFrames.ipynb
Convert a RDD of pandas DataFrames to a single Spark DataFrame using Arrow and without collecting all data in the driver.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
import wordcloud.PolarWordCloud
import wordcloud.CollisionMode
import wordcloud.PolarBlendMode
import wordcloud.bg.CircleBackground
import wordcloud.bg.PixelBoundryBackground
import wordcloud.bg.RectangleBackground
import wordcloud.font.CloudFont
import wordcloud.font.FontWeight
import wordcloud.font.scale.LinearFontScalar
import wordcloud.font.scale.SqrtFontScalar