BryanCutler/PySpark_createDataFrame_with_Arrow.ipynb

Last active September 16, 2020 02:30

Star (1) You must be signed in to star a gist
Fork (2) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/BryanCutler/bc73d573b7e46a984ff8b6edf228e298.js"></script>
Save BryanCutler/bc73d573b7e46a984ff8b6edf228e298 to your computer and use it in GitHub Desktop.

Download ZIP

How to create a Spark DataFrame from Pandas or NumPy with Arrow

Raw

PySpark_createDataFrame_with_Arrow.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

Author

BryanCutler commented Jul 10, 2019

Thanks @aschmu! The variable spark is a default SparkSession. I forgot to mention you should be running Jupyter with a PySpark kernel. I put a sample script on how I do this here https://gist.github.com/BryanCutler/b7f10167c4face19e03330a07b24ce21 in case it could be of help. Thanks for the feedback!!