Google Colab + PySpark

Опубликовано: 21 Август 2024
на канале: Adrian Dolinay
882
16

Tutorial on using PySpark within Google Colab.

The notebook can be found in the "Google Colab Tutorials" folder within the below repo.
GitHub Repo - https://github.com/ad17171717/YouTube...

CONNECT:
LinkedIn:   / adrian-dolinay-frm-96a289106  
GitHub: https://github.com/ad17171717
X:   / dolinayg  
Odysee: https://odysee.com/@adriandolinay:0
Medium:   / adriandolinay  

|-Video Chapters-|
0:00 - Intro
0:33 - Installing PySpark within a Colab session
1:02 - Downloading a dataset
1:34 - Initializing Spark session & reading dataset into a PySpark DataFrame
3:23 - PySpark operations
3:54 - Limitations of PySpark within Google Colab
4:47 - References and additional learning