How I Work With MILLIONS OF ROWS DATA using PYTHON | PYSPARK & BIG DATA

Опубликовано: 02 Август 2023
на канале: Mo Chen
12,041
425

🎉 Check out Bright Data ➡︎ https://brdta.com/datawithmo

NEWSLETTER, EXCLUSIVE COMMUNITY & DATA CAREER RESOURCES
🌐 My website: https://mochen.info/

GET IN TOUCH
📊 Bespoke data analytics solutions for your business: [email protected]
🤝 Collaborations & sponsorships: [email protected]

PROFESSIONAL CERTIFICATES
🖥️ Get 25% OFF DataCamp's Data Analytics and AI courses here: https://datacamp.pxf.io/DataCampMoChen
🏅 DataCamp Certifications: https://datacamp.pxf.io/DataCampCertM...
🧑‍💻 DataCamp Data Analyst Certification: https://datacamp.pxf.io/dataAnalystCe...
🧑‍💻 Data Analyst in Power BI (Microsoft PL-300): https://datacamp.pxf.io/dataAnalystPo...
📊 Data Analyst in Tableau (Tableau Certified Data Analyst): https://datacamp.pxf.io/DAinTableauMo...
🤖 AI Fundamentals: https://datacamp.pxf.io/AIfundamental...
☁️ Microsoft Azure Fundamentals (AZ-900): https://datacamp.pxf.io/az900MoChen
⛅ AWS Cloud Practitioner (CLF-C02): https://datacamp.pxf.io/awsCloudPract...
🌥️ Introduction to GCP: https://datacamp.pxf.io/gcp

ABOUT ME   / mo-chen1  
I'm Mo and I work as a data analytics manager / content creator. I make videos about how best to grow your data career; whether that's starting your data career, transitioning into it, or taking it to the next level.

TIMESTAMPS
00:00 Intro
01:47 Data Gathering
05:36 Basic Operations with PySpark
11:28 Clean and Transform Data
16:08 Getting the Insights / Answering Key Questions
23:13 Recap
28:04 Outro

PROJECT FILES
Dataset: https://brdta.com/datawithmo
Code: https://github.com/mochen862/big-data...