Four reasons to use scikit-learn (not pandas) for ML preprocessing

Опубликовано: 27 Октябрь 2020
на канале: Data School
12,794
372

Reasons to use scikit-learn (not pandas) for ML preprocessing:
1. You can cross-validate the entire workflow
2. You can grid search model & preprocessing hyperparameters
3. Avoids adding new columns to the source DataFrame
4. pandas lacks separate fit/transform steps to prevent data leakage

👉 New tips every TUESDAY and THURSDAY! 👈

🎥 Watch all tips:    • scikit-learn tips  
🗒️ Code for all tips: https://github.com/justmarkham/scikit...
💌 Get tips via email: https://scikit-learn.tips


=== WANT TO GET BETTER AT MACHINE LEARNING? ===

1) WATCH my video series:    • Machine learning in Python with sciki...  

2) ENROLL in my courses: https://www.dataschool.io/ml-courses/

3) LET'S CONNECT!
Newsletter: https://www.dataschool.io/subscribe/
Twitter:   / justmarkham  
Facebook:   / datascienceschool  
LinkedIn:   / justmarkham