Use "fit_transform" on training data, but "transform" (only) on testing/new data

Опубликовано: 22 Октябрь 2020
на канале: Data School
18,338
588

Use "fit_transform" on training data, but "transform" (only) on testing/new data.
Applies the same transformations to both sets of data, which creates consistent columns and prevents data leakage!

👉 New tips every TUESDAY and THURSDAY! 👈

🎥 Watch all tips:    • scikit-learn tips  
🗒️ Code for all tips: https://github.com/justmarkham/scikit...
💌 Get tips via email: https://scikit-learn.tips


=== WANT TO GET BETTER AT MACHINE LEARNING? ===

1) WATCH my video series:    • Machine learning in Python with sciki...  

2) ENROLL in my courses: https://www.dataschool.io/ml-courses/

3) LET'S CONNECT!
Newsletter: https://www.dataschool.io/subscribe/
Twitter:   / justmarkham  
Facebook:   / datascienceschool  
LinkedIn:   / justmarkham