Drop the first category from binary features (only) with OneHotEncoder

Опубликовано: 28 Сентябрь 2021
на канале: Data School
3,148
96

New in version 0.23: Use drop='if_binary' with OneHotEncoder to drop the first category ONLY if it's a binary feature (meaning it has exactly two categories).

Note: Beginning in scikit-learn 1.0, drop='first' and drop='if_binary' can both be used with handle_unknown='ignore'. However, the dropped category and an unknown category will both be encoded as all zeros.

👉 New tips every TUESDAY and THURSDAY! 👈

🎥 Watch all tips:    • scikit-learn tips  
🗒️ Code for all tips: https://github.com/justmarkham/scikit...
💌 Get tips via email: https://scikit-learn.tips


=== WANT TO GET BETTER AT MACHINE LEARNING? ===

1) LEARN THE FUNDAMENTALS in my intro course (free!): https://courses.dataschool.io/introdu...

2) BUILD YOUR ML CONFIDENCE in my intermediate course: https://courses.dataschool.io/buildin...

3) LET'S CONNECT!
Newsletter: https://www.dataschool.io/subscribe/
Twitter:   / justmarkham  
Facebook:   / datascienceschool  
LinkedIn:   / justmarkham