Linear Digressions
Linear Digressions
Ben Jaffe and Katie Malone
SMOTE: makin' yourself some fake minority data
14 minutes Posted Jun 12, 2016 at 8:06 pm.
0:00
14:37
Download MP3
Show notes
Machine learning on imbalanced classes: surprisingly tricky. Many (most?) algorithms tend to just assign the majority class label to all the data and call it a day. SMOTE is an algorithm for manufacturing new minority class examples for yourself, to help your algorithm better identify them in the wild.
Relevant links:
https://www.jair.org/media/953/live-953-2037-jair.pdf