arXiv (Cornell University)
Learning Deep Networks from Noisy Labels with Dropout Regularization
May 2017 • Ishan Jindal, Matthew Nokleby, Xuewen Chen
Large datasets often have unreliable labels-such as those obtained from Amazon's Mechanical Turk or social media platforms-and classifiers trained on mislabeled datasets often exhibit poor performance. We present a simple, effective technique for accounting for label noise when training deep neural networks. We augment a standard deep network with a softmax layer that models the label noise statistics. Then, we train the deep network and noise model jointly via end-to-end stochastic gradient descent on the (perhap…