Regularization in network optimization via trimmed stochastic gradient descent with noisy label

Research output: Contribution to journal › Journal article › Research › peer-review

Documents

Fulltext
Final published version, 5.89 MB, PDF document

Kensuke Nakamura
Bong Soo Sohn
Kyoung Jae Won
Byung Woo Hong

Regularization is essential for avoiding over-fitting to training data in network optimization, leading to better generalization of the trained networks. The label noise provides a strong implicit regularization by replacing the target ground truth labels of training examples by uniform random labels. However, it can cause undesirable misleading gradients due to the large loss associated with incorrect labels. We propose a first-order optimization method (Label-Noised Trim-SGD) that uses the label noise with the example trimming in order to remove the outliers based on the loss. The proposed algorithm is simple yet enables us to impose a large label-noise and obtain a better regularization effect than the original methods. The quantitative analysis is performed by comparing the behavior of the label noise, the example trimming, and the proposed algorithm. We also present empirical results that demonstrate the effectiveness of our algorithm using the major benchmarks and the fundamental networks, where our method has successfully outperformed the state-of-the-art optimization methods.

Original language	English
Journal	IEEE Access
Volume	10
Pages (from-to)	34706-34715
Number of pages	10
ISSN	2169-3536
DOIs	https://doi.org/10.1109/ACCESS.2022.3171910
Publication status	Published - 2022

Bibliographical note

Publisher Copyright:
Author

Research areas

Data models, data trimming, label noise, Loss measurement, network optimization, Neural networks, Noise measurement, Optimization, regularization, Stochastic processes, Training

ID: 307328329