International Journal of Research in Science and Technology (IJRST)

Details

Breakdown Point Analysis for Deep Neural Network Estimators Under Heavy-Tailed Noise

Abdullah Mohammed Rashid

Department of Accounting and Financial Control, College of Business Economics, Al-Nahrain University, Iraq.

Hadeel Fawzi Mohammed

Al-Kindy College of Medicine, University of Baghdad, Iraq.

Hasan Talib Hendi

Science of College, University of Baghdad, Iraq.

106-127

Vol: 16, Issue: 2, 2026

Receiving Date: 2026-02-28 Acceptance Date:

2026-04-05

Publication Date:

2026-04-25

Download PDF

http://doi.org/10.37648/ijrst.v16i02.005

Abstract

In this paper, we experimentally assess the breakdown point of deep neural network regression estimators trained in the presence of heavy-tailed label contamination. On the standard California Housing dataset, we randomly replace between 0% and 50% of the training labels with independent Cauchy noise (scale ? = 50). We consider Mean Squared Error (MSE) loss versus Huber loss (? = 1), training a 3-layer MLP at each corruption level for 100 epochs using Adam. Models are scored on a held-out clean test set using Mean Absolute Error (MAE). We observe that MSE trained networks experience catastrophic breakdown at as little as 5% corruption (tested in 5% increments), where test MAE grows by 3,221% over that of a model trained without label corruption. In stark contrast, Huber trained models are resilient to corruption across all levels of label contamination, growing by only 29% at 50% corruption. Specifically, at our maximal observed difference of 35% corruption, MSE incurs an error 340x that of Huber. This illustrates an empirical breakdown point of zero for MSE loss, and that Huber loss dramatically increases robustness of DNN regression estimators to adversarial heavy-tailed noise.

Keywords: breakdown point; robust regression; Huber loss; deep neural networks; heavy-tailed noise; outlier robustness.

References

Chen, J. (2024). Robust nonparametric regression based on deep ReLU neural networks. Statistics & Probability Letters, 208, 110046. https://doi.org/10.1016/j.spl.2024.110046
Chen, S., Koehler, F., Moitra, A., & Yau, M. (2022). Online and distribution-free robustness: Regression and contextual bandits with Huber contamination. In Proceedings of the 62nd IEEE Annual Symposium on Foundations of Computer Science (FOCS) (pp. 684–695). IEEE. https://doi.org/10.1109/FOCS52979.2021.00070
Chen, Y., Wang, Z., Liu, X., Li, H., & Zhang, J. (2025). A survey on learning from data with label noise via deep neural networks. Systems Science & Control Engineering, 13(1), Article 2488120. https://doi.org/10.1080/21642583.2025.2488120
Fan, J., Gu, Y., & Zhou, W.-X. (2024). How do noise tails impact on deep ReLU networks? The Annals of Statistics, 52(4), 1845–1871. https://doi.org/10.1214/24-AOS2428
Feng, Y., & Wu, Q. (2025). Understanding robust machine learning for nonparametric regression with heavy-tailed noise (arXiv preprint arXiv:2510.09888).
Hampel, F. R. (1971). A general qualitative definition of robustness. The Annals of Mathematical Statistics, 42(6), 1887–1896. https://doi.org/10.1214/aoms/1177693054
He, K., Zhang, X., Ren, S., & Sun, J. (2015). Delving deep into rectifiers: Surpassing human-level performance on ImageNet classification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV) (pp. 1026–1034). IEEE. https://doi.org/10.1109/ICCV.2015.123
Huber, P. J. (1964). Robust estimation of a location parameter. The Annals of Mathematical Statistics, 35(1), 73–101. https://doi.org/10.1214/aoms/1177703732
Jiao, Y., Shen, G., Lin, Y., & Huang, J. (2023). Deep nonparametric regression on approximate manifolds: Nonasymptotic error bounds with polynomial prefactors. The Annals of Statistics, 51(2), 691–716. https://doi.org/10.1214/23-AOS2266
Kingma, D. P., & Ba, J. (2015). Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR). https://arxiv.org/abs/1412.6980
Liu, T., Xu, Z., & Ling, Q. (2023). Functional linear regression with Huber loss. Journal of Complexity, 74, 101696. https://doi.org/10.1016/j.jco.2022.101696
Nguyen, T., & Oijen, R. (2024). Deep regression learning with optimal loss function. Journal of the American Statistical Association. https://doi.org/10.1080/01621459.2024.2412364
Pace, R. K., & Barry, R. (1997). Sparse spatial autoregressions. Statistics & Probability Letters, 33(3), 291–297. https://doi.org/10.1016/S0167-7152(96)00140-X
Paszke, A., et al. (2019). PyTorch: An imperative style, high-performance deep learning library. In H. Wallach et al. (Eds.), Advances in Neural Information Processing Systems (Vol. 32, pp. 8024–8035). Curran Associates.
Pedregosa, F., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
Rubio, A., & Dorronsoro, J. R. (2023). Robust losses in deep regression. In Hybrid Artificial Intelligent Systems (Lecture Notes in Computer Science, Vol. 14001, pp. 259–270). Springer. https://doi.org/10.1007/978-3-031-40725-3_22
Schmidt-Hieber, J. (2020). Nonparametric regression using deep neural networks with ReLU activation function. The Annals of Statistics, 48(4), 1875–1897. https://doi.org/10.1214/19-AOS1875
Shen, G., Jiao, Y., Lin, Y., & Huang, J. (2021). Robust nonparametric regression with deep neural networks (arXiv preprint arXiv:2107.10343).
Song, H., Kim, M., Park, D., Shin, Y., & Lee, J.-G. (2023). Learning from noisy labels with deep neural networks: A survey. IEEE Transactions on Neural Networks and Learning Systems, 34(11), 8135–8153. https://doi.org/10.1109/TNNLS.2022.3152527
Tyralis, H., Papacharalampous, G., Langousis, A., & Papalexiou, S. M. (2025). Deep Huber quantile regression networks. Neural Networks, 186, 107364. https://doi.org/10.1016/j.neunet.2025.107364
Werner, T. (2024). Global quantitative robustness of regression feed-forward neural networks. Neural Computing and Applications, 36, 19967–19988. https://doi.org/10.1007/s00521-024-10289-w
Xu, H., An, Y., & Lu, J. (2022). Robust learning of Huber loss under weak conditional moment. Neurocomputing, 507, 191–201. https://doi.org/10.1016/j.neucom.2022.08.012

Back

info@ijrst.com

+919555269393

Track Article

Upload Article

Details

Breakdown Point Analysis for Deep Neural Network Estimators Under Heavy-Tailed Noise

Abstract

References

Our Head Office

Quick Links

info@ijrst.com

+919555269393

Track Article

Upload Article

Details

Breakdown Point Analysis for Deep Neural Network Estimators Under Heavy-Tailed Noise

Abstract

References

Our Head Office

Quick Links

Indexing