TY - GEN
T1 - Characterising deep learning loss landscapes with local optima networks
AU - Zhou, Yuyang
AU - Neri, Ferrante
AU - Bai, Ruibin
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Deep learning has gained significant popularity in recent years, particularly for tasks like image and speech recognition, natural language processing, and other intricate pattern recognition challenges. However, training a deep learning model involves tuning millions or even billions of parameters. Consequently, this training process becomes a large-scale optimisation problem associated with a mostly unknown but highly non-convex fitness landscape. In recent decades, advances in fitness landscape analysis have revolved around characterizing landscapes representing loss functions, with Local Optima Networks (LONs) emerging as a promising tool. This paper, while focusing on LeNet-5, leverages LON to address four key questions concerning the nature of the learning problem. We emphasize the impact of experimental conditions during the analysis phase on drawing conclusions about the problem's nature. The results shed light on parametrization and optimiser selection to enhance the analysis and comprehension of deep learning loss landscapes. In particular, we identify the presence and number of funnels in the landscape's structure, study the impact of the dataset on the nature of the problem, investigate how the choice of local search optimisers may influence conclusions about the problem's structure. Finally, sensitivity analysis was conducted on the perturbation strength of the Basin-Hopping sampling method for LON construction.
AB - Deep learning has gained significant popularity in recent years, particularly for tasks like image and speech recognition, natural language processing, and other intricate pattern recognition challenges. However, training a deep learning model involves tuning millions or even billions of parameters. Consequently, this training process becomes a large-scale optimisation problem associated with a mostly unknown but highly non-convex fitness landscape. In recent decades, advances in fitness landscape analysis have revolved around characterizing landscapes representing loss functions, with Local Optima Networks (LONs) emerging as a promising tool. This paper, while focusing on LeNet-5, leverages LON to address four key questions concerning the nature of the learning problem. We emphasize the impact of experimental conditions during the analysis phase on drawing conclusions about the problem's nature. The results shed light on parametrization and optimiser selection to enhance the analysis and comprehension of deep learning loss landscapes. In particular, we identify the presence and number of funnels in the landscape's structure, study the impact of the dataset on the nature of the problem, investigate how the choice of local search optimisers may influence conclusions about the problem's structure. Finally, sensitivity analysis was conducted on the perturbation strength of the Basin-Hopping sampling method for LON construction.
KW - convolutional neural networks
KW - deep learning
KW - Fitness landscape analysis
KW - local optima net-works
KW - loss land-scape
UR - http://www.scopus.com/inward/record.url?scp=85201734427&partnerID=8YFLogxK
U2 - 10.1109/CEC60901.2024.10611772
DO - 10.1109/CEC60901.2024.10611772
M3 - Conference contribution
AN - SCOPUS:85201734427
T3 - 2024 IEEE Congress on Evolutionary Computation, CEC 2024 - Proceedings
BT - 2024 IEEE Congress on Evolutionary Computation, CEC 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 13th IEEE Congress on Evolutionary Computation, CEC 2024
Y2 - 30 June 2024 through 5 July 2024
ER -