TY - GEN
T1 - SiamNAS
T2 - 2025 Genetic and Evolutionary Computation Conference, GECCO 2025
AU - Zhou, Yuyang
AU - Neri, Ferrante
AU - Ong, Yew Soon
AU - Bai, Ruibin
N1 - Publisher Copyright:
© 2025 Copyright is held by the owner/author(s). Publication rights licensed to ACM.
PY - 2025/7/13
Y1 - 2025/7/13
N2 - Modern neural architecture search (NAS) is inherently multi-objective balancing trade-offs such as accuracy, parameter count, and computational cost. This complexity makes NAS computationally expensive and nearly impossible to solve without efficient approximations. To address this, we propose a novel surrogate modelling approach that leverages an ensemble of Siamese network blocks to predict dominance relationships between candidate architectures. Lightweight and easy to train, the surrogate achieves 92% accuracy and replaces the crowding distance calculation in the survivor selection strategy with a heuristic rule based on model size. Integrated into a framework termed SiamNAS, this design eliminates costly evaluations during the search process. Experiments on NAS-Bench-201 demonstrate the framework's ability to identify Pareto-optimal solutions with significantly reduced computational costs. The proposed SiamNAS identified a final non-dominated set containing the best architecture in NAS-Bench-201 for CIFAR-10 and the second-best for ImageNet, in terms of test error rate, within 0.01 GPU days. This proof-of-concept study highlights the potential of the proposed Siamese network surrogate model to generalise to multi-tasking optimisation, enabling simultaneous optimisation across tasks. Additionally, it offers opportunities to extend the approach for generating Sets of Pareto Sets (SOS), providing diverse Pareto-optimal solutions for heterogeneous task settings.
AB - Modern neural architecture search (NAS) is inherently multi-objective balancing trade-offs such as accuracy, parameter count, and computational cost. This complexity makes NAS computationally expensive and nearly impossible to solve without efficient approximations. To address this, we propose a novel surrogate modelling approach that leverages an ensemble of Siamese network blocks to predict dominance relationships between candidate architectures. Lightweight and easy to train, the surrogate achieves 92% accuracy and replaces the crowding distance calculation in the survivor selection strategy with a heuristic rule based on model size. Integrated into a framework termed SiamNAS, this design eliminates costly evaluations during the search process. Experiments on NAS-Bench-201 demonstrate the framework's ability to identify Pareto-optimal solutions with significantly reduced computational costs. The proposed SiamNAS identified a final non-dominated set containing the best architecture in NAS-Bench-201 for CIFAR-10 and the second-best for ImageNet, in terms of test error rate, within 0.01 GPU days. This proof-of-concept study highlights the potential of the proposed Siamese network surrogate model to generalise to multi-tasking optimisation, enabling simultaneous optimisation across tasks. Additionally, it offers opportunities to extend the approach for generating Sets of Pareto Sets (SOS), providing diverse Pareto-optimal solutions for heterogeneous task settings.
KW - multi-objective optimisation
KW - neural architecture search
KW - siamese networks
KW - surrogate assisted models
UR - https://www.scopus.com/pages/publications/105013077130
U2 - 10.1145/3712256.3726359
DO - 10.1145/3712256.3726359
M3 - Conference contribution
AN - SCOPUS:105013077130
T3 - GECCO 2025 - Proceedings of the 2025 Genetic and Evolutionary Computation Conference
SP - 1309
EP - 1318
BT - GECCO 2025 - Proceedings of the 2025 Genetic and Evolutionary Computation Conference
A2 - Ochoa, Gabriela
PB - Association for Computing Machinery, Inc
Y2 - 14 July 2025 through 18 July 2025
ER -