Dynamic Transfer Learning Switching Approach Using Resource Benchmark in Edge Intelligence

Lionel Nkenyereye; Chellakannu Rajkumar; Boon Giin Lee; Wan Young Chung

doi:10.1109/JIOT.2025.3557408

Dynamic Transfer Learning Switching Approach Using Resource Benchmark in Edge Intelligence

Lionel Nkenyereye, Chellakannu Rajkumar, Boon Giin Lee, Wan Young Chung

Research output: Journal Publication › Article › peer-review

Abstract

Machine learning (ML) techniques are applied for profiling computing and processing resources data collected while running deep neural network models on edge devices. Adaptive deep neural network (DNN) model switching requires proper benchmarking for categorizing AI models based on their applications and computational resources enabled by their processing accelerators. Based on benchmark metrics, DNN models can be classified into tiny, low, small, medium, and large resources, then identify DNN models that perform well within resource constraints. Ensure efficient resource allocation, latency management, and trade-off between accuracy and resource. In this work, we propose a benchmark for edge transfer artificial intelligence learning service (TALS) that uses ML techniques. They aim at classifying DNN models by their target edge applications while running edge inferences. We used both unsupervised learning (UL) and supervised learning (SL) techniques to identify the most effective features for the TALS models and to benchmark the performance of edge devices. To achieve this, two approaches were investigated: first, determining features based on edge inference’s computing resources profiling using principal component analysis; and second, classifying the DNN models at the target application level using a regression approach based on historical resource utilization data. In addition, we propose a dynamic model transfer learning that switches between a set of pre-trained and optimized and quantized DNN models based on the cost function. ML techniques learn resource-aware prediction from new resource allocation data and ensure that the multicriteria switch cost selects the inference task models that meet the edge resource constraint requirements. The experimental results highlight a strong relationship between the supervised learning model and the clustering execution method. The dynamic switching approach on real edge devices demonstrates dynamic switching between models according to inference task complexity. We conclude that dynamic switching models allows to ensure smooth operation without overloading resources in edge intelligence.

Original language	English
Journal	IEEE Internet of Things Journal
DOIs	https://doi.org/10.1109/JIOT.2025.3557408
Publication status	Published - 3 Apr 2025

Keywords

Benchmark
Deep neural network classification
Edge Computing
Edge Devices
Transfer AI Learning
Unsupervised Learning

ASJC Scopus subject areas

Signal Processing
Information Systems
Hardware and Architecture
Computer Science Applications
Computer Networks and Communications

Access to Document

10.1109/JIOT.2025.3557408

Cite this

@article{30c86dc8a4c8435eb40760f005666c6c,

title = "Dynamic Transfer Learning Switching Approach Using Resource Benchmark in Edge Intelligence",

abstract = "Machine learning (ML) techniques are applied for profiling computing and processing resources data collected while running deep neural network models on edge devices. Adaptive deep neural network (DNN) model switching requires proper benchmarking for categorizing AI models based on their applications and computational resources enabled by their processing accelerators. Based on benchmark metrics, DNN models can be classified into tiny, low, small, medium, and large resources, then identify DNN models that perform well within resource constraints. Ensure efficient resource allocation, latency management, and trade-off between accuracy and resource. In this work, we propose a benchmark for edge transfer artificial intelligence learning service (TALS) that uses ML techniques. They aim at classifying DNN models by their target edge applications while running edge inferences. We used both unsupervised learning (UL) and supervised learning (SL) techniques to identify the most effective features for the TALS models and to benchmark the performance of edge devices. To achieve this, two approaches were investigated: first, determining features based on edge inference{\textquoteright}s computing resources profiling using principal component analysis; and second, classifying the DNN models at the target application level using a regression approach based on historical resource utilization data. In addition, we propose a dynamic model transfer learning that switches between a set of pre-trained and optimized and quantized DNN models based on the cost function. ML techniques learn resource-aware prediction from new resource allocation data and ensure that the multicriteria switch cost selects the inference task models that meet the edge resource constraint requirements. The experimental results highlight a strong relationship between the supervised learning model and the clustering execution method. The dynamic switching approach on real edge devices demonstrates dynamic switching between models according to inference task complexity. We conclude that dynamic switching models allows to ensure smooth operation without overloading resources in edge intelligence.",

keywords = "Benchmark, Deep neural network classification, Edge Computing, Edge Devices, Transfer AI Learning, Unsupervised Learning",

author = "Lionel Nkenyereye and Chellakannu Rajkumar and Lee, {Boon Giin} and Chung, {Wan Young}",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2025",

month = apr,

day = "3",

doi = "10.1109/JIOT.2025.3557408",

language = "English",

journal = "IEEE Internet of Things Journal",

issn = "2327-4662",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Dynamic Transfer Learning Switching Approach Using Resource Benchmark in Edge Intelligence

AU - Nkenyereye, Lionel

AU - Rajkumar, Chellakannu

AU - Lee, Boon Giin

AU - Chung, Wan Young

PY - 2025/4/3

Y1 - 2025/4/3

N2 - Machine learning (ML) techniques are applied for profiling computing and processing resources data collected while running deep neural network models on edge devices. Adaptive deep neural network (DNN) model switching requires proper benchmarking for categorizing AI models based on their applications and computational resources enabled by their processing accelerators. Based on benchmark metrics, DNN models can be classified into tiny, low, small, medium, and large resources, then identify DNN models that perform well within resource constraints. Ensure efficient resource allocation, latency management, and trade-off between accuracy and resource. In this work, we propose a benchmark for edge transfer artificial intelligence learning service (TALS) that uses ML techniques. They aim at classifying DNN models by their target edge applications while running edge inferences. We used both unsupervised learning (UL) and supervised learning (SL) techniques to identify the most effective features for the TALS models and to benchmark the performance of edge devices. To achieve this, two approaches were investigated: first, determining features based on edge inference’s computing resources profiling using principal component analysis; and second, classifying the DNN models at the target application level using a regression approach based on historical resource utilization data. In addition, we propose a dynamic model transfer learning that switches between a set of pre-trained and optimized and quantized DNN models based on the cost function. ML techniques learn resource-aware prediction from new resource allocation data and ensure that the multicriteria switch cost selects the inference task models that meet the edge resource constraint requirements. The experimental results highlight a strong relationship between the supervised learning model and the clustering execution method. The dynamic switching approach on real edge devices demonstrates dynamic switching between models according to inference task complexity. We conclude that dynamic switching models allows to ensure smooth operation without overloading resources in edge intelligence.

AB - Machine learning (ML) techniques are applied for profiling computing and processing resources data collected while running deep neural network models on edge devices. Adaptive deep neural network (DNN) model switching requires proper benchmarking for categorizing AI models based on their applications and computational resources enabled by their processing accelerators. Based on benchmark metrics, DNN models can be classified into tiny, low, small, medium, and large resources, then identify DNN models that perform well within resource constraints. Ensure efficient resource allocation, latency management, and trade-off between accuracy and resource. In this work, we propose a benchmark for edge transfer artificial intelligence learning service (TALS) that uses ML techniques. They aim at classifying DNN models by their target edge applications while running edge inferences. We used both unsupervised learning (UL) and supervised learning (SL) techniques to identify the most effective features for the TALS models and to benchmark the performance of edge devices. To achieve this, two approaches were investigated: first, determining features based on edge inference’s computing resources profiling using principal component analysis; and second, classifying the DNN models at the target application level using a regression approach based on historical resource utilization data. In addition, we propose a dynamic model transfer learning that switches between a set of pre-trained and optimized and quantized DNN models based on the cost function. ML techniques learn resource-aware prediction from new resource allocation data and ensure that the multicriteria switch cost selects the inference task models that meet the edge resource constraint requirements. The experimental results highlight a strong relationship between the supervised learning model and the clustering execution method. The dynamic switching approach on real edge devices demonstrates dynamic switching between models according to inference task complexity. We conclude that dynamic switching models allows to ensure smooth operation without overloading resources in edge intelligence.

KW - Benchmark

KW - Deep neural network classification

KW - Edge Computing

KW - Edge Devices

KW - Transfer AI Learning

KW - Unsupervised Learning

UR - http://www.scopus.com/inward/record.url?scp=105002251178&partnerID=8YFLogxK

U2 - 10.1109/JIOT.2025.3557408

DO - 10.1109/JIOT.2025.3557408

M3 - Article

AN - SCOPUS:105002251178

SN - 2327-4662

JO - IEEE Internet of Things Journal

JF - IEEE Internet of Things Journal

ER -

Dynamic Transfer Learning Switching Approach Using Resource Benchmark in Edge Intelligence

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this