De2r: Unifying DVFS and Early-Exit for Embedded AI Inference via Reinforcement Learning

Yuting He; Jingjin Li; Chengtai Li; Qingyu Yang; Zheng Wang; Heshan Du; Jianfeng Ren; Heng Yu

doi:10.23919/DATE64628.2025.10992707

De²r: Unifying DVFS and Early-Exit for Embedded AI Inference via Reinforcement Learning

Yuting He, Jingjin Li, Chengtai Li, Qingyu Yang, Zheng Wang, Heshan Du, Jianfeng Ren, Heng Yu

School of Computer Science

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

Abstract

Executing neural networks on resource-constrained embedded devices faces challenges. Efforts have been made at the application and system levels to reduce the execution cost. Among them, the early-exit networks reduce computational cost through intermediate exits, while Dynamic Voltage and Frequency Scaling (DVFS) offers system energy reduction. Existing works strive to unify early-exit and DVFS for combined benefits on both timing and energy flexibility, yet limitations exist: 1) varying time constraints that make different exit points become more, or less, important in terms of inference accuracy, are not taken care of, and 2) the optimal decisions of unifying DVFS and early-exit as a multi-objective optimization problem are not achieved due to the large configuration space. To address these challenges, we propose Dr²r, a reinforcement learning-based framework that jointly optimizes early-exit points and DVFS settings for continuous inference. In particular, Dr²r includes a cross-training mechanism that fine-tunes the early-exit network to accommodate dynamic time constraints and system conditions. Experimental results demonstrate that Dr²r achieves up to 22.03% energy reduction and 3.23% accuracy gain compared to contemporary techniques.

Original language	English
Title of host publication	2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9783982674100
DOIs	https://doi.org/10.23919/DATE64628.2025.10992707
Publication status	Published - 2025
Event	2025 Design, Automation and Test in Europe Conference, DATE 2025 - Lyon, France Duration: 31 Mar 2025 → 2 Apr 2025

Publication series

Name	Proceedings -Design, Automation and Test in Europe, DATE
ISSN (Print)	1530-1591

Conference

Conference	2025 Design, Automation and Test in Europe Conference, DATE 2025
Country/Territory	France
City	Lyon
Period	31/03/25 → 2/04/25

Keywords

DVFS
Early-Exit Neural Networks
Embedded Computing
Reinforcement Learning

ASJC Scopus subject areas

General Engineering

Access to Document

10.23919/DATE64628.2025.10992707

Cite this

He, Y., Li, J., Li, C., Yang, Q., Wang, Z., Du, H., Ren, J., & Yu, H. (2025). De²r: Unifying DVFS and Early-Exit for Embedded AI Inference via Reinforcement Learning. In 2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings (Proceedings -Design, Automation and Test in Europe, DATE). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.23919/DATE64628.2025.10992707

@inproceedings{5e45281ee0084e09822a296fadfad6e1,

title = "De2r: Unifying DVFS and Early-Exit for Embedded AI Inference via Reinforcement Learning",

abstract = "Executing neural networks on resource-constrained embedded devices faces challenges. Efforts have been made at the application and system levels to reduce the execution cost. Among them, the early-exit networks reduce computational cost through intermediate exits, while Dynamic Voltage and Frequency Scaling (DVFS) offers system energy reduction. Existing works strive to unify early-exit and DVFS for combined benefits on both timing and energy flexibility, yet limitations exist: 1) varying time constraints that make different exit points become more, or less, important in terms of inference accuracy, are not taken care of, and 2) the optimal decisions of unifying DVFS and early-exit as a multi-objective optimization problem are not achieved due to the large configuration space. To address these challenges, we propose Dr2r, a reinforcement learning-based framework that jointly optimizes early-exit points and DVFS settings for continuous inference. In particular, Dr2r includes a cross-training mechanism that fine-tunes the early-exit network to accommodate dynamic time constraints and system conditions. Experimental results demonstrate that Dr2r achieves up to 22.03% energy reduction and 3.23% accuracy gain compared to contemporary techniques.",

keywords = "DVFS, Early-Exit Neural Networks, Embedded Computing, Reinforcement Learning",

author = "Yuting He and Jingjin Li and Chengtai Li and Qingyu Yang and Zheng Wang and Heshan Du and Jianfeng Ren and Heng Yu",

note = "Publisher Copyright: {\textcopyright} 2025 EDAA.; 2025 Design, Automation and Test in Europe Conference, DATE 2025 ; Conference date: 31-03-2025 Through 02-04-2025",

year = "2025",

doi = "10.23919/DATE64628.2025.10992707",

language = "English",

series = "Proceedings -Design, Automation and Test in Europe, DATE",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings",

address = "United States",

}

He, Y, Li, J, Li, C, Yang, Q, Wang, Z, Du, H , Ren, J & Yu, H 2025, De²r: Unifying DVFS and Early-Exit for Embedded AI Inference via Reinforcement Learning. in 2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings. Proceedings -Design, Automation and Test in Europe, DATE, Institute of Electrical and Electronics Engineers Inc., 2025 Design, Automation and Test in Europe Conference, DATE 2025, Lyon, France, 31/03/25. https://doi.org/10.23919/DATE64628.2025.10992707

De²r: Unifying DVFS and Early-Exit for Embedded AI Inference via Reinforcement Learning. / He, Yuting; Li, Jingjin; Li, Chengtai et al.
2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2025. (Proceedings -Design, Automation and Test in Europe, DATE).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - De2r

T2 - 2025 Design, Automation and Test in Europe Conference, DATE 2025

AU - He, Yuting

AU - Li, Jingjin

AU - Li, Chengtai

AU - Yang, Qingyu

AU - Wang, Zheng

AU - Du, Heshan

AU - Ren, Jianfeng

AU - Yu, Heng

PY - 2025

Y1 - 2025

N2 - Executing neural networks on resource-constrained embedded devices faces challenges. Efforts have been made at the application and system levels to reduce the execution cost. Among them, the early-exit networks reduce computational cost through intermediate exits, while Dynamic Voltage and Frequency Scaling (DVFS) offers system energy reduction. Existing works strive to unify early-exit and DVFS for combined benefits on both timing and energy flexibility, yet limitations exist: 1) varying time constraints that make different exit points become more, or less, important in terms of inference accuracy, are not taken care of, and 2) the optimal decisions of unifying DVFS and early-exit as a multi-objective optimization problem are not achieved due to the large configuration space. To address these challenges, we propose Dr2r, a reinforcement learning-based framework that jointly optimizes early-exit points and DVFS settings for continuous inference. In particular, Dr2r includes a cross-training mechanism that fine-tunes the early-exit network to accommodate dynamic time constraints and system conditions. Experimental results demonstrate that Dr2r achieves up to 22.03% energy reduction and 3.23% accuracy gain compared to contemporary techniques.

AB - Executing neural networks on resource-constrained embedded devices faces challenges. Efforts have been made at the application and system levels to reduce the execution cost. Among them, the early-exit networks reduce computational cost through intermediate exits, while Dynamic Voltage and Frequency Scaling (DVFS) offers system energy reduction. Existing works strive to unify early-exit and DVFS for combined benefits on both timing and energy flexibility, yet limitations exist: 1) varying time constraints that make different exit points become more, or less, important in terms of inference accuracy, are not taken care of, and 2) the optimal decisions of unifying DVFS and early-exit as a multi-objective optimization problem are not achieved due to the large configuration space. To address these challenges, we propose Dr2r, a reinforcement learning-based framework that jointly optimizes early-exit points and DVFS settings for continuous inference. In particular, Dr2r includes a cross-training mechanism that fine-tunes the early-exit network to accommodate dynamic time constraints and system conditions. Experimental results demonstrate that Dr2r achieves up to 22.03% energy reduction and 3.23% accuracy gain compared to contemporary techniques.

KW - DVFS

KW - Early-Exit Neural Networks

KW - Embedded Computing

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=105006910380&partnerID=8YFLogxK

U2 - 10.23919/DATE64628.2025.10992707

DO - 10.23919/DATE64628.2025.10992707

M3 - Conference contribution

AN - SCOPUS:105006910380

T3 - Proceedings -Design, Automation and Test in Europe, DATE

BT - 2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 31 March 2025 through 2 April 2025

ER -

He Y, Li J, Li C, Yang Q, Wang Z, Du H et al. De²r: Unifying DVFS and Early-Exit for Embedded AI Inference via Reinforcement Learning. In 2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2025. (Proceedings -Design, Automation and Test in Europe, DATE). doi: 10.23919/DATE64628.2025.10992707