Edge deep learning for low capabilities devices

  • Fotios Filippou
  • , Fotis Foukalas
  • , Theodoros Tsiftsis

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Nowadays, Deep Learning (DL) is being used to construct many applications in domains such as object detection, image classification, speech to text etc. Deep Neural Networks (DNNs) are the core of Deep Learning as they offer remarkable accuracy and performance across various tasks. Despite their powerful capabilities, DNNs often require substantial computational resources, which can be challenging to manage, especially when deploying them on edge devices. So, these models have to be optimized before being deployed to these devices. Optimizing a model means making it smaller and more efficient without losing too much performance. Even though techniques like pruning reduce the number of parameters, the goal is to keep accuracy and speed as close as possible to the original. We are going to present a hybrid solution combining two techniques, pruning and quantization. Pruning is the process of eliminating inessential weights and connections in order to reduce the model size. Once the unnecessary parameters are removed, the model is quantized by converting the weights of the remaining parameters from 32 floating point precision to half or to INT8. We verify and validate the performance of this hybrid approach using the COCO dataset (contains 80 classes) and the pre-trained YOLOv8 model. At the final stage, the hybrid model is deployed on two different edge devices for benchmarking, the NVIDIA Jetson Nano (4GB) and the Raspberry Pi 5 (16GB).

Original languageEnglish
Title of host publicationBalkancom 2025 - 8th International Balkan Conference on Communications and Networking
Subtitle of host publicationEmpowering Connections, Enabling Innovation
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331539191
DOIs
Publication statusPublished - 2025
Event8th International Balkan Conference on Communications and Networking, Balkancom 2025 - Piraeus, Greece
Duration: 17 Jun 202520 Jun 2025

Publication series

NameBalkancom 2025 - 8th International Balkan Conference on Communications and Networking: Empowering Connections, Enabling Innovation

Conference

Conference8th International Balkan Conference on Communications and Networking, Balkancom 2025
Country/TerritoryGreece
CityPiraeus
Period17/06/2520/06/25

Free Keywords

  • deep learning
  • edge devices
  • object detection
  • pruning
  • quantization
  • YOLO

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Edge deep learning for low capabilities devices'. Together they form a unique fingerprint.

Cite this