Abstract
In an effort to enhance the efficiency and precision of manual part assembly in industrial settings, the development of software for assembly guidance becomes imperative. Augmented reality (AR) technology offers a means to provide visual instructions for assembly tasks, rendering the guidance more comprehensible. Nevertheless, a significant challenge lies in the technology’s limited object detection capabilities, especially when distinguishing between similar assembled parts. This project proposes the utilization of deep learning neural networks to enhance the accuracy of object recognition within the AR guided assembly application. To achieve this objective, a dataset of assembly parts, known as the Visual Object Classes (VOC) dataset, was created. Data augmentation techniques were employed to expand this dataset, incorporating scale HSV (hue saturation value) transformations. Subsequently, deep learning models for the recognition of assembly parts were developed which were based on the Single Shot Multibox Detector (SSD) and the YOLOv7 detector. The models were trained and fine-tuned, targeting on the variations of the positions of detected parts. The effectiveness of this approach was evaluated using a case study involving an educational electronic blocks circuit science kit. The results demonstrated a high assembly part recognition accuracy of over 99% in mean average precision (MAP), along with favorable user testing outcomes. Consequently, the AR application was capable of offering high-quality guidance to users which holds promise for application in diverse scenarios and the resolution of real-world challenges.
Original language | English |
---|---|
Title of host publication | Intelligent Human Computer Interaction - 15th International Conference, IHCI 2023, Revised Selected Papers |
Subtitle of host publication | 15th International Conference, IHCI 2023, Daegu, South Korea, November 8–10, 2023, Revised Selected Papers, Part II |
Editors | Bong Jun Choi, Dhananjay Singh, Uma Shanker Tiwary, Wan-Young Chung |
Publisher | Springer Science and Business Media Deutschland GmbH |
Chapter | 11 |
Pages | 105-114 |
Number of pages | 10 |
ISBN (Electronic) | 9783031538308 |
ISBN (Print) | 9783031538292 |
DOIs | |
Publication status | Published - 29 Feb 2024 |
Event | 15th International Conference on Intelligent Human Computer Interaction, IHCI 2023 - Daegu, Korea, Republic of Duration: 8 Nov 2023 → 10 Nov 2023 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 14532 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 15th International Conference on Intelligent Human Computer Interaction, IHCI 2023 |
---|---|
Country/Territory | Korea, Republic of |
City | Daegu |
Period | 8/11/23 → 10/11/23 |
Keywords
- Augmented Reality
- Assembly Tasks
- Object Detection
- Object Recognition
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science
Fingerprint
Dive into the research topics of 'Deep Learning Approach for Enhanced Object Recognition and Assembly Guidance with Augmented Reality'. Together they form a unique fingerprint.Cite this
Lee, B. G., Wang, X., Han, R., Sun, L., Pike, M., & Chung, W. Y. (2024). Deep Learning Approach for Enhanced Object Recognition and Assembly Guidance with Augmented Reality. In B. J. Choi, D. Singh, U. S. Tiwary, & W.-Y. Chung (Eds.), Intelligent Human Computer Interaction - 15th International Conference, IHCI 2023, Revised Selected Papers: 15th International Conference, IHCI 2023, Daegu, South Korea, November 8–10, 2023, Revised Selected Papers, Part II (pp. 105-114). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14532 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-53830-8_11