Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation

Chengpei Xu; Wenjing Jia; Ruomei Wang; Xiangjian He; Baoquan Zhao; Yuanfang Zhang

doi:10.1109/TLT.2022.3216535

Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation

Chengpei Xu, Wenjing Jia, Ruomei Wang, Xiangjian He, Baoquan Zhao, Yuanfang Zhang

School of Computer Science

Research output: Journal Publication › Article › peer-review

10 Citations (Scopus)

Abstract

With the increasing popularity of open educational resources in the past few decades, more and more users watch online videos to gain knowledge. However, most educational videos only provide monotonous navigation tools and lack elaborating annotations. This makes the task of locating interesting contents time consuming. To address this limitation, in this article, we propose a slide-based video navigation tool that is able to extract the hierarchical structure and semantic relationship of visual entities in videos, by integrating multichannel information. Features of visual entities are first extracted from the presentation slides by a novel deep learning framework. Then, we propose a clustering approach to extract hierarchical relationships between visual entities (e.g., formulas, texts, or graphs appearing in educational slides). We use this information to associate visual entities with their corresponding audio speech text, by evaluating their semantic relationship. We present two cases where we use the structured data produced by this tool to generate a multilevel table of contents and notes to provide additional navigation materials for learning. The evaluation experiments demonstrate the effectiveness of our proposed solutions for visual entity extraction, hierarchical relationship extraction, as well as corresponding speech text matching. The user study also shows promising improvement in the autogenerated table of contents and notes for facilitating learning.

Original language	English
Pages (from-to)	1-17
Number of pages	17
Journal	IEEE Transactions on Learning Technologies
Volume	16
Issue number	1
DOIs	https://doi.org/10.1109/TLT.2022.3216535
Publication status	Published - 1 Feb 2023

Keywords

AutoNote generation
educational videos
hierarchical relationship extraction
video navigation
visual entity segmentation

ASJC Scopus subject areas

General Engineering
Education
Computer Science Applications

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/TLT.2022.3216535

Cite this

@article{dda804fe9db641abbce4892b6182a948,

title = "Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation",

abstract = "With the increasing popularity of open educational resources in the past few decades, more and more users watch online videos to gain knowledge. However, most educational videos only provide monotonous navigation tools and lack elaborating annotations. This makes the task of locating interesting contents time consuming. To address this limitation, in this article, we propose a slide-based video navigation tool that is able to extract the hierarchical structure and semantic relationship of visual entities in videos, by integrating multichannel information. Features of visual entities are first extracted from the presentation slides by a novel deep learning framework. Then, we propose a clustering approach to extract hierarchical relationships between visual entities (e.g., formulas, texts, or graphs appearing in educational slides). We use this information to associate visual entities with their corresponding audio speech text, by evaluating their semantic relationship. We present two cases where we use the structured data produced by this tool to generate a multilevel table of contents and notes to provide additional navigation materials for learning. The evaluation experiments demonstrate the effectiveness of our proposed solutions for visual entity extraction, hierarchical relationship extraction, as well as corresponding speech text matching. The user study also shows promising improvement in the autogenerated table of contents and notes for facilitating learning.",

keywords = "AutoNote generation, educational videos, hierarchical relationship extraction, video navigation, visual entity segmentation",

author = "Chengpei Xu and Wenjing Jia and Ruomei Wang and Xiangjian He and Baoquan Zhao and Yuanfang Zhang",

note = "Publisher Copyright: {\textcopyright} 2008-2011 IEEE.",

year = "2023",

month = feb,

day = "1",

doi = "10.1109/TLT.2022.3216535",

language = "English",

volume = "16",

pages = "1--17",

journal = "IEEE Transactions on Learning Technologies",

issn = "1939-1382",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "1",

}

TY - JOUR

T1 - Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation

AU - Xu, Chengpei

AU - Jia, Wenjing

AU - Wang, Ruomei

AU - He, Xiangjian

AU - Zhao, Baoquan

AU - Zhang, Yuanfang

PY - 2023/2/1

Y1 - 2023/2/1

N2 - With the increasing popularity of open educational resources in the past few decades, more and more users watch online videos to gain knowledge. However, most educational videos only provide monotonous navigation tools and lack elaborating annotations. This makes the task of locating interesting contents time consuming. To address this limitation, in this article, we propose a slide-based video navigation tool that is able to extract the hierarchical structure and semantic relationship of visual entities in videos, by integrating multichannel information. Features of visual entities are first extracted from the presentation slides by a novel deep learning framework. Then, we propose a clustering approach to extract hierarchical relationships between visual entities (e.g., formulas, texts, or graphs appearing in educational slides). We use this information to associate visual entities with their corresponding audio speech text, by evaluating their semantic relationship. We present two cases where we use the structured data produced by this tool to generate a multilevel table of contents and notes to provide additional navigation materials for learning. The evaluation experiments demonstrate the effectiveness of our proposed solutions for visual entity extraction, hierarchical relationship extraction, as well as corresponding speech text matching. The user study also shows promising improvement in the autogenerated table of contents and notes for facilitating learning.

AB - With the increasing popularity of open educational resources in the past few decades, more and more users watch online videos to gain knowledge. However, most educational videos only provide monotonous navigation tools and lack elaborating annotations. This makes the task of locating interesting contents time consuming. To address this limitation, in this article, we propose a slide-based video navigation tool that is able to extract the hierarchical structure and semantic relationship of visual entities in videos, by integrating multichannel information. Features of visual entities are first extracted from the presentation slides by a novel deep learning framework. Then, we propose a clustering approach to extract hierarchical relationships between visual entities (e.g., formulas, texts, or graphs appearing in educational slides). We use this information to associate visual entities with their corresponding audio speech text, by evaluating their semantic relationship. We present two cases where we use the structured data produced by this tool to generate a multilevel table of contents and notes to provide additional navigation materials for learning. The evaluation experiments demonstrate the effectiveness of our proposed solutions for visual entity extraction, hierarchical relationship extraction, as well as corresponding speech text matching. The user study also shows promising improvement in the autogenerated table of contents and notes for facilitating learning.

KW - AutoNote generation

KW - educational videos

KW - hierarchical relationship extraction

KW - video navigation

KW - visual entity segmentation

UR - http://www.scopus.com/inward/record.url?scp=85141537092&partnerID=8YFLogxK

U2 - 10.1109/TLT.2022.3216535

DO - 10.1109/TLT.2022.3216535

M3 - Article

AN - SCOPUS:85141537092

SN - 1939-1382

VL - 16

SP - 1

EP - 17

JO - IEEE Transactions on Learning Technologies

JF - IEEE Transactions on Learning Technologies

IS - 1

ER -

Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation

Abstract

Keywords

ASJC Scopus subject areas

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this