TY - GEN
T1 - Epitomized summarization of wireless capsule endoscopic videos for efficient visualization
AU - Chu, Xinqi
AU - Poh, Chee Khun
AU - Li, Liyuan
AU - Chan, Kap Luk
AU - Yan, Shuicheng
AU - Shen, Weijia
AU - Htwe, That Mon
AU - Liu, Jiang
AU - Lim, Joo Hwee
AU - Ong, Eng Hui
AU - Ho, Khek Yu
PY - 2010
Y1 - 2010
N2 - A video recording of an examination by Wireless Capsule Endoscopy (WCE) may typically contain more than 55,000 video frames, which makes the manual visual screening by an experienced gastroenterologist a highly time-consuming task. In this paper, we propose a novel method of epitomized summarization of WCE videos for efficient visualization to a gastroenterologist. For each short sequence of a WCE video, an epitomized frame is generated. New constraints are introduced into the epitome formulation to achieve the necessary visual quality for manual examination, and an EM algorithm for learning the epitome is derived. First, the local context weights are introduced to generate the epitomized frame. The epitomized frame preserves the appearance of all the input patches from the frames of the short sequence. Furthermore, by introducing spatial distributions for semantic interpretation of image patches in our epitome formulation, we show that it also provides a framework to facilitate the semantic description of visual features to generate organized visual summarization of WCE video, where the patches in different positions correspond to different semantic information. Our experiments on real WCE videos show that, using epitomized summarization, the number of frames have to be examined by the gastroenterologist can be reduced to less than one-tenth of the original frames in the video.
AB - A video recording of an examination by Wireless Capsule Endoscopy (WCE) may typically contain more than 55,000 video frames, which makes the manual visual screening by an experienced gastroenterologist a highly time-consuming task. In this paper, we propose a novel method of epitomized summarization of WCE videos for efficient visualization to a gastroenterologist. For each short sequence of a WCE video, an epitomized frame is generated. New constraints are introduced into the epitome formulation to achieve the necessary visual quality for manual examination, and an EM algorithm for learning the epitome is derived. First, the local context weights are introduced to generate the epitomized frame. The epitomized frame preserves the appearance of all the input patches from the frames of the short sequence. Furthermore, by introducing spatial distributions for semantic interpretation of image patches in our epitome formulation, we show that it also provides a framework to facilitate the semantic description of visual features to generate organized visual summarization of WCE video, where the patches in different positions correspond to different semantic information. Our experiments on real WCE videos show that, using epitomized summarization, the number of frames have to be examined by the gastroenterologist can be reduced to less than one-tenth of the original frames in the video.
UR - http://www.scopus.com/inward/record.url?scp=84872204642&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-15745-5_64
DO - 10.1007/978-3-642-15745-5_64
M3 - Conference contribution
C2 - 20879355
AN - SCOPUS:84872204642
SN - 3642157440
SN - 9783642157448
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 522
EP - 529
BT - Medical Image Computing and Computer-Assisted Intervention, MICCAI2010 - 13th International Conference, Proceedings
T2 - 13th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2010
Y2 - 20 September 2010 through 24 September 2010
ER -