SPFusionNet: Sketch segmentation using multi-modal data fusion

Fei Wang; Shujin Lin; Hefeng Wu; Hanhui Li; Ruomei Wang; Xiaonan Luo; Xiangjian He

doi:10.1109/ICME.2019.00285

SPFusionNet: Sketch segmentation using multi-modal data fusion

Fei Wang, Shujin Lin, Hefeng Wu, Hanhui Li, Ruomei Wang, Xiaonan Luo, Xiangjian He

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

23 Citations (Scopus)

Abstract

The sketch segmentation problem remains largely unsolved because conventional methods are greatly challenged by the highly abstract appearances of freehand sketches and their numerous shape variations. In this work, we tackle such challenges by exploiting different modes of sketch data in a unified framework. Specifically, we propose a deep neural network SPFusionNet to capture the characteristic of sketch by fusing from its image and point set modes. The image modal component SketchNet learns hierarchically abstract ro-bust features and utilizes multi-level representations to produce pixel-wise feature maps, while the point set-modal component SPointNet captures local and global contexts of the sampled point set to produce point-wise feature maps. Then our framework aggregates these feature maps by a fusion network component to generate the sketch segmentation result. The extensive experimental evaluation and comparison with peer methods on our large SketchSeg dataset verify the effectiveness of the proposed framework.

Original language	English
Title of host publication	Proceedings - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019
Publisher	IEEE Computer Society
Pages	1654-1659
Number of pages	6
ISBN (Electronic)	9781538695524
DOIs	https://doi.org/10.1109/ICME.2019.00285
Publication status	Published - Jul 2019
Externally published	Yes
Event	2019 IEEE International Conference on Multimedia and Expo, ICME 2019 - Shanghai, China Duration: 8 Jul 2019 → 12 Jul 2019

Publication series

Name	Proceedings - IEEE International Conference on Multimedia and Expo
Volume	2019-July
ISSN (Print)	1945-7871
ISSN (Electronic)	1945-788X

Conference

Conference	2019 IEEE International Conference on Multimedia and Expo, ICME 2019
Country/Territory	China
City	Shanghai
Period	8/07/19 → 12/07/19

Keywords

Deep neural network
Multi-modal fusion
Sketch segmentation

ASJC Scopus subject areas

Computer Networks and Communications
Computer Science Applications

Access to Document

10.1109/ICME.2019.00285

Cite this

Wang, F., Lin, S., Wu, H., Li, H., Wang, R., Luo, X., & He, X. (2019). SPFusionNet: Sketch segmentation using multi-modal data fusion. In Proceedings - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019 (pp. 1654-1659). Article 8784880 (Proceedings - IEEE International Conference on Multimedia and Expo; Vol. 2019-July). IEEE Computer Society. https://doi.org/10.1109/ICME.2019.00285

@inproceedings{1ce8b48b1bc2443f834bda0b249c26f2,

title = "SPFusionNet: Sketch segmentation using multi-modal data fusion",

abstract = "The sketch segmentation problem remains largely unsolved because conventional methods are greatly challenged by the highly abstract appearances of freehand sketches and their numerous shape variations. In this work, we tackle such challenges by exploiting different modes of sketch data in a unified framework. Specifically, we propose a deep neural network SPFusionNet to capture the characteristic of sketch by fusing from its image and point set modes. The image modal component SketchNet learns hierarchically abstract ro-bust features and utilizes multi-level representations to produce pixel-wise feature maps, while the point set-modal component SPointNet captures local and global contexts of the sampled point set to produce point-wise feature maps. Then our framework aggregates these feature maps by a fusion network component to generate the sketch segmentation result. The extensive experimental evaluation and comparison with peer methods on our large SketchSeg dataset verify the effectiveness of the proposed framework.",

keywords = "Deep neural network, Multi-modal fusion, Sketch segmentation",

author = "Fei Wang and Shujin Lin and Hefeng Wu and Hanhui Li and Ruomei Wang and Xiaonan Luo and Xiangjian He",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 IEEE International Conference on Multimedia and Expo, ICME 2019 ; Conference date: 08-07-2019 Through 12-07-2019",

year = "2019",

month = jul,

doi = "10.1109/ICME.2019.00285",

language = "English",

series = "Proceedings - IEEE International Conference on Multimedia and Expo",

publisher = "IEEE Computer Society",

pages = "1654--1659",

booktitle = "Proceedings - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019",

address = "United States",

}

Wang, F, Lin, S, Wu, H, Li, H, Wang, R, Luo, X & He, X 2019, SPFusionNet: Sketch segmentation using multi-modal data fusion. in Proceedings - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019., 8784880, Proceedings - IEEE International Conference on Multimedia and Expo, vol. 2019-July, IEEE Computer Society, pp. 1654-1659, 2019 IEEE International Conference on Multimedia and Expo, ICME 2019, Shanghai, China, 8/07/19. https://doi.org/10.1109/ICME.2019.00285

SPFusionNet: Sketch segmentation using multi-modal data fusion. / Wang, Fei; Lin, Shujin; Wu, Hefeng et al.
Proceedings - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019. IEEE Computer Society, 2019. p. 1654-1659 8784880 (Proceedings - IEEE International Conference on Multimedia and Expo; Vol. 2019-July).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - SPFusionNet

T2 - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019

AU - Wang, Fei

AU - Lin, Shujin

AU - Wu, Hefeng

AU - Li, Hanhui

AU - Wang, Ruomei

AU - Luo, Xiaonan

AU - He, Xiangjian

PY - 2019/7

Y1 - 2019/7

N2 - The sketch segmentation problem remains largely unsolved because conventional methods are greatly challenged by the highly abstract appearances of freehand sketches and their numerous shape variations. In this work, we tackle such challenges by exploiting different modes of sketch data in a unified framework. Specifically, we propose a deep neural network SPFusionNet to capture the characteristic of sketch by fusing from its image and point set modes. The image modal component SketchNet learns hierarchically abstract ro-bust features and utilizes multi-level representations to produce pixel-wise feature maps, while the point set-modal component SPointNet captures local and global contexts of the sampled point set to produce point-wise feature maps. Then our framework aggregates these feature maps by a fusion network component to generate the sketch segmentation result. The extensive experimental evaluation and comparison with peer methods on our large SketchSeg dataset verify the effectiveness of the proposed framework.

AB - The sketch segmentation problem remains largely unsolved because conventional methods are greatly challenged by the highly abstract appearances of freehand sketches and their numerous shape variations. In this work, we tackle such challenges by exploiting different modes of sketch data in a unified framework. Specifically, we propose a deep neural network SPFusionNet to capture the characteristic of sketch by fusing from its image and point set modes. The image modal component SketchNet learns hierarchically abstract ro-bust features and utilizes multi-level representations to produce pixel-wise feature maps, while the point set-modal component SPointNet captures local and global contexts of the sampled point set to produce point-wise feature maps. Then our framework aggregates these feature maps by a fusion network component to generate the sketch segmentation result. The extensive experimental evaluation and comparison with peer methods on our large SketchSeg dataset verify the effectiveness of the proposed framework.

KW - Deep neural network

KW - Multi-modal fusion

KW - Sketch segmentation

UR - http://www.scopus.com/inward/record.url?scp=85070968895&partnerID=8YFLogxK

U2 - 10.1109/ICME.2019.00285

DO - 10.1109/ICME.2019.00285

M3 - Conference contribution

AN - SCOPUS:85070968895

T3 - Proceedings - IEEE International Conference on Multimedia and Expo

SP - 1654

EP - 1659

BT - Proceedings - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019

PB - IEEE Computer Society

Y2 - 8 July 2019 through 12 July 2019

ER -

SPFusionNet: Sketch segmentation using multi-modal data fusion

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this