Discriminative Dictionary Learning with Motion Weber Local Descriptor for Violence Detection

Tao Zhang; Wenjing Jia; Xiangjian He; Jie Yang

doi:10.1109/TCSVT.2016.2589858

Discriminative Dictionary Learning with Motion Weber Local Descriptor for Violence Detection

Tao Zhang, Wenjing Jia, Xiangjian He, Jie Yang

Research output: Journal Publication › Article › peer-review

100 Citations (Scopus)

Abstract

Automatic violence detection from video is a hot topic for many video surveillance applications. However, there has been little success in developing an algorithm that can detect violence in surveillance videos with high performance. In this paper, following our recently proposed idea of motion Weber local descriptor (WLD), we make two major improvements and propose a more effective and efficient algorithm for detecting violence from motion images. First, we propose an improved WLD (IWLD) to better depict low-level image appearance information, and then extend the spatial descriptor IWLD by adding a temporal component to capture local motion information and hence form the motion IWLD. Second, we propose a modified sparse-representation-based classification model to both control the reconstruction error of coding coefficients and minimize the classification error. Based on the proposed sparse model, a class-specific dictionary containing dictionary atoms corresponding to the class labels is learned using class labels of training samples. With this learned dictionary, not only the representation residual but also the representation coefficients become discriminative. A classification scheme integrating the modified sparse model is developed to exploit such discriminative information. The experimental results on three benchmark data sets have demonstrated the superior performance of the proposed approach over the state of the arts.

Original language	English
Article number	7508910
Pages (from-to)	696-709
Number of pages	14
Journal	IEEE Transactions on Circuits and Systems for Video Technology
Volume	27
Issue number	3
DOIs	https://doi.org/10.1109/TCSVT.2016.2589858
Publication status	Published - Mar 2017
Externally published	Yes

Keywords

Class-specific dictionary learning (DL)
motion improved Weber local descriptor (MoIWLD)
sparse representation
violence detection

ASJC Scopus subject areas

Media Technology
Electrical and Electronic Engineering

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/TCSVT.2016.2589858

Cite this

@article{40279d1ea37144dd96c8a18e5a71c9e8,

title = "Discriminative Dictionary Learning with Motion Weber Local Descriptor for Violence Detection",

abstract = "Automatic violence detection from video is a hot topic for many video surveillance applications. However, there has been little success in developing an algorithm that can detect violence in surveillance videos with high performance. In this paper, following our recently proposed idea of motion Weber local descriptor (WLD), we make two major improvements and propose a more effective and efficient algorithm for detecting violence from motion images. First, we propose an improved WLD (IWLD) to better depict low-level image appearance information, and then extend the spatial descriptor IWLD by adding a temporal component to capture local motion information and hence form the motion IWLD. Second, we propose a modified sparse-representation-based classification model to both control the reconstruction error of coding coefficients and minimize the classification error. Based on the proposed sparse model, a class-specific dictionary containing dictionary atoms corresponding to the class labels is learned using class labels of training samples. With this learned dictionary, not only the representation residual but also the representation coefficients become discriminative. A classification scheme integrating the modified sparse model is developed to exploit such discriminative information. The experimental results on three benchmark data sets have demonstrated the superior performance of the proposed approach over the state of the arts.",

keywords = "Class-specific dictionary learning (DL), motion improved Weber local descriptor (MoIWLD), sparse representation, violence detection",

author = "Tao Zhang and Wenjing Jia and Xiangjian He and Jie Yang",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2017",

month = mar,

doi = "10.1109/TCSVT.2016.2589858",

language = "English",

volume = "27",

pages = "696--709",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - Discriminative Dictionary Learning with Motion Weber Local Descriptor for Violence Detection

AU - Zhang, Tao

AU - Jia, Wenjing

AU - He, Xiangjian

AU - Yang, Jie

PY - 2017/3

Y1 - 2017/3

N2 - Automatic violence detection from video is a hot topic for many video surveillance applications. However, there has been little success in developing an algorithm that can detect violence in surveillance videos with high performance. In this paper, following our recently proposed idea of motion Weber local descriptor (WLD), we make two major improvements and propose a more effective and efficient algorithm for detecting violence from motion images. First, we propose an improved WLD (IWLD) to better depict low-level image appearance information, and then extend the spatial descriptor IWLD by adding a temporal component to capture local motion information and hence form the motion IWLD. Second, we propose a modified sparse-representation-based classification model to both control the reconstruction error of coding coefficients and minimize the classification error. Based on the proposed sparse model, a class-specific dictionary containing dictionary atoms corresponding to the class labels is learned using class labels of training samples. With this learned dictionary, not only the representation residual but also the representation coefficients become discriminative. A classification scheme integrating the modified sparse model is developed to exploit such discriminative information. The experimental results on three benchmark data sets have demonstrated the superior performance of the proposed approach over the state of the arts.

AB - Automatic violence detection from video is a hot topic for many video surveillance applications. However, there has been little success in developing an algorithm that can detect violence in surveillance videos with high performance. In this paper, following our recently proposed idea of motion Weber local descriptor (WLD), we make two major improvements and propose a more effective and efficient algorithm for detecting violence from motion images. First, we propose an improved WLD (IWLD) to better depict low-level image appearance information, and then extend the spatial descriptor IWLD by adding a temporal component to capture local motion information and hence form the motion IWLD. Second, we propose a modified sparse-representation-based classification model to both control the reconstruction error of coding coefficients and minimize the classification error. Based on the proposed sparse model, a class-specific dictionary containing dictionary atoms corresponding to the class labels is learned using class labels of training samples. With this learned dictionary, not only the representation residual but also the representation coefficients become discriminative. A classification scheme integrating the modified sparse model is developed to exploit such discriminative information. The experimental results on three benchmark data sets have demonstrated the superior performance of the proposed approach over the state of the arts.

KW - Class-specific dictionary learning (DL)

KW - motion improved Weber local descriptor (MoIWLD)

KW - sparse representation

KW - violence detection

UR - http://www.scopus.com/inward/record.url?scp=85015149734&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2016.2589858

DO - 10.1109/TCSVT.2016.2589858

M3 - Article

AN - SCOPUS:85015149734

SN - 1051-8215

VL - 27

SP - 696

EP - 709

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 3

M1 - 7508910

ER -

Discriminative Dictionary Learning with Motion Weber Local Descriptor for Violence Detection

Abstract

Keywords

ASJC Scopus subject areas

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this