Using Deep Convolutional Neural Network Architectures for Object Classification and Detection within X-ray Baggage Security Imagery

Akcay, S.; Kundegorski, M.E.; Willcocks, C.G.; Breckon, T.P.

doi:10.1109/tifs.2018.2812196

Using Deep Convolutional Neural Network Architectures for Object Classification and Detection within X-ray Baggage Security Imagery

Akcay, S.; Kundegorski, M.E.; Willcocks, C.G.; Breckon, T.P.

Authors

Samet Akcay samet.akcay@durham.ac.uk
PGR Student Doctor of Philosophy

M.E. Kundegorski

Dr Chris Willcocks christopher.g.willcocks@durham.ac.uk
Associate Professor

Professor Toby Breckon toby.breckon@durham.ac.uk
Professor

Abstract

We consider the use of deep Convolutional Neural Networks (CNN) with transfer learning for the image classification and detection problems posed within the context of X-ray baggage security imagery. The use of the CNN approach requires large amounts of data to facilitate a complex end-to-end feature extraction and classification process. Within the context of Xray security screening, limited availability of object of interest data examples can thus pose a problem. To overcome this issue, we employ a transfer learning paradigm such that a pre-trained CNN, primarily trained for generalized image classification tasks where sufficient training data exists, can be optimized explicitly as a later secondary process towards this application domain. To provide a consistent feature-space comparison between this approach and traditional feature space representations, we also train Support Vector Machine (SVM) classifier on CNN features. We empirically show that fine-tuned CNN features yield superior performance to conventional hand-crafted features on object classification tasks within this context. Overall we achieve 0.994 accuracy based on AlexNet features trained with Support Vector Machine (SVM) classifier. In addition to classification, we also explore the applicability of multiple CNN driven detection paradigms such as sliding window based CNN (SW-CNN), Faster RCNN (F-RCNN), Region-based Fully Convolutional Networks (R-FCN) and YOLOv2. We train numerous networks tackling both single and multiple detections over SW-CNN/F-RCNN/RFCN/ YOLOv2 variants. YOLOv2, Faster-RCNN, and R-FCN provide superior results to the more traditional SW-CNN approaches. With the use of YOLOv2, using input images of size 544×544, we achieve 0.885 mean average precision (mAP) for a six-class object detection problem. The same approach with an input of size 416×416 yields 0.974 mAP for the two-class firearm detection problem and requires approximately 100ms per image. Overall we illustrate the comparative performance of these techniques and show that object localization strategies cope well with cluttered X-ray security imagery where classification techniques fail.

Citation

Akcay, S., Kundegorski, M., Willcocks, C., & Breckon, T. (2018). Using Deep Convolutional Neural Network Architectures for Object Classification and Detection within X-ray Baggage Security Imagery. IEEE Transactions on Information Forensics and Security, 13(9), 2203-2215. https://doi.org/10.1109/tifs.2018.2812196

Journal Article Type	Article
Acceptance Date	Feb 28, 2018
Online Publication Date	Mar 5, 2018
Publication Date	Sep 1, 2018
Deposit Date	Feb 28, 2018
Publicly Available Date	Mar 1, 2018
Journal	IEEE Transactions on Information Forensics and Security
Print ISSN	1556-6013
Electronic ISSN	1556-6021
Publisher	Institute of Electrical and Electronics Engineers
Peer Reviewed	Peer Reviewed
Volume	13
Issue	9
Pages	2203-2215
DOI	https://doi.org/10.1109/tifs.2018.2812196

Files

Accepted Journal Article (5.1 Mb)
PDF

Copyright Statement
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.