Multi-Class 3D Object Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery

Wang, Q.; Bhowmik, N.; Breckon, T.P.

doi:10.1109/icmla51294.2020.00012

Multi-Class 3D Object Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery

Wang, Q.; Bhowmik, N.; Breckon, T.P.

Authors

Qian Wang qian.wang@durham.ac.uk
Academic Visitor

Dr Neelanjan Bhowmik neelanjan.bhowmik@durham.ac.uk
Post Doctoral Research Associate

Professor Toby Breckon toby.breckon@durham.ac.uk
Professor

Abstract

Automatic detection of prohibited objects within passenger baggage is important for aviation security. X-ray Computed Tomography (CT) based 3D imaging is widely used in airports for aviation security screening whilst prior work on automatic prohibited item detection focus primarily on 2D X-ray imagery. Whilst some prior work has proven the possibility of extending deep convolutional neural networks (CNN) based automatic prohibited item detection from 2D X-ray imagery to volumetric 3D CT baggage security screening imagery, it focuses on the detection of one specific type of objects (e.g., either bottles or handguns). As a result, multiple models are needed if more than one type of prohibited item is required to be detected in practice. In this paper, we consider the detection of multiple object categories of interest using one unified framework. To this end, we formulate a more challenging multi-class 3D object detection problem within 3D CT imagery and propose a viable solution (3D RetinaNet) to tackle this problem. To enhance the performance of detection we investigate a variety of strategies including data augmentation and varying backbone networks. Experimentation carried out to provide both quantitative and qualitative evaluations of the proposed approach to multi-class 3D object detection within 3D CT baggage security screening imagery. Experimental results demonstrate the combination of the 3D RetinaNet and a series of favorable strategies can achieve a mean Average Precision (mAP) of 65.3% over five object classes (i.e. bottles, handguns, binoculars, glock frames, iPods). The overall performance is affected by the poor performance on glock frames and iPods due to the lack of data and their resemblance with the baggage clutter.

Citation

Wang, Q., Bhowmik, N., & Breckon, T. (2021). Multi-Class 3D Object Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery. . https://doi.org/10.1109/icmla51294.2020.00012

Conference Name	19th IEEE International Conference on Machine Learning and Applications (ICMLA 2020)
Conference Location	Miami, Florida
Start Date	Dec 14, 2020
End Date	Dec 17, 2020
Acceptance Date	Sep 16, 2020
Online Publication Date	Feb 23, 2021
Publication Date	2021
Deposit Date	Oct 26, 2020
Publicly Available Date	Oct 27, 2020
Publisher	Institute of Electrical and Electronics Engineers
DOI	https://doi.org/10.1109/icmla51294.2020.00012

Files

Accepted Conference Proceeding (1.4 Mb)
PDF

Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.