WO2019057200A1 - Inspection method and inspection device and computer readable medium - Google Patents

Inspection method and inspection device and computer readable medium Download PDF

Info

Publication number
WO2019057200A1
WO2019057200A1 PCT/CN2018/107317 CN2018107317W WO2019057200A1 WO 2019057200 A1 WO2019057200 A1 WO 2019057200A1 CN 2018107317 W CN2018107317 W CN 2018107317W WO 2019057200 A1 WO2019057200 A1 WO 2019057200A1
Authority
WO
WIPO (PCT)
Prior art keywords
activity map
inspected
class activity
layer
map
Prior art date
Application number
PCT/CN2018/107317
Other languages
French (fr)
Chinese (zh)
Inventor
赵自然
李强
刘耀红
顾建平
Original Assignee
清华大学
同方威视技术股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 清华大学, 同方威视技术股份有限公司 filed Critical 清华大学
Publication of WO2019057200A1 publication Critical patent/WO2019057200A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N23/00Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00
    • G01N23/02Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by transmitting the radiation through the material
    • G01N23/04Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by transmitting the radiation through the material and forming images of the material
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Definitions

  • the plurality of paths in the convolutional neural network share at least one convolutional layer and at least one pooled layer.
  • an inspection apparatus comprising: a scanning device that scans an object to be inspected with X-rays to obtain an X-ray image; and a processor configured to: process the object with a convolutional neural network Checking an X-ray image of the object to obtain a class activity map of the object to be inspected; and determining whether the object to be inspected includes a suspicious object based on the class activity map.
  • the source 110 can be a radioisotope (e.g., cobalt-60), a low energy X-ray machine, or a high energy X-ray accelerator.
  • a radioisotope e.g., cobalt-60
  • a low energy X-ray machine e.g., a high energy X-ray accelerator.
  • the convolutional layer (eg, the first and second convolutional layers 420 and 440) is the core building block of the CNN (eg, convolutional neural network 400).
  • the parameters of this layer consist of a collection of learnable convolution kernels (or simply convolution kernels), each with a small receptive field, but extending over the entire depth of the input data.
  • each convolution kernel is convolved along the width and height of the input data, the dot product between the elements of the convolution kernel and the input data is computed, and a two-dimensional activation map of the convolution kernel is generated.
  • the network is able to learn the convolution kernel that can be activated when a particular type of feature is seen at a spatial location of the input.
  • a mean A-mean(A)

Abstract

An inspection method and an inspection device, and a computer readable medium. Using an x-ray to scan an inspected object to obtain an x-ray image of the inspected object; using a convoluted neural network to process the x-ray image of the inspected object to obtain a class activity map of the inspected object; and, on the basis of the class activity map, determining whether the inspected object comprises a suspicious object. More accurate security inspection results can be obtained.

Description

检查方法和检查设备以及计算机可读介质Inspection method and inspection device and computer readable medium 技术领域Technical field
本公开的实施例涉及安全检查,具体涉及一种检查物品夹带的检查方法和检查设备以及计算机可读介质。Embodiments of the present disclosure relate to security inspections, and more particularly to an inspection method and inspection apparatus for inspecting an article entrainment and a computer readable medium.
背景技术Background technique
集装箱的出现极大程度地提高了货物运输的效率。随着全球经济的迅猛发展,集装箱运输在现代化运输产业中占有重要地位。集装箱货物运输具有易装卸、易搬运的特点,但这也导致在运输过程中常常会有不法分子趁机夹藏一些违禁品。特别是在海关进出口方面,大宗货物居多,逐一开箱检验不切合实际,需要借助辐射成像的方法来进行快速检验。The emergence of containers has greatly improved the efficiency of cargo transportation. With the rapid development of the global economy, container transportation plays an important role in the modern transportation industry. Container cargo transportation has the characteristics of easy loading and unloading and easy to carry, but it also causes illegal elements in the transportation process to trap some contraband. Especially in the import and export of customs, there are many bulk goods, and it is not practical to open the boxes one by one. It is necessary to use the method of radiation imaging to carry out rapid inspection.
但是现有检查技术利用图像的纹理等特征在检查夹带物品。例如现有的夹带检测方法是针对被检查物体的透射图像计算局部与其周围的纹理差异。在遍历图像后,综合其他特征,对差异较大且非噪声的部分判定为夹带嫌疑物。但是由于物品种类非常多,难以准确检查出集装箱中是否有夹带物品。However, existing inspection techniques utilize features such as texture of the image to inspect entrained items. For example, the existing entrainment detection method calculates the difference in texture between the local and its surroundings for the transmission image of the object to be inspected. After traversing the image, other features are integrated, and the portion with large difference and non-noise is judged to be entrained suspect. However, due to the wide variety of items, it is difficult to accurately check whether there are any entrained items in the container.
发明内容Summary of the invention
鉴于现有技术中的一个或多个问题,提出了一种检查方法和检查设备以及计算机可读介质,能够更准确地确定集装箱之类的货物中是否包含夹带物品。In view of one or more problems in the prior art, an inspection method and inspection apparatus and a computer readable medium are proposed, which are capable of more accurately determining whether or not an entrained item is contained in a cargo such as a container.
在本公开的一个方面,提出了一种检查方法,包括步骤:用X射线扫描被检查物体,得到被检查物体的X射线图像;利用卷积神经网络处理所述被检查物体的X射线图像,得到所述被检查物体的类别活性图;基于所述类别活性图确定所述被检查物体中是否有包括可疑对象。In an aspect of the present disclosure, an inspection method is provided, comprising the steps of: scanning an object to be inspected by X-rays to obtain an X-ray image of the object to be inspected; and processing an X-ray image of the object to be inspected by using a convolutional neural network, Obtaining a class activity map of the object to be inspected; and determining whether the object to be inspected includes a suspicious object based on the class activity map.
根据本公开的一些实施例,所述卷积神经网络包括与不同尺度对应的多个通路,每个通路具有至少一个卷积层、在所述至少一个卷积层后的池化层和一个全卷积层,并且所述全卷积层用于输出相应尺度下的权重矢量,所述得到被检查物体的类别活性图的步骤包括:用每个通路输出的权重矢量与该通路中最后一个池化层之前的那个卷积层的特征进行加权求和,得到该尺度下的类别活性图;融合多个尺度下的类别活性图,得到所述被检查物体的类别活性图。According to some embodiments of the present disclosure, the convolutional neural network includes a plurality of paths corresponding to different scales, each path having at least one convolution layer, a pooling layer after the at least one convolution layer, and a whole a convolution layer, and the full convolution layer is used to output a weight vector at a corresponding scale, and the step of obtaining a class activity map of the object to be inspected includes: using a weight vector outputted by each path and a last pool in the path The features of the convolutional layer before the layer are weighted and summed to obtain the class activity map at the scale; the class activity maps of the plurality of scales are merged to obtain the class activity map of the object to be inspected.
根据本公开的一些实施例,得到了原始尺度的类别活性图和至少一个较小尺度 的类别活性图,对所述至少一个较小尺度下的类别活性图进行上采样,得到上采样的类别活性图,并融合原始尺度的类别活性图和上采样的类别活性图,得到所述被检查物体的类别活性图。According to some embodiments of the present disclosure, a class activity map of the original scale and at least one smaller scale class activity map are obtained, and the class activity map of the at least one smaller scale is upsampled to obtain an upsampled class activity The graph and the original activity map of the original scale and the class activity map of the upsampled are obtained to obtain a class activity map of the object to be inspected.
根据本公开的一些实施例,基于所述类别活性图确定所述被检查物体中是否有包括可疑对象的步骤包括:基于所述被检查物体的类别活性图和所述X射线图像得到热力图;利用阈值划分的方法判断所述热力图中是否包括可疑对象。According to some embodiments of the present disclosure, determining, according to the class activity map, whether the object to be inspected includes a suspicious object comprises: obtaining a thermogram based on a class activity map of the object to be inspected and the X-ray image; A method of threshold division is used to determine whether a suspicious object is included in the heat map.
根据本公开的一些实施例,对所述被检查物体的类别活性图和所述X射线图像进行加权求和来得到所述热力图。According to some embodiments of the present disclosure, the class map of the object to be inspected and the X-ray image are weighted and summed to obtain the thermogram.
根据本公开的一些实施例,所述卷积神经网络中的多个通路共享至少一个卷积层和至少一个池化层。According to some embodiments of the present disclosure, the plurality of paths in the convolutional neural network share at least one convolutional layer and at least one pooled layer.
在本公开的另一方面,提出了一种检查设备,包括:扫描装置,用X射线对被检查物体进行扫描,得到X射线图像;处理器,配置为:利用卷积神经网络处理所述被检查物体的X射线图像,得到所述被检查物体的类别活性图;基于所述类别活性图确定所述被检查物体中是否有包括可疑对象。In another aspect of the present disclosure, an inspection apparatus is provided, comprising: a scanning device that scans an object to be inspected with X-rays to obtain an X-ray image; and a processor configured to: process the object with a convolutional neural network Checking an X-ray image of the object to obtain a class activity map of the object to be inspected; and determining whether the object to be inspected includes a suspicious object based on the class activity map.
根据本公开的一些实施例,所述卷积神经网络包括与不同尺度对应的多个通路,每个通路具有至少一个卷积层、在所述至少一个卷积层后的池化层和一个全卷积层,并且所述全卷积层用于输出相应尺度下的权重矢量,所述处理器被配置为:用每个通路输出的权重矢量与该通路中最后一个池化层之前的那个卷积层的特征进行加权求和,得到该尺度下的类别活性图;融合多个尺度下的类别活性图,得到所述被检查物体的类别活性图。According to some embodiments of the present disclosure, the convolutional neural network includes a plurality of paths corresponding to different scales, each path having at least one convolution layer, a pooling layer after the at least one convolution layer, and a whole a convolutional layer, and the full convolutional layer is used to output a weight vector at a corresponding scale, the processor being configured to: use the weight vector output by each path and the volume before the last pooled layer in the path The features of the layer are weighted and summed to obtain a class activity map of the scale; and the class activity maps of the plurality of scales are merged to obtain a class activity map of the object to be inspected.
根据本公开的一些实施例,所述处理器被配置为:得到原始尺度的类别活性图和至少一个较小尺度的类别活性图,对所述至少一个较小尺度下的类别活性图进行上采样,得到上采样的类别活性图,以及融合原始尺度的类别活性图和上采样的类别活性图,得到所述被检查物体的类别活性图。According to some embodiments of the present disclosure, the processor is configured to: obtain a class activity map of the original scale and at least one category activity map of the smaller scale, and upsample the class activity map of the at least one smaller scale And obtaining a upsampled class activity map, and merging the class activity map of the original scale and the upsampled class activity map to obtain a class activity map of the object to be inspected.
根据本公开的一些实施例,所述处理器被配置为:基于所述被检查物体的类别活性图和所述X射线图像得到热力图;利用阈值划分的方法判断所述热力图中是否包括可疑对象。According to some embodiments of the present disclosure, the processor is configured to: obtain a thermogram based on a class activity map of the object to be inspected and the X-ray image; and determine whether the thermogram includes suspiciousness by using a method of threshold division Object.
根据本公开的一些实施例,所述处理器被配置为对所述被检查物体的类别活性图和所述X射线图像进行加权求和来得到所述热力图。According to some embodiments of the present disclosure, the processor is configured to perform a weighted summation of a class activity map of the object under inspection and the X-ray image to obtain the thermogram.
根据本公开的一些实施例,所述卷积神经网络中的多个通路共享至少一个卷积层和至少一个池化层。According to some embodiments of the present disclosure, the plurality of paths in the convolutional neural network share at least one convolutional layer and at least one pooled layer.
在本公开的再一方面,提出了一种计算机可读介质,存储有计算机程序,所述计算机程序被处理器执行时实现如下步骤:利用卷积神经网络处理被检查物体的X射线图像,得到所述被检查物体的类别活性图;基于所述类别活性图确定所述被检查物体中是否有包括可疑对象。In still another aspect of the present disclosure, a computer readable medium is provided, stored with a computer program that, when executed by a processor, implements the steps of: processing an X-ray image of an object under inspection using a convolutional neural network, a class activity map of the object to be inspected; determining whether the object to be inspected includes a suspicious object based on the class activity map.
利用上述实施例的方案,基于类别活性图来判断可疑物品的位置,能够获得更为准确的安全检查结果。With the solution of the above embodiment, the position of the suspicious item is judged based on the category activity map, and a more accurate security check result can be obtained.
附图说明DRAWINGS
为了更好地理解本公开,将根据以下附图对本公开进行详细描述:For a better understanding of the present disclosure, the present disclosure will be described in detail in accordance with the following drawings:
图1示出了根据本公开实施例的检查设备的示意图;FIG. 1 shows a schematic diagram of an inspection apparatus according to an embodiment of the present disclosure;
图2示出了在图1所示的实施例中用于图像处理的计算机的内部结构的示意图;Figure 2 is a diagram showing the internal structure of a computer for image processing in the embodiment shown in Figure 1;
图3是用来说明根据本公开实施例的检查方法的示意性流程图;FIG. 3 is a schematic flowchart for explaining an inspection method according to an embodiment of the present disclosure; FIG.
图4是描述在本公开实施例的卷积神经网络的示意图;4 is a schematic diagram depicting a convolutional neural network in an embodiment of the present disclosure;
图5示出了在本公开实施例的检查设备和检查方法中使用的卷积神经网络的示意图;FIG. 5 is a schematic diagram showing a convolutional neural network used in an inspection apparatus and an inspection method of an embodiment of the present disclosure;
图6是描述在本公开实施例的检查方法中计算类活性图的过程的示意图;以及6 is a schematic diagram describing a process of calculating a class-like activity map in an inspection method of an embodiment of the present disclosure;
图7示出了根据本公开实施例的检查方法中得到的检查结果的例子。FIG. 7 shows an example of an inspection result obtained in an inspection method according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下面将详细描述本发明的具体实施例,应当注意,这里描述的实施例只用于举例说明,并不用于限制本发明。在以下描述中,为了提供对本发明的透彻理解,阐述了大量特定细节。然而,对于本领域普通技术人员显而易见的是:不必采用这些特定细节来实行本发明。在其他实例中,为了避免混淆本发明,未具体描述公知的结构、材料或方法。The embodiments of the present invention are described in detail below, and it should be noted that the embodiments described herein are for illustrative purposes only and are not intended to limit the invention. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be apparent to those skilled in the art that the invention In other instances, well-known structures, materials, or methods are not specifically described in order to avoid obscuring the invention.
鉴于现有技术中存在的问题,本公开的实施例提出了一种检查技术,其利用卷积神经网络对被检查物体的X射线图像进行处理,得到类别活性图,然后基于类别活性图来判断被检查物体中是否包含有可疑物品。这样的检查技术能够更为准确地判断被检查物体中是否包括可疑物品。例如,早期的夹带检测方法是针对当前图片来求局部与其周围的纹理差异,在遍历图像后,综合其他特征,对差异较大且非噪声的部分判定为夹带嫌疑物。但是本公开提出的技术是利用深度学习网络的学习能力,以大量的训练样本为依托,学习辐射图像内部的特征结构并结合浅层的具体特 征,进而通过类别活性图来确定异常的位置。具体来说,本公开的技术通过卷积神经网络产生的类别活性图来刻画图像中可能出现异常的位置,并通过阈值判定方法来找到可能存在夹带物品存在的位置。In view of the problems in the prior art, embodiments of the present disclosure propose an inspection technique that uses a convolutional neural network to process an X-ray image of an object to be inspected to obtain a class activity map, and then judge based on the class activity map. Whether the object being inspected contains suspicious items. Such an inspection technique can more accurately determine whether or not a suspicious item is included in the object to be inspected. For example, the early entrainment detection method is to find the difference between the local and the surrounding texture for the current picture. After traversing the image, the other features are integrated, and the part with large difference and non-noise is determined as the entrainment suspect. However, the technique proposed by the present disclosure utilizes the learning ability of the deep learning network, relies on a large number of training samples, learns the feature structure inside the radiation image and combines the specific characteristics of the shallow layer, and then determines the location of the abnormality through the class activity map. In particular, the techniques of the present disclosure characterize the locations in the image where anomalies may occur by convolving neural network generated class activity maps and by threshold determination methods to find locations where there may be entrained items.
图1示出了根据本公开实施例的检查设备的结构示意图。如图1所示的检查设备100包括X射线源110,探测器模块130、数据采集装置150、控制器140和计算设备160等。射线源110包括一个或多个X射线发生器,可以在控制140的控制下进行单能透射扫描,也可以进行双能透射扫描。FIG. 1 shows a schematic structural view of an inspection apparatus according to an embodiment of the present disclosure. The inspection apparatus 100 as shown in FIG. 1 includes an X-ray source 110, a detector module 130, a data collection device 150, a controller 140, a computing device 160, and the like. The source 110 includes one or more X-ray generators that can perform a single-energy transmission scan or a dual-energy transmission scan under the control of the control 140.
如图1所示,例如集装箱卡车之类的被检查物体120运动通过射线源110与探测器130之间的扫描区域。在一些实施例中,探测器130和数据采集装置150例如是具有整体模块结构的探测器及数据采集器,例如多排探测器,用于探测透射被检物体120的射线,获得模拟信号,并且将模拟信号转换成数字信号,从而输出被检查物体120针对X射线的透射图像。在双能的情况下,可以例如针对高能射线设置一排探测器,针对低能射线设置另一排探测器,或者高能射线和低能射线分时使用同一排探测器。控制器140用于控制整个系统的各个部分同步工作。诸如计算机之类的计算设备160用来处理由数据采集装置150采集的数据,对图像数据进行处理,输出结果。例如处理设备160运行图像处理程序,对扫描得到的X射线图像进行分析和学习,例如利用卷积神经网络处理被检查物体的X射线图像,得到被检查物体120的类别活性图。进而基于类别活性图确定被检查物体中是否有包括可疑对象。As shown in FIG. 1, an inspected object 120, such as a container truck, moves through a scanning area between the source 110 and the detector 130. In some embodiments, the detector 130 and the data acquisition device 150 are, for example, detectors and data collectors having an integral modular structure, such as multiple rows of detectors, for detecting radiation transmitted through the object under inspection 120, obtaining an analog signal, and The analog signal is converted into a digital signal, thereby outputting a transmission image of the object under inspection 120 for X-rays. In the case of dual energy, for example, one row of detectors can be provided for high energy rays, another row of detectors for low energy rays, or the same row of detectors for high energy and low energy rays. The controller 140 is used to control the various parts of the entire system to work synchronously. A computing device 160, such as a computer, is used to process the data collected by the data collection device 150, process the image data, and output the results. For example, the processing device 160 runs an image processing program, analyzes and learns the scanned X-ray image, and processes the X-ray image of the object to be inspected, for example, by using a convolutional neural network, to obtain a class activity map of the object to be inspected 120. Further, based on the class activity map, it is determined whether or not the object to be inspected includes a suspicious object.
根据该实施例,探测器模块130和数据采集装置150用于获取被检查物体110的透射数据。数据采集装置150中包括数据放大成形电路,它可工作于(电流)积分方式或脉冲(计数)方式。数据采集装置150的数据输出电缆与控制器140和计算设备160连接,根据触发命令将采集的数据存储在计算设备160中。According to this embodiment, the detector module 130 and the data acquisition device 150 are used to acquire transmission data of the object 110 under inspection. The data acquisition device 150 includes a data amplification shaping circuit that operates in either (current) integration mode or pulse (count) mode. The data output cable of data collection device 150 is coupled to controller 140 and computing device 160, and the acquired data is stored in computing device 160 in accordance with a trigger command.
在一些实施例中,探测器130包括多个探测单元,接收穿透被检查物体120的X射线。数据采集装置150与探测器130耦接,将探测器130产生的信号转换为探测数据。控制器140通过控制线路与射线源110连接,通过另一控制线路与探测器130连接,并且进一步与数据采集装置150连接,控制射线源110中的一个或多个X射线发生器对被检查物体120进行单能扫描,或者对被检查物体120进行双能扫描,从而随着被检查物体120的移动而发出X射线穿透被检查物体120。此外,控制器140控制探测器130和数据采集装置150,获得相应的透射数据,例如单能透射数据或者双能透射数据。计算设备160基于透射数据得到被检查物体120的图像,对该 图像进行处理,得到被检查物体120的类别活性图,然后基于类别活性图确定被检查物体120中是否有包括可疑对象。In some embodiments, the detector 130 includes a plurality of detection units that receive X-rays that penetrate the object under inspection 120. The data acquisition device 150 is coupled to the detector 130 to convert the signal generated by the detector 130 into probe data. The controller 140 is coupled to the radiation source 110 via a control line, to the detector 130 via another control line, and further coupled to the data acquisition device 150 for controlling one or more X-ray generators in the radiation source 110 to the object being inspected 120 performs a single-energy scan, or performs a dual-energy scan on the object to be inspected 120, so that X-rays are transmitted through the object to be inspected 120 as the object to be inspected 120 moves. In addition, controller 140 controls detector 130 and data acquisition device 150 to obtain corresponding transmission data, such as single energy transmission data or dual energy transmission data. The computing device 160 obtains an image of the object under inspection 120 based on the transmission data, processes the image to obtain a class activity map of the object 120 to be inspected, and then determines whether the object to be inspected 120 includes a suspicious object based on the class activity map.
图2示出了如图1所示的计算设备的结构框图。如图2所示,计算设备160包括存储设备161、只读存储器(ROM)162、随机存取存储器(RAM)163、输入装置164、处理器165、显示设备166和接口单元167以及总线168等。FIG. 2 shows a block diagram of the structure of the computing device shown in FIG. 1. As shown in FIG. 2, computing device 160 includes storage device 161, read only memory (ROM) 162, random access memory (RAM) 163, input device 164, processor 165, display device 166 and interface unit 167, bus 168, and the like. .
数据采集装置150所采集的数据通过接口单元1677和总线168存储在存储设备161中。只读存储器(ROM)162中存储有计算机数据处理器的配置信息以及程序。随机存取存储器(RAM)163用于在处理器165工作过程中暂存各种数据。另外,存储设备161中还存储有用于进行数据处理的计算机程序。内部总线168连接上述的存储设备161、只读存储器162、随机存取存储器163、输入装置164、处理器165、显示设备168和接口单元167。The data collected by the data collection device 150 is stored in the storage device 161 via the interface unit 1677 and the bus 168. Configuration information and a program of the computer data processor are stored in a read only memory (ROM) 162. A random access memory (RAM) 163 is used to temporarily store various data during the operation of the processor 165. In addition, a computer program for performing data processing is also stored in the storage device 161. The internal bus 168 is connected to the above-described storage device 161, read only memory 162, random access memory 163, input device 164, processor 165, display device 168, and interface unit 167.
在用户通过诸如键盘和鼠标之类的输入装置164输入的操作命令后,计算机程序的指令代码命令处理器165执行数据处理算法,在得到数据处理结果之后,将其显示在诸如LCD显示器之类的显示设备166上,或者直接以诸如打印之类硬拷贝的形式输出处理结果。After the user inputs an operation command through the input device 164 such as a keyboard and a mouse, the instruction code of the computer program instructs the processor 165 to execute the data processing algorithm, and after obtaining the data processing result, displays it on an LCD display or the like. The processing result is output on the display device 166 or directly in the form of a hard copy such as printing.
例如,射线源110可以是放射性同位素(例如钴-60),也可以是低能的X光机或高能的X射线加速器等。For example, the source 110 can be a radioisotope (e.g., cobalt-60), a low energy X-ray machine, or a high energy X-ray accelerator.
例如,探测器130从材料上划分,可以是气体探测器、闪烁体探测器或固体探测器等,从阵列排布上划分,可以是单排、双排或者多排,以及单层探测器或双层高低能探测器等。For example, the detector 130 is divided into materials, which may be gas detectors, scintillator detectors or solid detectors, etc., which are divided into arrays, which may be single row, double row or multiple rows, and single layer detectors or Double-layer high and low energy detectors, etc.
以上描述的是被检查物体120,例如集装箱卡车,移动通过检查区域,但是本领域的技术人员应该想到,也可以是被检查物体120静止而射线源和探测器移动完成扫描过程。What has been described above is that the object to be inspected 120, such as a container truck, moves through the inspection area, but it will be appreciated by those skilled in the art that the object to be inspected 120 may be stationary while the source of radiation and the detector are moved to complete the scanning process.
图3是用来说明根据本公开实施例的检查方法的示意性流程图。如图3所示,在步骤S310,用如图1所示的检查设备对被检查物体120进行X射线扫描,得到被检查物体120的X射线图像。然后,可选地,在步骤S320,对被检查物体的X射线图像进行预处理,例如去噪或者归一化等处理。在步骤S330,利用卷积神经网络对X射线图像进行处理,得到类别活性图。下面详细说明从X射线图像得到类别活性图的过程。FIG. 3 is a schematic flow chart for explaining an inspection method according to an embodiment of the present disclosure. As shown in FIG. 3, in step S310, the object to be inspected 120 is subjected to X-ray scanning using the inspection apparatus shown in FIG. 1, and an X-ray image of the object to be inspected 120 is obtained. Then, optionally, in step S320, the X-ray image of the object to be inspected is pre-processed, such as denoising or normalization. In step S330, the X-ray image is processed using a convolutional neural network to obtain a class activity map. The process of obtaining a class activity map from an X-ray image is described in detail below.
深度学习是机器学习领域的一个分支,它将原来浅层的神经网络插入更多隐含 层来实现输入信息的分布式表达。与传统的浅层学习不同,深度学习更强调模型的深度,认为模型的隐层数目越多,学习能力越强。同时,深度学习强调了特征学习的重要性,认为不同深度上学习到的特征能使分类与预测更加准确。弱标注(弱监督定位),主要是利用弱监督卷积神经网络的标注,指在网络的训练过程中,仅给予数据以图像尺度的标注作为约束,能让网络来自动学习复杂场景中的多个事物。弱标注实现的基础是主要来自于:(1)CNNs的分层结构-对于空间位置的判别具有一定的倾向性;(2)有效的端到端的训练。Deep learning is a branch of the machine learning field. It inserts the original shallow neural network into more hidden layers to achieve distributed expression of input information. Different from traditional shallow learning, deep learning emphasizes the depth of the model. The more the hidden layer of the model, the stronger the learning ability. At the same time, deep learning emphasizes the importance of feature learning, and believes that features learned at different depths can make classification and prediction more accurate. Weak labeling (weak supervised positioning) is mainly based on the labeling of weakly supervised convolutional neural networks. It means that in the training process of the network, only the data is given as the constraint of the image scale, which allows the network to automatically learn more in complex scenes. Things. The basis of weak annotation implementation is mainly from: (1) the hierarchical structure of CNNs - a certain tendency for the discrimination of spatial position; (2) effective end-to-end training.
图4是描述在本公开实施例的卷积神经网络的示意图。如上所述,为了识别透射图像中的特征,本公开的实施例提出使用卷积神经网络CNN来对图像中的特征进行识别。以下结合图4来详细说明根据本公开实施例的卷积神经网络400。如图4所示卷积神经网络400通常可以包含多个卷积层420和440,这些卷积层420和440一般是由彼此部分重叠的小型神经元(其在数学的意义上也被称为卷积核,以下如无特别声明,这两个术语可以互换使用)的集合。此外,在本公开的上下文中,除非另有明确声明,否则对于卷积神经网络400中的任何两层而言,更接近输入数据(或输入层,例如图4的输入层410)的层被称为“在前”或“在下”的层,而另一个更接近输出数据(或输出层,例如图4的输出层470)的层被称为“在后”或“在上”的层。此外,在训练、验证和/或使用期间,从输入层(例如,图4的输入层410)到输出层(例如,图4的输出层470)的方向被称为前向或正向(forward),而从输出层(例如,图4的输出层470)到输入层(例如,图4的输入层410)的方向被称为后向或反向(backward)。4 is a schematic diagram depicting a convolutional neural network in an embodiment of the present disclosure. As described above, in order to identify features in a transmitted image, embodiments of the present disclosure propose to use a convolutional neural network CNN to identify features in an image. The convolutional neural network 400 in accordance with an embodiment of the present disclosure is described in detail below in conjunction with FIG. The convolutional neural network 400 as shown in FIG. 4 may generally include a plurality of convolutional layers 420 and 440, which are generally small neurons that are partially overlapping each other (which is also referred to in a mathematical sense) A collection of convolution kernels, which are used interchangeably unless otherwise stated. Moreover, in the context of the present disclosure, layers of input data (or input layers, such as input layer 410 of FIG. 4) that are closer to input data (for any two layers in convolutional neural network 400) are, unless explicitly stated otherwise, A layer referred to as "before" or "below" and another layer closer to the output data (or output layer, such as output layer 470 of FIG. 4) is referred to as a "behind" or "on" layer. Moreover, the direction from the input layer (eg, input layer 410 of FIG. 4) to the output layer (eg, output layer 470 of FIG. 4) during training, verification, and/or use is referred to as forward or forward (forward) The direction from the output layer (eg, output layer 470 of FIG. 4) to the input layer (eg, input layer 410 of FIG. 4) is referred to as backward or backward.
以图4所示的第一卷积层420为例,这些小型神经元可以对输入图像的各个局部进行处理。然后这些小型神经元的输出被合并排列为一个输出(称为特征映射,例如第一卷积层420中的方形),以获得对原始图像中某些特征进行更好表示的输出图像。同时,相邻神经元之间部分重叠的排列也使得卷积神经网络400对于原始图像中的特征具备一定程度的平移容忍度。换言之,即使原始图像中的特征在某个容忍度内以平移方式改变了其位置,该卷积神经网络400也可以正确地识别出该特征。关于卷积层的详细描述将在后文中给出,此处不再详细讨论。Taking the first convolutional layer 420 shown in FIG. 4 as an example, these small neurons can process various parts of the input image. The outputs of these small neurons are then combined into one output (referred to as a feature map, such as a square in the first convolutional layer 420) to obtain an output image that better represents certain features in the original image. At the same time, the partially overlapping arrangement between adjacent neurons also causes the convolutional neural network 400 to have a degree of translational tolerance for features in the original image. In other words, the convolutional neural network 400 can correctly identify the feature even if the feature in the original image changes its position in a translational manner within a certain tolerance. A detailed description of the convolutional layer will be given later and will not be discussed in detail herein.
接下来的一层是可选的池化(pooling)层,即第一池化层430,其主要用于在保持特征的情况下对前一卷积层420的输出数据进行下采样,减少计算量并防止过拟合。The next layer is the optional pooling layer, the first pooling layer 430, which is mainly used to downsample the output data of the previous convolution layer 420 while maintaining the features, reducing the calculation. Quantity and prevent overfitting.
接下来的一层同样是一个卷积层,第二卷积层440,可以对于第一卷积层420 所产生的、并经由池化层430下采样的输出数据进行进一步的特征采样。从直观上看,其所学习到的特征在全局性上大于第一卷积层所学习到的特征。同样地,后续的卷积层都是对前一卷积层的特征的全局化。The next layer is also a convolutional layer, and the second convolutional layer 440 can perform further feature sampling on the output data generated by the first convolutional layer 420 and downsampled via the pooling layer 430. Intuitively, the features learned are globally larger than those learned by the first convolutional layer. Similarly, subsequent convolutional layers are global to the characteristics of the previous convolutional layer.
卷积层(例如,第一和第二卷积层420和440)是CNN(例如,卷积神经网络400)的核心构造单元。该层的参数由可学习卷积核(或简称为卷积核)的集合来构成,每个卷积核具有很小的感受野,但是在输入数据的整个深度上延伸。在前向过程中,将每个卷积核沿输入数据的宽度和高度进行卷积,计算卷积核的元素与输入数据之间的点积,并产生该卷积核的二维激活映射。作为结果,网络能够学习到在输入的某个空间位置上看到某个具体类型的特征时才可以激活的卷积核。The convolutional layer (eg, the first and second convolutional layers 420 and 440) is the core building block of the CNN (eg, convolutional neural network 400). The parameters of this layer consist of a collection of learnable convolution kernels (or simply convolution kernels), each with a small receptive field, but extending over the entire depth of the input data. In the forward process, each convolution kernel is convolved along the width and height of the input data, the dot product between the elements of the convolution kernel and the input data is computed, and a two-dimensional activation map of the convolution kernel is generated. As a result, the network is able to learn the convolution kernel that can be activated when a particular type of feature is seen at a spatial location of the input.
将所有卷积核的激活映射沿深度方向进行堆叠,形成了卷积层的全输出数据。因此,输出数据中的每个元素可以被解释为看到输入中的小区域并与相同激活映射中的其他卷积核共享参数的卷积核的输出。The activation maps of all convolution kernels are stacked in the depth direction to form the full output data of the convolutional layer. Thus, each element in the output data can be interpreted as an output of a convolution kernel that sees small regions in the input and shares parameters with other convolution kernels in the same activation map.
输出数据的深度控制了层中连接到输入数据的相同区域的卷积核的数量。例如,如图4所示,第一卷积层420的深度为4,第二卷积层440的深度为6。所有这些卷积核将学习到针对输入中的不同特征来激活。例如,如果第一卷积层420以原始图像为输入,则沿着深度维度的不同卷积核(即,图4中的不同方形)可以在输入数据中出现各种定向的边、或灰度块时激活。The depth of the output data controls the number of convolution kernels in the same area of the layer that are connected to the input data. For example, as shown in FIG. 4, the depth of the first convolutional layer 420 is 4, and the depth of the second convolutional layer 440 is 6. All of these convolution kernels will learn to activate for different features in the input. For example, if the first convolutional layer 420 is input with the original image, different convolution kernels along the depth dimension (ie, different squares in FIG. 4) may have various directional edges, or grayscales, appearing in the input data. Activated when the block.
训练过程在深度学习中是一个非常重要的部分。为了保证网络能够有效收敛,可以采用随机梯度下降法。例如,可以采用Nesterov优化算法来求解。在一些实施例中,初始的学习速率可以设置为从0.01开始,并逐渐减小,直至找到一个最优值。此外,在一些实施例中,对于权重的初始值,可以使用具有较小方差的Gaussian随机过程来初始化各卷积核的权重值。在一些实施例中,图像训练集可以采用标记的物品图像,其均标记有图像中的特征位置。虽然图4中给出了两个卷积层,两个池化层和全连接层的例子,但是本领域的技术人员应该想到,可以采用其数目的卷积层和池化层来实现卷积神经网络。The training process is a very important part of deep learning. In order to ensure that the network can effectively converge, a stochastic gradient descent method can be used. For example, the Nesterov optimization algorithm can be used to solve. In some embodiments, the initial learning rate can be set to start at 0.01 and gradually decrease until an optimal value is found. Moreover, in some embodiments, for an initial value of the weight, a Gaussian random process with a smaller variance can be used to initialize the weight values of the respective convolution kernels. In some embodiments, the image training set may employ a tagged item image that is each labeled with a feature location in the image. Although two examples of convolutional layers, two pooled layers and fully connected layers are given in Figure 4, those skilled in the art will appreciate that convolutional layers and pooling layers can be used to implement convolution. Neural Networks.
本公开的实施例中使用的类活性图(Class Activation Map)是指通过卷积神经网络得到的每类图像的可判别区域。图5示出了在本公开实施例的检查设备和检查方法中使用的卷积神经网络的示意图。The Class Activation Map used in the embodiments of the present disclosure refers to a discriminable region of each type of image obtained by convolutional neural networks. FIG. 5 shows a schematic diagram of a convolutional neural network used in an inspection apparatus and an inspection method of an embodiment of the present disclosure.
如图5所示,本公开实施例的卷积神经网络可以包括第一卷积层510、第一池化层511、第二卷积层512、第二池化层513、第三卷积层514、第三池化层515、第四卷积层516、第四池化层517、第五卷积层518、第五池化层519、第六卷积层 520、第七卷积层521、全卷积层522和分类层523。如图5所示,所采用的网络主要由卷积层构成,在网络的正向传播过程中,从不同的网络深度处截取多个分支,以得到不同尺度的网络特征,这里称每一个尺度为一个通路。对于每一通路的末尾,本发明对其采取以下操作。首先,求取全局平均值,例如在第三卷积层515和第四卷积层后分别连接全局平均池化层,第四池化层516和第五池化层518,并在第四池化层516和第五池化层518后后分别连接一个全卷积层542和532,将第三池化层515的特征作为第四卷积层516(全卷积层)的输入,将第四池化层517的特征作为第五卷积层518(全卷积层)的输入,然后分别连接到分类层—softmax层543和532,得到相应的分类结果。类似地,在第七卷积层521后连接全局卷积层522,进而连接到分类层522。如果想判别分类特征在原图像中的重要程度,只需针对每个通路将输出层(分类层输出的)的权重作用于该通路最后一个池化层前的卷积层上,进行加权求和就能够得到该通路中的类活性图。在至少一个通路的类活性图的基础上分析出夹带物品的位置。As shown in FIG. 5, the convolutional neural network of the embodiment of the present disclosure may include a first convolutional layer 510, a first pooling layer 511, a second convolutional layer 512, a second pooling layer 513, and a third convolutional layer. 514, third pooling layer 515, fourth convolution layer 516, fourth pooling layer 517, fifth convolution layer 518, fifth pooling layer 519, sixth convolution layer 520, seventh convolution layer 521 The full convolutional layer 522 and the classification layer 523. As shown in Figure 5, the network used is mainly composed of convolutional layers. During the forward propagation of the network, multiple branches are intercepted from different network depths to obtain network features of different scales. For a pathway. For the end of each path, the present invention takes the following actions. First, a global average is obtained, for example, after the third convolutional layer 515 and the fourth convolutional layer are respectively connected to the global average pooling layer, the fourth pooling layer 516 and the fifth pooling layer 518, and in the fourth pool. The layer 516 and the fifth pooling layer 518 are respectively connected to a full convolution layer 542 and 532, and the features of the third pooling layer 515 are used as input of the fourth convolution layer 516 (full convolution layer). The features of the four pooling layer 517 are input to the fifth convolutional layer 518 (full convolutional layer) and then connected to the classification layers - softmax layers 543 and 532, respectively, to obtain corresponding classification results. Similarly, the global convolutional layer 522 is connected after the seventh convolutional layer 521, and is then connected to the classification layer 522. If you want to determine the importance of the classification feature in the original image, you only need to apply the weight of the output layer (output from the classification layer) to the convolution layer before the last pooling layer of the path for each path, and perform weighted summation. A class-like activity map in the pathway can be obtained. The position of the entrained item is analyzed based on the activity map of at least one of the passages.
根据本公开的实施例,卷积层对图像进行卷积操作,随着神经网络的深度的变化,得到不同尺度的特征。例如,卷积核的长、宽分别为3,步长的长、宽为1,对于图像从左到右,从上到下进行遍历。遍历中所采取的操作是,对于卷积和大小的图像和卷积和进行点乘加和,作为该位置的结果。According to an embodiment of the present disclosure, the convolution layer performs a convolution operation on the image, and as the depth of the neural network changes, features of different scales are obtained. For example, the convolution kernel has a length and a width of 3, a step length of 1 and a width of 1 for traversing the image from left to right and top to bottom. The operation taken in the traversal is to perform a dot-plus summation on the convolution and the size of the image and the convolution sum as a result of the position.
根据本公开的实施例,池化层在卷积层后,实现降采样处理,既能减少网络参数,又能增大感受野。例如,池化层的窗口长和宽为3,前三个池化层步长的长、宽为2,后三个前三个池化层步长的长、宽为1。According to an embodiment of the present disclosure, after the convolution layer, the pooling layer implements a downsampling process, which can reduce network parameters and increase the receptive field. For example, the length and width of the window of the pooling layer are 3, the length and width of the first three pooling layer steps are 2, and the length and width of the last three preceding pooling layer steps are 1.
此外,根据本公开的实施例,上采样的实现可以采用加洞卷积的方式,卷积核长、宽为3,步长为1,所采用的加洞值分别为6、12、18,加洞操作就是对卷积核中每两个值补零,补零个数为加洞值,补零后卷积核继续进行卷积操作。In addition, according to an embodiment of the present disclosure, the implementation of upsampling may adopt a method of hole-convolution, the convolution kernel length, the width is 3, the step size is 1, and the hole values used are 6, 12, and 18, respectively. The hole-adding operation is to fill in zero for every two values in the convolution kernel, and the number of zeros is the hole value. After zero-padding, the convolution kernel continues to perform the convolution operation.
根据本公开的实施例,对X射线图进行网络训练,可以得到网络模型。将集装箱图片B带入到网络中,并根据附图5、6,来根据上述方法进行处理,执行步骤如下:According to an embodiment of the present disclosure, network training is performed on an X-ray map, and a network model can be obtained. The container picture B is brought into the network, and according to the above method, according to FIG. 5 and 6, the steps are as follows:
(1)准备数据:由于夹带物品的测试数据图片数量有限,通过数据运算的方法,来生成大量的含夹带物品的图片,用于网络的训练。(1) Preparation data: Due to the limited number of test data images of entrained items, a large number of pictures containing entrained items are generated by data calculation methods for network training.
(2)数据预处理:对集装箱内货物X射线扫描图像的预处理,包括去均值和图像尺寸的剪裁。(2) Data pre-processing: Pre-processing of X-ray scanned images of goods in containers, including de-averaging and image size cropping.
由于辐射图像是单通道的灰度图,假定数据集为A,通过下式进行去均值处理, 即Since the radiation image is a single-channel grayscale image, assuming that the data set is A, the mean value is processed by the following equation, that is,
A mean=A-mean(A) A mean =A-mean(A)
其中mean()表示求均值。对去均值后的网络进行剪裁,得到训练样本。Where mean() means to find the mean. The network after de-averaging is tailored to obtain training samples.
(3)训练网络:在训练阶段,采用一次训练的方法。如图5所示,网络采用跳跃结构,分别在池化层3和池化层4后以及卷积网络7后出现三个分支。网络借用全连接网络中综合利用浅、中、深三个层次上的特征来辅助网络进行训练预测。综合不同尺度的特征对于模型预测的正确率具有很大的改进。本领域技术人员应该意识到,可以设置更多的分支,例如4个,或者更少的分支,例如2个。(3) Training network: In the training phase, a training method is adopted. As shown in FIG. 5, the network adopts a hopping structure, and three branches appear after the pooling layer 3 and the pooling layer 4 and after the convolution network 7, respectively. The network borrows all-connected networks to comprehensively utilize features at the shallow, medium, and deep levels to assist the network in training prediction. The combination of features of different scales greatly improves the accuracy of model prediction. Those skilled in the art will appreciate that more branches, for example four, or fewer branches, for example two, may be provided.
(4)测试阶段:将待测试的图片带入到网络中,通过前馈计算来在线提取卷积层的特征,记作scores。(4) Test phase: Bring the picture to be tested into the network, and extract the features of the convolution layer online by feedforward calculation, and record it as scores.
(5)提取各通路中网络末端全连接层的值,记作权重矢量w1,w2,w3。(5) Extract the values of the fully connected layers at the end of the network in each path, and record them as weight vectors w1, w2, w3.
(6)提取每个通路中卷积层的特征,记作CAM_conv1,CAM_conv2,CAM_conv3。(6) Extract the features of the convolution layer in each path, denoted as CAM_conv1, CAM_conv2, CAM_conv3.
(7)求取各个特征层的均值,即scores(i)=∑ jscores(i,j),并对其进行降序排列。取得分高于阈值的特征层进行后续操作。 (7) Find the mean of each feature layer, ie scores(i)=∑ j scores(i,j), and arrange them in descending order. The feature layer above the threshold is obtained for subsequent operations.
(8)对于每个通道,将通路中最后一个池化层前的卷积层的特征与权重对应相乘求和,得到该通道的CAM图,例如得到三个路的CAM图:w1*CAM_conv1,w2*CAM_conv2.,w3*CAM_conv3,如图6所示。(8) For each channel, the characteristics of the convolution layer before the last pooling layer in the path are multiplied and weighted to obtain a CAM map of the channel, for example, a CAM map of three paths: w1*CAM_conv1 , w2*CAM_conv2., w3*CAM_conv3, as shown in Figure 6.
具体而言,对于待测定的图片,让f k(x,y)表示空间位置(x,y)在一个通路中最后一个池化层前的卷积层上的单元k活性。这样对于单元k,其全局平均池化的结果F k表示为 Specifically, for the picture to be determined, let f k (x, y) denote the unit k activity of the spatial position (x, y) on the convolution layer before the last pooling layer in one path. Thus for unit k, the global average pooled result F k is expressed as
F k=∑ x,yf k(x,y) F k =∑ x,y f k (x,y)
对于,某一类别c,输入到分类函数,则有For a category c, enter the classification function, then
Figure PCTCN2018107317-appb-000001
Figure PCTCN2018107317-appb-000001
其中,
Figure PCTCN2018107317-appb-000002
是对应单元k上类别c的权重,指示F k在类别c中的重要程度。最后,类别c的分类输出为,
Figure PCTCN2018107317-appb-000003
among them,
Figure PCTCN2018107317-appb-000002
Is the weight of the category c on the corresponding unit k, indicating the degree of importance of F k in the category c. Finally, the classification output of category c is,
Figure PCTCN2018107317-appb-000003
由于偏置项对于分类结果没有任何影响,所以这里并不考虑偏置项,设定softmax层的输入偏置为0。将F k=∑ x,yf k(x,y)引入到
Figure PCTCN2018107317-appb-000004
中,则有
Since the offset term has no effect on the classification result, the offset term is not considered here, and the input bias of the softmax layer is set to zero. Introducing F k =∑ x,y f k (x,y)
Figure PCTCN2018107317-appb-000004
In, there is
Figure PCTCN2018107317-appb-000005
Figure PCTCN2018107317-appb-000005
这里,S c=∑ x,yM c(x,y),因此M c(x,y)直接指示空间位置类c活性的重要程度。 Here, S c = ∑ x, y M c (x, y), so M c (x, y) directly indicates the importance of the spatial position class c activity.
(9)将三个通道得到活性图CAM1、CAM2和CAM3分别进行插值(例如上采样),得到原尺度的图像,然后进行相加求和,得到最终的CAM图。(9) The three channels are subjected to interpolation (for example, upsampling) of the active images CAM1, CAM2, and CAM3, respectively, to obtain an image of the original scale, and then summed and summed to obtain a final CAM map.
(10)将CAM图转化为热度图,按相应的比例因子与原图像相加,得到的加权和作为最终的结果(10) Convert the CAM map into a heat map, add the corresponding scale factor to the original image, and obtain the weighted sum as the final result.
Output=0.3×image+0.7×CAMOutput=0.3×image+0.7×CAM
上述image表示原图,CAM表示CAM图。本领域的技术人员理解可以采用其他的权重系数对二者进行加权求和,输出结果。The above image represents the original image, and CAM represents the CAM image. Those skilled in the art understand that other weighting coefficients can be used to weight the two and output the result.
(11)利用阈值划分的方法来提取出图像中异常的区域,并用方框标示。本领域的技术人员可以理解,上述将CAM图转换成热力图的过程可以省略,或者采用其他的加权因子来得到输出结果。(11) A method of threshold division is used to extract an abnormal region in the image and is indicated by a square. Those skilled in the art will appreciate that the above process of converting a CAM map into a heat map may be omitted, or other weighting factors may be used to obtain an output result.
此外,虽然上述实施例中采用的是在一个神经网络中设置多个分支的方式,但是本领域的技术人员可以理解,也可以对输入的图像进行下采样,得到多个尺度的图像。针对多个尺度的图像,使用各自的卷积神经网络进行处理。然后,对各个通道的处理结果进行上采样,在原始图像尺寸上融合多个通道处理的结果。Further, although the above embodiment employs a method of setting a plurality of branches in one neural network, those skilled in the art can understand that the input image can also be down-sampled to obtain images of a plurality of scales. Images for multiple scales are processed using their respective convolutional neural networks. Then, the processing results of the respective channels are upsampled, and the results of the multiple channel processing are combined on the original image size.
本公开利用了卷积神经网络局部特性能够定位物体的性质,同时引入卷积神经网络多尺度性征,来描述不同类别物体的位置信息。使用上采样,能够保持分类图片与原图片在尺度上的一致性,从而更准确清楚地实现物品位置信息的刻画。The present disclosure utilizes the local characteristics of convolutional neural networks to locate the properties of objects, and introduces multi-scale features of convolutional neural networks to describe the location information of different classes of objects. By using upsampling, it is possible to maintain the consistency of the classified picture and the original picture in scale, thereby more accurately and clearly characterizing the position information of the item.
以上的详细描述通过使用示意图、流程图和/或示例,已经阐述了检查方法和检查设备的众多实施例。在这种示意图、流程图和/或示例包含一个或多个功能和/或操作的情况下,本领域技术人员应理解,这种示意图、流程图或示例中的每一功能和/或操作可以通过各种结构、硬件、软件、固件或实质上它们的任意组合来单独和/或共同实现。在一个实施例中,本发明的实施例所述主题的若干部分可以通过专用集成电路(ASIC)、现场可编程门阵列(FPGA)、数字信号处理器(DSP)、或其他集成格式来实现。然而,本领域技术人员应认识到,这里所公开的实施例的一些方面在整体上或部分地可以等同地实现在集成电路中,实现为在一台或多台计算机上运行的一个或多个计算机程序(例如,实现为在一台或多台计算机系统上运行的一个或多个程序),实现为在一个或多个处理器上运行的一个或多个程序(例如,实现为在一个或多个微处理器上运行的一个或多个程序),实现为固件,或者实质上实现为上述方式的任意组合,并且本领域技术人员根据本公开,将具备设计电路和/或写入软件和/或固件代码的能力。此外,本领域技术人员将认识到,本公开所述主题的机 制能够作为多种形式的程序产品进行分发,并且无论实际用来执行分发的信号承载介质的具体类型如何,本公开所述主题的示例性实施例均适用。信号承载介质的示例包括但不限于:可记录型介质,如软盘、硬盘驱动器、紧致盘(CD)、数字通用盘(DVD)、数字磁带、计算机存储器等;以及传输型介质,如数字和/或模拟通信介质(例如,光纤光缆、波导、有线通信链路、无线通信链路等)。The above detailed description has set forth numerous embodiments of inspection methods and inspection apparatus by using schematics, flowcharts, and/or examples. In the event that such schematics, flowcharts, and/or examples include one or more functions and/or operations, those skilled in the art will appreciate that each function and/or operation in such a schematic, flowchart, or example may They are implemented individually and/or collectively by various structures, hardware, software, firmware or virtually any combination thereof. In one embodiment, portions of the subject matter of embodiments of the present invention may be implemented in an application specific integrated circuit (ASIC), field programmable gate array (FPGA), digital signal processor (DSP), or other integrated format. However, those skilled in the art will appreciate that some aspects of the embodiments disclosed herein may be implemented in an integrated circuit as a whole or in part, as one or more of one or more computers running on one or more computers. A computer program (eg, implemented as one or more programs running on one or more computer systems) implemented as one or more programs running on one or more processors (eg, implemented as one or One or more programs running on a plurality of microprocessors, implemented as firmware, or substantially in any combination of the above, and those skilled in the art, in accordance with the present disclosure, will be provided with design circuitry and/or write software and / or firmware code capabilities. Moreover, those skilled in the art will recognize that the mechanisms of the subject matter described herein can be distributed as a variety of forms of program products, and regardless of the particular type of signal-bearing medium that is actually used to perform the distribution, the subject matter of the present disclosure The exemplary embodiments are applicable. Examples of signal bearing media include, but are not limited to, recordable media such as floppy disks, hard drives, compact disks (CDs), digital versatile disks (DVDs), digital tapes, computer memories, and the like; and transmission-type media such as digital and / or analog communication media (eg, fiber optic cable, waveguide, wired communication link, wireless communication link, etc.).
虽然已参照几个典型实施例描述了本发明,但应当理解,所用的术语是说明和示例性、而非限制性的术语。由于本发明能够以多种形式具体实施而不脱离发明的精神或实质,所以应当理解,上述实施例不限于任何前述的细节,而应在随附权利要求所限定的精神和范围内广泛地解释,因此落入权利要求或其等效范围内的全部变化和改型都应为随附权利要求所涵盖。While the invention has been described with respect to the exemplary embodiments illustrated embodiments The present invention may be embodied in a variety of forms without departing from the spirit or scope of the invention. It is to be understood that the invention is not limited to the details. All changes and modifications that come within the scope of the claims or the equivalents thereof are intended to be covered by the appended claims.

Claims (13)

  1. 一种检查方法,包括步骤:An inspection method comprising the steps of:
    用X射线扫描被检查物体,得到被检查物体的X射线图像;Scanning the object to be inspected by X-rays to obtain an X-ray image of the object to be inspected;
    利用卷积神经网络处理所述被检查物体的X射线图像,得到所述被检查物体的类别活性图;Processing an X-ray image of the object to be inspected by using a convolutional neural network to obtain a class activity map of the object to be inspected;
    基于所述类别活性图确定所述被检查物体中是否有包括可疑对象。Determining whether or not the object to be inspected includes a suspicious object based on the class activity map.
  2. 如权利要求1所述的检查方法,其中,所述卷积神经网络包括与不同尺度对应的多个通路,每个通路具有至少一个卷积层、在所述至少一个卷积层后的池化层和一个全卷积层,并且所述全卷积层用于输出相应尺度下的权重矢量,所述得到被检查物体的类别活性图的步骤包括:The inspection method according to claim 1, wherein said convolutional neural network comprises a plurality of paths corresponding to different scales, each path having at least one convolution layer, pooling after said at least one convolution layer a layer and a full convolution layer, and the full convolution layer is used to output a weight vector at a corresponding scale, and the step of obtaining a class activity map of the object to be inspected includes:
    用每个通路输出的权重矢量与该通路中最后一个池化层之前的那个卷积层的特征进行加权求和,得到该尺度下的类别活性图;Weighting and summing the weight vector outputted by each path and the feature of the convolution layer before the last pooling layer in the path to obtain a class activity map at the scale;
    融合多个尺度下的类别活性图,得到所述被检查物体的类别活性图。A class activity map of a plurality of scales is integrated to obtain a class activity map of the object to be inspected.
  3. 如权利要求2所述的检查方法,其中得到了原始尺度的类别活性图和至少一个较小尺度的类别活性图,对所述至少一个较小尺度下的类别活性图进行上采样,得到上采样的类别活性图,并融合原始尺度的类别活性图和上采样的类别活性图,得到所述被检查物体的类别活性图。The inspection method according to claim 2, wherein a class activity map of the original scale and at least one smaller scale class activity map are obtained, and the class activity map of the at least one smaller scale is upsampled to obtain an upsampling The class activity map, and the original scale class activity map and the upsampled class activity map are combined to obtain a class activity map of the object to be inspected.
  4. 如权利要求1所述的检查方法,其中基于所述类别活性图确定所述被检查物体中是否有包括可疑对象的步骤包括:The inspection method according to claim 1, wherein the determining, based on the class activity map, whether the object to be inspected includes a suspicious object comprises:
    基于所述被检查物体的类别活性图和所述X射线图像得到热力图;Generating a heat map based on the class activity map of the object to be inspected and the X-ray image;
    利用阈值划分的方法判断所述热力图中是否包括可疑对象。A method of threshold division is used to determine whether a suspicious object is included in the heat map.
  5. 如权利要求4所述的检查方法,其中对所述被检查物体的类别活性图和所述X射线图像进行加权求和来得到所述热力图。The inspection method according to claim 4, wherein the heat map is obtained by weighting and summing the class activity map of the object to be inspected and the X-ray image.
  6. 如权利要求2所述的检查方法,其中,所述卷积神经网络中的多个通路共享至少一个卷积层和至少一个池化层。The inspection method according to claim 2, wherein the plurality of paths in the convolutional neural network share at least one convolution layer and at least one pooling layer.
  7. 一种检查设备,包括:An inspection device comprising:
    扫描装置,用X射线对被检查物体进行扫描,得到X射线图像;a scanning device that scans an object to be inspected with X-rays to obtain an X-ray image;
    处理器,配置为:Processor, configured as:
    利用卷积神经网络处理所述被检查物体的X射线图像,得到所述被检查物 体的类别活性图;Processing an X-ray image of the object to be inspected by using a convolutional neural network to obtain a class activity map of the object to be inspected;
    基于所述类别活性图确定所述被检查物体中是否有包括可疑对象。Determining whether or not the object to be inspected includes a suspicious object based on the class activity map.
  8. 如权利要求7所述的检查设备,其中,所述卷积神经网络包括与不同尺度对应的多个通路,每个通路具有至少一个卷积层、在所述至少一个卷积层后的池化层和一个全卷积层,并且所述全卷积层用于输出相应尺度下的权重矢量,所述处理器被配置为:The inspection apparatus according to claim 7, wherein said convolutional neural network includes a plurality of paths corresponding to different scales, each path having at least one convolutional layer, pooling after said at least one convolutional layer a layer and a full convolution layer, and the full convolution layer is used to output a weight vector at a corresponding scale, the processor being configured to:
    用每个通路输出的权重矢量与该通路中最后一个池化层之前的那个卷积层的特征进行加权求和,得到该尺度下的类别活性图;Weighting and summing the weight vector outputted by each path and the feature of the convolution layer before the last pooling layer in the path to obtain a class activity map at the scale;
    融合多个尺度下的类别活性图,得到所述被检查物体的类别活性图。A class activity map of a plurality of scales is integrated to obtain a class activity map of the object to be inspected.
  9. 如权利要求8所述的检查设备,其中所述处理器被配置为:The inspection apparatus of claim 8 wherein said processor is configured to:
    得到原始尺度的类别活性图和至少一个较小尺度的类别活性图,Obtaining a class activity map of the original scale and at least one class activity map of the smaller scale,
    对所述至少一个较小尺度下的类别活性图进行上采样,得到上采样的类别活性图,以及Upsampling the class activity map at the at least one smaller scale to obtain an upsampled class activity map, and
    融合原始尺度的类别活性图和上采样的类别活性图,得到所述被检查物体的类别活性图。A class activity map of the original scale and an upsampled class activity map are combined to obtain a class activity map of the object to be inspected.
  10. 如权利要求7所述的检查设备,其中所述处理器被配置为:The inspection apparatus of claim 7 wherein said processor is configured to:
    基于所述被检查物体的类别活性图和所述X射线图像得到热力图;Generating a heat map based on the class activity map of the object to be inspected and the X-ray image;
    利用阈值划分的方法判断所述热力图中是否包括可疑对象。A method of threshold division is used to determine whether a suspicious object is included in the heat map.
  11. 如权利要求10所述的检查设备,其中所述处理器被配置为对所述被检查物体的类别活性图和所述X射线图像进行加权求和来得到所述热力图。The inspection apparatus according to claim 10, wherein said processor is configured to perform a weighted summation of a class activity map of said object to be inspected and said X-ray image to obtain said heat map.
  12. 如权利要求8所述的检查设备,其中,所述卷积神经网络中的多个通路共享至少一个卷积层和至少一个池化层。The inspection apparatus according to claim 8, wherein the plurality of paths in the convolutional neural network share at least one convolution layer and at least one pooling layer.
  13. 一种计算机可读介质,存储有计算机程序,所述计算机程序被处理器执行时实现如下步骤:A computer readable medium storing a computer program, the computer program being executed by a processor to implement the following steps:
    利用卷积神经网络处理被检查物体的X射线图像,得到所述被检查物体的类别活性图;Processing an X-ray image of the object to be inspected by using a convolutional neural network to obtain a class activity map of the object to be inspected;
    基于所述类别活性图确定所述被检查物体中是否有包括可疑对象。Determining whether or not the object to be inspected includes a suspicious object based on the class activity map.
PCT/CN2018/107317 2017-09-25 2018-09-25 Inspection method and inspection device and computer readable medium WO2019057200A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710877741.1 2017-09-25
CN201710877741.1A CN109557114B (en) 2017-09-25 2017-09-25 Inspection method and inspection apparatus, and computer-readable medium

Publications (1)

Publication Number Publication Date
WO2019057200A1 true WO2019057200A1 (en) 2019-03-28

Family

ID=65810737

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/107317 WO2019057200A1 (en) 2017-09-25 2018-09-25 Inspection method and inspection device and computer readable medium

Country Status (2)

Country Link
CN (1) CN109557114B (en)
WO (1) WO2019057200A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111046939A (en) * 2019-12-06 2020-04-21 中国人民解放军战略支援部队信息工程大学 CNN (CNN) class activation graph generation method based on attention

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110648332B (en) * 2019-09-12 2022-03-15 电子科技大学 Image discriminable area extraction method based on multi-branch convolutional neural network feature orthogonality
CN111275682A (en) * 2020-01-19 2020-06-12 上海箱云物流科技有限公司 Container detection method, device and computer readable storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9547821B1 (en) * 2016-02-04 2017-01-17 International Business Machines Corporation Deep learning for algorithm portfolios
CN107145898A (en) * 2017-04-14 2017-09-08 北京航星机器制造有限公司 A kind of ray image sorting technique based on neutral net

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104977313A (en) * 2014-04-09 2015-10-14 四川省特种设备检验研究院 Method and device for detecting and identifying X-ray image defects of welding seam
CN104850836B (en) * 2015-05-15 2018-04-10 浙江大学 Insect automatic distinguishing method for image based on depth convolutional neural networks
CN105956532B (en) * 2016-04-25 2019-05-21 大连理工大学 A kind of traffic scene classification method based on multiple dimensioned convolutional neural networks
CN105975931B (en) * 2016-05-04 2019-06-14 浙江大学 A kind of convolutional neural networks face identification method based on multiple dimensioned pond
CN106485268B (en) * 2016-09-27 2020-01-21 东软集团股份有限公司 Image identification method and device
CN106529602B (en) * 2016-11-21 2019-08-13 中国科学院上海微系统与信息技术研究所 A kind of millimeter-wave image automatic target recognition method and device
CN106599967A (en) * 2016-12-08 2017-04-26 同方威视技术股份有限公司 Security check item positioning label and security check item positioning method
CN106874845B (en) * 2016-12-30 2021-03-26 东软集团股份有限公司 Image recognition method and device
CN106910192B (en) * 2017-03-06 2020-09-22 长沙全度影像科技有限公司 Image fusion effect evaluation method based on convolutional neural network

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9547821B1 (en) * 2016-02-04 2017-01-17 International Business Machines Corporation Deep learning for algorithm portfolios
CN107145898A (en) * 2017-04-14 2017-09-08 北京航星机器制造有限公司 A kind of ray image sorting technique based on neutral net

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
AKCAY, S. ET AL.: "Transfer Learning Using Convolutional Neural Networks for Object Classification within X-RAY Baggage Security Imagery", IEEE, 3 October 2016 (2016-10-03), pages 1 - 4, XP033016685 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111046939A (en) * 2019-12-06 2020-04-21 中国人民解放军战略支援部队信息工程大学 CNN (CNN) class activation graph generation method based on attention
CN111046939B (en) * 2019-12-06 2023-08-04 中国人民解放军战略支援部队信息工程大学 Attention-based CNN class activation graph generation method

Also Published As

Publication number Publication date
CN109557114A (en) 2019-04-02
CN109557114B (en) 2021-07-16

Similar Documents

Publication Publication Date Title
JP6678192B2 (en) Inspection equipment and firearm detection method
US10102641B2 (en) Methods for extracting shape feature, inspection methods and apparatuses
Rogers et al. Automated x-ray image analysis for cargo security: Critical review and future promise
KR101792927B1 (en) Object inspection method, display method and device
BR112021011253A2 (en) Machine Learning Systems Training for Seismic Interpretation
WO2015067208A1 (en) Detection method and device
WO2016107009A1 (en) Method and system for checking goods
CN109522913B (en) Inspection method and inspection apparatus, and computer-readable medium
WO2019057200A1 (en) Inspection method and inspection device and computer readable medium
WO2015171943A1 (en) Image-based object detection and feature extraction from a reconstructed charged particle image of a volume of interest
US20220366181A1 (en) Devices, systems, and methods for anomaly detection
JP2011089920A (en) X-ray inspection method and x-ray inspection apparatus using the same
Wu et al. End-to-end lung nodule detection in computed tomography
Van De Looverbosch et al. Inline nondestructive internal disorder detection in pear fruit using explainable deep anomaly detection on X-ray images
CN114898044B (en) Imaging method, device, equipment and medium for detection object
Bekara et al. Automatic QC of denoise processing using a machine learning classification
Wei et al. A real-time Threat Image Projection (TIP) model base on deep learning for X-ray baggage inspection
Zou et al. Cross-scene target detection based on feature adaptation and uncertainty-aware pseudo-label learning for high resolution sar images
Lu et al. A feature-based physics-constrained active dictionary learning scheme for image-based additive manufacturing process monitoring
CN114676759A (en) Method and device for detecting contraband in security inspection image
KR20230036030A (en) Method and system for detecting anomalies in images using a plurality of machine learning programs
Xu et al. Find the centroid: A vision‐based approach for optimal object grasping
Norrstig Visual object detection using convolutional neural networks in a virtual environment
Sun et al. Integrate Light-Weight Deep Learning Tools with Internet of Things
CN117994216A (en) Multi-scale feature fusion lesion detection method for tumor habitat analysis of medical image

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18858216

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18858216

Country of ref document: EP

Kind code of ref document: A1