WO2023236789A1 - Monitoring system - Google Patents

Monitoring system Download PDF

Info

Publication number
WO2023236789A1
WO2023236789A1 PCT/CN2023/096516 CN2023096516W WO2023236789A1 WO 2023236789 A1 WO2023236789 A1 WO 2023236789A1 CN 2023096516 W CN2023096516 W CN 2023096516W WO 2023236789 A1 WO2023236789 A1 WO 2023236789A1
Authority
WO
WIPO (PCT)
Prior art keywords
camera
image
area
circuit
monitoring system
Prior art date
Application number
PCT/CN2023/096516
Other languages
French (fr)
Chinese (zh)
Inventor
萨里尼约瑟夫
姚晨
杉山聪
Original Assignee
索尼半导体解决方案公司
萨里尼约瑟夫
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 索尼半导体解决方案公司, 萨里尼约瑟夫 filed Critical 索尼半导体解决方案公司
Publication of WO2023236789A1 publication Critical patent/WO2023236789A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
    • H04N7/181Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a plurality of remote sources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/50Constructional details
    • H04N23/54Mounting of pick-up tubes, electronic image sensors, deviation or focusing coils
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/61Control of cameras or camera modules based on recognised objects
    • H04N23/611Control of cameras or camera modules based on recognised objects where the recognised objects include parts of the human body
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N25/00Circuitry of solid-state image sensors [SSIS]; Control thereof
    • H04N25/70SSIS architectures; Circuits associated therewith
    • H04N25/76Addressed sensors, e.g. MOS or CMOS sensors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast

Definitions

  • Cameras and external processing units that communicate with the cameras, such as edge computers, cloud processors, and various smart terminals such as mobile phones.
  • the camera sends real-time captured images to the processing unit, which displays and processes the images.
  • the processing unit In sales venues, some intelligent monitoring systems can process images through edge computing and obtain the sales status of goods.
  • the camera needs to capture and store a large amount of image/video data, and it needs to be sent to the processing unit, significantly increasing the camera's power consumption.
  • data transmission is limited by the bandwidth of wired or wireless networks, making the transmission quality unstable or even low.
  • the data transmitted by the camera may contain images of people, exacerbating the risk of personal privacy leakage.
  • the image processing involved in current surveillance systems is performed in a processing unit outside the camera.
  • a processing unit outside the camera.
  • cloud processors are used to process images sent by cameras. This will also cause problems such as delays, losses and even errors in data transmission due to poor network transmission quality, which will lead to a decrease in the accuracy of image processing results.
  • the present invention proposes a monitoring system, which includes a smart camera with image processing function and at least one graphic mark.
  • Smart cameras are capable of processing images inside the camera and sending only the processing results rather than the complete image to the outside of the camera.
  • the power consumption of the camera and the monitoring system's dependence on the network are significantly reduced, and the speed and accuracy of data transmission are improved. Problems such as data packet loss, transmission errors, data delays, etc. of traditional monitoring systems will not occur.
  • personal privacy can also be safely protected by sending only the processing results as feature data instead of image data.
  • the present invention uses at least one graphic mark to monitor the target, which greatly reduces the amount of image/video data that needs to be processed, and also significantly reduces the complexity of the calculation model.
  • the monitoring system may include: a camera that captures an image of a monitoring area and stores the image; and at least one graphic marker installed to at least one placement position in the monitoring area, wherein the at least Each of a graphical marker is invisible to the camera when the target object is located at the corresponding position, wherein the camera includes circuitry configured to: capture the monitoring at a predetermined frequency image of the area, determine whether the graphic mark is included in each captured image, and send the determination result to the outside of the camera.
  • the circuit of the camera is further configured to extract object information from the image of the monitoring area and send the object information to the outside of the camera.
  • each of the at least one graphical mark has a selected color that is different from the color of the target object to be placed on the corresponding said storage location.
  • a plurality of graphic marks can be installed to a plurality of storage positions in the monitoring area respectively, and the plurality of graphic marks are different from each other.
  • multiple graphic indicia may have different patterns, shapes, and/or colors.
  • the circuitry of the camera is further configured to associate each image marker with a corresponding storage location and/or an object to be placed at the corresponding storage location.
  • the circuit of the camera is further configured to adjust the camera according to the recognition accuracy of the at least one graphic mark in the multiple images of the monitoring area by capturing and analyzing multiple images of the monitoring area. shooting parameters.
  • the circuit of the camera is further configured to: intercept an area of interest from the first image of the monitoring area, where the area of interest contains the object of interest located at the corresponding object location, A second image of the area of interest is captured, and the target object information of the target of interest is extracted from the second image.
  • the circuitry of the camera is further configured to, when determining that the graphic mark is included in a captured image, send the determination result together with a fill signal indicating an empty storage location.
  • the circuit of the camera is further configured to send the determination result and a filling signal indicating an empty storage position when it is determined that the graphic mark is included in a predetermined number of consecutive captured images.
  • Figure 1 is a schematic diagram showing a first embodiment of a monitoring system according to the present invention.
  • Figure 2 is a schematic diagram illustrating one embodiment of graphical markings of the monitoring system according to the present invention.
  • FIG. 3 is a schematic diagram showing one embodiment of the chip structure of the camera of the monitoring system according to the present invention.
  • Figure 4 is a schematic diagram showing a second embodiment of the monitoring system according to the present invention.
  • Figure 5 is a schematic diagram showing a third embodiment of the monitoring system according to the present invention.
  • FIG. 6 is a schematic diagram illustrating an embodiment of a camera of the surveillance system recognizing and extracting facial features of a human face according to the present invention.
  • FIG. 7 is a schematic diagram showing an embodiment of images captured by a camera of the monitoring system according to the present invention.
  • FIG. 8 is a schematic diagram illustrating an embodiment in which a camera of a surveillance system determines graphic markers in a surveillance area according to the present invention.
  • FIG. 9 is a schematic diagram showing another embodiment of a camera of a surveillance system determining graphic markers in a surveillance area according to the present invention.
  • FIG. 11 is a schematic diagram showing the working flow of the camera of the monitoring system according to the present invention.
  • Figure 1 shows a first embodiment of the monitoring system of the present invention.
  • the camera 1 of the monitoring system is installed in the monitoring area and can perform panoramic monitoring of the monitoring area.
  • a plurality of target objects 2 are arranged in the monitoring area, and the target objects 2 are placed on corresponding storage positions 3 respectively.
  • a supply source 8 of target objects is also arranged in the monitoring area, which is used to replenish the target objects 2 to the corresponding storage position 3 in time when needed.
  • the monitoring area can be a vending machine, and the target object 2 is a commodity. Based on the real-time monitoring of the camera 1, after receiving the filling instruction, the vending machine automatically fills the corresponding storage position with the target object 2 stored in the supply source 8.
  • the monitoring area can also be a storage area such as a warehouse, where the target object 2 is the stored goods and the storage location 3 is the inventory location in the warehouse. Based on the real-time monitoring of the camera 1, after receiving the filling instruction, the target object 2 in the supply source 8 is automatically replenished to the corresponding storage position 3.
  • the monitoring area is not limited to this.
  • the monitoring system of the present invention can be applied to any area that has monitoring requirements for target items. place.
  • the monitoring system also includes at least one graphic marker 4 .
  • a plurality of graphic marks 4 are respectively installed on corresponding target object storage positions 3 .
  • the graphic mark 4 can be adhered to the storage position 3 by means of an adhesive without any modification of the storage position 3 being necessary.
  • the graphic mark can be embedded in the corresponding storage position 3 through mechanical means. This method can reliably prevent the graphic mark from falling off or falling off due to operating errors.
  • the graphic mark 4 is invisible to the camera 1 .
  • the graphic mark 4 can be adhered to the surface of the object placement position 3 so that the target object 2 will block the graphic mark 4 after it is placed.
  • the graphic mark 4 on the storage position 3 is exposed and can then be captured by the camera 1 .
  • the camera 1 may determine that the corresponding storage position is empty based on detecting the graphic mark 4 in the captured image, and then send the determination result to the outside of the camera, such as an external processing unit.
  • the external processing unit sends instructions to the monitoring area based on the received judgment result that there is an empty level (empty level 3), and the monitoring area performs corresponding measures according to the instructions, such as filling the empty level, or sending out an alarm signal, etc.
  • FIG. 2 shows that multiple target objects 2 and multiple graphic marks 4 are respectively placed in multiple target object placement positions 3 .
  • Each graphic mark 4 is different, so that when the camera 1 recognizes the graphic mark 4, it can simultaneously determine the specific location of the corresponding object location and the corresponding target object information (eg, type/category, etc.).
  • the circuit included in the camera 1 can pre-associate each graphic mark 4 with information such as the corresponding object position 3 and/or the type, position coordinates and other information of the target object 2 .
  • graphic marks can be distinguished by graphics, shape, and/or color.
  • the present invention is not limited to this. Any graphic markings that are distinguishable from each other and thereby enable the camera to distinguish them from the image can be used in the present invention.
  • each graphic mark should also be distinguished from the corresponding target object, so that the camera can accurately identify the graphic mark and the target object respectively.
  • the image mark may include one or more colors that are different from the colors contained in the target object.
  • the graphic mark 4 has a lighter color (white in the figure) than the target 2 and has diagonal stripes. The camera 1 can easily and quickly detect the graphic mark in the captured image by recognizing the graphic and the different colors, thereby determining the empty storage position.
  • Figure 2 only shows graphic markers and targets distinguished by a single shade of color and a simple stripe pattern. It is easy to understand that graphic marks can be constructed in a variety of ways, including but not limited to one or a combination of graphics, shapes, colors, special symbols, etc.
  • Camera 1 is a smart camera with its own image processing function.
  • the interior of the camera 1 includes an integrated circuit chip, which may have a stacked structure as shown in FIG. 3 , such as a stacked complementary metal oxide semiconductor (CMOS) image sensor chip.
  • CMOS complementary metal oxide semiconductor
  • the integrated chip is composed of a stacked substrate structure.
  • a pixel array composed of a plurality of pixels is arranged on the first substrate in the stacked substrate. The pixel array converts the optical signal received by the camera lens into an electrical signal through photoelectric conversion, and transmits the electrical signal to the second substrate in the stacked substrate.
  • CMOS complementary metal oxide semiconductor
  • a memory and a processing circuit are arranged in the second substrate, and the second substrate is coupled to the first substrate (the coupling structure is not shown in the figure) to communicate with each other.
  • the memory may also be located on a third substrate (not shown) in the stacked substrate. That is, the pixel array, memory and processing unit can be located on different substrates. It should be understood that the above-mentioned stacked arrangement of multiple substrates is only exemplary. pixel arrays, pixel circuits, processing circuits, memories and other components can be designed on multiple substrates and stacked according to design requirements.
  • the processing circuit includes, for example, a digital signal processor (DSP), which generates image data based on electrical signals and stores the image data in a memory.
  • Predetermined algorithm models are also stored in the memory, and the processing circuit can perform various processing on the image based on these algorithm models, such as detecting whether there are graphic markers in the surveillance image, and different processing of the image as will be described in detail below.
  • the image processing of the surveillance system can be performed on the camera side, rather than in an external processor (edge computer, cloud server, mobile phone terminal, etc.) connected to the camera through wired or wireless connections.
  • camera 1 can also access the network, for example, to obtain or update algorithm models, etc.
  • FIG. 8 shows an example of an image of the monitoring area captured by the camera 1 .
  • the left image in FIG. 8 shows the first image captured by the camera 1 at a certain moment.
  • the circuit in the camera 1 processes the first image and initially detects the graphic marks 4 and 4'. After further calculation and processing, for example, comparing it with a pre-stored standard graphic mark image, it is determined that the graphic mark 4 is detected (for example, the probability greater than 90%). At the same time, the circuit determines that the probability that the pattern mark 4' is the pattern mark 4 is 90% or less.
  • the circuit can intercept the area of interest containing the graphic mark 4' from the first image (as shown in the dotted box in the left image of Figure 8), and instruct the camera 1 to adjust the shooting parameters, such as focal length, resolution, exposure time and possible other parameters (white balance, noise reduction, etc.), and then shoot the area of interest containing the graphic mark 4' to obtain a second image with higher image quality, as shown in the right image of Figure 8.
  • the circuit processes the clear second image, confirms that graphic mark 4 is detected, and then sends the judgment result to the outside of the camera.
  • the camera circuit can send only the judgment result to the outside. For example, in the embodiment shown in FIG. 8 , only the result of detecting two graphic marks 4 is sent to the external processing unit as feature data. Based on the detection results, the external processing unit determines that two vacant material levels appear in the monitoring area, and obtains the material location information and target object information associated with these two graphic marks 4. Then, the processing unit can issue corresponding instructions to the monitoring area, such as replenishing the target object 2 to the corresponding empty level, and/or causing the monitoring area to issue an alarm.
  • the surveillance camera of the present invention Compared with traditional surveillance systems that need to send a large amount of image/video data to the outside for processing and judgment by an external processing unit, the surveillance camera of the present invention only uses the detection results as feature data (for example, "m where target 2 is placed" bits and n bits found graphic mark 4" or "bit m and n bits are vacant material levels") are sent to the outside, which greatly reduces the amount of transmitted data, thereby significantly improving the quality and speed of transmission.
  • traditional surveillance systems monitor targets by detecting and judging target images in surveillance images. The detection and judgment of target images requires complex modeling and practice processes.
  • the present invention proposes to use graphic markers to indirectly monitor target objects by detecting graphic markers in monitoring images.
  • the process of modeling and practicing computational models is greatly simplified, and the amount of data that needs to be processed is significantly reduced, thereby significantly improving image processing speed.
  • the camera can adjust the shooting parameters and angles in real time and automatically as needed. The present invention only needs to arrange one camera in the monitoring area.
  • the camera can also send part of the image data outside the camera.
  • the camera cannot always accurately determine whether a graphic mark is detected (for example, when a predetermined number of shooting and detection are repeated, the probability is always below 90%, for example).
  • the camera can send the second image as shown in Figure 8 and/or the area of interest image captured from the first image or the second image to the outside of the camera through a request from the external processing unit, for identification by the user or more complex
  • the system makes a judgment.
  • the amount of image data that needs to be sent is still much smaller than that of a traditional surveillance system, because the traditional surveillance system does not perform any processing on the captured images.
  • the camera is also capable of sending complete image data external to the camera. In this case, the external processing unit can issue a complete data request to the camera.
  • the camera 1 can use the graphic mark 4 to automatically, quickly and accurately determine whether the target object in the monitoring area exists (for example, whether it is sold, unloaded, stolen or accidentally dropped, etc.).
  • camera 1 can also determine whether the layout of the target object is correct. Specifically, the camera 1 acquires a first image of the monitoring area, and the camera circuit intercepts an area of interest from the first image, where the area of interest contains at least one target object. The camera circuit processes the area of interest to obtain target information. For example, the camera circuit can identify the type/category of the target by comparing and matching the target in the image with pre-stored images of multiple targets. When the target object is a product with a machine-readable barcode such as a QR code or barcode, the camera circuit can also obtain information such as the type/type of the target object by identifying the QR code or barcode on the package of the target object.
  • a machine-readable barcode such as a QR code or barcode
  • the camera circuit can adjust the shooting parameters to obtain a second image containing the area of interest, and then obtain the second image with improved image quality. Extract target object information from the second image. Then, the camera circuit can compare the target object information with the pre-stored arrangement information to determine whether the target object is placed in the correct position.
  • the camera circuit can also send the target object information to an external processing unit, and the external processing unit performs subsequent processing and judgment.
  • the camera circuit can only extract and identify the target object information data (for example, "such and such Coke") from the image, instead of a huge amount of image data. Sent to the outside, thereby significantly reducing the transmission pressure on network bandwidth.
  • the camera circuitry can also send partial or complete image data to the outside based on external requests.
  • the camera circuit can adjust the shooting parameters in advance based on the recognition accuracy of the graphic marks in the monitoring area image for different monitoring areas, targets therein, and the layout of the targets. set up.
  • the camera can take multiple shots of the monitoring area with multiple sets of different shooting parameters in advance, and process and analyze the multiple sets of images obtained.
  • the shooting parameters here include, for example, but are not limited to shooting angle, resolution, exposure time, white balance, noise reduction, etc.
  • the camera circuit determines a preferred shooting parameter range suitable for the monitoring area and its target objects. Therefore, based on the determined shooting parameter range, the camera circuit can set the shooting parameters of the monitoring area in advance. In this way, the judgment efficiency and accuracy during monitoring are further improved.
  • FIG 4 shows a second embodiment of a monitoring system according to the invention.
  • the monitoring system is provided in a monitoring area of a vending store, for example.
  • the camera 1 is arranged so that its viewing angle covers multiple targets 2 sales area and inventory area 5.
  • the sales area includes a plurality of storage locations (not shown), and graphic marks (not shown) are arranged on each storage location. Graphical markings are similar to those described above for Figure 2.
  • the monitoring area is also equipped with sensors (not shown) that monitor whether someone enters the monitoring area. Camera 1 is connected to this sensor and receives the signal from the sensor. When the sensor sends a signal that someone enters the monitoring area, the camera 1 ends the standby state, starts shooting the monitoring area at a predetermined frequency, and processes the images.
  • camera 1 Upon detecting one or more graphic markers, camera 1 sends the detection results to the external processing unit.
  • the camera 1 directly sends a filling signal to the inventory area 5 based on the detection result, instructing the inventory area 5 to fill the target object into the corresponding empty level.
  • the camera 1 takes a picture of the inventory area 5 and processes the captured image to determine whether the supply source of the target object is empty. If it is determined that an empty supply source exists, the determination result is sent to the external processing unit. After receiving the reminder from the external processing unit, the user can replenish the supply source accordingly.
  • FIG. 5 shows a third embodiment of a monitoring system according to the invention.
  • the third embodiment differs from the second embodiment in that the camera 1 of the surveillance system is arranged to be able to simultaneously monitor the sales area, the storage area 5 and the entrance to the surveillance area. That is, camera 1 also monitors whether anyone enters the monitoring area.
  • the camera takes pictures of the entrance to the surveillance area at a predetermined frequency, and determines whether someone has entered the surveillance area by detecting and extracting facial features in the image (i.e., the dotted facial feature map in Figure 6) district.
  • Detecting facial features typically involves locating faces in images and detecting key facial structural features.
  • the camera circuit uses, for example, a deep learning-based algorithm to locate the face in the image and obtain the face boundary points.
  • key facial structural features in the face area are detected within the face boundary, such as the mouth, right eyebrow, left eyebrow, right eye, left eye, nose, and mandible, and the feature points shown in Figure 6 are obtained.
  • any known detection and extraction method of facial features can be selected as needed, and the relevant algorithms can be stored in the chip of the camera.
  • relevant calculations can also be performed through the cloud and the calculation results can be obtained.
  • the camera circuit determines that someone has entered the monitoring area based on the detected face facial feature data, and starts photographing the sales area at a predetermined frequency based on the determination.
  • the camera circuitry may send the facial feature data externally.
  • the camera 1 sends the detected facial feature map to the processing unit 7 outside the camera through the network 6.
  • the processing unit 7 and the external processing units mentioned in this article can be a remote computer as shown in Figure 6, or a cloud processor, an application program, or an application program on a mobile phone terminal, etc.
  • what is transmitted between the camera 1 and the processing unit 7 through the network 6 is the facial feature map extracted by the camera 1. This data is greatly reduced whether compared with the original image data or the face image data. The amount of data transferred and the speed of transfer. In this way, the need to transmit images with huge amounts of data is avoided, while the privacy of consumers is also avoided.
  • camera 1 may capture an image containing human hands.
  • camera 1 can process the human hands in the image, extract visual marks such as birthmarks or tattoos that the hands may contain, and send the visual marks to the outside.
  • the external processing unit can assist in identifying the target object possessing the hand through the visual marker. Since the camera 1 processes the image and only sends the visual mark to the outside, the efficiency of the external processing unit in identifying the target object can be improved.
  • the right image of FIG. 7 shows an image captured by the camera 1 including the graphic mark 4 .
  • the camera 1 determines that the corresponding target object 2 is taken away or sold.
  • a in Figure 9 and the right image in Figure 7 appear in sequentially captured image frames, the camera 1 determines that the target 2 may not have been taken away but accidentally slipped. In this way, the detection of human hands in the image can also assist the camera 1 in determining the status of commodities or goods in the monitoring area.
  • the camera may not be able to accurately determine whether there is a graphic mark 4 in the image due to the occlusion of the human hand.
  • the camera will wait for a certain period of time (to wait for the human hand to leave) before shooting again. If the camera is still unable to make a determination after a predetermined time, for example, a suspicious person remains in the monitoring area, the camera circuit will send a notification signal to the outside to alert the user.
  • FIG. 10 shows a specific example of the camera circuit of the monitoring system according to the present invention determining whether there is a graphic mark in the image.
  • the camera circuit processes the captured image, if the possibility of detecting a graphic mark is judged to be 90% or above based on the algorithm model stored in the camera, it is determined that the graphic mark is detected; if it is determined that the graphic mark is detected When the probability of reaching a graphic mark is 30% or less, it is determined that no graphic mark is detected.
  • step S9 is entered.
  • the camera circuit optimizes the shooting parameters and repeats shooting and judgment until it can accurately judge whether the empty position is filled. Similar to the embodiment shown in FIG. 10 , the steps may be repeated up to three times. If it can be determined whether the graphic mark is detected during the x (x ⁇ 3)-th shooting and analysis (thereby being able to determine whether the vacant material level is filled), enter S7; if it is still repeated after 3 times (i.e., x>3) If it cannot be determined whether the empty material level is filled, the camera circuit sends an external reminder. Users can go to the monitored area to check, for example, to clear obstacles.
  • the above-mentioned image processing includes the processing of graphic marks, targets, and facial features in the image.
  • the calculations involved can be performed using known algorithms for recognizing graphics or using known calculation methods. Modeling. I won’t go into details here.
  • the present invention is obviously not limited to this, but can be applied to any situation where there is a need to monitor target objects.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Image Processing (AREA)
  • Closed-Circuit Television Systems (AREA)

Abstract

The present invention relates to a monitoring system. The monitoring system according to the present invention may comprise: a camera, which captures an image of a monitoring area and stores the image; and at least one graphic marker, which is mounted on at least one storage position in the monitoring area, wherein each of the at least one graphic marker is invisible to the camera when a target object is located at the corresponding storage position; and the camera comprises a circuit, which is configured to: capture an image of the monitoring area at a predetermined frequency, determine whether the graphic marker is comprised in each captured image, and send a determination result to the outside of the camera. By means of the present invention, the power consumption of a camera and the dependence of a monitoring system on a network are significantly reduced, the amount of images/video data to be processed by the camera and the complexity of a computation model are reduced, the speed and accuracy of data transmission are improved, and personal privacy can be securely protected.

Description

监控系统surveillance system
相关申请的引用References to related applications
本申请要求于2022年06月7日向中华人民共和国国家知识产权局提交的第202210636273.X号中国专利申请的权益,在此将其全部内容以援引的方式整体并入本文中。This application claims the rights and interests of Chinese Patent Application No. 202210636273.
技术领域Technical field
本发明涉及一种监控系统,其包括具有图像处理功能的智能相机和至少一个图形标记。该监控系统可对各种应用场所的监控区域中的目标物进行监控。The invention relates to a monitoring system, which includes a smart camera with image processing function and at least one graphic mark. The monitoring system can monitor targets in the monitoring area of various application places.
背景技术Background technique
当前的监控系统一般包括相机和与相机通信的外部处理单元,例如,边缘计算机、云端处理器,以及诸如手机的各种智能终端等。在这样的监控系统中,相机将实时拍摄的图像发送到处理单元,处理单元对图像进行显示和处理。在销售场所,一些智能监控系统能够通过边缘计算等对图像进行处理,获得商品的销售情况等。Current surveillance systems generally include cameras and external processing units that communicate with the cameras, such as edge computers, cloud processors, and various smart terminals such as mobile phones. In such a monitoring system, the camera sends real-time captured images to the processing unit, which displays and processes the images. In sales venues, some intelligent monitoring systems can process images through edge computing and obtain the sales status of goods.
然而,这些监控系统存在诸多问题。首先,相机需要拍摄和存储大量的图像/视频数据,并且需要将其发送给处理单元,显著增加了相机的功耗。并且,数据传输受限于有线或无线网络的带宽,使得传输质量不稳定甚至低下。其次,相机传输的数据可能包含人的图像,使得个人隐私泄露的风险加剧。However, there are many problems with these monitoring systems. First, the camera needs to capture and store a large amount of image/video data, and it needs to be sent to the processing unit, significantly increasing the camera's power consumption. Moreover, data transmission is limited by the bandwidth of wired or wireless networks, making the transmission quality unstable or even low. Secondly, the data transmitted by the camera may contain images of people, exacerbating the risk of personal privacy leakage.
另外,当前的监控系统涉及的图像处理都是在相机外部的处理单元中进行。例如,在诸如智能销售商店/自动售卖机以及智能仓储监控等的自动监控系统中,使用云端处理器等对相机发送的图像进行处理。这也会产生因网络传输质量差而造成的数据传输的延迟、丢失甚至错误等问题,从而导致图像处理结果的准确度下降。In addition, the image processing involved in current surveillance systems is performed in a processing unit outside the camera. For example, in automatic monitoring systems such as smart sales stores/vending machines and smart warehouse monitoring, cloud processors are used to process images sent by cameras. This will also cause problems such as delays, losses and even errors in data transmission due to poor network transmission quality, which will lead to a decrease in the accuracy of image processing results.
再者,传统的自动监控系统采用的都是直接检测和分析监控图像/视频中的目标物的检测方法。这种检测方法需要大量的图像数据和计算,并且需要建立和训练复杂的计算模型。Furthermore, traditional automatic surveillance systems all use detection methods that directly detect and analyze target objects in surveillance images/videos. This detection method requires a large amount of image data and calculations, and requires the establishment and training of complex computational models.
发明内容Contents of the invention
为解决上述技术问题。本发明提出一种监控系统,其包括具有图像处理功能的智能相机和至少一个图形标记。智能相机能够在相机内部对图像进行处理,且仅将处理结果而不是完整图像发送到相机的外部。如此,显著降低了相机的功耗以及监控系统对网络的依赖,并且提高了数据传输的速度和准确性,不会发生传统监控系统的诸如数据包丢失、传输错误、数据延迟等等问题。同时,仅发送作为特征数据的处理结果而不是图像数据,也能够安全保护个人隐私。另外,本发明利用至少一个图形标记来对目标物进行监控,大大降低了需要处理的图像/视频数据的量,同时也显著降低了计算模型的复杂度。In order to solve the above technical problems. The present invention proposes a monitoring system, which includes a smart camera with image processing function and at least one graphic mark. Smart cameras are capable of processing images inside the camera and sending only the processing results rather than the complete image to the outside of the camera. In this way, the power consumption of the camera and the monitoring system's dependence on the network are significantly reduced, and the speed and accuracy of data transmission are improved. Problems such as data packet loss, transmission errors, data delays, etc. of traditional monitoring systems will not occur. At the same time, personal privacy can also be safely protected by sending only the processing results as feature data instead of image data. In addition, the present invention uses at least one graphic mark to monitor the target, which greatly reduces the amount of image/video data that needs to be processed, and also significantly reduces the complexity of the calculation model.
根据本发明的监控系统可包括:相机,其捕获监控区的图像并且存储所述图像;以及至少一个图形标记,其被安装到在所述监控区中的至少一个置物位,其中,所述至少一个图形标记中的每一者在目标物位于相应的所述置物位时对于所述相机时不可见的,其中,所述相机包括电路,所述电路被配置为:以预定频率捕获所述监控区的图像,确定所述图形标记是否包括在每个捕获图像中,并且发送确定结果到所述相机的外部。The monitoring system according to the present invention may include: a camera that captures an image of a monitoring area and stores the image; and at least one graphic marker installed to at least one placement position in the monitoring area, wherein the at least Each of a graphical marker is invisible to the camera when the target object is located at the corresponding position, wherein the camera includes circuitry configured to: capture the monitoring at a predetermined frequency image of the area, determine whether the graphic mark is included in each captured image, and send the determination result to the outside of the camera.
优选地,相机的电路进一步被配置为从所述监控区的图像中提取目标物信息,并且将所述目标物信息发送到所述相机的外部。Preferably, the circuit of the camera is further configured to extract object information from the image of the monitoring area and send the object information to the outside of the camera.
优选地,至少一个图形标记中的每一者具有选定的颜色,所述选定的颜色与要被放置在相应的所述置物位上的所述目标物的颜色不同。Preferably, each of the at least one graphical mark has a selected color that is different from the color of the target object to be placed on the corresponding said storage location.
根据本发明的监控系统,多个图形标记可分别被安装到所述监控区中的多个置物位,并且所述多个图形标记彼此是不同的。例如,多个图形标记可具有不同的图案、形状和/或颜色。According to the monitoring system of the present invention, a plurality of graphic marks can be installed to a plurality of storage positions in the monitoring area respectively, and the plurality of graphic marks are different from each other. For example, multiple graphic indicia may have different patterns, shapes, and/or colors.
优选地,相机的电路还被配置为将每个图像标记与相应的置物位和/或要被放置在相应的置物位处的目标物关联起来。 Preferably, the circuitry of the camera is further configured to associate each image marker with a corresponding storage location and/or an object to be placed at the corresponding storage location.
优选地,相机的电路还被配置为通过捕获、分析所述监控区的多个图像,根据所述至少一个图形标记在所述监控区的所述多个图像中的识别精度来调整所述相机的拍摄参数。Preferably, the circuit of the camera is further configured to adjust the camera according to the recognition accuracy of the at least one graphic mark in the multiple images of the monitoring area by capturing and analyzing multiple images of the monitoring area. shooting parameters.
优选地,为了确定和/或检查目标物,相机的电路还被配置为:从所述监控区的第一图像中截取关注区域,所述关注区域包含位于相应的置物位处的关注目标物,捕获所述关注区域的第二图像,以及从所述第二图像中提取所述关注目标物的所述目标物信息。Preferably, in order to determine and/or inspect the target object, the circuit of the camera is further configured to: intercept an area of interest from the first image of the monitoring area, where the area of interest contains the object of interest located at the corresponding object location, A second image of the area of interest is captured, and the target object information of the target of interest is extracted from the second image.
类似地,为了确定图形标记,优选地,相机的电路还被配置为:从所述监控区的第一图像中截取关注区域,所述关注区域包含所述至少一个图形标记中的关注图形标记,捕获所述关注区域的第二图像,以及确定所述关注图形标记是否包括在所述第二图像中。Similarly, in order to determine the graphic mark, preferably, the circuit of the camera is further configured to: intercept an area of interest from the first image of the monitoring area, the area of interest containing the graphic mark of interest in the at least one graphic mark, A second image of the area of interest is captured, and it is determined whether the graphic marker of interest is included in the second image.
优选地,相机的电路还被配置为接收外部信号,并且当所述外部信号指示有人进入所述监控区时,所述相机开始以所述预定频率捕获所述监控区的图像。可选地,相机的电路还被配置为通过从所述监控区的图像检测和提取人脸面部特征来检测是否有人进入所述监控区。Preferably, the circuit of the camera is further configured to receive an external signal, and when the external signal indicates that someone enters the monitoring area, the camera starts to capture images of the monitoring area at the predetermined frequency. Optionally, the circuit of the camera is also configured to detect whether someone enters the monitoring area by detecting and extracting facial features from images of the monitoring area.
优选地,相机的电路还被配置为当确定所述图形标记被包括在一个捕获图像中时,发送所述确定结果以及指示空的置物位的填充信号。另外优选地,相机的电路还可被配置为当确定所述图形标记被包括在预定数量的连续的捕获图像中时,发送所述确定结果以及指示空的置物位的填充信号。Preferably, the circuitry of the camera is further configured to, when determining that the graphic mark is included in a captured image, send the determination result together with a fill signal indicating an empty storage location. Preferably, the circuit of the camera is further configured to send the determination result and a filling signal indicating an empty storage position when it is determined that the graphic mark is included in a predetermined number of consecutive captured images.
根据本发明的监控系统的相机包括集成芯片,上述相机的电路布置在所述集成芯片上。集成芯片中还包括存储器,所述存储器存储有用于所述电路的图像处理的预定算法。The camera of the monitoring system according to the present invention includes an integrated chip, and the circuit of the above-mentioned camera is arranged on the integrated chip. A memory is also included in the integrated chip, and the memory stores a predetermined algorithm for image processing of the circuit.
本发明的详细以及优选的实施方式将在下面参照附图进行说明。Detailed and preferred embodiments of the present invention will be described below with reference to the accompanying drawings.
附图说明Description of the drawings
图1是示出了根据本发明的监控系统的第一实施例的示意图。Figure 1 is a schematic diagram showing a first embodiment of a monitoring system according to the present invention.
图2是示出了根据本发明的监控系统的图形标记的一个实施例的示意图。Figure 2 is a schematic diagram illustrating one embodiment of graphical markings of the monitoring system according to the present invention.
图3是示出了根据本发明的监控系统的相机的芯片结构的一个实施例的示意图。FIG. 3 is a schematic diagram showing one embodiment of the chip structure of the camera of the monitoring system according to the present invention.
图4是示出了根据本发明的监控系统的第二实施例的示意图。Figure 4 is a schematic diagram showing a second embodiment of the monitoring system according to the present invention.
图5是示出了根据本发明的监控系统的第三实施例的示意图。Figure 5 is a schematic diagram showing a third embodiment of the monitoring system according to the present invention.
图6是示出了根据本发明的监控系统的相机识别和提取人脸面部特征的实施例的示意图。FIG. 6 is a schematic diagram illustrating an embodiment of a camera of the surveillance system recognizing and extracting facial features of a human face according to the present invention.
图7是示出了根据本发明的监控系统的相机拍摄的图像的实施例的示意图。7 is a schematic diagram showing an embodiment of images captured by a camera of the monitoring system according to the present invention.
图8是示出了根据本发明的监控系统的相机确定监控区中图形标记的实施例的示意图。8 is a schematic diagram illustrating an embodiment in which a camera of a surveillance system determines graphic markers in a surveillance area according to the present invention.
图9是示出了根据本发明的监控系统的相机确定监控区中图形标记的另一实施例的示意图。FIG. 9 is a schematic diagram showing another embodiment of a camera of a surveillance system determining graphic markers in a surveillance area according to the present invention.
图10是示出了根据本发明的监控系统的相机确定图像中是否存在图形标记的实施例的流程示意图。FIG. 10 is a flowchart illustrating an embodiment of a camera of a surveillance system determining whether a graphic mark exists in an image according to the present invention.
图11是示出了根据本发明的监控系统的相机的工作流程的示意图。FIG. 11 is a schematic diagram showing the working flow of the camera of the monitoring system according to the present invention.
具体实施方式Detailed ways
在下文中,参照附图对本公开的优选实施方案进行详细地说明。除非特别说明,相同的附图标记表示相同或等同的部件。Hereinafter, preferred embodiments of the present disclosure are explained in detail with reference to the accompanying drawings. Unless otherwise stated, the same reference numbers refer to the same or equivalent parts.
图1示出本发明的监控系统的第一实施例。如图1所示,监控系统的相机1安装在监控区中,可对监控区进行全景监控。监控区内布置有多个目标物2,目标物2分别放置在相应的置物位3上。监控区内还布置有目标物的供应源8,用于在需要时及时向相应的置物位3补充目标物2。例如,监控区可为自动售卖机,目标物2为商品,基于相机1的实时监控,自动售卖机在收到填充指令后,将供应源8中存储的目标物2自动填充相应的置物位。或者,监控区还可以是仓库等的仓储区域,目标物2为仓储货物,置物位3为仓库中的存货位。基于相机1的实时监控,在收到填充指令后,供应源8中的目标物2被自动地补充到相应的置物位3。当然,监控区不限于此,本发明的监控系统可应用于任何有对目标物品的监控需求 的场所。Figure 1 shows a first embodiment of the monitoring system of the present invention. As shown in Figure 1, the camera 1 of the monitoring system is installed in the monitoring area and can perform panoramic monitoring of the monitoring area. A plurality of target objects 2 are arranged in the monitoring area, and the target objects 2 are placed on corresponding storage positions 3 respectively. A supply source 8 of target objects is also arranged in the monitoring area, which is used to replenish the target objects 2 to the corresponding storage position 3 in time when needed. For example, the monitoring area can be a vending machine, and the target object 2 is a commodity. Based on the real-time monitoring of the camera 1, after receiving the filling instruction, the vending machine automatically fills the corresponding storage position with the target object 2 stored in the supply source 8. Alternatively, the monitoring area can also be a storage area such as a warehouse, where the target object 2 is the stored goods and the storage location 3 is the inventory location in the warehouse. Based on the real-time monitoring of the camera 1, after receiving the filling instruction, the target object 2 in the supply source 8 is automatically replenished to the corresponding storage position 3. Of course, the monitoring area is not limited to this. The monitoring system of the present invention can be applied to any area that has monitoring requirements for target items. place.
监控系统还包括至少一个图形标记4。在如图2示出的实施例中,多个图形标记4分别安装在相应的目标物置物位3上。例如,图形标记4可通过粘结剂粘附到置物位3上,而不必对置物位3进行任何修改。或者可选地,可以通过机械手段将图形标记嵌入相应的置物位3中,这种方式可靠地避免了图形标记掉落或由于操作失误而脱落。The monitoring system also includes at least one graphic marker 4 . In the embodiment shown in FIG. 2 , a plurality of graphic marks 4 are respectively installed on corresponding target object storage positions 3 . For example, the graphic mark 4 can be adhered to the storage position 3 by means of an adhesive without any modification of the storage position 3 being necessary. Or alternatively, the graphic mark can be embedded in the corresponding storage position 3 through mechanical means. This method can reliably prevent the graphic mark from falling off or falling off due to operating errors.
如图2的上侧图像所示,在目标物2按照预定布局安放在相应的置物位3上时,图形标记4对相机1是不可见的。例如,图形标记4可粘附在置物位3的表面,使得目标物2安放好后将遮挡住图形标记4。如图2的下侧图像所示,当目标物2不在相应的置物位3上时,置物位3上的图形标记4露出,然后可被相机1捕捉到。相机1可基于在捕获到的图像中检测到图形标记4而判断相应的置物位为空,然后将判断结果发送到相机外部,例如外部处理单元。外部处理单元基于收到的存在空置物位(空的置物位3)的判断结果发送指令给监控区,监控区则根据指令执行相应措施,例如填充空置物位,或发出报警信号等。As shown in the upper image of FIG. 2 , when the target object 2 is placed on the corresponding object position 3 according to the predetermined layout, the graphic mark 4 is invisible to the camera 1 . For example, the graphic mark 4 can be adhered to the surface of the object placement position 3 so that the target object 2 will block the graphic mark 4 after it is placed. As shown in the lower image of FIG. 2 , when the target object 2 is not on the corresponding storage position 3 , the graphic mark 4 on the storage position 3 is exposed and can then be captured by the camera 1 . The camera 1 may determine that the corresponding storage position is empty based on detecting the graphic mark 4 in the captured image, and then send the determination result to the outside of the camera, such as an external processing unit. The external processing unit sends instructions to the monitoring area based on the received judgment result that there is an empty level (empty level 3), and the monitoring area performs corresponding measures according to the instructions, such as filling the empty level, or sending out an alarm signal, etc.
图2示出多个目标物2和多个图形标记4分别安放在多个目标物置物位3中。每个图形标记4是不同的,使得在相机1识别到图形标记4时,可同时确定相应的置物位的具体位置以及相应的目标物信息(例如,种类/类别等)。在这个方面,相机1中包含的电路可预先将每个图形标记4和与其对应的置物位3和/或目标物2的种类、位置坐标等信息关联起来。FIG. 2 shows that multiple target objects 2 and multiple graphic marks 4 are respectively placed in multiple target object placement positions 3 . Each graphic mark 4 is different, so that when the camera 1 recognizes the graphic mark 4, it can simultaneously determine the specific location of the corresponding object location and the corresponding target object information (eg, type/category, etc.). In this regard, the circuit included in the camera 1 can pre-associate each graphic mark 4 with information such as the corresponding object position 3 and/or the type, position coordinates and other information of the target object 2 .
多个图形标记可以在图形、形状和/或颜色上加以区别。当然,本发明不限于此。任何能够彼此区分进而使得相机能够从图像中加以区分的图形标记都可应用于本发明。同时,每个图形标记还应与对应的目标物区分开来,使得相机能够准确地分别识别图形标记和目标物。例如,图像标记可包括与目标物包含的颜色不同的一个或多个颜色。具体地,如图2的下侧图像所示,图形标记4与目标物2相比,呈现较浅的颜色(图中为白色)且具有斜条纹。相机1可通过对图形并结合对该不同颜色的识别,容易且迅速地在拍摄的图像中检测到图形标记,从而判断出空的置物位。出于便于说明和理解的目的,图2仅示出了以单一的颜色深浅以及简单条纹图案加以区分的图形标记和目标物。容易理解的是,图形标记存在多种构成方式,包括但不限于图形、形状、颜色、特殊符号等中的一种或多种的组合。Multiple graphic marks can be distinguished by graphics, shape, and/or color. Of course, the present invention is not limited to this. Any graphic markings that are distinguishable from each other and thereby enable the camera to distinguish them from the image can be used in the present invention. At the same time, each graphic mark should also be distinguished from the corresponding target object, so that the camera can accurately identify the graphic mark and the target object respectively. For example, the image mark may include one or more colors that are different from the colors contained in the target object. Specifically, as shown in the lower image of FIG. 2 , the graphic mark 4 has a lighter color (white in the figure) than the target 2 and has diagonal stripes. The camera 1 can easily and quickly detect the graphic mark in the captured image by recognizing the graphic and the different colors, thereby determining the empty storage position. For ease of illustration and understanding, Figure 2 only shows graphic markers and targets distinguished by a single shade of color and a simple stripe pattern. It is easy to understand that graphic marks can be constructed in a variety of ways, including but not limited to one or a combination of graphics, shapes, colors, special symbols, etc.
下面将详细说明本发明的监控系统使用的相机1。相机1是自身具有图像处理功能的智能相机。相机1的内部包括集成电路芯片,该集成电路芯片可具有如图3所示的层叠结构,例如堆叠式互补金属氧化物半导体(CMOS)图像传感器芯片。具体地,集成芯片由层叠基板结构构成。由多个像素组成的像素阵列布置在层叠基板中的第一基板上,像素阵列通过光电转换将相机镜头接收的光信号转换为电信号,并将该电信号传输到层叠基板中的第二基板上。第二基板中布置有存储器和处理电路,且该第二基板结合到第一基板(图中未示出结合结构)以相互通信。可选地,存储器还可位于层叠基板中的第三基板(未图示)上。即,像素阵列、存储器和处理单路可分别位于不同的基板上。应当理解的是,上述的多个基板的层叠布置方式仅仅是示例性的,可以根据设计需要将像素阵列、像素电路、处理电路、存储器等部件设计在多个基板上,并将这些基板层叠。处理电路例如包括数字信号处理器(DSP,Digital Signal Processor),其基于电信号生成图像数据,并将该图像数据存储在存储器中。存储器中还存储有预先确定的算法模型,处理电路可基于这些算法模型对图像进行各种处理,例如检测监控图像中是否存在图形标记,以及下文中将详细描述的对图像进行的不同处理。The camera 1 used in the monitoring system of the present invention will be described in detail below. Camera 1 is a smart camera with its own image processing function. The interior of the camera 1 includes an integrated circuit chip, which may have a stacked structure as shown in FIG. 3 , such as a stacked complementary metal oxide semiconductor (CMOS) image sensor chip. Specifically, the integrated chip is composed of a stacked substrate structure. A pixel array composed of a plurality of pixels is arranged on the first substrate in the stacked substrate. The pixel array converts the optical signal received by the camera lens into an electrical signal through photoelectric conversion, and transmits the electrical signal to the second substrate in the stacked substrate. superior. A memory and a processing circuit are arranged in the second substrate, and the second substrate is coupled to the first substrate (the coupling structure is not shown in the figure) to communicate with each other. Optionally, the memory may also be located on a third substrate (not shown) in the stacked substrate. That is, the pixel array, memory and processing unit can be located on different substrates. It should be understood that the above-mentioned stacked arrangement of multiple substrates is only exemplary. pixel arrays, pixel circuits, processing circuits, memories and other components can be designed on multiple substrates and stacked according to design requirements. The processing circuit includes, for example, a digital signal processor (DSP), which generates image data based on electrical signals and stores the image data in a memory. Predetermined algorithm models are also stored in the memory, and the processing circuit can perform various processing on the image based on these algorithm models, such as detecting whether there are graphic markers in the surveillance image, and different processing of the image as will be described in detail below.
由此,监控系统的图像处理可在相机端进行,而不是在与相机有线或无线连接的外部处理器(边缘计算机、云服务器、手机终端等)中进行。当然,相机1也能够接入网络,例如获取或更新算法模型等。As a result, the image processing of the surveillance system can be performed on the camera side, rather than in an external processor (edge computer, cloud server, mobile phone terminal, etc.) connected to the camera through wired or wireless connections. Of course, camera 1 can also access the network, for example, to obtain or update algorithm models, etc.
相机1中进行的图像处理的示例如图8所示。图8示出相机1拍摄的监控区的图像示例。图8中左侧图像示出相机1在某时刻拍摄的第一图像。相机1中的电路对第一图像进行处理后初步检测出图形标记4和4’,经过进一步计算处理,例如与预先存储的标准图形标的图像进行比较,确定检测到图形标记4(例如,可能性大于90%)。同时,电路判断图形标记4’为图形标记4的可能性为90%以下。此时,电路可从第一图像中截取包含图形标记4’的关注区域(如图8的左侧图像中虚线框所示),并指示相机1调整拍摄参数,例如焦距、分辨率、曝光时间以及可能的其它参数(白平衡、降噪等等),然后对包含图形标记4’的关注区域进行拍摄,获得图像质量较高的第二图像,如图8右侧图像所示。电路对清晰的第二图像进行处理,确认检测到图形标记4,然后将判断结果发送到相机外部。 An example of image processing performed in Camera 1 is shown in Figure 8. FIG. 8 shows an example of an image of the monitoring area captured by the camera 1 . The left image in FIG. 8 shows the first image captured by the camera 1 at a certain moment. The circuit in the camera 1 processes the first image and initially detects the graphic marks 4 and 4'. After further calculation and processing, for example, comparing it with a pre-stored standard graphic mark image, it is determined that the graphic mark 4 is detected (for example, the probability greater than 90%). At the same time, the circuit determines that the probability that the pattern mark 4' is the pattern mark 4 is 90% or less. At this time, the circuit can intercept the area of interest containing the graphic mark 4' from the first image (as shown in the dotted box in the left image of Figure 8), and instruct the camera 1 to adjust the shooting parameters, such as focal length, resolution, exposure time and possible other parameters (white balance, noise reduction, etc.), and then shoot the area of interest containing the graphic mark 4' to obtain a second image with higher image quality, as shown in the right image of Figure 8. The circuit processes the clear second image, confirms that graphic mark 4 is detected, and then sends the judgment result to the outside of the camera.
相机电路可仅将判断结果发送到外部。例如,在图8所示实施例中,仅将检测到2个图形标记4的结果作为特征数据发送给外部处理单元。外部处理单元基于该检测结果确定监控区出现2个空置物位,并得到与这2个图形标记4相关联的置物位信息和目标物信息。然后,处理单元可向监控区发出相应的指令,例如补充目标物2至相应的空置物位,和/或使监控区发出警报。The camera circuit can send only the judgment result to the outside. For example, in the embodiment shown in FIG. 8 , only the result of detecting two graphic marks 4 is sent to the external processing unit as feature data. Based on the detection results, the external processing unit determines that two vacant material levels appear in the monitoring area, and obtains the material location information and target object information associated with these two graphic marks 4. Then, the processing unit can issue corresponding instructions to the monitoring area, such as replenishing the target object 2 to the corresponding empty level, and/or causing the monitoring area to issue an alarm.
与传统监控系统需要将大量的图像/视频数据发送到外部由外部处理单元进行处理和判断相比,本发明的监控相机仅将作为特征数据的检测结果(例如,“在放置目标物2的m位和n位发现图形标记4”或者“m位和n位为空置物位”)发送到外部,极大地降低了传输数据的量,由此能够显著提高传输质量和速度。并且,传统监控系统采用在监控图像中检测和判断目标物图像的方式来对目标物实施监控,对目标物图像的检测和判断需要复杂的建模和练习过程。与之相比,本发明提出使用图形标记,通过在监控图像中检测图形标记来间接监控目标物。通过使用简单的图形标记,极大地简化了计算模型的建模和练习过程,并且显著降低需要处理的数据量,从而明显改善图像处理速度。再者,与传统监控系统相比,取决于本发明的监控相机本身的处理功能,相机能够实时且自动地根据需要调整拍摄参数和角度,本发明仅需要在监控区布置一个相机。Compared with traditional surveillance systems that need to send a large amount of image/video data to the outside for processing and judgment by an external processing unit, the surveillance camera of the present invention only uses the detection results as feature data (for example, "m where target 2 is placed" bits and n bits found graphic mark 4" or "bit m and n bits are vacant material levels") are sent to the outside, which greatly reduces the amount of transmitted data, thereby significantly improving the quality and speed of transmission. Moreover, traditional surveillance systems monitor targets by detecting and judging target images in surveillance images. The detection and judgment of target images requires complex modeling and practice processes. In contrast, the present invention proposes to use graphic markers to indirectly monitor target objects by detecting graphic markers in monitoring images. By using simple graphical notations, the process of modeling and practicing computational models is greatly simplified, and the amount of data that needs to be processed is significantly reduced, thereby significantly improving image processing speed. Furthermore, compared with the traditional monitoring system, depending on the processing function of the monitoring camera of the present invention, the camera can adjust the shooting parameters and angles in real time and automatically as needed. The present invention only needs to arrange one camera in the monitoring area.
可选地,在某些情况下,相机也可将部分图像数据发送到相机外部。例如,在某些特殊情况下,相机始终无法精确判断是否检测到图形标记(例如,重复进行预定次数的拍摄和检测,可能性始终在例如90%以下时)。此时,相机可通过外部处理单元的请求将如图8所示的第二图像和/或从第一图像或第二图像中截取的关注区域图像发送到相机外部,由用户或更复杂的识别系统进行判断。在这种情况下,需要发送的图像数据量仍然远小于传统监控系统需要发送的图像数据量,因为传统监控系统不会对拍摄的图像进行任何处理。额外可选地,容易理解的是,相机也能够将完整图像数据发送到相机外部。在此情况下,外部处理单元可向相机发出完整数据请求。Optionally, in some cases, the camera can also send part of the image data outside the camera. For example, in some special cases, the camera cannot always accurately determine whether a graphic mark is detected (for example, when a predetermined number of shooting and detection are repeated, the probability is always below 90%, for example). At this time, the camera can send the second image as shown in Figure 8 and/or the area of interest image captured from the first image or the second image to the outside of the camera through a request from the external processing unit, for identification by the user or more complex The system makes a judgment. In this case, the amount of image data that needs to be sent is still much smaller than that of a traditional surveillance system, because the traditional surveillance system does not perform any processing on the captured images. Additionally optionally, it will be appreciated that the camera is also capable of sending complete image data external to the camera. In this case, the external processing unit can issue a complete data request to the camera.
以如上所述方式,相机1能够利用图形标记4自动、快速且准确地判断监控区中的目标物是否存在(例如,是否销售、卸货、盗窃或者意外掉落等)。In the manner described above, the camera 1 can use the graphic mark 4 to automatically, quickly and accurately determine whether the target object in the monitoring area exists (for example, whether it is sold, unloaded, stolen or accidentally dropped, etc.).
并且,以类似的方式,相机1还可对目标物的布局是否正确进行判断。具体来说,相机1获取监控区的第一图像,相机电路从第一图像中截取关注区域,该关注区域包含至少一个目标物。相机电路对关注区域进行处理,获取目标物信息。例如,相机电路可通过将图像中的目标物与预先存储的多种目标物图像进行比较和匹配,识别出目标物的种类/类别。在目标物为具有QR码或条形码等机器可读条码的商品时,相机电路还可通过识别目标物包装上的QR码或条形码等获得目标物的种类/类型等信息。此外,与图8所示方式类似的,相机电路在从第一图像中无法准确获取目标物信息时,可对拍摄参数进行调整,获取包含关注区域的第二图像,然后从具有改进的图像质量的第二图像中提取目标物信息。然后,相机电路可将目标物信息与预先存储的布置信息进行比对,判断该目标物是否放置在正确的位置上。可选地,相机电路还可将目标物信息发送给外部处理单元,由外部处理单元进行后续处理和判断。And, in a similar manner, camera 1 can also determine whether the layout of the target object is correct. Specifically, the camera 1 acquires a first image of the monitoring area, and the camera circuit intercepts an area of interest from the first image, where the area of interest contains at least one target object. The camera circuit processes the area of interest to obtain target information. For example, the camera circuit can identify the type/category of the target by comparing and matching the target in the image with pre-stored images of multiple targets. When the target object is a product with a machine-readable barcode such as a QR code or barcode, the camera circuit can also obtain information such as the type/type of the target object by identifying the QR code or barcode on the package of the target object. In addition, similar to the method shown in Figure 8, when the target information cannot be accurately obtained from the first image, the camera circuit can adjust the shooting parameters to obtain a second image containing the area of interest, and then obtain the second image with improved image quality. Extract target object information from the second image. Then, the camera circuit can compare the target object information with the pre-stored arrangement information to determine whether the target object is placed in the correct position. Optionally, the camera circuit can also send the target object information to an external processing unit, and the external processing unit performs subsequent processing and judgment.
同样与图8实施例的处理方式类似的,相机电路可仅将从图像中提取并识别出的目标物信息数据(例如,“某某可乐”),而不是与之相比数量巨大的图像数据发送到外部,从而显著降低网络带宽的传输压力。当然,相机电路也可基于外部请求将部分或完整图像数据发送至外部。Also similar to the processing method of the embodiment of FIG. 8 , the camera circuit can only extract and identify the target object information data (for example, "such and such Coke") from the image, instead of a huge amount of image data. Sent to the outside, thereby significantly reducing the transmission pressure on network bandwidth. Of course, the camera circuitry can also send partial or complete image data to the outside based on external requests.
另外,在将相机安装到监控区域时,针对不同的监控区域、其中的目标物,以及目标物的布局,相机电路可基于对监控区图像中的图形标记的识别精度预先对拍摄参数进行调整和设置。具体来说,相机可预先对监控区域以多组不同的拍摄参数进行多次拍摄,并对获得的多组图像进行处理和分析。这里的拍摄参数例如包括但不限于拍摄角度、分辨率、曝光时间以及白平衡、降噪等等。基于对多组图像中图形标记的识别精度较高的图像帧的拍摄参数,相机电路确定出适合该监控区及其目标物的优选的拍摄参数范围。从而,基于确定的拍摄参数范围,相机电路可预先对该监控区的拍摄参数进行设置。以此方式,进一步提高了监控时的判断效率和准确率。In addition, when the camera is installed in the monitoring area, the camera circuit can adjust the shooting parameters in advance based on the recognition accuracy of the graphic marks in the monitoring area image for different monitoring areas, targets therein, and the layout of the targets. set up. Specifically, the camera can take multiple shots of the monitoring area with multiple sets of different shooting parameters in advance, and process and analyze the multiple sets of images obtained. The shooting parameters here include, for example, but are not limited to shooting angle, resolution, exposure time, white balance, noise reduction, etc. Based on the shooting parameters of image frames with higher recognition accuracy of graphic marks in multiple sets of images, the camera circuit determines a preferred shooting parameter range suitable for the monitoring area and its target objects. Therefore, based on the determined shooting parameter range, the camera circuit can set the shooting parameters of the monitoring area in advance. In this way, the judgment efficiency and accuracy during monitoring are further improved.
图4示出根据本发明的监控系统的第二实施例。在第二实施例中,监控系统设置在例如自动售卖商店的监控区中。如图4所示,相机1布置为其视角范围覆盖包括多个目标物2 的销售区以及存货区5。销售区包括多个置物位(未图示),每个置物位上都布置有图形标记(未图示)。图形标记与上述针对图2所述的类似。监控区还布置有监测是否有人进入监控区的传感器(未图示)。相机1连接至该传感器,并接收传感器发出的信号。当传感器发出有人进入监控区的信号时,相机1结束待机状态,以预定频率开始对监控区进行拍摄,并对图像进行处理。在检测到一个或多个图形标记时,相机1将检测结果发送到外部处理单元。可选地,相机1基于检测结果直接向存货区5发出填充信号,指示存货区5向相应的空置物位填充目标物。在确定空置物位被填充后,相机1对存货区5进行拍摄,并对拍摄的图像进行处理,判断是否有目标物的供应源为空。如果判断存在空的供应源,则将该判断结果发送到外部处理单元。用户在收到外部处理单元的提醒后,可相应补充供应源。Figure 4 shows a second embodiment of a monitoring system according to the invention. In the second embodiment, the monitoring system is provided in a monitoring area of a vending store, for example. As shown in Figure 4, the camera 1 is arranged so that its viewing angle covers multiple targets 2 sales area and inventory area 5. The sales area includes a plurality of storage locations (not shown), and graphic marks (not shown) are arranged on each storage location. Graphical markings are similar to those described above for Figure 2. The monitoring area is also equipped with sensors (not shown) that monitor whether someone enters the monitoring area. Camera 1 is connected to this sensor and receives the signal from the sensor. When the sensor sends a signal that someone enters the monitoring area, the camera 1 ends the standby state, starts shooting the monitoring area at a predetermined frequency, and processes the images. Upon detecting one or more graphic markers, camera 1 sends the detection results to the external processing unit. Optionally, the camera 1 directly sends a filling signal to the inventory area 5 based on the detection result, instructing the inventory area 5 to fill the target object into the corresponding empty level. After confirming that the empty material position is filled, the camera 1 takes a picture of the inventory area 5 and processes the captured image to determine whether the supply source of the target object is empty. If it is determined that an empty supply source exists, the determination result is sent to the external processing unit. After receiving the reminder from the external processing unit, the user can replenish the supply source accordingly.
图5示出根据本发明的监控系统的第三实施例。第三实施例与第二实施例的不同之处在于,监控系统的相机1布置为能够同时监控销售区、存储区5以及监控区入口。即,相机1还对是否有人进入监控区进行监控。Figure 5 shows a third embodiment of a monitoring system according to the invention. The third embodiment differs from the second embodiment in that the camera 1 of the surveillance system is arranged to be able to simultaneously monitor the sales area, the storage area 5 and the entrance to the surveillance area. That is, camera 1 also monitors whether anyone enters the monitoring area.
具体地,如图5所示,相机以预定频率对监控区入口进行拍摄,并通过检测和提取图像中的人脸面部特征(即,图6中的点状面部特征图)来判断有人进入监控区。检测面部特征通常包含定位图像中的人脸和检测关键面部结构特征。首先,相机电路使用例如基于深度学习的算法来在图像中进行人脸定位,获得人脸边界点。然后,在人脸边界内检测脸部区域中的关键脸部结构特征,例如嘴、右眉、左眉、右眼、左眼、鼻子、下颌,获得如图6所示的特征点。应当理解的是,可以根据需要选择任何已知的面部特征的检测和提取方法,并将相关算法存储在相机的芯片中。当然,在相机连接到网络中时,还可以通过云端进行相关计算,并获得计算结果。Specifically, as shown in Figure 5, the camera takes pictures of the entrance to the surveillance area at a predetermined frequency, and determines whether someone has entered the surveillance area by detecting and extracting facial features in the image (i.e., the dotted facial feature map in Figure 6) district. Detecting facial features typically involves locating faces in images and detecting key facial structural features. First, the camera circuit uses, for example, a deep learning-based algorithm to locate the face in the image and obtain the face boundary points. Then, key facial structural features in the face area are detected within the face boundary, such as the mouth, right eyebrow, left eyebrow, right eye, left eye, nose, and mandible, and the feature points shown in Figure 6 are obtained. It should be understood that any known detection and extraction method of facial features can be selected as needed, and the relevant algorithms can be stored in the chip of the camera. Of course, when the camera is connected to the network, relevant calculations can also be performed through the cloud and the calculation results can be obtained.
相机电路基于检测到人脸面部特征数据而判断有人进入监控区,并且基于该判断开始以预定频率对销售区进行拍摄。可选地或优选地,相机电路可将面部特征数据发送到外部。例如,如图6所示,相机1将检测到的面部特征图通过网络6发送到相机外部的处理单元7。处理单元7以及本文中提及的外部处理单元可以是如图6所示的远程计算机,或者是云端处理器、应用程序,也可以是手机终端的应用程序等等。如图6所示,通过网络6在相机1和处理单元7之间传输的是相机1提取出的面部特征图,该数据无论是与原始图像数据还是与人脸图像数据相比均极大地降低了数据传输量以及传输速度。以此方式,既避免了传输数据量巨大的图像的需要,又可避免暴露消费者的隐私。The camera circuit determines that someone has entered the monitoring area based on the detected face facial feature data, and starts photographing the sales area at a predetermined frequency based on the determination. Alternatively or preferably, the camera circuitry may send the facial feature data externally. For example, as shown in Figure 6, the camera 1 sends the detected facial feature map to the processing unit 7 outside the camera through the network 6. The processing unit 7 and the external processing units mentioned in this article can be a remote computer as shown in Figure 6, or a cloud processor, an application program, or an application program on a mobile phone terminal, etc. As shown in Figure 6, what is transmitted between the camera 1 and the processing unit 7 through the network 6 is the facial feature map extracted by the camera 1. This data is greatly reduced whether compared with the original image data or the face image data. The amount of data transferred and the speed of transfer. In this way, the need to transmit images with huge amounts of data is avoided, while the privacy of consumers is also avoided.
另外,在这个方面,如图7的左侧图像所示,相机1可能会拍摄到包含人手的图像。在某些应用场合,相机1可对该图像中的人手进行处理,提取手部可能包含的胎记或纹身等视觉标记,并将该视觉标记发送到外部。从而,外部处理单元可通过该视觉标记辅助识别拥有该手部的目标对象。由于相机1通过对图像进行处理,仅将视觉标记发送到外部,从而可提高外部处理单元识别目标对象的效率。相反地,在其他应用场合,在目标对象是隐私信息需要被保护的人群时(例如,当监控区为销售场所,而目标对象为普通消费者时),当相机1检测到人手时,则不会将包含人手的图像发送到外部。以此方式,通过在不同的监控场所/情形中,对相机1进行不同的设置,而实现不同的目的。In addition, in this regard, as shown in the left image of Figure 7, camera 1 may capture an image containing human hands. In some applications, camera 1 can process the human hands in the image, extract visual marks such as birthmarks or tattoos that the hands may contain, and send the visual marks to the outside. Thus, the external processing unit can assist in identifying the target object possessing the hand through the visual marker. Since the camera 1 processes the image and only sends the visual mark to the outside, the efficiency of the external processing unit in identifying the target object can be improved. On the contrary, in other application situations, when the target object is a group of people whose private information needs to be protected (for example, when the surveillance area is a sales place and the target object is ordinary consumers), when camera 1 detects human hands, it will not An image containing a human hand is sent externally. In this way, different purposes are achieved by setting up the camera 1 differently in different monitoring places/situations.
此外,图7的右侧图像示出相机1拍摄的包含图形标记4的图像。如图7的箭头所示,如果左侧图像和右侧图像以先后顺序分别出现在连续拍摄(例如,15fps)的图像帧中,则相机1判断对应的目标物2被取走或销售。相反,如果是如图9的A和图7的右侧图像以先后顺序分别出现在连续拍摄的图像帧中,则相机1判断目标物2可能不是被取走而是意外滑落。以此方式,对图像中的人手的检测还能够辅助相机1判断监控区中的商品或货物状况。Furthermore, the right image of FIG. 7 shows an image captured by the camera 1 including the graphic mark 4 . As shown by the arrows in Figure 7, if the left image and the right image respectively appear in sequential order in the image frames of continuous shooting (for example, 15fps), the camera 1 determines that the corresponding target object 2 is taken away or sold. On the contrary, if A in Figure 9 and the right image in Figure 7 appear in sequentially captured image frames, the camera 1 determines that the target 2 may not have been taken away but accidentally slipped. In this way, the detection of human hands in the image can also assist the camera 1 in determining the status of commodities or goods in the monitoring area.
另外,当相机拍摄到如图7中的左侧图像时,由于人手的遮挡,相机可能无法准确判断该图像中是否存在图形标记4。此时,相机将等待一定时间(以等待人手离开),再次进行拍摄。如果经过预定时间,相机仍然无法进行判断,例如,可疑人员滞留在监控区内,则相机电路向外部发出通知信号,以提醒用户。In addition, when the camera captures the image on the left as shown in Figure 7, the camera may not be able to accurately determine whether there is a graphic mark 4 in the image due to the occlusion of the human hand. At this time, the camera will wait for a certain period of time (to wait for the human hand to leave) before shooting again. If the camera is still unable to make a determination after a predetermined time, for example, a suspicious person remains in the monitoring area, the camera circuit will send a notification signal to the outside to alert the user.
图9示出相机1检测到图形标记4的一个实施例。图9的A示出相机1在对监控区进行监控时拍摄到的图像,在该图像中没有检测到图形标记,那么相机1不发送任何信号或图像到外部。相机1继续操作,在拍摄到如图9的B所示的图像时,相机电路检测到图形标记4,然后仅将检测结果发送给外部。可选地,相机电路将检测结果连同指示空的置物位的填充信号一起发送到外部。另外可选地,相机电路将填充信号直接发送给如图1所示的供应 源8或如图4和5所示的存货区5。供应源8或存货区5在接收到来自外部或相机电路的填充信号后,将目标物2补充至相应的置物位。图9的C示出相机1拍摄到的正在填充目标物2的图像,基于该图像以及随后将拍摄到补货完成的图像(如图9的A),相机1判断补货成功。如果在发出填充信号之后的预定时间段,相机拍摄到如图9的D所示的图像,则判断补货失败,并将该判断结果发送给外部。同时,相机1对供应源8或存货区5进行拍摄,并对图片进行分析,判断供应源8或存货区5中相应目标物的供应源是否为空,并将判断结果发送到外部。如果相机1判断相应目标物的供应源为空,可选地,可同时发送补充供应源的信号到外部。另一方面,如果相机1判断相应目标物的供应源不为空,则可能是由于自动填充设备出现故障或将要填充的目标物出现卡顿等原因而导致无法正常填充,此时相机1的电路向外部发出提醒信号,外部处理单元可提醒用户进行查看和检修。FIG. 9 shows an embodiment in which the camera 1 detects the graphic mark 4 . A of FIG. 9 shows an image captured by the camera 1 when monitoring the monitoring area. If no graphic mark is detected in the image, the camera 1 does not send any signal or image to the outside. The camera 1 continues to operate, and when the image shown in B of FIG. 9 is captured, the camera circuit detects the graphic mark 4, and then only sends the detection result to the outside. Optionally, the camera circuitry sends the detection result externally together with a fill signal indicating an empty storage location. Additionally optionally, the camera circuitry sends the fill signal directly to the supply as shown in Figure 1 Source 8 or inventory area 5 as shown in Figures 4 and 5. After receiving the filling signal from the external or camera circuit, the supply source 8 or the inventory area 5 replenishes the target object 2 to the corresponding storage position. C in FIG. 9 shows an image of the target object 2 being filled captured by the camera 1. Based on this image and the subsequent image of the replenishment completed (A in FIG. 9), the camera 1 determines that the replenishment is successful. If the camera captures an image as shown in D in Figure 9 within a predetermined period of time after the filling signal is sent, it is judged that the replenishment has failed, and the judgment result is sent to the outside. At the same time, the camera 1 takes pictures of the supply source 8 or the inventory area 5, analyzes the pictures, determines whether the supply source of the corresponding target object in the supply source 8 or the inventory area 5 is empty, and sends the judgment result to the outside. If the camera 1 determines that the supply source of the corresponding target object is empty, optionally, a signal for replenishing the supply source can be sent to the outside at the same time. On the other hand, if camera 1 determines that the supply source of the corresponding target object is not empty, it may be that the automatic filling equipment fails or the target object to be filled is stuck, resulting in the failure to fill normally. At this time, the circuit of camera 1 Send a reminder signal to the outside, and the external processing unit can remind the user to check and repair.
值得指出的是,图形标记4在如图9的C所示的时刻也会部分露出并被相机1拍摄和检测到,但相机1在这样的情况下并不会不当地发出填充信号。基于相机1存储的算法,相机只有在检测到如图9的B所示的完整的图形标记4时,才会发送存在空置物位的判断结果(以及可选地,填充信号)。相应地,在图9的ABC所示的时刻顺序以外拍摄到如图9的C的图像时,例如,目标物2正在被取出时,相机1也不会错误地发送存在空置物位的判断结果。It is worth pointing out that the graphic mark 4 will also be partially exposed and photographed and detected by the camera 1 at the moment shown in C of Figure 9 , but the camera 1 will not inappropriately send out a fill signal in such a situation. Based on the algorithm stored in camera 1, the camera will only send a judgment result (and optionally, a filling signal) that there is an empty material level when it detects a complete graphic mark 4 as shown in B of Figure 9 . Correspondingly, when the image as shown in C in FIG. 9 is captured outside the time sequence shown in ABC in FIG. 9 , for example, when the target object 2 is being taken out, the camera 1 will not erroneously send the judgment result that there is an empty object level. .
图10示出根据本发明的监控系统的相机电路判断图像中是否存在图形标记的一个具体示例。如图10所示,在相机电路对拍摄的图像进行处理时,如果基于相机中存储的算法模型判断检测到图形标记的可能性为90%及以上时,则确定检测到图形标记;如果判断检测到图形标记的可能性为30%及以下时,则确定没有检测到图形标记。FIG. 10 shows a specific example of the camera circuit of the monitoring system according to the present invention determining whether there is a graphic mark in the image. As shown in Figure 10, when the camera circuit processes the captured image, if the possibility of detecting a graphic mark is judged to be 90% or above based on the algorithm model stored in the camera, it is determined that the graphic mark is detected; if it is determined that the graphic mark is detected When the probability of reaching a graphic mark is 30% or less, it is determined that no graphic mark is detected.
另一方面,如果判断检测到图形标记的可能性为大于30%且小于90%,则相机电路可对拍摄参数进行调整后再次拍摄(count+=1),以基于优化后的图像再次进行分析和判断,直到能够确定是否检测到图形标记。该步骤至多重复3次。如果在第x(x≤3,即图10中所示的count<=3)次的拍摄和分析时,确定检测到图形标记(即,可能性为90%及以上),则进入如图11所示的步骤S5;如果在第x(x≤3)次的拍摄和分析时,确定没有检测到图形标记(即,可能性为30%及以下),则进入如图11所示的步骤S4。On the other hand, if it is determined that the possibility of detecting a graphic mark is greater than 30% and less than 90%, the camera circuit can adjust the shooting parameters and shoot again (count+=1) to analyze and analyze again based on the optimized image. Judge until it can be determined whether a graphic mark is detected. This step is repeated up to 3 times. If it is determined that a graphic mark is detected (ie, the probability is 90% and above) during the xth (x≤3, that is, count<=3 as shown in Figure 10) times of shooting and analysis, then enter Figure 11 Step S5 shown; if it is determined that no graphic mark is detected during the xth (x≤3) shooting and analysis (that is, the probability is 30% and below), then enter step S4 as shown in Figure 11 .
另一方面,如果重复3次以后仍不能确定是否检测到图形标记(可能性均落在30%和90%之间)(即图10中所示的count>3),则相机电路向外部发出提醒信号。用户收到提醒后,可通过外部处理单元向相机请求监控区图像,由用户基于相机发送的监控的部分/完整图像进行人工判断。或者,在相机电路无法判断的原因是出现故障或障碍物时,用户可前往监控区检查。On the other hand, if after repeating 3 times it is still not possible to determine whether the graphic mark is detected (the probability falls between 30% and 90%) (i.e. count>3 shown in Figure 10), the camera circuit sends an external signal Alert signal. After receiving the reminder, the user can request the surveillance area image from the camera through the external processing unit, and the user will make manual judgment based on the partial/complete image of the surveillance sent by the camera. Alternatively, when the camera circuit cannot determine that the cause is a malfunction or obstacle, the user can go to the monitoring area for inspection.
应当理解的是,图10中所示出的可能性数值90%和30%仅是示例性的,用户可以根据设计需要、使用场景等因素选择其它的数值作为阈值。例如,80%和20%等。It should be understood that the possibility values 90% and 30% shown in Figure 10 are only exemplary, and users can select other values as thresholds based on design needs, usage scenarios and other factors. For example, 80% and 20% etc.
图11示出根据本发明的监控系统的相机的工作步骤的一个示例。如图11所示,在步骤S1中,判断是否有人进入监控区。例如,在图4所示的实施例中,设置在监控区入口的传感器检测到有人进入监控区时发送信号给相机1。没有收到该信号时,相机1处于待机状态。或者可选地,如在图5所示的实施例中,在相机1布置为视角覆盖监控区入口时,可对监控区入口进行拍摄,通过对监控区入口图像的处理来判断是否有人进入监控区。这里的判断方式如在上面基于图6描述的。Figure 11 shows an example of the working steps of the camera of the monitoring system according to the present invention. As shown in Figure 11, in step S1, it is determined whether someone enters the monitoring area. For example, in the embodiment shown in FIG. 4 , a sensor installed at the entrance of the monitoring area sends a signal to the camera 1 when it detects that someone enters the monitoring area. When this signal is not received, camera 1 is in standby mode. Or alternatively, as in the embodiment shown in Figure 5, when the camera 1 is arranged with a viewing angle covering the entrance of the monitoring area, the entrance of the monitoring area can be photographed, and whether someone enters the monitoring area can be determined by processing the image of the entrance of the monitoring area. district. The judgment method here is as described above based on Figure 6 .
在相机1检测到有人进入监控区或者收到有人进入监控区的外部信号后,相机1开始对监控区进行拍摄,如步骤S2所示。在S2中,相机1以预定的频率对监控区进行拍摄。相机电路在拍摄的图像中可能检测到图形标记4时进入步骤S3。在S3中,相机电路通过确定是否检测到图形标记来判断是否存在空置物位。当确定未在当前拍摄的一个图像中检测到图形标记4时,或者优选地,在连续拍摄的多帧(预先设定的数目)图像中都未检测到图形标记4时,相机电路作出没有空置物位的判断,然后返回S1。另一方面,在S3中,当相机电路确定在当前拍摄的一个图像中存在图形标记4时,判断与该图形标记4对应的置物位为空。可选地或优选地,例如为了提高判断的准确性,当相机电路在连续拍摄的多帧(预先设定的数目)图像中均检测到图形标记4时,确定与该图形标记4对应的置物位为空,并获得与该空置物位关联的置物位和/或目标物的信息。在同时检测到多个图形标记4时,相机电路分别确定与每个图形标记4对应的置物位的信息(例如,在监控区的具体位置)和/或目标物2 的信息(例如,目标物的种类/类型)。另外,在S3中,相机1的电路可能检测到监控区被障碍物阻挡,例如消费者或其他人员在销售区前徘徊,或者消费者或其他人员的手等部位作为障碍物遮挡了相机1的部分视角。此时,如图10所示,相机在重复检测预定次数(例如,3次)仍然不能判断拍摄图像中是否存在目标物或图形标记时,相机电路将向外部发出提醒信号,提醒用户前往处理障碍物/障碍情形。可选地,相机也能够基于用户请求将部分拍摄图像发送到外部,由用户进行人工判断。After the camera 1 detects that someone enters the monitoring area or receives an external signal that someone enters the monitoring area, the camera 1 starts to photograph the monitoring area, as shown in step S2. In S2, camera 1 takes pictures of the monitoring area at a predetermined frequency. The camera circuit proceeds to step S3 when the graphic mark 4 may be detected in the captured image. In S3, the camera circuit determines whether a vacant level exists by determining whether a graphic mark is detected. When it is determined that the graphic mark 4 is not detected in one image currently captured, or preferably, when the graphic mark 4 is not detected in a plurality of consecutively captured images (a preset number), the camera circuit makes a decision that there is no vacancy. Judgment of material level, and then return to S1. On the other hand, in S3, when the camera circuit determines that there is a graphic mark 4 in an image currently captured, it determines that the storage position corresponding to the graphic mark 4 is empty. Alternatively or preferably, for example, in order to improve the accuracy of judgment, when the camera circuit detects the graphic mark 4 in multiple frames (a preset number) of images continuously captured, determine the object corresponding to the graphic mark 4. The position is empty, and information about the position and/or target associated with the empty position is obtained. When multiple graphic marks 4 are detected at the same time, the camera circuit determines the location information corresponding to each graphic mark 4 (for example, the specific location in the monitoring area) and/or the target object 2 information (e.g., type/type of target). In addition, in S3, the circuit of camera 1 may detect that the monitoring area is blocked by obstacles, such as consumers or other personnel wandering in front of the sales area, or the hands and other parts of consumers or other personnel acting as obstacles blocking camera 1 Partial perspective. At this time, as shown in Figure 10, when the camera still cannot determine whether there is a target or graphic mark in the captured image after repeated detection a predetermined number of times (for example, 3 times), the camera circuit will send a reminder signal to the outside to remind the user to deal with the obstacle. object/obstacle situation. Optionally, the camera can also send part of the captured images to the outside based on the user's request, and the user can make manual judgment.
在S3中检测到一个或多个图形标记4时,进入步骤S5。在S5中,相机电路将S3中的确定结果以及获取的特征信息(例如,“放置目标物X的位于某行某列处的置物位为空”,“放置目标物X的某处置物位、放置目标物Y的某处置物位……为空”)发送到外部,而不是将拍摄到的图像发送到外部。可选地,相机电路同时将填充信号与上述确定结果一起发送到外部。在收到上述确定结果(以及可选地,填充信号)后,外部处理单元发出指令,指示监控区向相应的空置物位填充目标物。另外可选地,相机电路可直接将填充信号发送到监控区的供应源(例如,图1所示的供应源8或图4所示的存货区5),指示供应源8或存货区5向对应的空置物位填充相应的目标物。接着,进入步骤S6。在S6中,等待预定时间(例如,2秒),以待完成目标物填充。然后进入步骤S7。在S7中,相机电路检测空置物位是否确实被填充。在该步骤中,与S3类似地,确定是否在拍摄图像中检测到图形标记。如果没有检测到图形标记4,相机电路判断空置物位已经被填充,然后返回S1;如果检测到图形标记4,则进入步骤S8。在S8中,判断供应源8或存货区5中存放的相应目标物是否为空。具体地,相机1对供应源8或存货区5进行拍摄,通过对所拍摄的图像进行处理,检测相应目标物的供应源8或存货区5是否为空。如果判断相应目标物的供应物源为空时,进入步骤S10。在S10中,相机电路将判断结果发送到外部。可选地,相机电路将补充供应源的指令与判断结果一起发送到外部。收到来自外部处理单元的补充供应源的信号后,用户相应地对供应源8或存货区5进行补给。When one or more graphic marks 4 are detected in S3, step S5 is entered. In S5, the camera circuit combines the determination result in S3 and the acquired feature information (for example, "the object location at a certain row and column where the target object The object position somewhere where the target Y is placed...is empty") is sent to the outside instead of sending the captured image to the outside. Optionally, the camera circuit simultaneously sends the fill signal to the outside together with the above determination result. After receiving the above determination result (and optionally, the filling signal), the external processing unit issues an instruction to instruct the monitoring area to fill the target object into the corresponding vacant material level. Alternatively, the camera circuit may directly send a filling signal to the supply source of the monitoring area (for example, the supply source 8 shown in Figure 1 or the inventory area 5 shown in Figure 4), instructing the supply source 8 or the inventory area 5 to The corresponding empty material level is filled with the corresponding target object. Next, proceed to step S6. In S6, wait for a predetermined time (for example, 2 seconds) to complete the target filling. Then proceed to step S7. In the S7, the camera circuit detects whether the empty level is actually filled. In this step, similarly to S3, it is determined whether a graphic mark is detected in the captured image. If graphic mark 4 is not detected, the camera circuit determines that the empty material level has been filled, and then returns to S1; if graphic mark 4 is detected, it proceeds to step S8. In S8, it is determined whether the corresponding target stored in the supply source 8 or the inventory area 5 is empty. Specifically, the camera 1 photographs the supply source 8 or the inventory area 5, and processes the photographed image to detect whether the supply source 8 or the inventory area 5 of the corresponding target object is empty. If it is determined that the supply source of the corresponding target object is empty, step S10 is entered. In S10, the camera circuit sends the judgment result to the outside. Optionally, the camera circuit sends an instruction to replenish the supply source to the outside together with the judgment result. After receiving a signal from the external processing unit to replenish the supply source, the user replenishes the supply source 8 or the inventory area 5 accordingly.
在S7中,当相机电路无法判断空置物位是否被填充时(例如,拍摄图像不清晰、有障碍物遮挡等),进入步骤S9。在S9中,相机电路优化拍摄参数,重复进行拍摄和判断,直到能够准确判断空置物位是否被填充。可与图10所示实施例类似地,重复步骤至多进行3次。如果第x(x≤3)次的拍摄和分析时,能够确定是否检测到图形标记(从而能够判断空置物位是否被填充),进入S7;如果重复3次后(即,x>3)仍无法确定空置物位是否被填充,则相机电路向外部发出提醒。用户可前往监控区检查,以例如清除障碍物。In S7, when the camera circuit cannot determine whether the vacant material position is filled (for example, the captured image is unclear, blocked by obstacles, etc.), step S9 is entered. In the S9, the camera circuit optimizes the shooting parameters and repeats shooting and judgment until it can accurately judge whether the empty position is filled. Similar to the embodiment shown in FIG. 10 , the steps may be repeated up to three times. If it can be determined whether the graphic mark is detected during the x (x≤3)-th shooting and analysis (thereby being able to determine whether the vacant material level is filled), enter S7; if it is still repeated after 3 times (i.e., x>3) If it cannot be determined whether the empty material level is filled, the camera circuit sends an external reminder. Users can go to the monitored area to check, for example, to clear obstacles.
上述处理步骤作为程序存储在相机1的集成芯片的存储器中,用于在相机中的处理电路读取程序时指示相机在监控过程中的相应操作。可以理解的是,上述程序也可存储在外部处理单元或计算机可读存储介质中,用于在特殊情况下(例如,相机电路无法对图像进行正常操作),由外部处理单元/计算机获取相机拍摄的照片,并在相机外部进行处理。The above processing steps are stored as a program in the memory of the integrated chip of the camera 1, and are used to instruct the camera to perform corresponding operations during the monitoring process when the processing circuit in the camera reads the program. It can be understood that the above program can also be stored in an external processing unit or a computer-readable storage medium, and is used to obtain the camera shots by the external processing unit/computer under special circumstances (for example, the camera circuit cannot operate the image normally). photos and process them outside the camera.
上述提及的图像处理,包括对图像中的图形标记、目标物以及人脸面部特征的处理等,所涉及的计算均可采用已知的用于识别图形的算法或采用已知的计算方式进行建模。在此不再赘述。The above-mentioned image processing includes the processing of graphic marks, targets, and facial features in the image. The calculations involved can be performed using known algorithms for recognizing graphics or using known calculation methods. Modeling. I won’t go into details here.
虽然在上面的实施例中以布置在自动售卖机/商店的监控系统为例进行了描述,但本发明显然不限于此,而是可以应用于任何有监控目标物需求的场合。 Although the above embodiment is described by taking a monitoring system arranged in a vending machine/store as an example, the present invention is obviously not limited to this, but can be applied to any situation where there is a need to monitor target objects.

Claims (13)

  1. 一种监控系统,包括:A monitoring system including:
    相机,其捕获监控区的图像并且存储所述图像;以及a camera that captures images of the surveillance area and stores the images; and
    至少一个图形标记,其被安装到所述监控区中的至少一个置物位,其中,所述至少一个图形标记中的每一者在目标物位于相应的所述置物位时对所述相机是不可见的,At least one graphical mark is installed to at least one placement position in the monitoring area, wherein each of the at least one graphical mark is invisible to the camera when the target object is located at the corresponding placement position. visible,
    其中,所述相机包括电路,所述电路被配置为:Wherein, the camera includes a circuit configured as:
    以预定频率捕获所述监控区的图像,capturing images of the surveillance area at a predetermined frequency,
    确定所述图形标记是否包括在每个捕获图像中,并且determine whether the graphic marker is included in each captured image, and
    发送确定结果到所述相机的外部。Send the determination results to the outside of the camera.
  2. 根据权利要求1所述的监控系统,其中,所述电路进一步被配置为从所述监控区的图像中提取目标物信息,并且将所述目标物信息发送到所述相机的外部。The monitoring system of claim 1, wherein the circuit is further configured to extract target information from the image of the monitoring area and send the target information to the outside of the camera.
  3. 根据权利要求1所述的监控系统,其中,所述至少一个图形标记中的每一者具有选定的颜色,所述选定的颜色与要被放置在相应的所述置物位上的所述目标物的颜色不同。The monitoring system of claim 1, wherein each of said at least one graphical mark has a selected color that is consistent with said item to be placed on the corresponding said storage location. The target objects are of different colors.
  4. 根据权利要求1所述的监控系统,其中,多个图形标记分别被安装到所述监控区中的多个置物位,并且所述多个图形标记彼此是不同的。The monitoring system according to claim 1, wherein a plurality of graphic marks are respectively installed to a plurality of storage positions in the monitoring area, and the plurality of graphic marks are different from each other.
  5. 根据权利要求4所述的监控系统,其中,所述多个图形标记具有不同的图案、形状和/或颜色。The monitoring system of claim 4, wherein the plurality of graphic markers have different patterns, shapes and/or colors.
  6. 根据权利要求4所述的监控系统,其中,所述电路还被配置为将每个所述图像标记与相应的所述置物位和/或要被放置在相应的所述置物位处的所述目标物关联起来。The monitoring system of claim 4, wherein the circuit is further configured to associate each of the image markers with the corresponding storage location and/or the object to be placed at the corresponding storage location. Target objects are associated.
  7. 根据权利要求1所述的监控系统,其中,所述电路还被配置为通过捕获、分析所述监控区的多个图像,根据所述至少一个图形标记在所述监控区的所述多个图像中的识别精度来调整所述相机的拍摄参数。The monitoring system of claim 1, wherein the circuit is further configured to mark the plurality of images in the monitoring area according to the at least one graphic by capturing and analyzing the plurality of images of the monitoring area. The shooting parameters of the camera are adjusted based on the recognition accuracy.
  8. 根据权利要求2所述的监控系统,其中,所述电路还被配置为:The monitoring system of claim 2, wherein the circuit is further configured to:
    从所述监控区的第一图像中截取关注区域,所述关注区域包含位于相应的置物位处的关注目标物,An area of interest is intercepted from the first image of the monitoring area, where the area of interest includes an object of interest located at a corresponding object location,
    捕获所述关注区域的第二图像,以及capturing a second image of the area of interest, and
    从所述第二图像中提取所述关注目标物的所述目标物信息。The target object information of the target object of interest is extracted from the second image.
  9. 根据权利要求1所述的监控系统,其中,所述电路还被配置为:The monitoring system of claim 1, wherein the circuit is further configured to:
    从所述监控区的第一图像中截取关注区域,所述关注区域包含所述至少一个图形标记中的关注图形标记,A region of interest is intercepted from the first image of the monitoring area, and the region of interest contains a graphic mark of concern in the at least one graphic mark,
    捕获所述关注区域的第二图像,以及capturing a second image of the area of interest, and
    确定所述关注图形标记是否包括在所述第二图像中。It is determined whether the graphic mark of interest is included in the second image.
  10. 根据权利要求1所述的监控系统,其中,所述电路还被配置为接收外部信号,并且当所述外部信号指示有人进入所述监控区时,所述相机开始以所述预定频率捕获所述监控区的图像,或者,The monitoring system of claim 1, wherein the circuit is further configured to receive an external signal, and when the external signal indicates that someone enters the monitoring area, the camera begins to capture the predetermined frequency. images of the surveillance area, or,
    其中,所述电路还被配置为通过从所述监控区的图像中检测和提取人脸面部特征来检测是否有人进入所述监控区。Wherein, the circuit is further configured to detect whether someone enters the monitoring area by detecting and extracting facial features from the image of the monitoring area.
  11. 根据权利要求1所述的监控系统,其中,所述电路还被配置为当确定所述图形标记被包括在一个捕获图像中时,发送所述确定结果以及指示空的置物位的填充信号。The monitoring system of claim 1 , wherein the circuit is further configured to, when it is determined that the graphic mark is included in a captured image, send the determination result together with a fill signal indicating an empty storage location.
  12. 根据权利要求1所述的监控系统,其中,所述电路还被配置为当确定所述图形标记被包括在预定数量的连续的捕获图像中时,发送所述确定结果以及指示空的置物位的填充信号。The monitoring system of claim 1 , wherein the circuit is further configured to, when it is determined that the graphic mark is included in a predetermined number of consecutive captured images, transmit the determination result together with a message indicating an empty storage location. Fill signal.
  13. 根据权利要求1-12中任一项所述的监控系统,其中,所述相机包括集成芯片,所述电 路布置在所述集成芯片上,并且所述集成芯片还包括存储器,所述存储器存储有用于所述电路的图像处理的预定算法。 The monitoring system according to any one of claims 1-12, wherein the camera includes an integrated chip, and the electronic Roads are arranged on the integrated chip, and the integrated chip further includes a memory storing a predetermined algorithm for image processing of the circuit.
PCT/CN2023/096516 2022-06-07 2023-05-26 Monitoring system WO2023236789A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210636273.X 2022-06-07
CN202210636273.XA CN117255173A (en) 2022-06-07 2022-06-07 Monitoring system

Publications (1)

Publication Number Publication Date
WO2023236789A1 true WO2023236789A1 (en) 2023-12-14

Family

ID=89117616

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/096516 WO2023236789A1 (en) 2022-06-07 2023-05-26 Monitoring system

Country Status (2)

Country Link
CN (1) CN117255173A (en)
WO (1) WO2023236789A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130134178A1 (en) * 2011-11-29 2013-05-30 Pin Lu Vending machine with automated detection of product position
CN108629659A (en) * 2018-04-27 2018-10-09 北京无人店科技有限公司 The self-service system made an inventory using vision measurement
CN110135324A (en) * 2019-05-09 2019-08-16 程度 A kind of logistics finance cargo monitoring system and operation method
JP2020021367A (en) * 2018-08-02 2020-02-06 富士電機株式会社 Automatic vending machine
DE202020100579U1 (en) * 2020-02-03 2020-02-20 KEMAS Gesellschaft für Elektronik, Elektromechanik, Mechanik und Systeme mbH Goods store
CN111294503A (en) * 2018-12-07 2020-06-16 阿里巴巴集团控股有限公司 Commodity replenishment and commodity identification processing method and device and electronic equipment
CN213024564U (en) * 2020-08-28 2021-04-20 北京映翰通网络技术股份有限公司 Automatic vending machine with goods detection mechanism

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130134178A1 (en) * 2011-11-29 2013-05-30 Pin Lu Vending machine with automated detection of product position
CN108629659A (en) * 2018-04-27 2018-10-09 北京无人店科技有限公司 The self-service system made an inventory using vision measurement
JP2020021367A (en) * 2018-08-02 2020-02-06 富士電機株式会社 Automatic vending machine
CN111294503A (en) * 2018-12-07 2020-06-16 阿里巴巴集团控股有限公司 Commodity replenishment and commodity identification processing method and device and electronic equipment
CN110135324A (en) * 2019-05-09 2019-08-16 程度 A kind of logistics finance cargo monitoring system and operation method
DE202020100579U1 (en) * 2020-02-03 2020-02-20 KEMAS Gesellschaft für Elektronik, Elektromechanik, Mechanik und Systeme mbH Goods store
CN213024564U (en) * 2020-08-28 2021-04-20 北京映翰通网络技术股份有限公司 Automatic vending machine with goods detection mechanism

Also Published As

Publication number Publication date
CN117255173A (en) 2023-12-19

Similar Documents

Publication Publication Date Title
US7733404B2 (en) Fast imaging system calibration
CN102833478B (en) Fault-tolerant background model
CN109389719B (en) Community door access control system and door opening method
JP5480777B2 (en) Object display device and object display method
CN110934572A (en) Body temperature monitoring method and device and readable storage medium
EP3493525B1 (en) Imaging processing program, imaging processing method, and imaging processing device
CN108764052A (en) Image processing method, device, computer readable storage medium and electronic equipment
US20170316575A1 (en) Image processing apparatus, image processing method and program
CN108805024A (en) Image processing method, device, computer readable storage medium and electronic equipment
US8881982B2 (en) Portable encoded information reading terminal configured to acquire images
JP2000278584A (en) Image input device provided with image processing function and recording medium recording its image processing program
TWI709110B (en) Camera calibration method and apparatus, electronic device
CN111358443A (en) Body temperature monitoring method and device
TW201727537A (en) Face recognition system and face recognition method
CN109086694A (en) A kind of face identification system and method
CN111429194B (en) User track determination system, method, device and server
WO2023236789A1 (en) Monitoring system
KR20200090403A (en) Electronic apparatus and the control method thereof
TWI631480B (en) Entry access system having facil recognition
KR101799335B1 (en) Fake Face Distinction System by Using Monocular Camera and Method thereof
CN110621962A (en) Positioning method of movable platform and related device and system
CN112052706B (en) Electronic device and face recognition method
Chowdhury et al. Human surveillance system for security application
JP2000251077A (en) Customer face recognizing method for bank transaction system
US20210084263A1 (en) Surveillance assist device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23818954

Country of ref document: EP

Kind code of ref document: A1