WO2019184275A1 - Image processing method, device and system - Google Patents

Image processing method, device and system Download PDF

Info

Publication number
WO2019184275A1
WO2019184275A1 PCT/CN2018/106752 CN2018106752W WO2019184275A1 WO 2019184275 A1 WO2019184275 A1 WO 2019184275A1 CN 2018106752 W CN2018106752 W CN 2018106752W WO 2019184275 A1 WO2019184275 A1 WO 2019184275A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
video frame
label
collection device
tag
Prior art date
Application number
PCT/CN2018/106752
Other languages
French (fr)
Chinese (zh)
Inventor
金海善
林圣拿
何溯
杨俊�
Original Assignee
杭州海康威视系统技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN201810272370.9A external-priority patent/CN109274926B/en
Application filed by 杭州海康威视系统技术有限公司 filed Critical 杭州海康威视系统技术有限公司
Publication of WO2019184275A1 publication Critical patent/WO2019184275A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating 3D models or images for computer graphics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast

Definitions

  • the present application relates to the field of video surveillance technologies, and in particular, to an image processing method, device, and system.
  • image acquisition devices are provided in many scenes, and related personnel can monitor the scene through video frame images collected by the device.
  • the display content only includes the image itself and the acquisition time of the image.
  • the user who views the video frame image he can only familiarize himself with the real environment corresponding to the video frame image, and understand the specific content contained in the image based on the real environment. It can be seen that this image display method is not intuitive and the display effect is poor.
  • An object of the embodiments of the present application is to provide an image processing method, device, and system, which improve the display effect of a video frame image.
  • an image processing method including:
  • the video frame image after the tag is displayed according to the preset display rule.
  • the video frame image is a panoramic image
  • the first collecting device is configured to correspond to at least one second collecting device
  • the second collecting device performs image capturing on the sub-scene corresponding to the panoramic image
  • the method further includes:
  • the step of determining at least one target location in the video frame image includes:
  • the first collection device is an augmented reality AR panoramic camera.
  • the step of generating a label according to the image of the sub-scene includes:
  • the step of adding the target information in the sub-scene image to the content of the label includes:
  • Identifying the sub-scene image determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
  • the step of displaying the tagged video frame image according to the preset display rule includes:
  • the content of the added tag is displayed.
  • the step of displaying the tagged video frame image according to the preset display rule includes:
  • the video frame image after the tag is added, and the content of the added tag.
  • display the contents of the added tags including:
  • the method further includes:
  • the clicked label is determined as the target label
  • the content of the target tag is displayed in the video frame image.
  • the method before the step of determining the at least one target location in the video frame image, the method further includes:
  • the step of determining at least one target location in the video frame image includes:
  • a target location of the added tag is determined according to the tag addition instruction.
  • the step of displaying the tagged video frame image according to the preset display rule includes:
  • Determining a layer display strategy and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
  • the label corresponding to the current display layer is displayed.
  • the step of acquiring the sub-scene image collected by the second collection device includes:
  • the step of generating a label includes:
  • the step of detecting whether an abnormal event occurs in the panoramic image comprises:
  • the step of determining the target second collection device corresponding to the abnormal event includes:
  • the method further includes:
  • the step of displaying the tagged video frame image according to the preset display rule includes:
  • the label is displayed in the video frame image in a preset alarm mode.
  • an embodiment of the present application further discloses an image processing apparatus, including: a processor and a memory;
  • a memory for storing a computer program
  • the processor when used to execute the program stored on the memory, implements the following steps:
  • the video frame image after the tag is displayed according to the preset display rule.
  • the video frame image is a panoramic image
  • the first collecting device is configured to correspond to at least one second collecting device
  • the second collecting device performs image capturing on the sub-scene corresponding to the panoramic image
  • the processor is further configured to implement the following steps:
  • processor is further configured to implement the following steps:
  • processor is further configured to implement the following steps:
  • Identifying the sub-scene image determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
  • processor is further configured to implement the following steps:
  • the content of the added tag is displayed.
  • processor is further configured to implement the following steps:
  • the video frame image after the tag is added, and the content of the added tag.
  • processor is further configured to implement the following steps:
  • processor is further configured to implement the following steps:
  • the clicked label is determined as the target label
  • the content of the target tag is displayed in the video frame image.
  • processor is further configured to implement the following steps:
  • a target location of the added tag is determined according to the tag addition instruction.
  • processor is further configured to implement the following steps:
  • Determining a layer display strategy and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
  • the label corresponding to the current display layer is displayed.
  • processor is further configured to implement the following steps:
  • processor is further configured to implement the following steps:
  • processor is further configured to implement the following steps:
  • processor is further configured to implement the following steps:
  • the label is displayed in the video frame image in a preset alarm mode.
  • an embodiment of the present application further discloses an image processing system, including: a first collection device and an image processing device, where
  • the first collecting device is configured to collect a video frame image, and send the collected video frame image to the image processing device;
  • the image processing device is configured to determine, according to a video frame image acquired by the first collection device, at least one target location in the video frame image; add a label at each determined target location, the label is based on user input
  • the content or the image acquired by the second collection device is generated; and the video frame image after the tag is added is displayed according to the preset display rule.
  • the system further includes: at least one second collection device,
  • the second collection device is configured to perform image collection on a sub-scene corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
  • the image processing device is further configured to acquire a sub-scene image acquired by the second collection device; generate a label according to the sub-scene image; and determine, according to the calibration information of the first collection device and the second collection device acquired in advance The label corresponding to the second collection device is at a target position in the panoramic image.
  • the first collection device is an augmented reality AR panoramic camera.
  • an embodiment of the present application further discloses a computer readable storage medium, where the computer readable storage medium stores a computer program, and when the computer program is executed by a processor, implement any one of the foregoing image processing methods. .
  • an embodiment of the present application further discloses an executable program code for being executed to execute any of the image processing methods described above.
  • the label can help the user understand the specific content included in the video frame image, and therefore, adding the tagged video
  • the frame image can display the image content more intuitively, and the display effect is better.
  • FIG. 1 is a schematic diagram of a first process of an image processing method according to an embodiment of the present disclosure
  • FIG. 1 is a schematic diagram of a display interface according to an embodiment of the present application.
  • FIG. 1b is a schematic diagram of another display interface provided by an embodiment of the present application.
  • FIG. 2 is a second schematic flowchart of an image processing method according to an embodiment of the present application.
  • FIG. 2a is a schematic diagram of an application scenario provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of a third process of an image processing method according to an embodiment of the present disclosure.
  • FIG. 4 is a schematic structural diagram of an image processing apparatus according to an embodiment of the present application.
  • FIG. 4b is a schematic structural diagram of another image processing device according to an embodiment of the present disclosure.
  • FIG. 5 is a schematic structural diagram of an image processing system according to an embodiment of the present application.
  • an embodiment of the present application provides an image processing method, device, and system.
  • the method can be applied to various image processing devices, and is not specifically limited.
  • FIG. 1 is a schematic flowchart of an image processing method according to an embodiment of the present disclosure, including:
  • S101 Determine, for the video frame image acquired by the first collection device, at least one target location in the video frame image.
  • the processing object of the embodiment of the present application is a video frame image, and the image provided by the embodiment of the present application may be processed for each frame of the video.
  • the target location there are several ways to determine the target location. For example, one or more fixed positions may be set in advance in the video frame image as the target position; for example, the intermediate position of the video frame image may be set as the target position.
  • the user-specified location may be determined as the target location according to the user instruction. It should be noted that, for the same video, or multiple videos of the same scene, the user may only send an instruction once, according to the instruction, may be in this paragraph or The target position is determined in each of the plurality of pieces of video.
  • the installation location of the first collection device is generally fixed, and the scene corresponding to the captured video frame image is also substantially unchanged. Therefore, in each video frame image, the difference in the screen content corresponding to the preset position is usually The difference in the screen content corresponding to the position specified in the above user command is usually not large, and the target position can be determined in the plurality of video frame images according to an instruction sent by the user.
  • the way to determine the target location can also be other, and is not limited.
  • S102 Add a label at each determined target position, and the label is generated according to an input instruction or an image acquired by the second collection device.
  • the input command can be a tagged instruction entered by the user.
  • the second collection device may be an acquisition device that is configured in the same scenario as the first collection device. For example, the first collection device performs image collection for the scene A, and the second collection device performs image collection for the sub-scenario A1 in the scene A.
  • the label may include a "tag symbol” and a "tag content”.
  • the "tag symbol” may be an arrow, a triangle, etc.
  • the "tag symbol” is for marking a position in the video frame image.
  • the specific format of the label is not limited; the content of the label may be an image collected by other collection devices, or may be some image analysis data, or may be associated data of the scene at the label, and the like, and is not limited.
  • the image analysis data may be a face recognition result, a vehicle recognition result, or the like
  • the associated data of the scene may be an introduction content of the scene, or if the scene is a traffic bayonet, the associated data may be traffic flow data or the like.
  • the tag may also include a "tag name", for example, may be some simple text information, such as "some building", “some park” and the like.
  • the input instruction is the text information “some building” input by the user and the specific introduction of the building
  • a label may be generated, and the label symbol may be an arrow, and the label name may be the text information “some building”, the label
  • the content can be a specific introduction to the building.
  • the target location is a traffic bayonet
  • the label content added at the traffic bayonet may be video data collected at the bayonet, a captured image at the bayonet, traffic flow data at the bayonet, and the like.
  • the user can design his own label according to his own needs. Specifically, the user may click on a certain position in the video frame image and input some text or image content; the device performing the scheme may generate a corresponding label according to the content input by the user, and after the video frame image and the subsequent In the video frame image, the location clicked by the user is determined as the target location, and the generated tag is added at the target location.
  • the label may be generated according to the image collected by the other collection device.
  • the first collection device performs image collection for the scene A
  • the second collection device performs image collection for the sub-scenario A1 in the scene A.
  • the tag may be generated according to the image collected by the second collecting device, and the position corresponding to the sub-scene A1 is determined as the target position in S101, and the tag corresponding to the sub-scene A1 is added at the target position.
  • the label added in the video frame image includes both the label generated according to the user's needs and the label generated according to the image collected by other collection devices, so that the label type is more abundant.
  • the tag may include a "tag symbol” and a "tag content”.
  • the "tag symbol” and the “tag content” may be separately displayed.
  • the "tag symbol” may be added to the video frame.
  • the "tag content” is displayed in an area other than the video frame image, so that the content of the tag does not cover the video frame image, and the display effect is better.
  • the tag further includes a "tag name”
  • the "tag name” may be displayed in the video frame image, or may be displayed in an area outside the video frame image, which is not limited.
  • the video frame image after the tag is added may be displayed, and in the second area, the content of the added tag may be displayed.
  • the first area and the second area may be different areas of the same display device, or may be a display area in an adjacent display device, which is not limited.
  • the video frame image after adding the label and the content of the added label are displayed.
  • the video frame image after the tag is added may be displayed in the main screen area, and the content of the added tag is displayed in the small screen area.
  • the small screen area may be located at any position on the right side, the left side, the upper side, and the lower side of the main screen area, and is not limited.
  • the "content of the tag” can be of various types, such as video data, captured images, image analysis data, and the like, and different types of data can be displayed in different areas.
  • the video data and the captured image may be displayed in the small screen area or the second area in the above picture
  • the image analysis data may be displayed in the video frame image, etc., and the specific display manner is not limited.
  • the specific shape, color, transparency, and specific type of "tag content” of the "tag symbol” may be set in advance or may be changed according to user selection.
  • the current display label may be determined; and the content of the current display label is displayed.
  • the display order can be set, and the current display label is determined according to the order.
  • the display order can be determined randomly, or can be set according to the importance degree of each label.
  • the label corresponding to the display instruction is determined as the current display label, and the like, and is not limited.
  • the clicked tag may be determined as the target tag; the content of the target tag is displayed in the video frame image.
  • the content of the label can be directly displayed in the video frame image.
  • a layer classification policy may be preset, and according to the policy, a layer category corresponding to each label is determined. In other words, it is to divide each label into different layer categories. For example, you can divide labels into intersection label layers, bayonet label layers, area label layers, building label layers, and more.
  • the layer display strategy can be determined based on user instructions.
  • the layer display strategy can include the current display layer and how the current display layer is displayed.
  • the user instruction only includes the current display layer information, and the device determines the current display layer according to the user instruction.
  • the device stores the display manner corresponding to each layer, so that the device can further determine the current display image.
  • the display mode of the layer in the second case, the user instruction includes the current display layer information and the display mode information, and the device can determine the current display layer and the display mode of the current display layer according to the user instruction, which are all reasonable.
  • the display mode may include: flashing display, jitter display, static display, etc., and is not limited.
  • the label is displayed separately from the content of the label, and the display manner may include the manner in which the label is displayed, or the manner in which the label content is displayed, for example, a building label.
  • the corresponding display manner of the layer may be: the label is displayed in the video frame image, and the corresponding label content is flashed in other areas (the second area or the picture-in-picture area).
  • the detailed image corresponding to the video frame image collected by the first collection device may be acquired, and after S101, according to the pixel point correspondence between the detail image and the video frame image acquired in advance, Determining that the target location corresponds to the location in the detail image as the to-be-processed location; adding the label added at the target location to the to-be-processed location corresponding to the target location; in this embodiment, S103 may include: The video frame image after the tag is added and the detailed image after the tag is displayed according to the preset display rule.
  • the video frame image acquired in S101 may be a panoramic image, and in addition, a detailed image corresponding to the panoramic image may be acquired, and the panoramic view is obtained according to a pixel point correspondence relationship between the panoramic image and the detailed image.
  • the label added to the image corresponds to the detail image, and the label is also added in the detail image.
  • the third collection device may be disposed outside the first collection device, where the first collection device and the third collection device perform image collection for the same scene, the first collection device collects the panoramic image, and the third collection device collects the detailed image.
  • the third collecting device can be a ball machine, the ball machine can be rotated, and detailed images of different viewing angles can be collected.
  • the pixel point correspondence between the panoramic image and the detail image may be obtained according to calibration information between the first collection device and the third collection device.
  • the dome camera can collect detailed images corresponding to the four regions, which are the detail image B1 and the detail image B2, respectively.
  • the four detail images can be displayed in turn in a preset order.
  • the currently displayed detail image is B1
  • 10 target positions are determined in area 1
  • labels are added for the 10 target positions, correspondingly, there are also 10 pending positions in the detail image B1
  • Add the same 10 labels to these 10 pending locations since the number of tags is large, only a part of the tags may be displayed in the area 1 of the panoramic image A, and the 10 tags are displayed in the detail image B1.
  • the video frame image after the label is added may be displayed in the first area, and the detailed image after the label is added may be displayed in the third area; or, the label may be displayed in the form of picture-in-picture The video frame image and the detailed image after the tag is added.
  • the display label described here only displays the "tag symbol” and displays the "tag content” in another area.
  • the image of the added video frame may be displayed, in the second area, the content of the added label is displayed, and in the third area, the detailed image after the label is added.
  • the first area, the second area, and the third area mentioned herein may be different areas of the same display device, or may be display areas in different display devices.
  • the video frame image after the tag is added, the detailed image after the tag is added, and the content of the added tag can be displayed in the form of picture-in-picture.
  • the video frame image after adding the label is displayed in the main screen area
  • the detailed image after adding the label is displayed in the small screen area in the lower left corner
  • the content of the added label is displayed in the small screen area on the right side.
  • the "tag name” may be displayed in the video frame image, or may be displayed in an area outside the video frame image, which is not limited.
  • FIG. 2 is a second schematic flowchart of an image processing method according to an embodiment of the present disclosure. The embodiment shown in FIG. 2 is based on the embodiment shown in FIG.
  • S201 Acquire a sub-scene image collected by the second collection device.
  • the video frame image collected by the first collection device is a panoramic image
  • the first collection device corresponds to at least one second collection device
  • the second collection device performs image collection on the sub-scene corresponding to the panoramic image.
  • the image collected by the second collection device is a sub-scene image.
  • the first collection device may be an augmented reality AR panoramic camera, so that the collected panoramic image is better.
  • the first collecting device may also be a plurality of guns, and the images collected by the plurality of guns are spliced to obtain a panoramic image.
  • the second collection device can be an ordinary camera, such as a ball machine, a capture machine, and the like. If the second collection device is a dome camera, the sub-scene image may be a surveillance video image. If the second collection device is a capture camera, the sub-scene image may be a snapshot image, and the like, which is not limited.
  • a large scene A includes four sub-scenes: A1, A2, A3, and A4.
  • the first collection device performs image collection on scene A
  • the second collection device 1 performs A1 on A1.
  • Image acquisition the second collection device 2 performs image acquisition on A2
  • the second collection device 3 performs image acquisition on A3, and the second collection device 4 performs image acquisition on A4.
  • the first collection device and the second collection device may be the same device, such as an AR eagle eye device, and the AR eagle eye device has an augmented reality function.
  • the AR eagle eye device may be integrated with a plurality of camera lenses and one In the ball machine lens, the image obtained by splicing the plurality of camera lenses can be used as a panoramic image, and the image captured by the camera lens is used as a sub-scene image.
  • the AR Hawkeye device can also be provided with a platform for scheduling and managing the plurality of camera lenses and a dome camera lens.
  • the second collection device sends the collected sub-scene image to the device that executes the solution in real time.
  • the device that executes the solution acquires the sub-scene image from the second collection device after receiving the user instruction.
  • the device that executes the solution acquires the sub-scene image from the second collection device corresponding to the abnormal event after detecting an abnormal event in the video frame image (panoramic image) of the S101.
  • the abnormal event may be a traffic accident, a robbery event, etc., and is not limited.
  • the embodiment of the present application does not limit the timing of acquiring a sub-scene image.
  • the label may include a "tag symbol” and a "tag content”.
  • the "tag symbol” may be an arrow, a triangle, etc., and the "tag symbol” is for marking a position in the video frame image.
  • the label the specific form is not limited; the "content of the label” may include the sub-scene image.
  • the tag may also include a "tag name", for example, may be some simple text information, such as "some building", “some park” and the like.
  • the sub-scene image and/or the target information in the sub-scene image may be added to the content of the tag.
  • the tag contains the target information in the sub-scene image.
  • the target information may include vehicle information in the image, such as a license plate number, a vehicle body color, etc., and may also include road information, such as traffic flow in the road; or
  • vehicle information in the image such as a license plate number, a vehicle body color, etc.
  • road information such as traffic flow in the road
  • the target information may be abnormal event information, such as a traffic accident.
  • the target information may be character information in the image, such as height, gender, etc.; or, in the third scheme, the target information may be abnormal event information, such as Robbery, fire, etc.
  • the method for obtaining the target information is different.
  • the device that executes the solution may identify the sub-scene image acquired by the S 201, and determine the target information in the sub-scene image according to the recognition result;
  • the second collection device may have an image recognition function, and the second collection device sends the identified target information to the device;
  • the server connected to the second collection device identifies the sub-scene image, and The identified target information is sent to the device; these methods are reasonable.
  • the tag contains both the sub-scene image and the target information in the sub-scene image.
  • the target information can be understood as an introduction or description of the sub-scene image, and the target information can be set around the sub-scene image so that the user can better understand what is happening in the sub-scene image.
  • S101 may be S101A: determining, according to the calibration information of the first collection device and the second collection device that are acquired in advance, a target position of the label corresponding to the second collection device in the panoramic image.
  • the calibration relationship can be understood as a relationship between the panoramic image coordinate system and the sub-scene image coordinate system. Conversion relationship. For example, there is a position X in the sub-scene A1, the pixel coordinate point of the position X in the panoramic image is (x1, y1), and the pixel coordinate point in the sub-scene image acquired by the second acquisition device 1 is (x2, Y2), the calibration relationship is the conversion relationship between (x1, y1) and (x2, y2).
  • related information (calibration information) of the calibration relationship may be acquired in advance, and the calibration information may be used to determine a position of the label of the second collection device in the panoramic image.
  • a third collection device is further disposed in addition to the first collection device and the second collection device.
  • the first collection device is a plurality of guns
  • the panoramic image is acquired
  • the second collection device is a capture camera.
  • the captured image is captured as a sub-scene image
  • the third acquisition device acquires a detailed image.
  • Determining at least one target position in the panoramic image and determining, according to the calibration information between the first collection device and the third collection device, the target position corresponding to the position in the detail image as the to-be-processed position;
  • the panoramic image after the label is added, the detailed image after the label is added, and the content of the added label are displayed.
  • the images collected by different devices can only be displayed separately (there is no relationship between the images). If the user needs to pay attention to the images collected by multiple devices, you need to switch back and forth between the images collected by the multiple devices. complex.
  • the first collecting device collects the panoramic image
  • the second collecting device collects the image of the sub-scene in the panoramic image to generate a sub-scene image; generates a label according to the sub-scene image, and adds the label to the label.
  • the panoramic image the panoramic image after the label is displayed is displayed; thus, the solution displays the image (the panoramic image) collected by the first collection device and the image (label) collected by the second collection device, and the user does not display If you need to switch, you can pay attention to the images collected by multiple devices, and the operation is simple.
  • an abnormal event may be detected in the panoramic image collected by the first collection device; if yes, the target second collection device corresponding to the abnormal event is determined; and the sub-scene image collected by the target second collection device is acquired.
  • the abnormality model may be preset: according to the above description, the abnormal events may include traffic accidents, robberies, fires, etc., and these abnormal events may be simulated in advance to generate corresponding abnormal models.
  • the panoramic image is then matched with the preset anomaly model. If the matching is successful, an abnormal event occurs in the panoramic image.
  • the position where the match is successful is the position of the abnormal event in the panoramic image.
  • the abnormal event alarm information sent by the other device or the user for the panoramic image may be received, and the alarm information is received, and an abnormal event occurs in the panoramic image.
  • the device that implements the solution can communicate with other devices, and other devices can send abnormal event alarm information to the device after determining that an abnormal event occurs in the panoramic image.
  • the user can also send an abnormal event alarm message to the device, which is also reasonable.
  • the abnormal event alarm information may carry the position of the abnormal event in the panoramic image.
  • the calibration relationship there is a calibration relationship between the first collection device and the four second collection devices.
  • related information calibration information
  • the calibration information can determine the target second collection device corresponding to the above-mentioned "position of the abnormal event in the panoramic image", that is, the second collection device that performs image acquisition for the sub-scene where the abnormal event is located.
  • S202 is: generating a label corresponding to the abnormal event according to the sub-scene image.
  • the focus area may be divided in the panoramic image in advance, and when an abnormal event occurs in the panoramic image is detected, it may be determined whether the position of the abnormal event in the panoramic image is at a preset. The focus area; if so, the label is displayed in the video frame image in a preset alarm mode.
  • intersection A in the panoramic image is an area that needs to be focused
  • the intersection A is set as the focus area in the panoramic image in advance; if an abnormal event occurs in the panoramic image, and the abnormal event occurs at the intersection In A, the label is displayed in the video frame image in a preset alarm mode.
  • the content of the label and the label are separately displayed may be displayed in the second area or the picture-in-picture area by an alarm method, for example, the color change of the pop-up window, the pop-up window shake, etc.
  • an alarm method for example, the color change of the pop-up window, the pop-up window shake, etc. The specific is not limited.
  • FIG. 3 is a third schematic flowchart of an image processing method according to an embodiment of the present disclosure. The embodiment shown in FIG. 3 is based on the embodiment shown in FIG.
  • S301 Receive a label adding instruction sent by a user.
  • the user can click on a target such as a building or an intersection in a video frame image, and then input the content related to the target (target content), and the target content may include text information (such as a building name). , or other relevant instructions), or can also contain images.
  • a target such as a building or an intersection in a video frame image
  • the target content may include text information (such as a building name). , or other relevant instructions), or can also contain images.
  • the tag addition instruction can carry the target location (the location clicked by the user) and the target content (the content, text or image input by the user).
  • the user may also obtain the sub-scene image collected by the second collection device, and use the acquired sub-scene image as the target content, or the user may select the sub-scene image and the target information in the sub-scene image (as shown in FIG. 2
  • the target information in the embodiment has the same meaning and will not be described again as the target content.
  • S302 Generate a label according to the label adding instruction.
  • the label may include a "tag symbol” and a "tag content”.
  • the "tag symbol” may be an arrow, a triangle, etc.
  • the "tag symbol” is for marking a position in the video frame image.
  • the specific form of the label is not limited; in this embodiment, the target content input by the user may be used as the content of the label.
  • the tag may also include a "tag name", for example, may be some simple text information, such as “some building”, “some park” and the like. It is also possible to use part of the content input by the above user as the name of the tag.
  • S101 is S101B: determining the target position of the added tag according to the tag adding instruction.
  • the target location is the location that the above user clicks.
  • the location and content of the label are determined by the user, that is, the user can design his own label according to his own needs, and the user experience is better.
  • the embodiment of the present application further provides an image processing device.
  • the embodiment of the present application further provides an image processing device, as shown in FIG. 4a, comprising: a processor 401 and a memory 402;
  • the processor 401 is configured to implement any of the above image processing methods when executing a program stored on the memory 402.
  • FIG. 4b is a schematic structural diagram of another image processing apparatus according to an embodiment of the present disclosure, including: a housing 501, a processor 502, a memory 503, a circuit board 504, and a power supply circuit 505, wherein the circuit board 504 is disposed in the housing 501.
  • the processor 502 and the memory 503 are disposed on the circuit board 504;
  • the power supply circuit 505 is configured to supply power to the respective circuits or devices of the image processing apparatus;
  • the memory 503 is configured to store executable program code;
  • the processor 502 passes The executable program code stored in the memory 503 is read to execute a program corresponding to the executable program code for performing the following steps:
  • the video frame image after the tag is displayed according to the preset display rule.
  • the video frame image is a panoramic image
  • the first collection device corresponds to at least one second collection device
  • the second collection device performs image collection on the sub-scene corresponding to the panoramic image
  • the processor is further configured to implement the following steps:
  • the processor is further configured to implement the following steps:
  • the processor is further configured to implement the following steps:
  • Identifying the sub-scene image determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
  • the processor is further configured to implement the following steps:
  • the content of the added tag is displayed.
  • the processor is further configured to implement the following steps:
  • the video frame image after the tag is added, and the content of the added tag.
  • the processor is further configured to implement the following steps:
  • the processor is further configured to implement the following steps:
  • the clicked label is determined as the target label
  • the content of the target tag is displayed in the video frame image.
  • the processor is further configured to implement the following steps:
  • a target location of the added tag is determined according to the tag addition instruction.
  • the processor is further configured to implement the following steps:
  • Determining a layer display strategy and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
  • the label corresponding to the current display layer is displayed.
  • the processor is further configured to implement the following steps:
  • the processor is further configured to implement the following steps:
  • the processor is further configured to implement the following steps:
  • the processor is further configured to implement the following steps:
  • the label is displayed in the video frame image in a preset alarm mode.
  • the processor is further configured to implement the following steps:
  • the video frame image after the tag is added and the detailed image after the tag is displayed according to the preset display rule.
  • the processor is further configured to implement the following steps:
  • the first area displaying the video frame image after the label is added; in the third area, displaying the detailed image after adding the label;
  • the video frame image after the tag is added, and the detail image after the tag is added.
  • the label can help the user understand the specific content included in the video frame image, and therefore, after adding the label
  • the video frame image can display the image content more intuitively, and the display effect is better.
  • the embodiment of the present application further provides an image processing system, where the system may include: a first collection device and an image processing device, where
  • the first collecting device is configured to collect a video frame image, and send the collected video frame image to the image processing device;
  • the image processing device is configured to determine, according to a video frame image acquired by the first collection device, at least one target location in the video frame image; adding a label at each determined target location, the label according to the input instruction Or the image collected by the second collection device is generated; and the video frame image after the label is added is displayed according to the preset display rule.
  • the system further includes: at least one second collection device (second acquisition device 1, second collection device 2, second collection device 3, and second collection device 4) ,
  • the second collection device is configured to perform image collection on a sub-scene corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
  • the image processing device is further configured to acquire a sub-scene image acquired by the second collection device; generate a label according to the sub-scene image; and determine, according to the calibration information of the first collection device and the second collection device acquired in advance The label corresponding to the second collection device is at a target position in the panoramic image.
  • the image processing device in this embodiment may be a platform device, which may acquire resources from multiple collection devices, display images, and interact with users.
  • the first collection device is an augmented reality AR panoramic camera.
  • the image processing device can also be used to:
  • the image processing device can also be used to:
  • Identifying the sub-scene image determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
  • the image processing device can also be used to:
  • the content of the added tag is displayed.
  • the image processing device can also be used to:
  • the video frame image after the tag is added, and the content of the added tag.
  • the image processing device can also be used to:
  • the image processing device can also be used to:
  • the clicked label is determined as the target label
  • the content of the target tag is displayed in the video frame image.
  • the image processing device can also be used to:
  • a target location of the added tag is determined according to the tag addition instruction.
  • the image processing device can also be used to:
  • Determining a layer display strategy and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
  • the label corresponding to the current display layer is displayed.
  • the image processing device can also be used to:
  • the image processing device can also be used to:
  • the image processing device can also be used to:
  • the image processing device can also be used to:
  • the label is displayed in the video frame image in a preset alarm mode.
  • the system may further include: a third collection device;
  • the third collection device is configured to collect a detailed image corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
  • the image processing device is further configured to acquire a detail image collected by the third collection device; and determine, according to a pixel point correspondence relationship between the detail image and the video frame image, the target location is corresponding to a position in the detail image as a to-be-processed location; adding a tag added at the target location to a to-be-processed location corresponding to the target location; according to a preset display rule, the tagged video frame image, and The detailed image after the label is added for display.
  • the image processing device acquires a video frame image collected by the first collection device, adds a label to a target position in the video frame image, and then displays the video frame image after the label is added; Help users understand the specific content contained in the video frame image. Therefore, the video frame image after the label is added can display the image content more intuitively, and the display effect is better.
  • the embodiment of the present application further provides a computer readable storage medium, where the computer readable storage medium stores a computer program, and when the computer program is executed by the processor, implements any of the above image processing methods.
  • the embodiment of the present application also provides an executable program code for being executed to execute any of the image processing methods described above.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Graphics (AREA)
  • Computer Hardware Design (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

Disclosed in an embodiment of the present invention are an image processing method, device and system. The method comprises: adding a tag to a target position in a video frame image, and displaying the video frame image added with the tag. The tag can help a user understand a specific content contained in the video frame image. The video frame image added with the tag can more intuitively present a content therein, thereby providing better presentation effect.

Description

一种图像处理方法、设备及系统Image processing method, device and system
本申请要求于2018年3月29日提交中国专利局、申请号为201810272370.9、发明名称为“一种图像处理方法、设备及系统”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. 201810272370.9, entitled "Image Processing Method, Apparatus and System", filed on March 29, 2018, the entire contents of which are incorporated herein by reference. In the application.
技术领域Technical field
本申请涉及视频监控技术领域,特别涉及一种图像处理方法、设备及系统。The present application relates to the field of video surveillance technologies, and in particular, to an image processing method, device, and system.
背景技术Background technique
目前,许多场景中都设置有图像采集设备,相关人员可以通过设备采集到的视频帧图像,对场景进行监控。一般来说,对视频帧图像进行展示时,展示内容仅包含图像本身及图像的采集时刻。对于观看视频帧图像的用户来说,只能自己熟悉视频帧图像对应的真实环境,基于该真实环境,理解图像中包含的具体内容。可见,这种图像展示方式不直观,展示效果较差。At present, image acquisition devices are provided in many scenes, and related personnel can monitor the scene through video frame images collected by the device. In general, when displaying a video frame image, the display content only includes the image itself and the acquisition time of the image. For the user who views the video frame image, he can only familiarize himself with the real environment corresponding to the video frame image, and understand the specific content contained in the image based on the real environment. It can be seen that this image display method is not intuitive and the display effect is poor.
发明内容Summary of the invention
本申请实施例的目的在于提供一种图像处理方法、设备及系统,提高视频帧图像的展示效果。An object of the embodiments of the present application is to provide an image processing method, device, and system, which improve the display effect of a video frame image.
为达到上述目的,本申请实施例公开了一种图像处理方法,包括:To achieve the above objective, an embodiment of the present application discloses an image processing method, including:
针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置;Determining at least one target location in the video frame image for the video frame image acquired by the first acquisition device;
在所确定的每个目标位置处添加标签,所述标签根据输入指令或者第二采集设备采集的图像生成;Adding a label at each determined target location, the label being generated according to an input instruction or an image acquired by the second collection device;
根据预设展示规则,对添加标签后的视频帧图像进行展示。The video frame image after the tag is displayed according to the preset display rule.
可选的,所述视频帧图像为全景图像,所述第一采集设备对应至少一台第二采集设备,第二采集设备针对所述全景图像对应的子场景进行图像采集;Optionally, the video frame image is a panoramic image, the first collecting device is configured to correspond to at least one second collecting device, and the second collecting device performs image capturing on the sub-scene corresponding to the panoramic image.
在所述视频帧图像中确定至少一个目标位置之前,所述方法还包括:Before determining at least one target location in the video frame image, the method further includes:
获取第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the second collection device;
根据所述子场景图像,生成标签;Generating a label according to the sub-scene image;
在所述视频帧图像中确定至少一个目标位置的步骤,包括:The step of determining at least one target location in the video frame image includes:
根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第二采集设备对应的标签在所述全景图像中的目标位置。And determining, according to the calibration information of the first collection device and the second collection device that are acquired in advance, a target location of the label corresponding to the second collection device in the panoramic image.
可选的,所述第一采集设备为增强现实AR全景相机。Optionally, the first collection device is an augmented reality AR panoramic camera.
可选的,所述根据所述子场景图像,生成标签的步骤,包括:Optionally, the step of generating a label according to the image of the sub-scene includes:
将所述子场景图像和/或所述子场景图像中的目标信息添加至所述标签的内容。Adding the sub-scene image and/or target information in the sub-scene image to the content of the tag.
可选的,所述将所述子场景图像中的目标信息添加至所述标签的内容的步骤,包括:Optionally, the step of adding the target information in the sub-scene image to the content of the label includes:
对所述子场景图像进行识别,根据识别结果,确定出所述子场景图像中的目标信息;将所述目标信息添加至所述标签的内容;Identifying the sub-scene image, determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
或者,接收第二采集设备发送的所述目标信息;将所述目标信息添加至所述标签的内容;Or receiving the target information sent by the second collection device; adding the target information to the content of the label;
或者,接收与第二采集设备通信连接的服务器发送的所述目标信息;将所述目标信息添加至所述标签的内容。Or receiving the target information sent by a server communicatively coupled to the second collection device; adding the target information to the content of the tag.
可选的,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:Optionally, the step of displaying the tagged video frame image according to the preset display rule includes:
在第一区域中,展示添加标签后的视频帧图像;In the first area, displaying the video frame image after the tag is added;
在第二区域中,展示所添加标签的内容。In the second area, the content of the added tag is displayed.
可选的,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:Optionally, the step of displaying the tagged video frame image according to the preset display rule includes:
以画中画的形式,展示添加标签后的视频帧图像、以及所添加标签的内容。In the form of a picture-in-picture, the video frame image after the tag is added, and the content of the added tag.
可选的,展示所添加标签的内容,包括:Optionally, display the contents of the added tags, including:
在所添加的标签中,确定当前展示标签;In the added tag, determine the current display tag;
展示所述当前展示标签的内容。Show the content of the current display tag.
可选的,所述方法还包括:Optionally, the method further includes:
在检测到用户点击所述视频帧图像中的标签后,将被点击标签确定为目标标签;After detecting that the user clicks on the label in the video frame image, the clicked label is determined as the target label;
在所述视频帧图像中展示所述目标标签的内容。The content of the target tag is displayed in the video frame image.
可选的,在所述视频帧图像中确定至少一个目标位置的步骤之前,所述方法还包括:Optionally, before the step of determining the at least one target location in the video frame image, the method further includes:
接收标签添加指令;Receiving a tag addition instruction;
根据所述标签添加指令,生成标签;Generating a label according to the label adding instruction;
在所述视频帧图像中确定至少一个目标位置的步骤,包括:The step of determining at least one target location in the video frame image includes:
根据所述标签添加指令,确定所添加标签的目标位置。A target location of the added tag is determined according to the tag addition instruction.
可选的,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:Optionally, the step of displaying the tagged video frame image according to the preset display rule includes:
根据预设图层分类策略,确定每个标签对应的图层;Determining a layer corresponding to each label according to a preset layer classification policy;
确定图层展示策略,根据所述图层展示策略,确定当前展示图层、及所述当前展示图层的展示方式;Determining a layer display strategy, and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
以所述展示方式,对所述当前展示图层对应的标签进行展示。In the display manner, the label corresponding to the current display layer is displayed.
可选的,所述获取所述第二采集设备采集的子场景图像的步骤,包括:Optionally, the step of acquiring the sub-scene image collected by the second collection device includes:
检测所述全景图像中是否发生异常事件;Detecting whether an abnormal event occurs in the panoramic image;
如果是,确定所述异常事件对应的目标第二采集设备;If yes, determining a target second collection device corresponding to the abnormal event;
获取所述目标第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the target second collection device;
根据所述子场景图像,生成标签的步骤,包括:According to the sub-scene image, the step of generating a label includes:
根据所述子场景图像,生成所述异常事件对应的标签。And generating, according to the sub-scene image, a label corresponding to the abnormal event.
可选的,所述检测所述全景图像中是否发生异常事件的步骤,包括:Optionally, the step of detecting whether an abnormal event occurs in the panoramic image comprises:
将所述全景图像与预设异常模型进行匹配;Matching the panoramic image with a preset anomaly model;
如果匹配成功,则表示所述全景图像中发生异常事件。If the match is successful, it means that an abnormal event occurs in the panoramic image.
或者,判断是否接收到针对所述全景图像的异常事件报警信息;Or determining whether abnormal event alarm information for the panoramic image is received;
如果接收到,则表示所述全景图像中发生异常事件。If received, an abnormal event occurs in the panoramic image.
可选的,所述确定所述异常事件对应的目标第二采集设备的步骤,包括:Optionally, the step of determining the target second collection device corresponding to the abnormal event includes:
确定所述异常事件在所述全景图像中的位置;Determining a location of the abnormal event in the panoramic image;
根据预先获取的所述第一采集设备与每台第二采集设备的标定信息,确定与所述位置相对应的目标第二采集设备。Determining, according to the pre-acquired calibration information of the first collection device and each second collection device, the target second collection device corresponding to the location.
可选的,在检测到所述全景图像中发生异常事件的情况下,所述方法还包括:Optionally, in the case that an abnormal event occurs in the panoramic image is detected, the method further includes:
判断所述异常事件在所述全景图像中的位置是否位于预设重点区域;Determining whether the location of the abnormal event in the panoramic image is located in a preset focus area;
如果是,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:If yes, the step of displaying the tagged video frame image according to the preset display rule includes:
以预设报警方式,在视频帧图像中展示所述标签。The label is displayed in the video frame image in a preset alarm mode.
为达到上述目的,本申请实施例还公开了一种图像处理设备,包括:处理器和存储器;In order to achieve the above objective, an embodiment of the present application further discloses an image processing apparatus, including: a processor and a memory;
存储器,用于存放计算机程序;a memory for storing a computer program;
处理器,用于执行存储器上所存放的程序时,实现以下步骤:The processor, when used to execute the program stored on the memory, implements the following steps:
针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置;Determining at least one target location in the video frame image for the video frame image acquired by the first acquisition device;
在所确定的每个目标位置处添加标签,所述标签根据用户输入内容或者第二采集设备采集的图像生成;Adding a label at each determined target location, the label being generated according to the user input content or an image acquired by the second collection device;
根据预设展示规则,对添加标签后的视频帧图像进行展示。The video frame image after the tag is displayed according to the preset display rule.
可选的,所述视频帧图像为全景图像,所述第一采集设备对应至少一台第二采集设备,第二采集设备针对所述全景图像对应的子场景进行图像采集;Optionally, the video frame image is a panoramic image, the first collecting device is configured to correspond to at least one second collecting device, and the second collecting device performs image capturing on the sub-scene corresponding to the panoramic image.
所述处理器还用于实现如下步骤:The processor is further configured to implement the following steps:
获取第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the second collection device;
根据所述子场景图像,生成标签;Generating a label according to the sub-scene image;
根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第二采集设备对应的标签在所述全景图像中的目标位置。And determining, according to the calibration information of the first collection device and the second collection device that are acquired in advance, a target location of the label corresponding to the second collection device in the panoramic image.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
将所述子场景图像和/或所述子场景图像中的目标信息添加至所述标签的内容。Adding the sub-scene image and/or target information in the sub-scene image to the content of the tag.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
对所述子场景图像进行识别,根据识别结果,确定出所述子场景图像中的目标信息;将所述目标信息添加至所述标签的内容;Identifying the sub-scene image, determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
或者,接收第二采集设备发送的所述目标信息;将所述目标信息添加至所述标签的内容;Or receiving the target information sent by the second collection device; adding the target information to the content of the label;
或者,接收与第二采集设备通信连接的服务器发送的所述目标信息;将 所述目标信息添加至所述标签的内容。Or receiving the target information sent by a server communicatively coupled to the second collection device; adding the target information to the content of the tag.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
在第一区域中,展示添加标签后的视频帧图像;In the first area, displaying the video frame image after the tag is added;
在第二区域中,展示所添加标签的内容。In the second area, the content of the added tag is displayed.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
以画中画的形式,展示添加标签后的视频帧图像、以及所添加标签的内容。In the form of a picture-in-picture, the video frame image after the tag is added, and the content of the added tag.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
在所添加的标签中,确定当前展示标签;In the added tag, determine the current display tag;
展示所述当前展示标签的内容。Show the content of the current display tag.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
在检测到用户点击所述视频帧图像中的标签后,将被点击标签确定为目标标签;After detecting that the user clicks on the label in the video frame image, the clicked label is determined as the target label;
在所述视频帧图像中展示所述目标标签的内容。The content of the target tag is displayed in the video frame image.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
接收标签添加指令;Receiving a tag addition instruction;
根据所述标签添加指令,生成标签;Generating a label according to the label adding instruction;
根据所述标签添加指令,确定所添加标签的目标位置。A target location of the added tag is determined according to the tag addition instruction.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
根据预设图层分类策略,确定每个标签对应的图层;Determining a layer corresponding to each label according to a preset layer classification policy;
确定图层展示策略,根据所述图层展示策略,确定当前展示图层、及所述当前展示图层的展示方式;Determining a layer display strategy, and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
以所述展示方式,对所述当前展示图层对应的标签进行展示。In the display manner, the label corresponding to the current display layer is displayed.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
检测所述全景图像中是否发生异常事件;Detecting whether an abnormal event occurs in the panoramic image;
如果是,确定所述异常事件对应的目标第二采集设备;If yes, determining a target second collection device corresponding to the abnormal event;
获取所述目标第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the target second collection device;
根据所述子场景图像,生成所述异常事件对应的标签。And generating, according to the sub-scene image, a label corresponding to the abnormal event.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
将所述全景图像与预设异常模型进行匹配;Matching the panoramic image with a preset anomaly model;
如果匹配成功,则表示所述全景图像中发生异常事件。If the match is successful, it means that an abnormal event occurs in the panoramic image.
或者,判断是否接收到针对所述全景图像的异常事件报警信息;Or determining whether abnormal event alarm information for the panoramic image is received;
如果接收到,则表示所述全景图像中发生异常事件。If received, an abnormal event occurs in the panoramic image.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
确定所述异常事件在所述全景图像中的位置;Determining a location of the abnormal event in the panoramic image;
根据预先获取的所述第一采集设备与每台第二采集设备的标定信息,确定与所述位置相对应的目标第二采集设备。Determining, according to the pre-acquired calibration information of the first collection device and each second collection device, the target second collection device corresponding to the location.
可选的,所述处理器还用于实现如下步骤:Optionally, the processor is further configured to implement the following steps:
在检测到所述全景图像中发生异常事件的情况下,判断所述异常事件在所述全景图像中的位置是否位于预设重点区域;When it is detected that an abnormal event occurs in the panoramic image, determining whether a location of the abnormal event in the panoramic image is located in a preset focus area;
如果是,以预设报警方式,在视频帧图像中展示所述标签。If so, the label is displayed in the video frame image in a preset alarm mode.
为达到上述目的,本申请实施例还公开了一种图像处理系统,包括:第一采集设备和图像处理设备,其中,In order to achieve the above object, an embodiment of the present application further discloses an image processing system, including: a first collection device and an image processing device, where
所述第一采集设备,用于采集视频帧图像,并将所采集的视频帧图像发送至所述图像处理设备;The first collecting device is configured to collect a video frame image, and send the collected video frame image to the image processing device;
所述图像处理设备,用于针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置;在所确定的每个目标位置处添加标签,所述标签根据用户输入内容或者第二采集设备采集的图像生成;根据预设展示规则,对添加标签后的视频帧图像进行展示。The image processing device is configured to determine, according to a video frame image acquired by the first collection device, at least one target location in the video frame image; add a label at each determined target location, the label is based on user input The content or the image acquired by the second collection device is generated; and the video frame image after the tag is added is displayed according to the preset display rule.
可选的,所述系统还包括:至少一台第二采集设备,Optionally, the system further includes: at least one second collection device,
所述第二采集设备,用于针对全景图像对应的子场景进行图像采集,所述全景图像为所述第一采集设备所采集的视频帧图像;The second collection device is configured to perform image collection on a sub-scene corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
所述图像处理设备,还用于获取第二采集设备采集的子场景图像;根据所述子场景图像,生成标签;根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第二采集设备对应的标签在所述全景图像中的目标位置。The image processing device is further configured to acquire a sub-scene image acquired by the second collection device; generate a label according to the sub-scene image; and determine, according to the calibration information of the first collection device and the second collection device acquired in advance The label corresponding to the second collection device is at a target position in the panoramic image.
可选的,所述第一采集设备为增强现实AR全景相机。Optionally, the first collection device is an augmented reality AR panoramic camera.
为达到上述目的,本申请实施例还公开了一种计算机可读存储介质,所 述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现上述任一种图像处理方法。In order to achieve the above object, an embodiment of the present application further discloses a computer readable storage medium, where the computer readable storage medium stores a computer program, and when the computer program is executed by a processor, implement any one of the foregoing image processing methods. .
为达到上述目的,本申请实施例还公开了一种可执行程序代码,所述可执行程序代码用于被运行以执行上述任一种图像处理方法。In order to achieve the above object, an embodiment of the present application further discloses an executable program code for being executed to execute any of the image processing methods described above.
应用本申请实施例,在视频帧图像中的目标位置处添加标签,然后对添加标签后的视频帧图像进行展示;标签可以帮助用户理解视频帧图像包含的具体内容,因此,添加标签后的视频帧图像能够更直观地展示图像内容,展示效果较好。Applying the embodiment of the present application, adding a label at a target position in the video frame image, and then displaying the labeled video frame image; the label can help the user understand the specific content included in the video frame image, and therefore, adding the tagged video The frame image can display the image content more intuitively, and the display effect is better.
附图说明DRAWINGS
为了更清楚地说明本申请实施例和现有技术的技术方案,下面对实施例和现有技术中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present application and the technical solutions of the prior art, the following description of the embodiments and the drawings used in the prior art will be briefly introduced. Obviously, the drawings in the following description are only Some embodiments of the application may also be used to obtain other figures from those of ordinary skill in the art without departing from the scope of the invention.
图1为本申请实施例提供的图像处理方法的第一种流程示意图;FIG. 1 is a schematic diagram of a first process of an image processing method according to an embodiment of the present disclosure;
图1a为本申请实施例提供的一种展示界面示意图;FIG. 1 is a schematic diagram of a display interface according to an embodiment of the present application;
图1b为本申请实施例提供的另一种展示界面示意图;FIG. 1b is a schematic diagram of another display interface provided by an embodiment of the present application;
图2为本申请实施例提供的图像处理方法的第二种流程示意图;2 is a second schematic flowchart of an image processing method according to an embodiment of the present application;
图2a为本申请实施例提供的一种应用场景示意图;2a is a schematic diagram of an application scenario provided by an embodiment of the present application;
图3为本申请实施例提供的图像处理方法的第三种流程示意图;FIG. 3 is a schematic diagram of a third process of an image processing method according to an embodiment of the present disclosure;
图4a为本申请实施例提供的一种图像处理设备的结构示意图;FIG. 4 is a schematic structural diagram of an image processing apparatus according to an embodiment of the present application;
图4b为本申请实施例提供的另一种图像处理设备的结构示意图;FIG. 4b is a schematic structural diagram of another image processing device according to an embodiment of the present disclosure;
图5为本申请实施例提供的一种图像处理系统的结构示意图。FIG. 5 is a schematic structural diagram of an image processing system according to an embodiment of the present application.
具体实施方式detailed description
为使本申请的目的、技术方案、及优点更加清楚明白,以下参照附图并举实施例,对本申请进一步详细说明。显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the objects, technical solutions, and advantages of the present application more comprehensible, the present application will be further described in detail below with reference to the accompanying drawings. It is apparent that the described embodiments are only a part of the embodiments of the present application, and not all of them. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.
为了解决上述技术问题,本申请实施例提供了一种图像处理方法、设备 及系统。该方法可以应用于各种图像处理设备,具体不做限定。In order to solve the above technical problem, an embodiment of the present application provides an image processing method, device, and system. The method can be applied to various image processing devices, and is not specifically limited.
下面首先对本申请实施例提供的一种图像处理方法进行详细说明。An image processing method provided by an embodiment of the present application is described in detail below.
图1为本申请实施例提供的一种图像处理方法的流程示意图,包括:FIG. 1 is a schematic flowchart of an image processing method according to an embodiment of the present disclosure, including:
S101:针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置。S101: Determine, for the video frame image acquired by the first collection device, at least one target location in the video frame image.
本申请实施例的处理对象为视频帧图像,针对视频中的每一帧图像,都可以采用本申请实施例提供的方案进行处理。The processing object of the embodiment of the present application is a video frame image, and the image provided by the embodiment of the present application may be processed for each frame of the video.
确定目标位置的方式有多种。举例来说,可以预先在视频帧图像中设定一个或多个固定的位置,作为目标位置;比如,可以将视频帧图像的中间位置设定为目标位置。或者,可以根据用户指令,将用户指定的位置确定为目标位置,需要说明的是,针对同一段视频,或者同一场景的多段视频,用户可以仅发送一次指令,根据该指令,可以在这一段或多段视频中的每帧图像中确定目标位置。There are several ways to determine the target location. For example, one or more fixed positions may be set in advance in the video frame image as the target position; for example, the intermediate position of the video frame image may be set as the target position. Alternatively, the user-specified location may be determined as the target location according to the user instruction. It should be noted that, for the same video, or multiple videos of the same scene, the user may only send an instruction once, according to the instruction, may be in this paragraph or The target position is determined in each of the plurality of pieces of video.
可以理解,第一采集设备的安装位置通常是固定的,其采集的视频帧图像对应的场景也大体不变,因此,在各个视频帧图像中,上述预先设定的位置对应的画面内容差别通常不大,上述用户指令中指定的位置对应的画面内容差别通常也不大,可以根据用户发送的一次指令,在多个视频帧图像中确定目标位置。It can be understood that the installation location of the first collection device is generally fixed, and the scene corresponding to the captured video frame image is also substantially unchanged. Therefore, in each video frame image, the difference in the screen content corresponding to the preset position is usually The difference in the screen content corresponding to the position specified in the above user command is usually not large, and the target position can be determined in the plurality of video frame images according to an instruction sent by the user.
确定目标位置的方式也可以为其他,具体不做限定。The way to determine the target location can also be other, and is not limited.
S102:在所确定的每个目标位置处添加标签,所述标签根据输入指令或者第二采集设备采集的图像生成。S102: Add a label at each determined target position, and the label is generated according to an input instruction or an image acquired by the second collection device.
输入指令可以为用户输入的添加标签的指令。第二采集设备可以为与第一采集设备设置于同一场景中的采集设备,比如,第一采集设备针对场景A进行图像采集,第二采集设备针对场景A中的子场景A1进行图像采集。The input command can be a tagged instruction entered by the user. The second collection device may be an acquisition device that is configured in the same scenario as the first collection device. For example, the first collection device performs image collection for the scene A, and the second collection device performs image collection for the sub-scenario A1 in the scene A.
举例来说,标签可以包括“标签符号”及“标签的内容”,比如,“标签符号”可以为箭头、三角形等几何图形,“标签符号”是为了在视频帧图像中标记该位置处有一个标签,具体形式不作限定;“标签的内容”可以为其他采 集设备采集的图像,也可以为一些图像分析数据,还可以为标签处景物的关联数据,等等,具体不做限定。For example, the label may include a "tag symbol" and a "tag content". For example, the "tag symbol" may be an arrow, a triangle, etc., and the "tag symbol" is for marking a position in the video frame image. The specific format of the label is not limited; the content of the label may be an image collected by other collection devices, or may be some image analysis data, or may be associated data of the scene at the label, and the like, and is not limited.
图像分析数据可以为人脸识别结果、车辆识别结果等,景物的关联数据可以为对该景物的介绍内容,或者如果该景物为交通卡口,该关联数据可以为车流量数据等。此外,标签还可以包括“标签名称”,比如,可以为一些简洁的文字信息,比如“某某大厦”、“某某公园”等。The image analysis data may be a face recognition result, a vehicle recognition result, or the like, and the associated data of the scene may be an introduction content of the scene, or if the scene is a traffic bayonet, the associated data may be traffic flow data or the like. In addition, the tag may also include a "tag name", for example, may be some simple text information, such as "some building", "some park" and the like.
比如,假设输入指令为用户输入的文字信息“某某大厦”以及该大厦的具体介绍,则可以生成一个标签,标签符号可以为一个箭头,标签名称可以为该文字信息“某某大厦”,标签的内容可以为该大厦的具体介绍。For example, if the input instruction is the text information “some building” input by the user and the specific introduction of the building, a label may be generated, and the label symbol may be an arrow, and the label name may be the text information “some building”, the label The content can be a specific introduction to the building.
再比如,目标位置为交通卡口,交通卡口处添加的标签内容可以为该卡口处采集的视频数据、该卡口处的抓拍图像、该卡口处的车流量数据,等等。For another example, the target location is a traffic bayonet, and the label content added at the traffic bayonet may be video data collected at the bayonet, a captured image at the bayonet, traffic flow data at the bayonet, and the like.
作为一种实施方式,用户可以根据自身需要,设计属于自己的标签。具体的,用户可以点击视频帧图像中的某一位置,并输入一些文字或者图像内容;执行本方案的设备可以根据用户输入的内容,生成对应的标签,并在该视频帧图像及其之后的视频帧图像中,将用户点击的位置确定为目标位置,在该目标位置处添加所生成的标签。As an implementation manner, the user can design his own label according to his own needs. Specifically, the user may click on a certain position in the video frame image and input some text or image content; the device performing the scheme may generate a corresponding label according to the content input by the user, and after the video frame image and the subsequent In the video frame image, the location clicked by the user is determined as the target location, and the generated tag is added at the target location.
或者,作为另一种实施方式,可以根据其他采集设备采集的图像生成标签,比如,第一采集设备针对场景A进行图像采集,第二采集设备针对场景A中的子场景A1进行图像采集,这样,可以根据第二采集设备采集的图像生成标签,并在S101中将子场景A1对应的位置确定为目标位置,在该目标位置处,添加子场景A1对应的标签。Or, as another implementation manner, the label may be generated according to the image collected by the other collection device. For example, the first collection device performs image collection for the scene A, and the second collection device performs image collection for the sub-scenario A1 in the scene A. The tag may be generated according to the image collected by the second collecting device, and the position corresponding to the sub-scene A1 is determined as the target position in S101, and the tag corresponding to the sub-scene A1 is added at the target position.
或者,视频帧图像中添加的标签既包括根据用户需要生成的标签,也包括根据其他采集设备采集的图像生成的标签,这样,标签种类更丰富。Alternatively, the label added in the video frame image includes both the label generated according to the user's needs and the label generated according to the image collected by other collection devices, so that the label type is more abundant.
S103:根据预设展示规则,对添加标签后的视频帧图像进行展示。S103: Display the video frame image after the labeling according to the preset display rule.
如上所述,标签可以包括“标签符号”及“标签的内容”,作为一种实施方式,可以将“标签符号”及“标签的内容”分开展示,比如,“标签符号”可以添加在视频帧图像中,并在视频帧图像之外的区域展示“标签的内容”,这样,标签的内容不会覆盖视频帧图像,展示效果更佳。如果标签还包括“标 签名称”,该“标签名称”可以展示在视频帧图像中,也可以展示在在视频帧图像之外的区域,具体不做限定。As described above, the tag may include a "tag symbol" and a "tag content". As an embodiment, the "tag symbol" and the "tag content" may be separately displayed. For example, the "tag symbol" may be added to the video frame. In the image, the "tag content" is displayed in an area other than the video frame image, so that the content of the tag does not cover the video frame image, and the display effect is better. If the tag further includes a "tag name", the "tag name" may be displayed in the video frame image, or may be displayed in an area outside the video frame image, which is not limited.
比如,可以在第一区域中,展示添加标签后的视频帧图像,在第二区域中,展示所添加标签的内容。第一区域和第二区域可以为同一显示设备的不同区域,或者,也可以为相邻的显示设备中的显示区域,具体不做限定。For example, in the first area, the video frame image after the tag is added may be displayed, and in the second area, the content of the added tag may be displayed. The first area and the second area may be different areas of the same display device, or may be a display area in an adjacent display device, which is not limited.
或者如图1a所示界面,以画中画的形式,展示添加标签后的视频帧图像、以及所添加标签的内容。具体的,可以在主屏幕区域中展示添加标签后的视频帧图像,在小屏幕区域中展示所添加标签的内容。小屏幕区域可以位于主屏幕区域的右侧、左侧、上侧、下侧等任意位置,具体不做限定。Or, as shown in FIG. 1a, in the form of picture-in-picture, the video frame image after adding the label and the content of the added label are displayed. Specifically, the video frame image after the tag is added may be displayed in the main screen area, and the content of the added tag is displayed in the small screen area. The small screen area may be located at any position on the right side, the left side, the upper side, and the lower side of the main screen area, and is not limited.
如上所述,“标签的内容”可以有多种类型,比如视频数据、抓拍图像、图像分析数据等等,可以在不同区域展示不同类型的数据。比如,可以在上述画中画的小屏幕区域或者上述第二区域中展示该视频数据及抓拍图像,并将图像分析数据展示在视频帧图像中,等等,具体展示方式不做限定。As described above, the "content of the tag" can be of various types, such as video data, captured images, image analysis data, and the like, and different types of data can be displayed in different areas. For example, the video data and the captured image may be displayed in the small screen area or the second area in the above picture, and the image analysis data may be displayed in the video frame image, etc., and the specific display manner is not limited.
另外,“标签符号”的具体形状、颜色、透明度、以及“标签内容”的具体类型等等可以预先设定,或者根据用户选择进行更改。In addition, the specific shape, color, transparency, and specific type of "tag content" of the "tag symbol" may be set in advance or may be changed according to user selection.
如果添加标签的数量较多,可以对标签进行叠加展示,或者也可以在第二区域或者画中画小屏幕区域中,仅展示部分标签的内容。具体的,可以在所添加的标签中,确定当前展示标签;展示所述当前展示标签的内容。If you add a large number of labels, you can overlay the labels, or you can display only the contents of some labels in the second area or in the small area of the picture. Specifically, in the added label, the current display label may be determined; and the content of the current display label is displayed.
确定当前展示标签的方式有多种,比如,可以设定展示顺序,根据该顺序确定当前展示标签,其中,该展示顺序可以随机确定,也可以根据各个标签的重要程度进行设定,具体不做限定;或者,也可以在接收到用户针对某个标签的展示指令后,将该展示指令对应的标签确定为当前展示标签,等等,具体不做限定。There are various ways to determine the current display label. For example, the display order can be set, and the current display label is determined according to the order. The display order can be determined randomly, or can be set according to the importance degree of each label. Alternatively, after receiving the display instruction of the user for a certain label, the label corresponding to the display instruction is determined as the current display label, and the like, and is not limited.
作为一种实施方式,如果检测到用户点击所述视频帧图像中的标签,可以将被点击标签确定为目标标签;在所述视频帧图像中展示所述目标标签的内容。As an embodiment, if it is detected that the user clicks on the tag in the video frame image, the clicked tag may be determined as the target tag; the content of the target tag is displayed in the video frame image.
可以理解,如果用户点击视频帧图像中的标签,为了更好的响应用户需要,可以直接在视频帧图像中展示该标签的内容。It can be understood that if the user clicks on the label in the video frame image, in order to better respond to the user's needs, the content of the label can be directly displayed in the video frame image.
作为一种实施方式,可以预先设定图层分类策略,根据该策略,确定每个标签对应的图层类别。换句话说,也就是把各个标签划分为不同的图层类别。比如,可以将标签划分为路口标签图层、卡口标签图层、区域标签图层、建筑物标签图层等等。As an implementation manner, a layer classification policy may be preset, and according to the policy, a layer category corresponding to each label is determined. In other words, it is to divide each label into different layer categories. For example, you can divide labels into intersection label layers, bayonet label layers, area label layers, building label layers, and more.
在这种实施方式中,可以根据用户指令,确定图层展示策略。图层展示策略中可以包含当前展示图层、以及当前展示图层的展示方式。In such an embodiment, the layer display strategy can be determined based on user instructions. The layer display strategy can include the current display layer and how the current display layer is displayed.
第一种情况,用户指令中仅包含当前展示图层信息,设备根据用户指令确定当前展示图层,另外,设备中存储有各图层对应的展示方式,这样,设备可以进一步确定出当前展示图层的展示方式;第二种情况,用户指令中包含当前展示图层信息、以及展示方式信息,设备根据用户指令,可以确定出当前展示图层、以及当前展示图层的展示方式,这都是合理的。In the first case, the user instruction only includes the current display layer information, and the device determines the current display layer according to the user instruction. In addition, the device stores the display manner corresponding to each layer, so that the device can further determine the current display image. The display mode of the layer; in the second case, the user instruction includes the current display layer information and the display mode information, and the device can determine the current display layer and the display mode of the current display layer according to the user instruction, which are all reasonable.
展示方式可以包括:闪烁展示、抖动展示、静态展示等,具体不做限定。The display mode may include: flashing display, jitter display, static display, etc., and is not limited.
需要说明的是,如果在这种实施方式中,将标签与标签的内容分开展示,则上述展示方式既可以包含针对标签的展示方式,也可以包含针对标签内容的展示方式,比如,建筑物标签图层对应的展示方式可以为:标签在视频帧图像中抖动展示,对应的标签内容在其他区域(第二区域或者画中画区域)闪烁展示。It should be noted that, in this embodiment, the label is displayed separately from the content of the label, and the display manner may include the manner in which the label is displayed, or the manner in which the label content is displayed, for example, a building label. The corresponding display manner of the layer may be: the label is displayed in the video frame image, and the corresponding label content is flashed in other areas (the second area or the picture-in-picture area).
作为一种实施方式,还可以获取第一采集设备采集的视频帧图像对应的细节图像,并且在S101之后,根据预先获取的所述细节图像与所述视频帧图像之间的像素点对应关系,确定所述目标位置对应到所述细节图像中的位置,作为待处理位置;将所述目标位置处添加的标签添加到所述目标位置对应的待处理位置;本实施方式中,S103可以包括:根据预设展示规则,对添加标签后的视频帧图像、以及添加标签后的细节图像进行展示。As an implementation manner, the detailed image corresponding to the video frame image collected by the first collection device may be acquired, and after S101, according to the pixel point correspondence between the detail image and the video frame image acquired in advance, Determining that the target location corresponds to the location in the detail image as the to-be-processed location; adding the label added at the target location to the to-be-processed location corresponding to the target location; in this embodiment, S103 may include: The video frame image after the tag is added and the detailed image after the tag is displayed according to the preset display rule.
举例来说,S101中获取到的视频帧图像可以为全景图像,在此之外,还可以获取该全景图像对应的细节图像,并且根据全景图像与细节图像之间的像素点对应关系,将全景图像中添加的标签对应到细节图像中,在细节图像中也进行标签的添加。For example, the video frame image acquired in S101 may be a panoramic image, and in addition, a detailed image corresponding to the panoramic image may be acquired, and the panoramic view is obtained according to a pixel point correspondence relationship between the panoramic image and the detailed image. The label added to the image corresponds to the detail image, and the label is also added in the detail image.
具体的,可以在第一采集设备之外设置第三采集设备,第一采集设备与第三采集设备针对同一场景进行图像采集,第一采集设备采集得到全景图像,第三采集设备采集得到细节图像。第三采集设备可以为球机,球机可以转动,可以采集不同视角的细节图像。全景图像与细节图像之间的像素点对应关系可以根据第一采集设备与第三采集设备之间的标定信息得到。Specifically, the third collection device may be disposed outside the first collection device, where the first collection device and the third collection device perform image collection for the same scene, the first collection device collects the panoramic image, and the third collection device collects the detailed image. . The third collecting device can be a ball machine, the ball machine can be rotated, and detailed images of different viewing angles can be collected. The pixel point correspondence between the panoramic image and the detail image may be obtained according to calibration information between the first collection device and the third collection device.
举例来说,假设全景图像A中包括四个区域:区域1、区域2、区域3和区域4,球机可以采集到这个四个区域对应的细节图像,分别为细节图像B1、细节图像B2、细节图像B3和细节图像B4。可以按照预设顺序,轮流展示这4个细节图像。For example, assuming that the panoramic image A includes four regions: region 1, region 2, region 3, and region 4, the dome camera can collect detailed images corresponding to the four regions, which are the detail image B1 and the detail image B2, respectively. Detail image B3 and detail image B4. The four detail images can be displayed in turn in a preset order.
假设当前展示的细节图像为B1,假设在区域1中确定出10个目标位置,并针对这10个目标位置添加了标签,相对应的,细节图像B1中也存在10个待处理位置,并在这10个待处理位置处添加同样的10个标签。一种情况下,由于标签数量较大,在全景图像A的区域1中可以仅展示部分标签,而在细节图像B1中展示这10个标签。Assume that the currently displayed detail image is B1, assuming that 10 target positions are determined in area 1, and labels are added for the 10 target positions, correspondingly, there are also 10 pending positions in the detail image B1, and Add the same 10 labels to these 10 pending locations. In one case, since the number of tags is large, only a part of the tags may be displayed in the area 1 of the panoramic image A, and the 10 tags are displayed in the detail image B1.
作为一种实施方式,可以在第一区域中,展示添加标签后的视频帧图像,在第三区域中,展示添加标签后的细节图像;或者,可以以画中画的形式,展示添加标签后的视频帧图像、以及添加标签后的细节图像。As an implementation manner, the video frame image after the label is added may be displayed in the first area, and the detailed image after the label is added may be displayed in the third area; or, the label may be displayed in the form of picture-in-picture The video frame image and the detailed image after the tag is added.
一种情况下,这里所说的展示标签只是对“标签符号”进行展示,而在另一区域中展示“标签内容”。比如,可以在第一区域中,展示添加标签后的视频帧图像,在第二区域中,展示所添加标签的内容,在第三区域中,展示添加标签后的细节图像。这里所说的第一区域、第二区域、第三区域可以为同一显示设备的不同区域,也可以为不同显示设备中的显示区域。In one case, the display label described here only displays the "tag symbol" and displays the "tag content" in another area. For example, in the first area, the image of the added video frame may be displayed, in the second area, the content of the added label is displayed, and in the third area, the detailed image after the label is added. The first area, the second area, and the third area mentioned herein may be different areas of the same display device, or may be display areas in different display devices.
再比如,如图1b所示,可以以画中画的形式,展示添加标签后的视频帧图像、添加标签后的细节图像、以及所添加标签的内容。图1b中,在主屏幕区域中展示添加标签后的视频帧图像,在左下角的小屏幕区域中展示添加标签后的细节图像,在右侧的小屏幕区域中展示所添加标签的内容。展示方式有多种,具体不做限定。For another example, as shown in FIG. 1b, the video frame image after the tag is added, the detailed image after the tag is added, and the content of the added tag can be displayed in the form of picture-in-picture. In FIG. 1b, the video frame image after adding the label is displayed in the main screen area, the detailed image after adding the label is displayed in the small screen area in the lower left corner, and the content of the added label is displayed in the small screen area on the right side. There are many ways to display, and the specifics are not limited.
如果标签还包括“标签名称”,该“标签名称”可以展示在视频帧图像中, 也可以展示在在视频帧图像之外的区域,具体不做限定。If the label further includes a "tag name", the "tag name" may be displayed in the video frame image, or may be displayed in an area outside the video frame image, which is not limited.
应用本申请图1所示实施例,在视频帧图像中的目标位置处添加标签,然后对添加标签后的视频帧图像进行展示;标签可以帮助用户理解视频帧图像包含的具体内容,因此,添加标签后的视频帧图像能够更直观地展示图像内容,展示效果较好。Applying the embodiment shown in FIG. 1 of the present application, adding a label at a target position in the video frame image, and then displaying the labeled video frame image; the label can help the user understand the specific content included in the video frame image, and therefore, adding The video frame image behind the label can display the image content more intuitively, and the display effect is better.
图2为本申请实施例提供的图像处理方法的第二种流程示意图,图2所示实施例在图1所示实施例的基础上,在S101之前,还包括:FIG. 2 is a second schematic flowchart of an image processing method according to an embodiment of the present disclosure. The embodiment shown in FIG. 2 is based on the embodiment shown in FIG.
S201:获取第二采集设备采集的子场景图像。S201: Acquire a sub-scene image collected by the second collection device.
在图2所示实施例中,第一采集设备采集的视频帧图像为全景图像,第一采集设备对应至少一台第二采集设备,第二采集设备针对该全景图像对应的子场景进行图像采集,第二采集设备采集的图像为子场景图像。In the embodiment shown in FIG. 2, the video frame image collected by the first collection device is a panoramic image, the first collection device corresponds to at least one second collection device, and the second collection device performs image collection on the sub-scene corresponding to the panoramic image. The image collected by the second collection device is a sub-scene image.
作为一种实施方式,该第一采集设备可以为增强现实AR全景相机,这样,采集到的全景图像效果更好。As an implementation manner, the first collection device may be an augmented reality AR panoramic camera, so that the collected panoramic image is better.
或者,该第一采集设备也可以为多个枪机,将这多个枪机采集的图像进行拼接,得到全景图像。Alternatively, the first collecting device may also be a plurality of guns, and the images collected by the plurality of guns are spliced to obtain a panoramic image.
第二采集设备可以为普通摄像头,比如球机、抓拍机等。如果第二采集设备为球机,则该子场景图像可以为监控视频图像,如果第二采集设备为抓拍机,则该子场景图像可以为抓拍图片,等等,具体不做限定。The second collection device can be an ordinary camera, such as a ball machine, a capture machine, and the like. If the second collection device is a dome camera, the sub-scene image may be a surveillance video image. If the second collection device is a capture camera, the sub-scene image may be a snapshot image, and the like, which is not limited.
举例来说,可以如图2a所示,一个较大场景A中包含A1、A2、A3、A4四个子场景,其中,第一采集设备对场景A进行图像采集,第二采集设备1对A1进行图像采集,第二采集设备2对A2进行图像采集,第二采集设备3对A3进行图像采集,第二采集设备4对A4进行图像采集。For example, as shown in FIG. 2a, a large scene A includes four sub-scenes: A1, A2, A3, and A4. The first collection device performs image collection on scene A, and the second collection device 1 performs A1 on A1. Image acquisition, the second collection device 2 performs image acquisition on A2, the second collection device 3 performs image acquisition on A3, and the second collection device 4 performs image acquisition on A4.
再举一例,第一采集设备与第二采集设备可以为同一台设备,比如AR鹰眼设备,AR鹰眼设备具有增强现实功能,AR鹰眼设备中可以集成设置有多个枪机镜头和一个球机镜头,可以将该多个枪机镜头拼接后的图像作为全景图像,将该球机镜头采集的图像作为子场景图像。AR鹰眼设备中还可以设置有平台,平台对该多个枪机镜头和一个球机镜头进行调度管理。As another example, the first collection device and the second collection device may be the same device, such as an AR eagle eye device, and the AR eagle eye device has an augmented reality function. The AR eagle eye device may be integrated with a plurality of camera lenses and one In the ball machine lens, the image obtained by splicing the plurality of camera lenses can be used as a panoramic image, and the image captured by the camera lens is used as a sub-scene image. The AR Hawkeye device can also be provided with a platform for scheduling and managing the plurality of camera lenses and a dome camera lens.
第一种方案,第二采集设备实时地将采集到的子场景图像发送给执行本方案的设备。In the first solution, the second collection device sends the collected sub-scene image to the device that executes the solution in real time.
第二种方案,执行本方案的设备在接收到用户指令后,从第二采集设备中获取子场景图像。In the second solution, the device that executes the solution acquires the sub-scene image from the second collection device after receiving the user instruction.
第三种方案,执行本方案的设备在检测到S101的视频帧图像(全景图像)中发生异常事件后,从异常事件对应的第二采集设备中获取子场景图像。该异常事件可以为交通事故、抢劫事件等等,具体不做限定。In a third aspect, the device that executes the solution acquires the sub-scene image from the second collection device corresponding to the abnormal event after detecting an abnormal event in the video frame image (panoramic image) of the S101. The abnormal event may be a traffic accident, a robbery event, etc., and is not limited.
本申请实施例并不对获取子场景图像的时机进行限定。The embodiment of the present application does not limit the timing of acquiring a sub-scene image.
S202:根据所述子场景图像,生成标签。S202: Generate a label according to the sub-scene image.
举例来说,标签可以包括“标签符号”及“标签的内容”,比如,“标签符号”可以为箭头、三角形等几何图形,“标签符号”是为了在视频帧图像中标记该位置处有一个标签,具体形式不作限定;“标签的内容”即可以包括该子场景图像。此外,标签还可以包括“标签名称”,比如,可以为一些简洁的文字信息,比如“某某大厦”、“某某公园”等。For example, the label may include a "tag symbol" and a "tag content". For example, the "tag symbol" may be an arrow, a triangle, etc., and the "tag symbol" is for marking a position in the video frame image. The label, the specific form is not limited; the "content of the label" may include the sub-scene image. In addition, the tag may also include a "tag name", for example, may be some simple text information, such as "some building", "some park" and the like.
作为一种实施方式,可以将所述子场景图像和/或所述子场景图像中的目标信息添加至所述标签的内容。As an embodiment, the sub-scene image and/or the target information in the sub-scene image may be added to the content of the tag.
也就是说,第一种情况,该标签中仅包含子场景图像,将S102获取到的子场景图像添加至标签的内容。That is to say, in the first case, only the sub-scene image is included in the tag, and the sub-scene image acquired by S102 is added to the content of the tag.
第二种情况,该标签中包含子场景图像中的目标信息。In the second case, the tag contains the target information in the sub-scene image.
举例来说,如果S101中全景图像针对的场景为交通路口,该目标信息可以包含图像中的车辆信息,比如车牌号、车身颜色等,也可以包含道路信息,比如道路中车流量等;或者,在上述第三种方案中,该目标信息可以为异常事件信息,比如交通事故等。For example, if the scene for the panoramic image in S101 is a traffic intersection, the target information may include vehicle information in the image, such as a license plate number, a vehicle body color, etc., and may also include road information, such as traffic flow in the road; or In the above third solution, the target information may be abnormal event information, such as a traffic accident.
如果S101中全景图像针对的场景为楼道内场景,该目标信息可以为图像中的人物信息,比如身高、性别等;或者,在上述第三种方案中,该目标信息可以为异常事件信息,比如抢劫、火灾等。If the scene for the panoramic image in S101 is a scene in the corridor, the target information may be character information in the image, such as height, gender, etc.; or, in the third scheme, the target information may be abnormal event information, such as Robbery, fire, etc.
获取该目标信息的方式有多种,比如,(1)、执行本方案的设备可以对S 201获取到的子场景图像进行识别,根据识别结果,确定出所述子场景图像中的目标信息;(2)、第二采集设备可以具有图像识别功能,第二采集设备将识别出的目标信息发送给本设备;(3)、与第二采集设备通信连接的服务器对子场景图像进行识别,并将识别出的目标信息发送给本设备;这些方式都是合理的。The method for obtaining the target information is different. For example, (1) the device that executes the solution may identify the sub-scene image acquired by the S 201, and determine the target information in the sub-scene image according to the recognition result; (2) The second collection device may have an image recognition function, and the second collection device sends the identified target information to the device; (3) the server connected to the second collection device identifies the sub-scene image, and The identified target information is sent to the device; these methods are reasonable.
第三种情况,该标签中既包含子场景图像又包含子场景图像中的目标信息。In the third case, the tag contains both the sub-scene image and the target information in the sub-scene image.
该目标信息可以理解为对子场景图像的介绍或说明,可以将目标信息设置在子场景图像的周围,以使用户更好的理解子场景图像中发生了什么。The target information can be understood as an introduction or description of the sub-scene image, and the target information can be set around the sub-scene image so that the user can better understand what is happening in the sub-scene image.
在图2所示实施例中,S101可以为S101A:根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第二采集设备对应的标签在全景图像中的目标位置。In the embodiment shown in FIG. 2, S101 may be S101A: determining, according to the calibration information of the first collection device and the second collection device that are acquired in advance, a target position of the label corresponding to the second collection device in the panoramic image.
本领域技术人员可以理解,在图2a所示场景中,第一采集设备与四台第二采集设备之间存在标定关系,该标定关系可以理解为全景图像坐标系与子场景图像坐标系之间的转换关系。举例来说,子场景A1中存在位置X,该位置X在全景图像中的像素坐标点为(x1,y1),在第二采集设备1采集的子场景图像中的像素坐标点为(x2,y2),该标定关系即为(x1,y1)与(x2,y2)之间的转换关系。A person skilled in the art can understand that in the scenario shown in FIG. 2a, there is a calibration relationship between the first acquisition device and the four second acquisition devices, and the calibration relationship can be understood as a relationship between the panoramic image coordinate system and the sub-scene image coordinate system. Conversion relationship. For example, there is a position X in the sub-scene A1, the pixel coordinate point of the position X in the panoramic image is (x1, y1), and the pixel coordinate point in the sub-scene image acquired by the second acquisition device 1 is (x2, Y2), the calibration relationship is the conversion relationship between (x1, y1) and (x2, y2).
本实施例中,可以预先获取该标定关系的相关信息(标定信息),利用该标定信息,便可以确定第二采集设备的标签在全景图像中对应的位置。In this embodiment, related information (calibration information) of the calibration relationship may be acquired in advance, and the calibration information may be used to determine a position of the label of the second collection device in the panoramic image.
一种实施方式中,在第一采集设备、第二采集设备之外,还设置第三采集设备,比如,第一采集设备为多台枪机,采集得到全景图像,第二采集设备为抓拍机,采集得到抓拍图像作为子场景图像,第三采集设备采集得到细节图像。In an embodiment, a third collection device is further disposed in addition to the first collection device and the second collection device. For example, the first collection device is a plurality of guns, and the panoramic image is acquired, and the second collection device is a capture camera. The captured image is captured as a sub-scene image, and the third acquisition device acquires a detailed image.
本实施方式可以为:This embodiment can be:
一、获取第一采集设备采集的全景图像、第二采集设备采集的子场景图像、第三采集设备采集的细节图像;1. acquiring a panoramic image collected by the first collecting device, a sub-scene image collected by the second collecting device, and a detailed image collected by the third collecting device;
二、在全景图像中确定至少一个目标位置,并根据第一采集设备与第三采集设备之间的标定信息,确定该目标位置对应到细节图像中的位置,作为待处理位置;Determining at least one target position in the panoramic image, and determining, according to the calibration information between the first collection device and the third collection device, the target position corresponding to the position in the detail image as the to-be-processed position;
三、根据第二采集设备采集的子场景图像生成标签,或者说将该子场景图像作为标签的内容;Third, generating a label according to the sub-scene image collected by the second collection device, or using the sub-scene image as the content of the label;
四、根据第一采集设备与第二采集设备之间的标定信息,确定第二采集设备对应的标签在该全景图像中的目标位置,在所确定的目标位置处添加该标签;并在所确定的目标位置对应的待处理位置处添加该标签;And determining, according to the calibration information between the first collection device and the second collection device, a target position of the label corresponding to the second collection device in the panoramic image, adding the label at the determined target position; and determining Add the label at the location to be processed corresponding to the target location;
五、根据预设展示规则,对添加标签后的全景图像、添加标签后的细节图像以及所添加标签的内容进行展示。5. According to the preset display rule, the panoramic image after the label is added, the detailed image after the label is added, and the content of the added label are displayed.
现有方案中,不同设备采集的图像只能单独展示(图像之间不存在关联关系),如果用户需要关注多台设备采集的图像,则需要在这多台设备采集的图像中来回切换,操作复杂。In the existing solution, the images collected by different devices can only be displayed separately (there is no relationship between the images). If the user needs to pay attention to the images collected by multiple devices, you need to switch back and forth between the images collected by the multiple devices. complex.
而应用本申请图2所示实施例,第一采集设备采集全景图像,第二采集设备对全景图像中的子场景进行图像采集,生成子场景图像;根据子场景图像生成标签,将标签添加至全景图像中,并对添加标签后的全景图像进行展示;由此可见,本方案将第一采集设备采集的图像(全景图像)与第二采集设备采集的图像(标签)进行关联展示,用户不需要切换,便可以关注多台设备采集的图像,操作简单。Applying the embodiment shown in FIG. 2, the first collecting device collects the panoramic image, and the second collecting device collects the image of the sub-scene in the panoramic image to generate a sub-scene image; generates a label according to the sub-scene image, and adds the label to the label. In the panoramic image, the panoramic image after the label is displayed is displayed; thus, the solution displays the image (the panoramic image) collected by the first collection device and the image (label) collected by the second collection device, and the user does not display If you need to switch, you can pay attention to the images collected by multiple devices, and the operation is simple.
下面对图2所示实施例中提到的第三种方案进行介绍。The third scheme mentioned in the embodiment shown in Fig. 2 will be described below.
具体的,可以检测第一采集设备采集的全景图像中是否发生异常事件;如果是,确定所述异常事件对应的目标第二采集设备;获取所述目标第二采集设备采集的子场景图像。Specifically, an abnormal event may be detected in the panoramic image collected by the first collection device; if yes, the target second collection device corresponding to the abnormal event is determined; and the sub-scene image collected by the target second collection device is acquired.
作为一种实施方式,可以预先设定异常模型:根据上面内容描述,异常事件可以包括交通事故、抢劫、火灾等,可以预先模拟这些异常事件,生成 对应的异常模型。然后将全景图像与预设异常模型进行匹配,如果匹配成功,表示全景图像中发生了异常事件。匹配成功的位置即为异常事件在全景图像中的位置。As an implementation manner, the abnormality model may be preset: according to the above description, the abnormal events may include traffic accidents, robberies, fires, etc., and these abnormal events may be simulated in advance to generate corresponding abnormal models. The panoramic image is then matched with the preset anomaly model. If the matching is successful, an abnormal event occurs in the panoramic image. The position where the match is successful is the position of the abnormal event in the panoramic image.
或者,作为另一种实施方式,也可以接收其他设备或者用户针对该全景图像发送的异常事件报警信息,接收到该报警信息,也表示全景图像中发生了异常事件。Alternatively, as another implementation manner, the abnormal event alarm information sent by the other device or the user for the panoramic image may be received, and the alarm information is received, and an abnormal event occurs in the panoramic image.
可以理解,执行本方案的设备可以与其他设备通信连接,其他设备可以在判断全景图像中发生异常事件后,向本设备发送异常事件报警信息。或者,用户也可以向本设备发送异常事件报警信息,这也是合理的。该异常事件报警信息中可以携带异常事件在全景图像中的位置。It can be understood that the device that implements the solution can communicate with other devices, and other devices can send abnormal event alarm information to the device after determining that an abnormal event occurs in the panoramic image. Alternatively, the user can also send an abnormal event alarm message to the device, which is also reasonable. The abnormal event alarm information may carry the position of the abnormal event in the panoramic image.
根据上面内容描述,在图2a所示场景中,第一采集设备与四台第二采集设备之间存在标定关系,本实施例中,可以预先获取该标定关系的相关信息(标定信息),利用该标定信息,便可以确定出与上述“异常事件在全景图像中的位置”相对应的目标第二采集设备,也就是针对异常事件所在的子场景进行图像采集的第二采集设备。According to the above description, in the scenario shown in FIG. 2a, there is a calibration relationship between the first collection device and the four second collection devices. In this embodiment, related information (calibration information) of the calibration relationship may be acquired in advance. The calibration information can determine the target second collection device corresponding to the above-mentioned "position of the abnormal event in the panoramic image", that is, the second collection device that performs image acquisition for the sub-scene where the abnormal event is located.
在本方案中,S202为:根据所述子场景图像,生成所述异常事件对应的标签。In this solution, S202 is: generating a label corresponding to the abnormal event according to the sub-scene image.
另外,在本方案中,可以预先在全景图像中划分重点区域,在检测到所述全景图像中发生异常事件的情况下,可以判断所述异常事件在所述全景图像中的位置是否位于预设重点区域;如果是,则以预设报警方式,在视频帧图像中展示所述标签。In addition, in the present solution, the focus area may be divided in the panoramic image in advance, and when an abnormal event occurs in the panoramic image is detected, it may be determined whether the position of the abnormal event in the panoramic image is at a preset. The focus area; if so, the label is displayed in the video frame image in a preset alarm mode.
举例来说,如果全景图像中的路口A为需要重点关注的区域,则预先在全景图像中将路口A设定为重点区域;如果检测到全景图像中发生异常事件,而且该异常事件发生在路口A中,则以预设报警方式,在视频帧图像中展示所述标签。For example, if the intersection A in the panoramic image is an area that needs to be focused, the intersection A is set as the focus area in the panoramic image in advance; if an abnormal event occurs in the panoramic image, and the abnormal event occurs at the intersection In A, the label is displayed in the video frame image in a preset alarm mode.
预设报警方式有多种,比如,闪烁、抖动、或者直接输出提示信息等。需要说明的是,如果采用将标签与标签的内容分开展示的实施方式,也可以以报警方式在第二区域或者画中画区域中,展示标签的内容,比如,弹窗变 色,弹窗抖动等,具体不做限定。There are various preset alarm modes, such as flickering, dithering, or direct output of prompt information. It should be noted that, if an embodiment in which the content of the label and the label are separately displayed is used, the content of the label may be displayed in the second area or the picture-in-picture area by an alarm method, for example, the color change of the pop-up window, the pop-up window shake, etc. The specific is not limited.
应用上述方案,能够监测全景图像中异常事件的发生,并生成针对异常事件的标签,提高监控效果。Applying the above scheme, it is possible to monitor the occurrence of an abnormal event in the panoramic image, and generate a label for the abnormal event to improve the monitoring effect.
图3为本申请实施例提供的图像处理方法的第三种流程示意图,图3所示实施例在图1所示实施例的基础上,在S101之前,还包括:FIG. 3 is a third schematic flowchart of an image processing method according to an embodiment of the present disclosure. The embodiment shown in FIG. 3 is based on the embodiment shown in FIG.
S301:接收用户发送的标签添加指令。S301: Receive a label adding instruction sent by a user.
举例来说,在图1a所示界面中,用户可以点击视频帧图像中的建筑物、路口等目标,然后输入该目标相关的内容(目标内容),目标内容可以包含文字信息(比如建筑物名称,或者其他相关说明),或者也可以包含图像。For example, in the interface shown in FIG. 1a, the user can click on a target such as a building or an intersection in a video frame image, and then input the content related to the target (target content), and the target content may include text information (such as a building name). , or other relevant instructions), or can also contain images.
执行本方案的设备检测到用户的点击,并且接收到用户发送的目标内容,便认为接收到用户发送的标签添加指令。也就是说,该标签添加指令中可以携带目标位置(用户点击的位置)及目标内容(用户输入的内容,文字或者图像)。When the device that executes the solution detects the user's click and receives the target content sent by the user, it considers that the tag addition instruction sent by the user is received. That is to say, the tag addition instruction can carry the target location (the location clicked by the user) and the target content (the content, text or image input by the user).
需要说明的是,用户也可以获取第二采集设备采集的子场景图像,并将获取的子场景图像作为目标内容,或者用户可以将子场景图像及子场景图像中的目标信息(与图2所示实施例中的目标信息含义相同,不再赘述)作为目标内容。It should be noted that the user may also obtain the sub-scene image collected by the second collection device, and use the acquired sub-scene image as the target content, or the user may select the sub-scene image and the target information in the sub-scene image (as shown in FIG. 2 The target information in the embodiment has the same meaning and will not be described again as the target content.
S302:根据所述标签添加指令,生成标签。S302: Generate a label according to the label adding instruction.
举例来说,标签可以包括“标签符号”及“标签的内容”,比如,“标签符号”可以为箭头、三角形等几何图形,“标签符号”是为了在视频帧图像中标记该位置处有一个标签,具体形式不作限定;本实施例中,可以将上述用户输入的目标内容作为标签的内容。For example, the label may include a "tag symbol" and a "tag content". For example, the "tag symbol" may be an arrow, a triangle, etc., and the "tag symbol" is for marking a position in the video frame image. The specific form of the label is not limited; in this embodiment, the target content input by the user may be used as the content of the label.
此外,标签还可以包括“标签名称”,比如,可以为一些简洁的文字信息,比如“某某大厦”、“某某公园”等。也可以将上述用户输入的部分内容作为标签的名称。In addition, the tag may also include a "tag name", for example, may be some simple text information, such as "some building", "some park" and the like. It is also possible to use part of the content input by the above user as the name of the tag.
这种情况下,S101为S101B:根据所述标签添加指令,确定所添加标签的目标位置。目标位置即为上述用户点击的位置。In this case, S101 is S101B: determining the target position of the added tag according to the tag adding instruction. The target location is the location that the above user clicks.
应用本申请图3所示实施例,标签的位置及内容由用户来确定,也就是说,用户可以根据自身需要,设计属于自己的标签,用户体验较佳。Applying the embodiment shown in FIG. 3 of the present application, the location and content of the label are determined by the user, that is, the user can design his own label according to his own needs, and the user experience is better.
与上述方法实施例相对应,本申请实施例还提供一种图像处理设备。Corresponding to the above method embodiment, the embodiment of the present application further provides an image processing device.
本申请实施例提供还提供一种图像处理设备,如图4a所示,包括:处理器401和存储器402;The embodiment of the present application further provides an image processing device, as shown in FIG. 4a, comprising: a processor 401 and a memory 402;
存储器402,用于存放计算机程序;a memory 402 for storing a computer program;
处理器401,用于执行存储器402上所存放的程序时,实现上述任一种图像处理方法。The processor 401 is configured to implement any of the above image processing methods when executing a program stored on the memory 402.
图4b为本申请实施例提供的另一种图像处理设备的结构示意图,包括:壳体501、处理器502、存储器503、电路板504和电源电路505,其中,电路板504安置在壳体501围成的空间内部,处理器502和存储器503设置在电路板504上;电源电路505,用于为图像处理设备的各个电路或器件供电;存储器503用于存储可执行程序代码;处理器502通过读取存储器503中存储的可执行程序代码来运行与可执行程序代码对应的程序,以用于执行以下步骤:FIG. 4b is a schematic structural diagram of another image processing apparatus according to an embodiment of the present disclosure, including: a housing 501, a processor 502, a memory 503, a circuit board 504, and a power supply circuit 505, wherein the circuit board 504 is disposed in the housing 501. Inside the enclosed space, the processor 502 and the memory 503 are disposed on the circuit board 504; the power supply circuit 505 is configured to supply power to the respective circuits or devices of the image processing apparatus; the memory 503 is configured to store executable program code; the processor 502 passes The executable program code stored in the memory 503 is read to execute a program corresponding to the executable program code for performing the following steps:
针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置;Determining at least one target location in the video frame image for the video frame image acquired by the first acquisition device;
在所确定的每个目标位置处添加标签,所述标签根据输入指令或者第二采集设备采集的图像生成;Adding a label at each determined target location, the label being generated according to an input instruction or an image acquired by the second collection device;
根据预设展示规则,对添加标签后的视频帧图像进行展示。The video frame image after the tag is displayed according to the preset display rule.
作为一种实施方式,所述视频帧图像为全景图像,所述第一采集设备对应至少一台第二采集设备,第二采集设备针对所述全景图像对应的子场景进行图像采集;As an embodiment, the video frame image is a panoramic image, the first collection device corresponds to at least one second collection device, and the second collection device performs image collection on the sub-scene corresponding to the panoramic image.
所述处理器还用于实现如下步骤:The processor is further configured to implement the following steps:
获取第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the second collection device;
根据所述子场景图像,生成标签;Generating a label according to the sub-scene image;
根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第 二采集设备对应的标签在所述全景图像中的目标位置。And determining, according to the calibration information of the first collection device and the second collection device that are acquired in advance, a target position of the label corresponding to the second collection device in the panoramic image.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
将所述子场景图像和/或所述子场景图像中的目标信息添加至所述标签的内容。Adding the sub-scene image and/or target information in the sub-scene image to the content of the tag.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
对所述子场景图像进行识别,根据识别结果,确定出所述子场景图像中的目标信息;将所述目标信息添加至所述标签的内容;Identifying the sub-scene image, determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
或者,接收第二采集设备发送的所述目标信息;将所述目标信息添加至所述标签的内容;Or receiving the target information sent by the second collection device; adding the target information to the content of the label;
或者,接收与第二采集设备通信连接的服务器发送的所述目标信息;将所述目标信息添加至所述标签的内容。Or receiving the target information sent by a server communicatively coupled to the second collection device; adding the target information to the content of the tag.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
在第一区域中,展示添加标签后的视频帧图像;In the first area, displaying the video frame image after the tag is added;
在第二区域中,展示所添加标签的内容。In the second area, the content of the added tag is displayed.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
以画中画的形式,展示添加标签后的视频帧图像、以及所添加标签的内容。In the form of a picture-in-picture, the video frame image after the tag is added, and the content of the added tag.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
在所添加的标签中,确定当前展示标签;In the added tag, determine the current display tag;
展示所述当前展示标签的内容。Show the content of the current display tag.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
在检测到用户点击所述视频帧图像中的标签后,将被点击标签确定为目标标签;After detecting that the user clicks on the label in the video frame image, the clicked label is determined as the target label;
在所述视频帧图像中展示所述目标标签的内容。The content of the target tag is displayed in the video frame image.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
接收标签添加指令;Receiving a tag addition instruction;
根据所述标签添加指令,生成标签;Generating a label according to the label adding instruction;
根据所述标签添加指令,确定所添加标签的目标位置。A target location of the added tag is determined according to the tag addition instruction.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
根据预设图层分类策略,确定每个标签对应的图层;Determining a layer corresponding to each label according to a preset layer classification policy;
确定图层展示策略,根据所述图层展示策略,确定当前展示图层、及所述当前展示图层的展示方式;Determining a layer display strategy, and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
以所述展示方式,对所述当前展示图层对应的标签进行展示。In the display manner, the label corresponding to the current display layer is displayed.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
检测所述全景图像中是否发生异常事件;Detecting whether an abnormal event occurs in the panoramic image;
如果是,确定所述异常事件对应的目标第二采集设备;If yes, determining a target second collection device corresponding to the abnormal event;
获取所述目标第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the target second collection device;
根据所述子场景图像,生成所述异常事件对应的标签。And generating, according to the sub-scene image, a label corresponding to the abnormal event.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
将所述全景图像与预设异常模型进行匹配;Matching the panoramic image with a preset anomaly model;
如果匹配成功,则表示所述全景图像中发生异常事件。If the match is successful, it means that an abnormal event occurs in the panoramic image.
或者,判断是否接收到针对所述全景图像的异常事件报警信息;Or determining whether abnormal event alarm information for the panoramic image is received;
如果接收到,则表示所述全景图像中发生异常事件。If received, an abnormal event occurs in the panoramic image.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
确定所述异常事件在所述全景图像中的位置;Determining a location of the abnormal event in the panoramic image;
根据预先获取的所述第一采集设备与每台第二采集设备的标定信息,确定与所述位置相对应的目标第二采集设备。Determining, according to the pre-acquired calibration information of the first collection device and each second collection device, the target second collection device corresponding to the location.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
在检测到所述全景图像中发生异常事件的情况下,判断所述异常事件在所述全景图像中的位置是否位于预设重点区域;When it is detected that an abnormal event occurs in the panoramic image, determining whether a location of the abnormal event in the panoramic image is located in a preset focus area;
如果是,以预设报警方式,在视频帧图像中展示所述标签。If so, the label is displayed in the video frame image in a preset alarm mode.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
获取第一采集设备采集的视频帧图像对应的细节图像;Obtaining a detail image corresponding to the video frame image collected by the first collection device;
根据预先获取的所述细节图像与所述视频帧图像之间的像素点对应关系,确定所述目标位置对应到所述细节图像中的位置,作为待处理位置;Determining, according to a pixel point correspondence relationship between the detail image and the video frame image acquired in advance, that the target position corresponds to a position in the detail image as a to-be-processed position;
将所述目标位置处添加的标签添加到所述目标位置对应的待处理位置;Adding a label added at the target location to a to-be-processed location corresponding to the target location;
根据预设展示规则,对添加标签后的视频帧图像、以及添加标签后的细节图像进行展示。The video frame image after the tag is added and the detailed image after the tag is displayed according to the preset display rule.
作为一种实施方式,所述处理器还用于实现如下步骤:As an implementation manner, the processor is further configured to implement the following steps:
在第一区域中,展示添加标签后的视频帧图像;在第三区域中,展示添加标签后的细节图像;In the first area, displaying the video frame image after the label is added; in the third area, displaying the detailed image after adding the label;
或者,以画中画的形式,展示添加标签后的视频帧图像、以及添加标签后的细节图像。Or, in the form of a picture-in-picture, the video frame image after the tag is added, and the detail image after the tag is added.
应用本申请所示实施例,在视频帧图像中的目标位置处添加标签,然后对添加标签后的视频帧图像进行展示;标签可以帮助用户理解视频帧图像包含的具体内容,因此,添加标签后的视频帧图像能够更直观地展示图像内容,展示效果较好。Applying the embodiment shown in the present application, adding a label at a target position in the video frame image, and then displaying the labeled video frame image; the label can help the user understand the specific content included in the video frame image, and therefore, after adding the label The video frame image can display the image content more intuitively, and the display effect is better.
本申请实施例还提供一种图像处理系统,该系统可以包括:第一采集设备和图像处理设备,其中,The embodiment of the present application further provides an image processing system, where the system may include: a first collection device and an image processing device, where
所述第一采集设备,用于采集视频帧图像,并将所采集的视频帧图像发送至所述图像处理设备;The first collecting device is configured to collect a video frame image, and send the collected video frame image to the image processing device;
所述图像处理设备,用于针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置;在所确定的每个目标位置处添加标签, 所述标签根据输入指令或者第二采集设备采集的图像生成;根据预设展示规则,对添加标签后的视频帧图像进行展示。The image processing device is configured to determine, according to a video frame image acquired by the first collection device, at least one target location in the video frame image; adding a label at each determined target location, the label according to the input instruction Or the image collected by the second collection device is generated; and the video frame image after the label is added is displayed according to the preset display rule.
作为一种实施方式,如图5所示,所述系统还包括:至少一台第二采集设备(第二采集设备1、第二采集设备2、第二采集设备3和第二采集设备4),As shown in FIG. 5, the system further includes: at least one second collection device (second acquisition device 1, second collection device 2, second collection device 3, and second collection device 4) ,
所述第二采集设备,用于针对全景图像对应的子场景进行图像采集,所述全景图像为所述第一采集设备所采集的视频帧图像;The second collection device is configured to perform image collection on a sub-scene corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
所述图像处理设备,还用于获取第二采集设备采集的子场景图像;根据所述子场景图像,生成标签;根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第二采集设备对应的标签在所述全景图像中的目标位置。The image processing device is further configured to acquire a sub-scene image acquired by the second collection device; generate a label according to the sub-scene image; and determine, according to the calibration information of the first collection device and the second collection device acquired in advance The label corresponding to the second collection device is at a target position in the panoramic image.
本实施方式中的图像处理设备可以为平台设备,该平台设备可以从多台采集设备中获取资源,还可以展示图像,还可以与用户进行交互等。The image processing device in this embodiment may be a platform device, which may acquire resources from multiple collection devices, display images, and interact with users.
作为一种实施方式,所述第一采集设备为增强现实AR全景相机。As an implementation manner, the first collection device is an augmented reality AR panoramic camera.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
将所述子场景图像和/或所述子场景图像中的目标信息添加至所述标签的内容。Adding the sub-scene image and/or target information in the sub-scene image to the content of the tag.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
对所述子场景图像进行识别,根据识别结果,确定出所述子场景图像中的目标信息;将所述目标信息添加至所述标签的内容;Identifying the sub-scene image, determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
或者,接收第二采集设备发送的所述目标信息;将所述目标信息添加至所述标签的内容;Or receiving the target information sent by the second collection device; adding the target information to the content of the label;
或者,接收与第二采集设备通信连接的服务器发送的所述目标信息;将所述目标信息添加至所述标签的内容。Or receiving the target information sent by a server communicatively coupled to the second collection device; adding the target information to the content of the tag.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
在第一区域中,展示添加标签后的视频帧图像;In the first area, displaying the video frame image after the tag is added;
在第二区域中,展示所添加标签的内容。In the second area, the content of the added tag is displayed.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
以画中画的形式,展示添加标签后的视频帧图像、以及所添加标签的内容。In the form of a picture-in-picture, the video frame image after the tag is added, and the content of the added tag.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
在所添加的标签中,确定当前展示标签;In the added tag, determine the current display tag;
展示所述当前展示标签的内容。Show the content of the current display tag.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
在检测到用户点击所述视频帧图像中的标签后,将被点击标签确定为目标标签;After detecting that the user clicks on the label in the video frame image, the clicked label is determined as the target label;
在所述视频帧图像中展示所述目标标签的内容。The content of the target tag is displayed in the video frame image.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
接收标签添加指令;Receiving a tag addition instruction;
根据所述标签添加指令,生成标签;Generating a label according to the label adding instruction;
根据所述标签添加指令,确定所添加标签的目标位置。A target location of the added tag is determined according to the tag addition instruction.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
根据预设图层分类策略,确定每个标签对应的图层;Determining a layer corresponding to each label according to a preset layer classification policy;
确定图层展示策略,根据所述图层展示策略,确定当前展示图层、及所述当前展示图层的展示方式;Determining a layer display strategy, and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
以所述展示方式,对所述当前展示图层对应的标签进行展示。In the display manner, the label corresponding to the current display layer is displayed.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
检测所述全景图像中是否发生异常事件;Detecting whether an abnormal event occurs in the panoramic image;
如果是,确定所述异常事件对应的目标第二采集设备;If yes, determining a target second collection device corresponding to the abnormal event;
获取所述目标第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the target second collection device;
根据所述子场景图像,生成所述异常事件对应的标签。And generating, according to the sub-scene image, a label corresponding to the abnormal event.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
将所述全景图像与预设异常模型进行匹配;Matching the panoramic image with a preset anomaly model;
如果匹配成功,则表示所述全景图像中发生异常事件。If the match is successful, it means that an abnormal event occurs in the panoramic image.
或者,判断是否接收到针对所述全景图像的异常事件报警信息;Or determining whether abnormal event alarm information for the panoramic image is received;
如果接收到,则表示所述全景图像中发生异常事件。If received, an abnormal event occurs in the panoramic image.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
确定所述异常事件在所述全景图像中的位置;Determining a location of the abnormal event in the panoramic image;
根据预先获取的所述第一采集设备与每台第二采集设备的标定信息,确定与所述位置相对应的目标第二采集设备。Determining, according to the pre-acquired calibration information of the first collection device and each second collection device, the target second collection device corresponding to the location.
作为一种实施方式,图像处理设备还可以用于:As an implementation manner, the image processing device can also be used to:
在检测到所述全景图像中发生异常事件的情况下,判断所述异常事件在所述全景图像中的位置是否位于预设重点区域;When it is detected that an abnormal event occurs in the panoramic image, determining whether a location of the abnormal event in the panoramic image is located in a preset focus area;
如果是,以预设报警方式,在视频帧图像中展示所述标签。If so, the label is displayed in the video frame image in a preset alarm mode.
作为一种实施方式,所述系统还可以包括:第三采集设备;As an implementation manner, the system may further include: a third collection device;
所述第三采集设备,用于采集全景图像对应的细节图像,所述全景图像为所述第一采集设备所采集的视频帧图像;The third collection device is configured to collect a detailed image corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
所述图像处理设备,还用于获取所述第三采集设备采集的细节图像;根据预先获取的所述细节图像与所述视频帧图像之间的像素点对应关系,确定所述目标位置对应到所述细节图像中的位置,作为待处理位置;将所述目标位置处添加的标签添加到所述目标位置对应的待处理位置;根据预设展示规则,对添加标签后的视频帧图像、以及添加标签后的细节图像进行展示。The image processing device is further configured to acquire a detail image collected by the third collection device; and determine, according to a pixel point correspondence relationship between the detail image and the video frame image, the target location is corresponding to a position in the detail image as a to-be-processed location; adding a tag added at the target location to a to-be-processed location corresponding to the target location; according to a preset display rule, the tagged video frame image, and The detailed image after the label is added for display.
应用本申请实施例提供的系统,图像处理设备获取第一采集设备采集的视频帧图像,并在视频帧图像中的目标位置处添加标签,然后对添加标签后的视频帧图像进行展示;标签可以帮助用户理解视频帧图像包含的具体内容, 因此,添加标签后的视频帧图像能够更直观地展示图像内容,展示效果较好。Applying the system provided by the embodiment of the present application, the image processing device acquires a video frame image collected by the first collection device, adds a label to a target position in the video frame image, and then displays the video frame image after the label is added; Help users understand the specific content contained in the video frame image. Therefore, the video frame image after the label is added can display the image content more intuitively, and the display effect is better.
本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现上述任一种图像处理方法。The embodiment of the present application further provides a computer readable storage medium, where the computer readable storage medium stores a computer program, and when the computer program is executed by the processor, implements any of the above image processing methods.
本申请实施例还提供了一种可执行程序代码,所述可执行程序代码用于被运行以执行上述任一种图像处理方法。The embodiment of the present application also provides an executable program code for being executed to execute any of the image processing methods described above.
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that, in this context, relational terms such as first and second are used merely to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply such entities or operations. There is any such actual relationship or order between them. Furthermore, the term "comprises" or "comprises" or "comprises" or any other variations thereof is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device that comprises a plurality of elements includes not only those elements but also Other elements, or elements that are inherent to such a process, method, item, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.
本说明书中的各个实施例均采用相关的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于图4a和图4b所示的设备实施例、图5所示的系统实施例、以及计算机可读存储介质实施例、可执行程序代码实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。The various embodiments in the present specification are described in a related manner, and the same or similar parts between the various embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the device embodiment shown in Figures 4a and 4b, the system embodiment shown in Figure 5, and the computer readable storage medium embodiment, executable program code embodiment, since it is substantially similar to the method embodiment Therefore, the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment.
本领域普通技术人员可以理解实现上述方法实施方式中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,所述的程序可以存储于计算机可读取存储介质中,这里所称得的存储介质,如:ROM/RAM、磁碟、光盘等。One of ordinary skill in the art can understand that all or part of the steps in implementing the above method embodiments can be completed by a program to instruct related hardware, and the program can be stored in a computer readable storage medium, which is referred to herein. Storage media such as ROM/RAM, disk, CD, etc.
以上所述仅为本申请的较佳实施例而已,并不用以限制本申请,凡在本申请的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本申请保护的范围之内。The above is only the preferred embodiment of the present application, and is not intended to limit the present application. Any modifications, equivalent substitutions, improvements, etc., which are made within the spirit and principles of the present application, should be included in the present application. Within the scope of protection.

Claims (24)

  1. 一种图像处理方法,其特征在于,包括:An image processing method, comprising:
    针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置;Determining at least one target location in the video frame image for the video frame image acquired by the first acquisition device;
    在所确定的每个目标位置处添加标签,所述标签根据输入指令或者第二采集设备采集的图像生成;Adding a label at each determined target location, the label being generated according to an input instruction or an image acquired by the second collection device;
    根据预设展示规则,对添加标签后的视频帧图像进行展示。The video frame image after the tag is displayed according to the preset display rule.
  2. 根据权利要求1所述的方法,其特征在于,所述视频帧图像为全景图像,所述第一采集设备对应至少一台第二采集设备,第二采集设备针对所述全景图像对应的子场景进行图像采集;The method according to claim 1, wherein the video frame image is a panoramic image, the first collecting device corresponds to at least one second collecting device, and the second collecting device is for a sub-scene corresponding to the panoramic image. Perform image acquisition;
    在所述视频帧图像中确定至少一个目标位置之前,所述方法还包括:Before determining at least one target location in the video frame image, the method further includes:
    获取第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the second collection device;
    根据所述子场景图像,生成标签;Generating a label according to the sub-scene image;
    在所述视频帧图像中确定至少一个目标位置的步骤,包括:The step of determining at least one target location in the video frame image includes:
    根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第二采集设备对应的标签在所述全景图像中的目标位置。And determining, according to the calibration information of the first collection device and the second collection device that are acquired in advance, a target location of the label corresponding to the second collection device in the panoramic image.
  3. 根据权利要求2所述的方法,其特征在于,所述第一采集设备为增强现实AR全景相机。The method of claim 2 wherein said first acquisition device is an augmented reality AR panoramic camera.
  4. 根据权利要求2所述的方法,其特征在于,所述根据所述子场景图像,生成标签的步骤,包括:The method according to claim 2, wherein the step of generating a label according to the sub-scene image comprises:
    将所述子场景图像和/或所述子场景图像中的目标信息添加至所述标签的内容。Adding the sub-scene image and/or target information in the sub-scene image to the content of the tag.
  5. 根据权利要求4所述的方法,其特征在于,所述将所述子场景图像中的目标信息添加至所述标签的内容的步骤,包括:The method according to claim 4, wherein the step of adding the target information in the sub-scene image to the content of the tag comprises:
    对所述子场景图像进行识别,根据识别结果,确定出所述子场景图像中的目标信息;将所述目标信息添加至所述标签的内容;Identifying the sub-scene image, determining target information in the sub-scene image according to the recognition result; adding the target information to the content of the label;
    或者,接收第二采集设备发送的所述目标信息;将所述目标信息添加至所述标签的内容;Or receiving the target information sent by the second collection device; adding the target information to the content of the label;
    或者,接收与第二采集设备通信连接的服务器发送的所述目标信息;将所述目标信息添加至所述标签的内容。Or receiving the target information sent by a server communicatively coupled to the second collection device; adding the target information to the content of the tag.
  6. 根据权利要求1所述的方法,其特征在于,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:The method according to claim 1, wherein the step of displaying the tagged video frame image according to the preset display rule comprises:
    在第一区域中,展示添加标签后的视频帧图像;In the first area, displaying the video frame image after the tag is added;
    在第二区域中,展示所添加标签的内容。In the second area, the content of the added tag is displayed.
  7. 根据权利要求1所述的方法,其特征在于,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:The method according to claim 1, wherein the step of displaying the tagged video frame image according to the preset display rule comprises:
    以画中画的形式,展示添加标签后的视频帧图像、以及所添加标签的内容。In the form of a picture-in-picture, the video frame image after the tag is added, and the content of the added tag.
  8. 根据权利要求6或7所述的方法,其特征在于,展示所添加标签的内容,包括:The method according to claim 6 or 7, wherein displaying the content of the added tag comprises:
    在所添加的标签中,确定当前展示标签;In the added tag, determine the current display tag;
    展示所述当前展示标签的内容。Show the content of the current display tag.
  9. 根据权利要求6或7所述的方法,其特征在于,所述方法还包括:The method according to claim 6 or 7, wherein the method further comprises:
    在检测到用户点击所述视频帧图像中的标签后,将被点击标签确定为目标标签;After detecting that the user clicks on the label in the video frame image, the clicked label is determined as the target label;
    在所述视频帧图像中展示所述目标标签的内容。The content of the target tag is displayed in the video frame image.
  10. 根据权利要求1所述的方法,其特征在于,在所述视频帧图像中确定至少一个目标位置的步骤之前,所述方法还包括:The method of claim 1 wherein prior to the step of determining at least one target location in said video frame image, said method further comprising:
    接收标签添加指令;Receiving a tag addition instruction;
    根据所述标签添加指令,生成标签;Generating a label according to the label adding instruction;
    在所述视频帧图像中确定至少一个目标位置的步骤,包括:The step of determining at least one target location in the video frame image includes:
    根据所述标签添加指令,确定所添加标签的目标位置。A target location of the added tag is determined according to the tag addition instruction.
  11. 根据权利要求1所述的方法,其特征在于,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:The method according to claim 1, wherein the step of displaying the tagged video frame image according to the preset display rule comprises:
    根据预设图层分类策略,确定每个标签对应的图层;Determining a layer corresponding to each label according to a preset layer classification policy;
    确定图层展示策略,根据所述图层展示策略,确定当前展示图层、及所述当前展示图层的展示方式;Determining a layer display strategy, and determining, according to the layer display strategy, a current display layer and a display manner of the current display layer;
    以所述展示方式,对所述当前展示图层对应的标签进行展示。In the display manner, the label corresponding to the current display layer is displayed.
  12. 根据权利要求2所述的方法,其特征在于,所述获取所述第二采集设备采集的子场景图像的步骤,包括:The method according to claim 2, wherein the step of acquiring the sub-scene image collected by the second collection device comprises:
    检测所述全景图像中是否发生异常事件;Detecting whether an abnormal event occurs in the panoramic image;
    如果是,确定所述异常事件对应的目标第二采集设备;If yes, determining a target second collection device corresponding to the abnormal event;
    获取所述目标第二采集设备采集的子场景图像;Obtaining a sub-scene image collected by the target second collection device;
    根据所述子场景图像,生成标签的步骤,包括:According to the sub-scene image, the step of generating a label includes:
    根据所述子场景图像,生成所述异常事件对应的标签。And generating, according to the sub-scene image, a label corresponding to the abnormal event.
  13. 根据权利要求12所述的方法,其特征在于,所述检测所述全景图像中是否发生异常事件的步骤,包括:The method according to claim 12, wherein the step of detecting whether an abnormal event occurs in the panoramic image comprises:
    将所述全景图像与预设异常模型进行匹配;Matching the panoramic image with a preset anomaly model;
    如果匹配成功,则表示所述全景图像中发生异常事件;If the matching is successful, it indicates that an abnormal event occurs in the panoramic image;
    或者,判断是否接收到针对所述全景图像的异常事件报警信息;Or determining whether abnormal event alarm information for the panoramic image is received;
    如果接收到,则表示所述全景图像中发生异常事件。If received, an abnormal event occurs in the panoramic image.
  14. 根据权利要求12所述的方法,其特征在于,所述确定所述异常事件对应的目标第二采集设备的步骤,包括:The method according to claim 12, wherein the step of determining the target second collection device corresponding to the abnormal event comprises:
    确定所述异常事件在所述全景图像中的位置;Determining a location of the abnormal event in the panoramic image;
    根据预先获取的所述第一采集设备与每台第二采集设备的标定信息,确 定与所述位置相对应的目标第二采集设备。Determining a target second collection device corresponding to the location according to the pre-acquired calibration information of the first collection device and each second collection device.
  15. 根据权利要求12所述的方法,其特征在于,在检测到所述全景图像中发生异常事件的情况下,所述方法还包括:The method according to claim 12, wherein in the case that an abnormal event occurs in the panoramic image is detected, the method further comprises:
    判断所述异常事件在所述全景图像中的位置是否位于预设重点区域;Determining whether the location of the abnormal event in the panoramic image is located in a preset focus area;
    如果是,所述根据预设展示规则,对添加标签后的视频帧图像进行展示的步骤,包括:If yes, the step of displaying the tagged video frame image according to the preset display rule includes:
    以预设报警方式,在视频帧图像中展示所述标签。The label is displayed in the video frame image in a preset alarm mode.
  16. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    获取第一采集设备采集的视频帧图像对应的细节图像;Obtaining a detail image corresponding to the video frame image collected by the first collection device;
    在所述视频帧图像中确定至少一个目标位置之后,还包括:After determining the at least one target location in the video frame image, the method further includes:
    根据预先获取的所述细节图像与所述视频帧图像之间的像素点对应关系,确定所述目标位置对应到所述细节图像中的位置,作为待处理位置;Determining, according to a pixel point correspondence relationship between the detail image and the video frame image acquired in advance, that the target position corresponds to a position in the detail image as a to-be-processed position;
    将所述目标位置处添加的标签添加到所述目标位置对应的待处理位置;Adding a label added at the target location to a to-be-processed location corresponding to the target location;
    所述根据预设展示规则,对添加标签后的视频帧图像进行展示,包括:And displaying, according to the preset display rule, the video frame image after the label is added, including:
    根据预设展示规则,对添加标签后的视频帧图像、以及添加标签后的细节图像进行展示。The video frame image after the tag is added and the detailed image after the tag is displayed according to the preset display rule.
  17. 根据权利要求16所述的方法,其特征在于,所述根据预设展示规则,对添加标签后的视频帧图像、以及添加标签后的细节图像进行展示,包括:The method according to claim 16, wherein the displaying the tagged video frame image and the tagged detail image according to the preset display rule comprises:
    在第一区域中,展示添加标签后的视频帧图像;在第三区域中,展示添加标签后的细节图像;In the first area, displaying the video frame image after the label is added; in the third area, displaying the detailed image after adding the label;
    或者,以画中画的形式,展示添加标签后的视频帧图像、以及添加标签后的细节图像。Or, in the form of a picture-in-picture, the video frame image after the tag is added, and the detail image after the tag is added.
  18. 一种图像处理设备,其特征在于,包括:处理器和存储器;An image processing device, comprising: a processor and a memory;
    存储器,用于存放计算机程序;a memory for storing a computer program;
    处理器,用于执行存储器上所存放的程序时,实现权利要求1-17任一项所 述的图像处理方法。The processor, when executed to execute a program stored on the memory, implements the image processing method according to any one of claims 1-17.
  19. 一种图像处理系统,其特征在于,包括:第一采集设备和图像处理设备,其中,An image processing system, comprising: a first collection device and an image processing device, wherein
    所述第一采集设备,用于采集视频帧图像,并将所采集的视频帧图像发送至所述图像处理设备;The first collecting device is configured to collect a video frame image, and send the collected video frame image to the image processing device;
    所述图像处理设备,用于针对第一采集设备采集的视频帧图像,在所述视频帧图像中确定至少一个目标位置;在所确定的每个目标位置处添加标签,所述标签根据输入指令或者第二采集设备采集的图像生成;根据预设展示规则,对添加标签后的视频帧图像进行展示。The image processing device is configured to determine, according to a video frame image acquired by the first collection device, at least one target location in the video frame image; adding a label at each determined target location, the label according to the input instruction Or the image collected by the second collection device is generated; and the video frame image after the label is added is displayed according to the preset display rule.
  20. 根据权利要求19所述的系统,其特征在于,所述系统还包括:至少一台第二采集设备,The system of claim 19, wherein the system further comprises: at least one second collection device,
    所述第二采集设备,用于针对全景图像对应的子场景进行图像采集,所述全景图像为所述第一采集设备所采集的视频帧图像;The second collection device is configured to perform image collection on a sub-scene corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
    所述图像处理设备,还用于获取第二采集设备采集的子场景图像;根据所述子场景图像,生成标签;根据预先获取的所述第一采集设备与第二采集设备的标定信息,确定第二采集设备对应的标签在所述全景图像中的目标位置。The image processing device is further configured to acquire a sub-scene image acquired by the second collection device; generate a label according to the sub-scene image; and determine, according to the calibration information of the first collection device and the second collection device acquired in advance The label corresponding to the second collection device is at a target position in the panoramic image.
  21. 根据权利要求19所述的系统,其特征在于,所述第一采集设备为增强现实AR全景相机。The system of claim 19 wherein said first acquisition device is an augmented reality AR panoramic camera.
  22. 根据权利要求19所述的系统,其特征在于,所述系统还包括:第三采集设备;The system of claim 19, wherein the system further comprises: a third collection device;
    所述第三采集设备,用于采集全景图像对应的细节图像,所述全景图像为所述第一采集设备所采集的视频帧图像;The third collection device is configured to collect a detailed image corresponding to the panoramic image, where the panoramic image is a video frame image collected by the first collection device;
    所述图像处理设备,还用于获取所述第三采集设备采集的细节图像;根据预先获取的所述细节图像与所述视频帧图像之间的像素点对应关系,确定所述目标位置对应到所述细节图像中的位置,作为待处理位置;将所述目标位置处添加的标签添加到所述目标位置对应的待处理位置;根据预设展示规 则,对添加标签后的视频帧图像、以及添加标签后的细节图像进行展示。The image processing device is further configured to acquire a detail image collected by the third collection device; and determine, according to a pixel point correspondence relationship between the detail image and the video frame image, the target location is corresponding to a position in the detail image as a to-be-processed location; adding a tag added at the target location to a to-be-processed location corresponding to the target location; according to a preset display rule, the tagged video frame image, and The detailed image after the label is added for display.
  23. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现权利要求1-17任一所述的方法步骤。A computer readable storage medium, wherein the computer readable storage medium stores a computer program, the computer program being executed by a processor to implement the method steps of any of claims 1-17.
  24. 一种可执行程序代码,其特征在于,所述可执行程序代码用于被运行以执行权利要求1-17任一所述的方法步骤。An executable program code, characterized in that the executable program code is operative to perform the method steps of any of claims 1-17.
PCT/CN2018/106752 2018-03-29 2018-09-20 Image processing method, device and system WO2019184275A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810272370.9 2018-03-29
CN201810272370.9A CN109274926B (en) 2017-07-18 2018-03-29 Image processing method, device and system

Publications (1)

Publication Number Publication Date
WO2019184275A1 true WO2019184275A1 (en) 2019-10-03

Family

ID=68062694

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/106752 WO2019184275A1 (en) 2018-03-29 2018-09-20 Image processing method, device and system

Country Status (1)

Country Link
WO (1) WO2019184275A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102457718A (en) * 2010-10-14 2012-05-16 霍尼韦尔国际公司 Graphical bookmarking of video data with user inputs in video surveillance
CN103929618A (en) * 2014-04-18 2014-07-16 卢旭东 Operational control method for outdoor advertising board state marking system
CN104285244A (en) * 2012-05-23 2015-01-14 高通股份有限公司 Image-driven view management for annotations
US20170364747A1 (en) * 2016-06-15 2017-12-21 International Business Machines Corporation AUGEMENTED VIDEO ANALYTICS FOR TESTING INTERNET OF THINGS (IoT) DEVICES

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102457718A (en) * 2010-10-14 2012-05-16 霍尼韦尔国际公司 Graphical bookmarking of video data with user inputs in video surveillance
CN104285244A (en) * 2012-05-23 2015-01-14 高通股份有限公司 Image-driven view management for annotations
CN103929618A (en) * 2014-04-18 2014-07-16 卢旭东 Operational control method for outdoor advertising board state marking system
US20170364747A1 (en) * 2016-06-15 2017-12-21 International Business Machines Corporation AUGEMENTED VIDEO ANALYTICS FOR TESTING INTERNET OF THINGS (IoT) DEVICES

Similar Documents

Publication Publication Date Title
CN109274926B (en) Image processing method, device and system
US10043079B2 (en) Method and apparatus for providing multi-video summary
CN104137154B (en) Systems and methods for managing video data
US20110109747A1 (en) System and method for annotating video with geospatially referenced data
CN110536074B (en) Intelligent inspection system and inspection method
US8929596B2 (en) Surveillance including a modified video data stream
CN110557603B (en) Method and device for monitoring moving target and readable storage medium
CN110136091B (en) Image processing method and related product
KR101652856B1 (en) Apparatus for providing user interface screen based on control event in cctv
CN101272483B (en) System and method for managing moving surveillance cameras
EP3062506B1 (en) Image switching method and apparatus
CN112162683A (en) Image amplification method and device and storage medium
JP2019125053A (en) Information terminal device, information processing system, and display control program
KR100653825B1 (en) Change detecting method and apparatus
JP4632362B2 (en) Information output system, information output method and program
WO2019184275A1 (en) Image processing method, device and system
KR101842564B1 (en) Focus image surveillant method for multi images, Focus image managing server for the same, Focus image surveillant system for the same, Computer program for the same and Recording medium storing computer program for the same
CN110737385A (en) video mouse interaction method, intelligent terminal and storage medium
CN113905211B (en) Video patrol method, device, electronic equipment and storage medium
US20210375109A1 (en) Team monitoring
US20230162591A1 (en) Interactive kiosk with emergency call module
US20030112415A1 (en) Apparatus for projection and capture of a display interface
KR20200073669A (en) Method for managing image information, Apparatus for managing image information and Computer program for the same
KR102398280B1 (en) Apparatus and method for providing video of area of interest
CN114677163A (en) Advertisement interaction method, device, medium and equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18911456

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18911456

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 18911456

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 18911456

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 19.05.2021)