WO2022142890A1 - Data processing method and related apparatus - Google Patents

Data processing method and related apparatus Download PDF

Info

Publication number
WO2022142890A1
WO2022142890A1 PCT/CN2021/132867 CN2021132867W WO2022142890A1 WO 2022142890 A1 WO2022142890 A1 WO 2022142890A1 CN 2021132867 W CN2021132867 W CN 2021132867W WO 2022142890 A1 WO2022142890 A1 WO 2022142890A1
Authority
WO
WIPO (PCT)
Prior art keywords
point cloud
bounding box
vertex
target
key
Prior art date
Application number
PCT/CN2021/132867
Other languages
French (fr)
Chinese (zh)
Inventor
程莉莉
苏飞
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2022142890A1 publication Critical patent/WO2022142890A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating 3D models or images for computer graphics
    • G06T19/20Editing of 3D images, e.g. changing shapes or colours, aligning objects or positioning parts
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds

Definitions

  • the present application relates to the technical field of automatic driving, and in particular, to a data processing method and a related device.
  • the embodiment of the present application discloses a data processing method and a related device, which can mark extended constraint information of the point cloud bounding box on a point cloud bounding box containing incomplete point cloud data of a target object, so as to improve the accuracy of size processing and data usage.
  • an embodiment of the present application discloses a data processing method, wherein: the incomplete point cloud data of the target object is annotated according to the image of the target object, and the annotation information is obtained, and the annotation information includes the moving direction of the target object and the Point cloud bounding box of incomplete point cloud data, the z direction of the point cloud bounding box is parallel to the z axis, and parallel to the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane; according to the moving direction and/or z direction
  • the extended constraint information of the point cloud bounding box is determined, the extended constraint information includes at least one of the critical long side, the critical wide side and the critical high side intersecting with the key vertices; the extended constraint information is marked on the point cloud bounding box.
  • the expansion constraint information for size processing of the point cloud bounding box is added to the point cloud bounding box, so that the target bounding box that meets the actual needs can be obtained according to the expanded constraint information, which is convenient to improve the utilization rate of data.
  • the point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling.
  • the accuracy of the point cloud bounding box is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
  • determining the extended constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction includes: determining the vertex confidence of each vertex in the bounding box of the point cloud, where the vertex confidence is used to describe that the vertex is The probability of the vertices of the object bounding box of the target object; the vertex corresponding to the maximum vertex confidence is determined as the key vertex; the three combined edges in the point cloud bounding box that intersect the key vertex are determined as three reference edges; according to the moving direction and/or The z-direction determines at least one of a critical long side, a critical wide side, and a critical high side among the three reference sides.
  • the vertices in the point cloud bounding box that are most likely to coincide with the vertices in the object bounding box are used as key vertices, and then the three reference edges connected to the key vertices are determined according to the moving direction and/or the z direction to surround the object.
  • the sides corresponding to the length, width and height of the box are used as the key long side, the key wide side and the key high side respectively, which can improve the accuracy of determining the extended constraint information.
  • determining the vertex confidence of each vertex in the point cloud bounding box includes: determining the vertex confidence of the vertex according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when the number of point clouds When the value is larger, the vertex confidence is higher; and/or, the vertex confidence of the vertex is determined according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the vertex confidence is higher,
  • the acquisition device collects incomplete point cloud data. It can be understood that the point cloud can reflect the information collected by the target object. The closer the distance between the collection device and the point cloud, the higher the accuracy of the point cloud collected.
  • the probability that the vertex is a vertex of the object bounding box is determined according to the number of point clouds corresponding to the vertex and/or the distance between the vertex and the acquisition device, which can improve the accuracy of determining the vertex confidence.
  • determining the extended constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction includes: determining the overall confidence of three combined edges intersecting with each vertex in the bounding box of the point cloud, the overall confidence The degree is used to describe the probability that the three combined edges are the edges of the object bounding box of the target object; the three combined edges corresponding to the maximum value of the overall confidence are determined as the three reference edges; the vertices where the three reference edges intersect are determined as key vertices; according to the movement The direction and/or the z-direction determines at least one of a critical long side, a critical broad side, and a critical high side of the three reference sides.
  • the three combined edges in the point cloud bounding box that are most likely to coincide with the edges in the object bounding box are used as the three reference edges, and then the length of the three reference edges and the object bounding box is determined according to the moving direction and/or the z direction.
  • the edges corresponding to , width and height are used as the key long edge, the key wide edge and the key high edge respectively, which can improve the accuracy of determining the extended constraint information.
  • determining the overall confidence of the three combined edges in the point cloud bounding box intersecting with each vertex includes: determining according to the number of point clouds corresponding to the three combined edges in the point cloud bounding box intersecting with each vertex The overall confidence of the three combined edges, where the larger the number of point clouds, the larger the overall confidence; and/or, according to the distance between each vertex in the point cloud bounding box and the acquisition device, determine the point cloud bounding box in the The overall confidence of the three combined edges that intersect the vertex, where the smaller the distance, the greater the overall confidence, and the acquisition device has collected incomplete point cloud data. It can be understood that the point cloud can reflect the information collected by the target object.
  • the number of point clouds corresponding to the three combined edges intersecting one vertex in the point cloud bounding box, and/or the distance between the vertexes intersected by the three combined edges and the acquisition device it is determined that the three combined edges are all surrounded by objects
  • the probability of the edges of the box ie, the overall confidence
  • the target object is a vehicle
  • the annotation information further includes a vehicle type.
  • the method further includes: determining a first size of the point cloud bounding box according to the vehicle type; enclosing the point cloud according to the extended constraint information and the first size The box is dimensioned to obtain the first target bounding box. In this way, the accuracy of the size processing of the point cloud bounding box is improved, and the authenticity of the first target bounding box can be improved.
  • performing size processing on the bounding box of the point cloud according to the extended constraint information and the first size, and obtaining the first target bounding box includes: determining the key long side, the key wide side and the key width according to the extended constraint information and the first size At least one target edge in the key high side, as well as the target length and target extension direction of at least one target edge; according to the target length and target extension direction, the size of the target edge and the corresponding edge of the target edge in the point cloud bounding box are processed to obtain The first target bounding box. In this way, the first target bounding box satisfying the vehicle type is obtained based on the first size and the expansion constraint information, which improves the accuracy of the size processing of the point cloud bounding box.
  • the method further includes: storing the reference point cloud data obtained by marking the extended constraint information on the point cloud bounding box. In this way, it is convenient to further improve the utilization rate of data.
  • the target object is a vehicle
  • the annotation information further includes a vehicle type.
  • the method further includes: receiving an annotation instruction for the reference point cloud data; determining the second size of the point cloud bounding box according to the annotation instruction and the vehicle type ; Perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box.
  • the second target bounding box that satisfies the labeling instruction and the vehicle type is obtained based on the second size and the extended constraint information, which improves the accuracy of size processing of the point cloud bounding box and improves the data usage rate.
  • an embodiment of the present application discloses a data processing device, wherein the labeling unit is configured to label incomplete point cloud data of the target object according to an image of the target object, and obtain labeling information, where the labeling information includes the target object's data.
  • the moving direction and the point cloud bounding box containing incomplete point cloud data the z direction of the point cloud bounding box is parallel to the z axis and the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane;
  • the determination unit is used for
  • the extended constraint information of the bounding box of the point cloud is determined according to the moving direction and/or the z direction, the extended constraint information includes at least one of the key long side, the key wide side and the key high side intersecting with the key vertices;
  • the labeling unit is also used for Annotate extended constraint information on the point cloud bounding box.
  • the expansion constraint information for size processing of the point cloud bounding box is added to the point cloud bounding box, so that the target bounding box that meets the actual needs can be obtained according to the expanded constraint information, which is convenient to improve the utilization rate of data.
  • the point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the Accuracy of labeling point cloud bounding boxes.
  • the extended constraint information is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
  • the determining unit is specifically configured to determine the vertex confidence of each vertex in the point cloud bounding box, wherein the vertex confidence is used to describe the probability that the vertex is the vertex of the object bounding box of the target object; determine the vertex The vertex corresponding to the maximum confidence value is the key vertex; determine the three combined edges that intersect the key vertex in the point cloud bounding box as the three reference edges; determine the key long edge, key edge, and key edge among the three reference edges according to the moving direction and/or the z direction At least one of wide side and critical high side.
  • the vertices in the point cloud bounding box that are most likely to coincide with the vertices in the object bounding box are used as key vertices, and then the three reference edges connected to the key vertices are determined according to the moving direction and/or the z direction to surround the object.
  • the sides corresponding to the length, width and height of the box are used as the key long side, the key wide side and the key high side respectively, which can improve the accuracy of determining the extended constraint information.
  • the determining unit is specifically configured to determine the vertex confidence of the vertex according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when the number of point clouds is larger, the vertex confidence is larger; And/or, the vertex confidence of the vertex is determined according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the vertex confidence is larger, and the acquisition device has collected incomplete point cloud data.
  • the point cloud can reflect the information collected by the target object. The closer the distance between the collection device and the point cloud, the accuracy of the point cloud collected.
  • the probability that the vertex is a vertex of the object bounding box is determined according to the number of point clouds corresponding to the vertex and/or the distance between the vertex and the acquisition device, which can improve the accuracy of determining the vertex confidence.
  • the determining unit is specifically configured to determine the overall confidence level of the three combined edges intersecting each vertex in the point cloud bounding box, wherein the overall confidence level is used to describe the object whose three combined edges are the target object
  • the probability of the edges of the bounding box determine the three combined edges corresponding to the maximum value of the overall confidence as the three reference edges; determine the vertex where the three reference edges intersect as the key vertex; determine the key among the three reference edges according to the movement direction and/or the z direction At least one of Long Side, Critical Wide Side, and Critical High Side.
  • the three combined edges in the point cloud bounding box that are most likely to coincide with the edges in the object bounding box are used as the three reference edges, and then the length of the three reference edges and the object bounding box is determined according to the moving direction and/or the z direction.
  • the edges corresponding to , width and height are used as the key long edge, the key wide edge and the key high edge respectively, which can improve the accuracy of determining the extended constraint information.
  • the determining unit is specifically configured to determine the overall confidence of the three combined edges according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, wherein, when the number of point clouds is larger, the greater the number of point clouds When , the overall confidence is greater; and/or, according to the distance between each vertex in the point cloud bounding box and the acquisition device, determine the overall confidence of the three combined edges in the point cloud bounding box that intersect with the vertices, wherein, when the distance The smaller the value, the greater the overall confidence, and the acquisition device has collected incomplete point cloud data.
  • the point cloud can reflect the information collected by the target object.
  • the number of point clouds corresponding to the three combined edges intersecting one vertex in the point cloud bounding box, and/or the distance between the vertexes intersected by the three combined edges and the acquisition device it is determined that the three combined edges are all surrounded by objects
  • the probability of the edges of the box ie, the overall confidence
  • the target object is a vehicle
  • the annotation information further includes a vehicle type
  • the determining unit is further configured to determine the first size of the bounding box of the point cloud according to the vehicle type
  • the data processing apparatus further includes a processing unit configured to determine the first size of the bounding box of the point cloud according to the expansion
  • the constraint information and the first size are used to perform size processing on the bounding box of the point cloud to obtain the first target bounding box. In this way, the accuracy of the size processing of the point cloud bounding box is improved, and the authenticity of the first target bounding box can be improved.
  • the processing unit is specifically configured to determine, according to the first size and the expansion constraint information, at least one target side among the critical long side, the critical wide side, and the critical high side, as well as the target length and the target side of the at least one target side Expansion direction: According to the target length and the target expansion direction, size processing is performed on the target edge and the edge corresponding to the target edge in the point cloud bounding box to obtain the first target bounding box. In this way, the first target bounding box satisfying the vehicle type is obtained based on the first size and the expansion constraint information, which improves the accuracy of the size processing of the point cloud bounding box.
  • the data processing apparatus further includes: a storage unit configured to store the reference point cloud data obtained by marking the extended constraint information on the point cloud bounding box. In this way, it is convenient to further improve the utilization rate of data.
  • the target object is a vehicle
  • the labeling information further includes the vehicle type
  • the data processing device further includes a communication unit and a processing unit, wherein: the communication unit is used for receiving labeling instructions for the reference point cloud data; the determining unit, It is also used to determine the second size of the point cloud bounding box according to the annotation instruction and the vehicle type; the processing unit is used to perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box.
  • the second target bounding box that satisfies the labeling instruction and the vehicle type is obtained based on the second size and the extended constraint information, which improves the accuracy of size processing of the point cloud bounding box and improves the data usage rate.
  • an embodiment of the present application discloses another data processing apparatus, comprising a processor and a memory connected to the processor, the memory is used to store one or more programs, and is configured to be executed by the processor to execute the above-mentioned first aspect A step of.
  • the present application provides a computer-readable storage medium, where instructions are stored in the computer-readable storage medium, when the computer-readable storage medium runs on a computer, the computer executes the method of the first aspect.
  • the present application provides a computer program product.
  • the computer program product is used to store a computer program, and when the computer program runs on a computer, the computer can execute the method of the first aspect.
  • the present application provides a chip, including a processor and a memory, where the processor is configured to call and execute instructions stored in the memory from the memory, so that a device equipped with the chip executes the method of the above-mentioned first aspect.
  • the present application provides another chip, comprising: an input interface, an output interface and a processing circuit, the input interface, the output interface and the processing circuit are connected through an internal connection path, and the processing circuit is used to execute the above-mentioned first aspect Methods.
  • the present application provides another chip, including: an input interface, an output interface, a processor, and optionally, a memory, and the input interface, the output interface, the processor, and the memory are connected through an internal connection path,
  • the processor is used to execute code in the memory, and when the code is executed, the processor is used to perform the method of any of the above aspects.
  • an embodiment of the present application provides a chip system, including at least one processor, a memory and an interface circuit, the memory, the transceiver and the at least one processor are interconnected by lines, and at least one memory stores a computer program; the computer program is The processor performs the method of the first aspect described above.
  • FIG. 1 is a schematic structural diagram of a data processing system provided by an embodiment of the present application.
  • FIG. 2 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of optional vehicle types in a labeling platform provided by an embodiment of the present application.
  • FIG. 4 is a two-dimensional image and a point cloud image collected by a collection device provided in an embodiment of the present application;
  • FIG. 5 is a schematic diagram of a point cloud bounding box provided by the prior art extended to an imaginary bounding box;
  • FIG. 6 is a schematic flowchart of a data processing method provided by an embodiment of the present application.
  • FIG. 7 is a schematic diagram of marking a point cloud bounding box and a moving direction provided by an embodiment of the present application.
  • FIG. 8 is a schematic diagram of annotating extended constraint information provided by an embodiment of the present application.
  • FIG. 9 is a two-dimensional image and a point cloud image collected by another collection device provided in an embodiment of the present application.
  • FIG. 10 is a schematic flowchart of another data processing method provided by an embodiment of the present application.
  • FIG. 11 is a schematic diagram of size processing of a point cloud bounding box provided by an embodiment of the present application.
  • FIG. 12 is a schematic structural diagram of a data processing apparatus provided by an embodiment of the present application.
  • FIG. 13 is a schematic structural diagram of another data processing apparatus provided by an embodiment of the present application.
  • FIG. 1 is a system architecture diagram of a data transmission method applied to an embodiment of the present application.
  • the system includes an electronic device 10 and a collection device 20 .
  • the present application does not limit the number of electronic devices 10 and collection devices 20 .
  • the electronic devices in the embodiments of the present application may include, but are not limited to, personal computers, server computers, handheld or laptop devices, mobile devices (such as cell phones, mobile phones, tablet computers, personal digital assistants, media players, etc.), consumer electronic devices, minicomputers, mainframe computers, mobile robots, drones, etc.
  • the electronic device may be an in-vehicle device in a computer system (or an in-vehicle system), or may be other devices, which are not limited herein.
  • the electronic device 10 is depicted as a personal computer.
  • FIG. 2 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
  • the electronic device 10 may include a display device 110 , a processor 120 and a memory 130 .
  • the memory 130 may be used to store software programs and data, and the processor 120 may execute various functional applications and data processing of the electronic device 10 by running the software programs and data stored in the memory 130 .
  • the memory 130 may mainly include a stored program area and a stored data area, wherein the stored program area may store an operating system, an application program required for at least one function (such as an image acquisition function, etc.), and the like; Use the created data (such as audio data, text information, image data, etc.) and the like. Additionally, memory 130 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
  • the processor 120 is the control center of the electronic device 10, uses various interfaces and lines to connect various parts of the entire electronic device 10, and executes various functions of the electronic device 10 by running or executing the software programs and/or data stored in the memory 130. function and process data for overall monitoring of the electronic device 10 .
  • the processor 120 may include one or more processing units, for example, the processor 120 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) Wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
  • the NPU is a neural-network (NN) computing processor.
  • NN neural-network
  • the NPU By borrowing the structure of biological neural networks, such as the transmission mode between neurons in the human brain, it can quickly process the input information and can continuously learn by itself.
  • Applications such as intelligent cognition of the electronic device 10 can be realized through the NPU, for example: image recognition, face recognition, speech recognition, text understanding, etc.
  • the processor 120 may include one or more interfaces.
  • the interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
  • I2C integrated circuit
  • I2S integrated circuit built-in audio
  • PCM pulse code modulation
  • PCM pulse code modulation
  • UART universal asynchronous transceiver
  • MIPI mobile industry processor interface
  • GPIO general-purpose input/output
  • SIM subscriber identity module
  • USB universal serial bus
  • the I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (CL).
  • the processor 120 may contain multiple sets of I2C buses.
  • the processor 120 can be respectively coupled to the touch sensor, the charger, the flash, the camera 160 and the like through different I2C bus interfaces.
  • the processor 120 can couple the touch sensor through the I2C interface, so that the processor 120 communicates with the touch sensor through the I2C bus interface, so as to realize the touch function of the electronic device 10 .
  • the I2S interface can be used for audio communication.
  • the processor 120 may contain multiple sets of I2S buses.
  • the processor 120 may be coupled with the audio module through an I2S bus to implement communication between the processor 120 and the audio module.
  • the audio module can transmit audio signals to the wireless fidelity (WiFi) module 190 through the I2S interface, so as to realize the function of answering calls through the Bluetooth headset.
  • WiFi wireless fidelity
  • the PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals.
  • the audio module and WiFi module 190 may be coupled through a PCM bus interface.
  • the audio module can also transmit audio signals to the WiFi module 190 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
  • the UART interface is a universal serial data bus used for asynchronous communication.
  • the bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication.
  • a UART interface is typically used to connect the processor 120 and the WiFi module 190 .
  • the processor 120 communicates with the Bluetooth module in the WiFi module 190 through the UART interface to implement the Bluetooth function.
  • the audio module can transmit the audio signal to the WiFi module 190 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
  • the MIPI interface may be used to connect the processor 120 with peripheral devices such as the display device 110 and the camera 160 .
  • MIPI interfaces include camera 160 serial interface (camera serial interface, CSI), display serial interface (display serial interface, DSI) and so on.
  • the processor 120 communicates with the camera 160 through a CSI interface to implement the photographing function of the electronic device 10 .
  • the processor 120 communicates with the display screen through the DSI interface to implement the display function of the electronic device 10 .
  • the GPIO interface can be configured by software.
  • the GPIO interface can be configured as a control signal or as a data signal.
  • the GPIO interface may be used to connect the processor 120 with the camera 160, the display device 110, the WiFi module 190, the audio module, the sensor module, and the like.
  • the GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
  • the USB interface is an interface that conforms to the USB standard specification, which can be a Mini USB interface, a Micro USB interface, a USB Type C interface, etc.
  • the USB interface can be used to connect a charger to charge the electronic device 10, and can also be used to transmit data between the electronic device 10 and peripheral devices. It can also be used to connect headphones to play audio through the headphones.
  • the interface can also be used to connect other electronic devices, for example, Augmented Reality (AR) devices.
  • AR Augmented Reality
  • the interface connection relationship between the modules illustrated in the embodiment of the present invention is only a schematic illustration, and does not constitute a structural limitation of the electronic device 10 .
  • the electronic device 10 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
  • the electronic device 10 also includes a camera 160 for capturing images or videos.
  • the camera 160 may be a common camera or a focusing camera.
  • the electronic device 10 may further include an input device 140 for receiving input numerical information, character information or contact touch operations/non-contact gestures, and generating signal inputs related to user settings and function control of the electronic device 10 .
  • the display device 110 including a display panel, is used to display the information input by the user or the information provided to the user and various menu interfaces of the electronic device 10, etc., and is mainly used to display the camera in the electronic device 10 or The image to be detected collected by the sensor.
  • the display panel may be configured in the form of a liquid crystal display (liquid crystal display, LCD) or an organic light-emitting diode (organic light-emitting diode, OLED) or the like.
  • the electronic device 10 may also include one or more sensors 170, such as image sensors, infrared sensors, laser sensors (which may include laser displacement sensors and lidar sensors, etc.), pressure sensors, gyroscope sensors, air pressure sensors, magnetic sensors, acceleration sensors , distance sensor, proximity light sensor, ambient light sensor, fingerprint sensor, touch sensor, temperature sensor, bone conduction sensor, etc., wherein the image sensor can be time of flight (TOF) sensor, structured light sensor, etc.
  • TOF time of flight
  • the electronic device 10 may also include a power supply 150 for powering other modules.
  • the electronic device 10 may further include a radio frequency (RF) circuit 180 for performing network communication with wireless network devices, and may also include a WiFi module 190 for performing WiFi communication with other devices, for example, for receiving other devices transmitted images or data, etc.
  • RF radio frequency
  • the electronic device 10 may also include other possible functional modules such as a flashlight, a Bluetooth module, an external interface, a button, a motor, etc., which will not be repeated here.
  • other possible functional modules such as a flashlight, a Bluetooth module, an external interface, a button, a motor, etc., which will not be repeated here.
  • the collection device in this embodiment of the present application may be a movable device, and the movable device may include, but is not limited to, an airplane, a ship, a robot, a vehicle, etc., and may also be a device on the road, for example, a roadside unit (roadside unit, RSU).
  • the airplanes, ships, and vehicles described in the embodiments of the present application may be human-driven devices or unmanned devices, which are not limited herein.
  • the acquisition device 20 is depicted as a vehicle.
  • the collection device may include a processor, a display device, and a memory, and reference may be made to the description of the electronic device, which will not be repeated here.
  • the acquisition device 20 may also include sensors, such as image pickup devices (eg, cameras, etc.), lidar sensors, and the like. Among them, the image pickup device is used to collect two-dimensional images.
  • the lidar sensor is used to detect the reflected signal of the laser signal sent by the lidar, thereby obtaining the laser point cloud (or point cloud).
  • the collection device for collecting the two-dimensional graphics and the point cloud may be the same device or different devices, which are not limited here.
  • the processor of the acquisition device may include a point cloud processing module for processing point cloud data.
  • the collection device may be used to collect data of the target object (for example, at least one of two-dimensional graphics, point cloud data, distance to the target object, etc.), and send the data to the corresponding Electronic equipment.
  • the electronic device may be configured to receive data sent from the collection device, and execute the data processing method described in the embodiments of the present application according to the data.
  • the data processing method described in the embodiments of the present application is directly executed by the acquisition device (or the processor in the acquisition device or the point cloud processing module in the processor, etc.).
  • An application program corresponding to the labeling platform can be run in the electronic device, and the labeling platform can be used to display the data received from the collection device, and can be provided to the labeling personnel for labeling.
  • the data processing method described in the embodiments of the present application may also be executed by an application program corresponding to the labeling platform, etc., which is not limited herein.
  • the target object is an object that needs to be identified by the acquisition device or the electronic device.
  • the target objects include objects on the road and objects outside the road.
  • the objects on the road include people, cars, traffic lights, traffic signs (such as speed limit signs, etc.), traffic sign poles and foreign objects on the road.
  • Foreign objects refer to objects that should not appear on the road, such as cartons, tires, etc. left on the road.
  • Objects outside the road include buildings on both sides of the road, trees, and isolation belts between roads.
  • the target object may also be a device such as an airplane, a ship, a robot, etc., which is not limited here.
  • the moving direction refers to the moving direction of the target object.
  • the target object is a vehicle
  • the moving direction of the vehicle is usually the forward direction
  • the moving direction can also be referred to as the head direction.
  • Object type refers to the classification of the target object, for example, aircraft type, ship type, robot type, vehicle type, etc. Object types can also be further classified according to aircraft types, ship types, robot types, vehicle types, etc., or specifically, drone types, unmanned ship types, unmanned vehicle types, and the like.
  • the vehicle type in the labeling platform can include types such as buses, motorcycles, bicycles, construction vehicles, tricycles, tank trucks or pickups for labeling Personnel selection.
  • the above vehicle types can also be supplied to the electronic device for identification based on the image features of the vehicle type. It can be understood that the vehicle sizes are different between different vehicle types, and each type of vehicle corresponds to one size.
  • Object sizes vary between different object types, so the approximate size of the target object in the point cloud data can be determined based on the object type.
  • the moving direction of the target object of different object types is different from the direction of the length and width of the cuboid corresponding to the object.
  • the moving direction of the target object is usually consistent with the direction corresponding to the length of the cuboid corresponding to the vehicle.
  • the moving direction of the target object is usually the direction of upright walking, that is, the direction corresponding to the width of the rectangular parallelepiped corresponding to the humanoid robot.
  • Point cloud data also known as laser point cloud (PCD), three-dimensional point cloud or point cloud
  • PCD laser point cloud
  • three-dimensional point cloud or point cloud is the use of laser to obtain the three-dimensional spatial coordinates of each sampling point on the surface of the object (usually x, y) in the same spatial reference system.
  • z in the form of three-dimensional coordinates
  • mass points that express the spatial distribution of the target and the characteristics of the target surface are obtained.
  • point clouds lack detailed texture information, they contain rich three-dimensional spatial information.
  • point cloud data may also include color information, gray value, depth, segmentation results, etc., which are not limited here.
  • the image obtained by projecting the point cloud data to a two-dimensional plane is called a point cloud image.
  • the acquisition device can only collect part of the point cloud for objects that are far away or occluded. In the embodiment of this application, all the collected point cloud data are called incomplete point cloud data. If the point cloud data of the target object is insufficient, all the collected point cloud data of the target object are called incomplete point cloud data of the target object. point cloud data.
  • FIG. 4 takes the target object as a vehicle for illustration.
  • (a) in FIG. 4 is a two-dimensional graph collected by the collecting device 20
  • (b) in FIG. 4 is a point cloud image corresponding to all the point cloud data collected by the collecting device 20 . As can be seen from (a) in FIG.
  • the front of the acquisition device 20 includes 4 target objects (ie, 4 vehicles) 21 , and the distance between the acquisition device 20 and the front target object 21 is relatively far, and the road is adjacent to the road
  • the target object 21 on the side may be occluded by the leaves on the roadside, and the point cloud data of the target object 21 may be incomplete.
  • there are sparse point cloud data in the labeling frame of the target object 21 so that it can be determined that the point cloud data of the target object 21 is insufficient, and all the target objects 21 collected by the collection device 20 have sparse point cloud data.
  • the point cloud data is called incomplete point cloud data of the target object 21 .
  • the image of the target object includes a two-dimensional image collected by the acquisition device for the target object, and may also include a three-dimensional image corresponding to incomplete point cloud data of the target object, etc., which is not limited herein.
  • Bounding box is an algorithm for solving the optimal bounding space of discrete point sets.
  • the most common bounding boxes are sphere, axis-aligned bounding box (AABB), bounding sphere (sphere), oriented bounding box (OBB), and fixed directions hulls or k -DOP, FDH).
  • the axis-aligned bounding box and the oriented bounding box are bounding boxes corresponding to cuboids, and the axis-aligned bounding box of a given object is defined as the smallest hexahedron containing the object and each side parallel to the coordinate axis.
  • the directed bounding box of a given object is defined as the smallest cuboid that contains the object and has an arbitrary direction relative to the coordinate axis.
  • the shape feature encloses the object as tightly as possible.
  • the y-direction corresponding to the x-axis/y-axis may have a certain angle with the x-axis/y such that the x-direction/y-direction
  • the area of the constituent faces is the smallest.
  • the z-axis described in the embodiments of the present application may be the z-axis direction in the geodetic coordinate system, etc., which is not limited herein.
  • the point cloud bounding box in this embodiment of the present application is a cuboid that includes all point clouds of a given object (that is, incomplete point cloud data of the target object), and the z direction of the cuboid (that is, the point cloud bounding box) is the same as the
  • the z-axis and the direction corresponding to the height of the point cloud bounding box are parallel, and the z-axis is perpendicular to the horizontal plane.
  • the axis of the coordinate axis corresponding to the point cloud bounding box can be located in the center of the point cloud bounding box, and the x-axis, y-axis and z-axis directions of the coordinate axis can be parallel to the length, width and height of the point cloud bounding box respectively.
  • the height is perpendicular to the horizontal plane, and the length is greater than the width. Therefore, the z direction of the point cloud bounding box is parallel to the direction corresponding to the height of the point cloud bounding box, and the x direction of the point cloud bounding box is parallel to the direction corresponding to the length of the point cloud bounding box, and the y direction is parallel to the point cloud bounding box.
  • the directions corresponding to the width of the box are parallel.
  • the length of the edge is related to the length of the point cloud data actually collected.
  • the edge with a longer number of collected point clouds can be used as the length of the bounding box of the point cloud, and the edge that intersects with the length and height at the other edge of a vertex or is parallel to the edge can be used as the length of the bounding box of the point cloud.
  • the z direction of the object bounding box corresponding to the target object is parallel to the z axis.
  • the z-direction that defines the bounding box of the point cloud is parallel to the z-axis, regardless of the acquisition angle, it can be ensured that the height direction corresponding to the bounding box of the point cloud and the bounding box of the object is perpendicular to the horizontal plane. That is to say, the z direction of the point cloud bounding box is parallel to the z direction of the target object, which can improve the accuracy of labeling the point cloud bounding box.
  • the point cloud bounding box containing the incomplete point cloud data of the target object is marked.
  • the type of the point cloud bounding box is a directed bounding box
  • the z direction of the point cloud bounding box is parallel to the z axis. Since the directed bounding box is the smallest cuboid defined to contain the object and any direction relative to the coordinate axis, the compactness of the bounding box of the point cloud can be guaranteed, thereby further improving the accuracy of labeling the bounding box of the point cloud.
  • the object bounding box in this embodiment of the present application is a cuboid containing a given object (ie, a target object), and the z-direction of the cuboid is parallel to the z-axis.
  • the direction corresponding to the height of the object bounding box is parallel to the z-axis
  • the x direction of the object bounding box is parallel to the direction corresponding to the length of the object bounding box
  • the y direction of the object bounding box is parallel to the object bounding box.
  • the width corresponds to the direction.
  • the length or width of the object bounding box can be determined by the moving direction of the object bounding box and the object type.
  • the moving direction of the target object is usually consistent with the direction corresponding to the length of the object bounding box, so that The edge in the bounding box of the object that is parallel to the moving direction of the target object can be determined as the length of the bounding box of the object, and the other side of the bounding box of the object that intersects with the length and height at a vertex or the edge parallel to the edge is used as the bounding box of the object.
  • the width of the bounding box of the object is usually consistent with the direction corresponding to the length of the object bounding box, so that The edge in the bounding box of the object that is parallel to the moving direction of the target object can be determined as the length of the bounding box of the object, and the other side of the bounding box of the object that intersects with the length and height at a vertex or the edge parallel to the edge is used as the bounding box of the object.
  • the width of the bounding box of the object is usually consistent with the direction corresponding to the length of the object bounding box, so that The edge in the bound
  • the moving direction of the target object is usually the direction of upright walking, that is, the direction corresponding to the width of the bounding box of the object, so it can be determined that the bounding box of the object is parallel to the moving direction of the target object
  • the side of the bounding box of the object is used as the length of the bounding box of the object, and the other side of the bounding box of the object that intersects with the length and height at a vertex or the side parallel to the side is used as the width of the bounding box of the object.
  • the currently commonly used method for labeling incomplete point cloud data is shown in (a) in Figure 5.
  • the bounding box 30 corresponding to the incomplete point cloud data and the moving direction indicated by the arrow A1 are first marked; Perform frame expansion processing on the bounding box 30 to obtain the imaginary bounding box 31 and the moving direction indicated by the arrow A2, so that the imaginary bounding box 31 conforms to the size of a normal vehicle (for example, the type of vehicle marked in (b) in FIG. 5 is an engineering vehicle, And the length, width and height (unit is meter) of the virtual bounding box 31 corresponding to the construction vehicle are 1.77, 2.78 and 2.00 respectively).
  • the specification of the expansion box is difficult to determine, and the bounding box is usually expanded by the annotator based on subjective experience, which lacks objectivity. And the labeling efficiency is reduced due to the frame expansion operation. In addition, the imaginary bounding box obtained by frame expansion is difficult to be used by other teams or personnel.
  • a data processing method provided by an example of this application can be applied to a data processing apparatus, and the data processing apparatus may be the above-mentioned electronic equipment or collection equipment.
  • the embodiment of the present application takes an electronic device as an example to describe the data processing method. Please refer to FIG. 6 , which is a schematic flowchart of the data processing method applied by the embodiment of the present application.
  • the method may include the following steps S601-S603, wherein:
  • S601 Label the incomplete point cloud data of the target object according to the image of the target object, and obtain label information, wherein the label information includes the moving direction of the target object and the point cloud bounding box containing the incomplete point cloud data, and the point cloud bounding box
  • the z direction of is parallel to the z axis and the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane.
  • the target object, the image of the target object and incomplete point cloud data, the moving direction of the point cloud bounding box and the target object, the z direction of the point cloud bounding box and the direction corresponding to the height, and the z axis can refer to the aforementioned definition, in This will not be repeated here.
  • the labeling information includes a point cloud bounding box containing incomplete point cloud data, and the moving direction of the target object. As shown in (b) in FIG. 7 , it may include a point cloud bounding box 30 represented by a cuboid. , the moving direction indicated by the arrow A1.
  • the annotation information may also include object type, occlusion situation, up-down direction, zoom ratio, etc., which are not limited herein.
  • the object types can be referred to above, and are not repeated here.
  • the annotation information may further include the vehicle type.
  • the occlusion situation is used to describe whether the target object is occluded, as well as the occluded parts and other information. It can determine the lack of incomplete point cloud data. When most of the data is missing, the incomplete point cloud data is obtained by size processing. The accuracy of the target bounding box is insufficient.
  • the up and down direction corresponds to the width of the object bounding box, and the width of the object bounding box can be directly determined based on the up and down direction. If the up and down directions are marked in the box, the width direction corresponding to the bounding box of the point cloud and the bounding box of the object can be determined, which is convenient to improve the speed of determining the expansion direction of the bounding box of the point cloud.
  • the zoom ratio is used to describe the zoom size between the point cloud data collected by the acquisition device and the real object. The zoom ratio can be understood as the zoom size between the incomplete point cloud data corresponding to the target object and the actual target object.
  • This embodiment of the present application does not limit the method for marking the bounding box of the point cloud.
  • the target area corresponding to the target object in the point cloud data collected by the collection device is determined;
  • the point cloud bounding box corresponding to the point cloud data is marked.
  • the target area is the position corresponding to the target object in the point cloud data collected by the collection device.
  • the labeler can compare the point cloud data with the two-dimensional graphics, and the obtained label information can determine the target area corresponding to the target object.
  • the device determines the position of the target object in the point cloud data as the target area according to the mapping relationship between the two-dimensional graphics and the point cloud data, so that the three-dimensional point cloud corresponding to the target area is used as the incomplete point cloud data corresponding to the target object.
  • the present application does not limit the method for determining the target area.
  • This application does not limit the preset algorithm. It can be based on the cuboid containing all the point clouds of the target objects (that is, incomplete point cloud data) required in the point cloud bounding box, and the z direction of the cuboid is parallel to the z axis, so that It is obtained by shrinking the box corresponding to the point cloud data in the target area.
  • the point cloud bounding box can be obtained by fine-tuning the cuboid obtained by the above preset algorithm by an annotator. It can be understood that the accuracy of the point cloud bounding box can be further improved by fine-tuning the point cloud bounding box obtained by the annotator.
  • S602 Determine extension constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction, wherein the extension constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices.
  • the expansion constraint information is used to define the expansion direction of the bounding box of the point cloud, which may include key vertices, and at least one of the key long sides, key broad sides, and key high sides intersecting with the key vertices, and also It may include the expansion directions corresponding to the key vertices and the key long sides, the key wide sides, and the key high sides, etc., which are not limited here.
  • the key vertex is used to describe the vertices in the point cloud bounding box that are closest to the object bounding box, that is, the key vertex is the vertex in the point cloud bounding box that most likely coincides with the vertices in the object bounding box.
  • the vertex d1 in the point cloud bounding box 30 shown in (b) of FIG. 8 is a key vertex.
  • the key long edge, the key wide edge and the key high edge are the three combined edges that intersect at the key vertices and correspond to the length, width and height of the bounding box of the object respectively. That is, the critical long side, critical wide side, and critical high side are most likely to coincide with the three edges in the object's bounding box that intersect at a vertex.
  • the arrow A1 in FIG. 8 indicates the moving direction of the target object, the line segment L1, the line segment L2 and the line segment L3 intersect at the key vertex d1, and the edges corresponding to the line segment L1, the line segment L2 and the line segment L3 in the point cloud bounding box 30 can be called as Critical Long Side, Critical Wide Side, and Critical High Side.
  • the present application does not limit the method for determining the extension constraint information, which may include the following two implementation manners, wherein:
  • the vertex confidence is used to describe the probability that the vertex of the point cloud bounding box is the vertex of the object bounding box of the target object.
  • This application does not limit the method of determining the vertex confidence.
  • the vertex confidence of each vertex is determined according to the number of point clouds corresponding to each vertex in the bounding box of the point cloud. When , the vertex confidence is greater.
  • the number of point clouds corresponding to the vertices can be understood as the number of point clouds within a preset range corresponding to the vertices, and the preset range can be the distance between the three combined edges connected to the vertices in the bounding box of the point cloud and the vertices
  • the 1/4 sphere, Mitsubishi cone or cube formed by connecting points that differ by the same threshold is not limited here.
  • a vertex corresponding to a plane in the point cloud bounding box may be selected. As shown in (a) in FIG.
  • vertex d1, vertex d2, vertex d3 and vertex d4 in the point cloud bounding box are selected , the size relationship between the number of point clouds in the preset range corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is vertex d1 > vertex d2 > vertex d3 > vertex d4.
  • the vertex confidence the relationship between the vertex confidences of each vertex in vertex d1, vertex d2, vertex d3 and vertex d4 is vertex d1> vertex d2> vertex d3> vertex d4, so , the key vertex can be determined as vertex d1.
  • the point cloud can reflect the collected information of the target object.
  • determining the probability that the vertex is a vertex of the object bounding box (ie, the vertex confidence) according to the number of point clouds corresponding to the vertex can improve the accuracy of determining the vertex confidence.
  • the vertex confidence of the vertex is determined according to the distance between the vertex of the bounding box of the point cloud and the acquisition device. When the distance is smaller, the vertex confidence is higher.
  • the acquisition device collects incomplete point cloud data
  • the acquisition device is a device that collects incomplete point cloud data.
  • the acquisition device may also be a device for acquiring a two-dimensional image of the target object, which is not limited herein.
  • the distance between the vertices of the point cloud bounding box and the acquisition device can be calculated by the three-dimensional coordinates corresponding to the vertices of the point cloud bounding box and the three-dimensional coordinates corresponding to the acquisition device (which can be a lidar sensor in the acquisition device).
  • the 3D coordinates corresponding to the vertices of the object bounding box corresponding to the vertices of the point cloud bounding box and the 3D coordinates corresponding to the acquisition device can be calculated and obtained, which are not limited here.
  • vertex d1> vertex d2> vertex d3> vertex d4 therefore, it can be Determine the key vertex as vertex d1.
  • the vertex confidence determines the probability that the vertex is the vertex of the object bounding box (that is, the vertex confidence), which can improve the accuracy of determining the vertex confidence.
  • the occlusion probability of each vertex in the bounding box of the point cloud is determined according to the two-dimensional image of the target object; the vertex confidence is determined according to the occlusion probability.
  • the vertex confidence is smaller.
  • the occlusion probability of the vertex is used to describe the probability that the vertex is occluded, that is, the probability that the vertex can be used to restore the outline.
  • the magnitude relationship between the occlusion probabilities of the vertex d1, vertex d2, vertex d3 and vertex d4 can be determined according to the two-dimensional image as vertex d1 ⁇ vertex d2 ⁇ vertex d3 ⁇ vertex d4.
  • vertex d1> vertex d2> vertex d3> vertex d4 the size relationship between the vertex confidences of each vertex in vertex d1, vertex d2, vertex d3 and vertex d4 is vertex d1> vertex d2> vertex d3> vertex d4, therefore, The key vertex can be determined to be vertex d1.
  • the two-dimensional image can reflect the situation that the target object is occluded.
  • the probability of occlusion is larger, it means that the probability that the vertex can be restored is smaller, that is, the overall confidence is smaller.
  • the probability of each vertex being occluded (that is, the occlusion probability) can be determined according to the two-dimensional graph, and then the vertex confidence can be determined according to the occlusion probability, which can improve the accuracy of determining the vertex confidence.
  • the cloud data determines the occlusion probability of the vertices of the point cloud bounding box, and then determines the vertex confidence according to the occlusion probability, and also determines the vertex confidence according to the number and distance of the point cloud (for example, obtain the weighted average value corresponding to the number and distance of the point cloud, Determine vertex confidence by weighted average, etc.); or determine key vertices according to point cloud data and distance (for example, when it is determined that the maximum number of point clouds corresponds to 2 or more vertices, it can be determined according to the 2 or The distance between two or more vertices and the acquisition device, the vertex with the smallest distance is used as the key vertex; or when it is determined that the smallest distance corresponds to two or more vert
  • the vertex confidence is determined by the weighted average, etc.); or key vertices are determined according to the number of point clouds and occlusion probability (for example, when it is determined that the maximum number of point clouds corresponds to 2 or more vertices, according to the 2 here
  • occlusion probability of two or more vertices the vertex with the smallest occlusion probability is used as the key vertex; or when it is determined that the minimum occlusion probability corresponds to 2 or more vertices, the 2 or more vertices can be The number of point clouds corresponding to the vertices of , and the vertices corresponding to the maximum number of point clouds are regarded as key vertices, etc.) and so on.
  • each vertex can connect three edges.
  • the three combined edges in the point cloud bounding box that intersect with the key vertices may be called three reference edges, that is, the reference edges are the edges in the point cloud bounding box that connect with the key vertices.
  • the key high side is the side corresponding to the height of the bounding box of the object.
  • the height of the bounding box of the object and the height of the bounding box of the point cloud are both parallel to the z direction of the bounding box of the point cloud. It can be determined that among the three reference edges The edge corresponding to the z direction is the critical high edge.
  • the long side or the wide side corresponding to the moving direction in the point cloud bounding box (object bounding box) can be determined.
  • the long edge corresponding to the moving direction can be determined according to the object type, the long edge corresponding to the moving direction can be selected as the key long edge of the point cloud bounding box among the three reference edges that intersect with the key vertices.
  • the broadside corresponding to the moving direction can be selected from the three reference edges intersecting with the key vertices as the key broadside of the point cloud bounding box. It can be understood that each of the key long side, the key wide side and the key high side is a reference edge except the other two reference edges among the three reference edges respectively.
  • the reference edge other than the reference edge corresponding to the moving direction and the reference edge corresponding to the z direction among the three reference edges can be determined as the key broad edge.
  • the critical broad side corresponding to the moving direction can be determined according to the object type
  • the reference side except the reference side corresponding to the moving direction and the reference side corresponding to the z direction among the three reference sides can be determined as the critical long side.
  • the moving direction corresponds to the direction of the long side, that is, the side corresponding to the moving direction among the three reference sides can be determined as the key long side.
  • the key vertex is d1
  • the edge intersecting with the key vertex d1 is the key long edge, that is, the key
  • the long side is the line segment L1.
  • the edge intersecting with the key vertex d1 is the key high edge, that is, the line segment L3 in the point cloud bounding box 30 is determined as the key high edge.
  • the remaining reference edges are key broad edges. Therefore, the edge that intersects with the key vertex d1 in the point cloud bounding box, and the remaining line segment L2, is the key broad edge.
  • This application does not limit which side of the key long side, the key wide side and the key high side is determined.
  • the key long side, the key wide side and the key high side can be determined respectively, and the incomplete point cloud data can also be analyzed. Get the edge that needs to be extended. For example, determine the target size of the target object in the point cloud image according to the object type or specific object type (for example, vehicle type, aircraft type, ship type, robot type, etc.), and the target size includes the length corresponding to the length, width and height. .
  • the critical long side and/or the critical broad side can be determined according to the movement direction and the z-direction, and the critical high side can be determined according to the z-direction.
  • the key long side, the key wide side and the key high side that need to be expanded can be determined. side.
  • the vertex confidence of each vertex in the bounding box of the point cloud is determined first, and then the vertex corresponding to the maximum vertex confidence is used as the key vertex. That is to say, first take the vertices in the point cloud bounding box that are most likely to coincide with the vertices in the object bounding box as the key vertices, and then determine the three combined edges that intersect with the key vertices as the three reference edges, and then according to the moving direction and/or z Among the three reference edges for direction determination, the edges corresponding to the length, width and height of the bounding box of the object are used as the key long edge, key wide edge and key high edge respectively, which can improve the accuracy of determining extended constraint information.
  • the second is to determine the overall confidence of the three combined edges that intersect each vertex in the point cloud bounding box; determine the three combined edges corresponding to the maximum value of the overall confidence as the three reference edges; determine the vertex where the three reference edges intersect as the key Vertices; determine at least one of the critical long side, the critical wide side, and the critical high side of the three reference edges according to the movement direction and/or the z-direction.
  • the overall confidence is used to describe the probability that the three combined edges are the edges of the object bounding box of the target object.
  • This application does not limit the method for determining the overall confidence.
  • the overall confidence of the three combined edges is determined according to the number of point clouds corresponding to the three combined edges in the point cloud bounding box that intersect each vertex. , when the number of point clouds is larger, the overall confidence is larger.
  • the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box can be understood as the number of point clouds within the preset range corresponding to each vertex and the three combined edges, and the preset range can be three A 1/4 sphere, a Mitsubishi cone or a cube, etc., formed by connecting points whose distances from the vertices differ by the same threshold in the combined edge are not limited here.
  • three combined edges corresponding to a vertex corresponding to a plane in the point cloud bounding box may be selected. As shown in (a) in FIG.
  • the vertex d1, vertex d2, Vertex d3 and vertex d4 the three combined edges corresponding to vertex d1 are L1, L2 and L3, the three combined edges corresponding to vertex d2 are L2, L7 and L6, the three combined edges corresponding to vertex d3 are L1, L4 and L5, and the three combined edges corresponding to vertex d3 are L1, L4 and L5.
  • the three combined edges corresponding to d4 are L5, L6 and L8.
  • the size relationship between the number of point clouds corresponding to the three combined edges that intersect each vertex is: three combined edges corresponding to vertex d1 > three combined edges corresponding to vertex d2 > three combined edges corresponding to vertex d3 > three combined edges corresponding to vertex d4 Combine edges.
  • the relationship between the overall confidences of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is the three combined edges corresponding to vertex d1>
  • the point cloud can reflect the collected information of the target object.
  • determining the probability that the three combined edges are the edges of the object bounding box ie, the overall confidence
  • the overall confidence level of the three combined edges in the point cloud bounding box intersecting the vertex is determined according to the distance between the vertex of the point cloud bounding box and the acquisition device. When the distance is smaller, the overall confidence level bigger.
  • the acquisition device collects incomplete point cloud data
  • the acquisition device is a device that collects incomplete point cloud data.
  • the acquisition device may also be a device for acquiring a two-dimensional image of the target object, which is not limited herein.
  • the distance between the vertices of the point cloud bounding box and the acquisition device can be calculated by the three-dimensional coordinates corresponding to the vertices of the point cloud bounding box and the three-dimensional coordinates corresponding to the acquisition device (which can be a lidar sensor in the acquisition device).
  • the 3D coordinates corresponding to the vertices of the object bounding box corresponding to the vertices of the point cloud bounding box and the 3D coordinates corresponding to the acquisition device can be calculated and obtained, which are not limited here.
  • the relationship between the overall confidences of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is that the three combined edges corresponding to vertex d1 > vertex d2
  • the probability that the three combined edges corresponding to the vertex are all the edges of the object bounding box can improve the overall confidence in determining degree of accuracy.
  • the occlusion probability of the three combined edges intersecting each vertex in the point cloud bounding box is determined according to the two-dimensional image of the target object; the overall confidence is determined according to the occlusion probability. The overall confidence is smaller.
  • the occlusion probability of the three combined edges is used to describe the probability that the area corresponding to the three combined edges in the object bounding box is blocked, that is, the probability that the preset area corresponding to the three combined edges can be used to restore the outline.
  • the magnitude relationship between the occlusion probabilities of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 can be determined in the two-dimensional image. is three combined edges corresponding to vertex d1 ⁇ three combined edges corresponding to vertex d2 ⁇ three combined edges corresponding to vertex d3 ⁇ three combined edges corresponding to vertex d4.
  • the relationship between the overall confidences of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is that the three combined edges corresponding to vertex d1 > vertex
  • the two-dimensional image can reflect the situation that the target object is occluded.
  • the probability of occlusion is larger, it means that the probability that the vertex can be restored is smaller, that is, the overall confidence is smaller.
  • the probability that each of the three combined edges intersecting each vertex is occluded ie, the occlusion probability
  • the overall confidence level can be determined according to the occlusion probability, which can improve the accuracy of determining the overall confidence level.
  • the point cloud determines the occlusion probability of the vertices of the point cloud bounding box, and then determines the overall confidence according to the occlusion probability, and can also determine the overall confidence according to the number and distance of the point cloud (for example, obtain the weighted average corresponding to the number and distance of the point cloud, The overall confidence is determined by the weighted average, etc.); or three key edges are determined according to the point cloud data and distance (for example, when it is determined that the maximum number of point clouds corresponds to 2 or more vertices, it can be determined according to the 2 here or the distance between two or more vertices and the acquisition device, the three combined edges that intersect with the vertex with the smallest distance are used as three key edges; or when the smallest distance is determined to correspond to two or more
  • the weighted average corresponding to the number of point clouds and distance, and determine the overall confidence by the weighted average, etc. determine three key edges according to the number of point clouds and the occlusion probability (for example, when determining the maximum number of point clouds corresponding to When there are 2 or more vertices, the three combined edges that intersect with the vertex with the smallest occlusion probability can be used as the three key edges according to the occlusion probability of the 2 or more vertices here; or when the minimum occlusion probability is determined.
  • the three combined edges that intersect the vertices corresponding to the maximum number of point clouds can be used as three keys. side etc.) etc.
  • the edge confidence of each edge in the bounding box of the point cloud can be determined, the overall confidence corresponding to the three combinations can be determined according to the edge confidences of the three combined edges intersecting at one vertex.
  • the edge confidence here is used to describe the probability that the edge corresponding to the edge confidence is the edge of the object bounding box.
  • the overall confidence can be based on the preset weights corresponding to the length, width and height respectively. Confidence is weighted.
  • the preset weights here can be set according to the importance of the length, width, and height to be expanded, etc., which are not limited here.
  • Determining the critical long side, critical wide side, and critical high side, and determining which of the critical long side, critical wide side, and critical high side, can refer to the description in the first method of determining extended constraint information, which is not repeated here. Repeat.
  • the overall confidence level of the three combined edges that intersect each vertex in the point cloud bounding box is determined first, and the three combined edges corresponding to the largest overall confidence level are used as three references. side. That is to say, the edge in the bounding box of the point cloud that is most likely to coincide with the edge in the bounding box of the object is used as the reference edge, and then the three combined edges that intersect with the key vertices are determined as the three reference edges, and then according to the moving direction and/or z Among the three reference edges for direction determination, the edges corresponding to the length, width and height of the bounding box of the object are used as the key long edge, key wide edge and key high edge respectively, which can improve the accuracy of determining extended constraint information.
  • the method further includes: determining, according to the two-dimensional image, that the unoccluded position of the target object includes boundary information of the target object.
  • the boundary information includes the vertices or edges of the object bounding box containing the target object. It can be understood that when it is determined according to the two-dimensional image that the incomplete point cloud data includes the boundary information of the target object, the expansion operation can be performed based on the boundary information. Moreover, the location of the boundary information is fixed, which is convenient for improving the accuracy of determining the extended constraint information and improving the effect of data multiplexing.
  • step S602 if it is determined according to the two-dimensional image that the unoccluded position of the target object does not include boundary information of the target object, step S602 is not performed.
  • step S602 is executed, that is, the extension constraint information of the bounding box of the point cloud is determined.
  • S603 Mark extended constraint information on the point cloud bounding box.
  • the extended constraint information is marked on the bounding box of the point cloud, which can be understood as adding extended constraint information to the marked information, for example, marking the key long side, key wide side and key high side on the marking information At least one of the key vertices and key long sides, key wide sides and key high sides, or at least one of the key vertices and key long sides, key wide sides and key high sides correspond to direction of expansion. As shown in (b) of FIG. 8 and FIG.
  • arrows A3, A4 and A5 intersecting with the key vertex d1 are marked on the point cloud bounding box 30, and each of the arrows A3, A4 and A5 are the expansion directions corresponding to the key long side L1, the key wide side L2 and the key high side L3, respectively.
  • the incomplete point cloud data corresponding to the target object is marked according to the image of the target object, so as to obtain the point cloud bounding box and the marking information of the moving direction of the target object.
  • the extended constraint information of the point cloud bounding box is determined according to the moving direction of the target object and/or the z direction of the point cloud bounding box, and the extended constraint information is marked on the point cloud bounding box, so that the actual needs can be obtained according to the extended constraint information.
  • the target bounding box is convenient to improve the utilization rate of data.
  • the point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling.
  • the accuracy of the point cloud bounding box is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
  • the target object is a vehicle
  • the annotation information includes a vehicle type
  • the method further includes: determining a first size of the point cloud bounding box according to the vehicle type; Perform size processing to obtain the first target bounding box.
  • the first size may include the length, width and height of the target object represented in the image, that is, the length, width and height of the rectangular parallelepiped corresponding to the first target bounding box. Since the shape of the vehicle is not a rectangular parallelepiped, the target object contained in the rectangular parallelepiped may contain redundant space, and the first size may further include the length of each side of the cube corresponding to the target object, etc., which is not limited here.
  • the first size of the target object mapped in the point cloud data is determined according to the vehicle type, that is, the size to be obtained by processing the bounding box of the point cloud. It should be noted that since the target object in the point cloud data is incomplete point cloud data, that is to say, the image of the target object is incomplete, in most cases, the size processing operation of the point cloud bounding box is an expansion operation. .
  • the first size can also be determined according to the scaling ratio in the annotation information, and the scaling ratio can be used to obtain the size relationship between the target object in the point cloud data and the actual target object, so the target can be obtained based on the scaling ratio and the vehicle type
  • the size of the object in the point cloud image can improve the accuracy of determining the first size.
  • the first target bounding box is a bounding box obtained by size processing according to the vehicle type and extension constraint information of the point cloud bounding box.
  • the first size of the point cloud bounding box can be determined according to the vehicle type, and then the size of the point cloud bounding box is processed according to the extended constraint information and the first size to obtain the first target bounding box, which improves the performance of the point cloud bounding box.
  • the accuracy of size processing can improve the authenticity of the first target bounding box.
  • the present application does not limit the method for obtaining the first target bounding box.
  • at least one target side among the key long side, the key wide side and the key high side is determined according to the extended constraint information and the first size, and the target length of the target edge and the target extension direction; according to the target length and target extension direction, the size of the target edge and the corresponding edge of the target edge in the point cloud bounding box are processed to obtain the first target bounding box.
  • the target side is the side that needs size processing among the key long side, the key wide side, and the key high side, and the target length may include the length of the target side for size processing, or the length corresponding to the target side in the first size, etc.
  • the target extension direction is the extension direction corresponding to the key vertex and the target edge. As shown in (a) in FIG. 11 , the target extension direction may be the direction indicated by at least one arrow in the overlapping arrows of the line segment L1, the line segment L2 and the line segment L3 , that is, at least one direction of arrow A3, arrow A4, and arrow A5.
  • the edge corresponding to the target edge in the point cloud bounding box refers to the edge that needs to follow the target edge for size processing.
  • the edge corresponding to the target edge can be the edge parallel to the target edge in the cuboid. It should be noted that, when the first size includes the length of each side in the cube corresponding to the target object, the side corresponding to the target side may be another side whose range needs to be reduced.
  • the side corresponding to the line segment L1 in the box 30 further includes the line segment L6, the line segment L9 and the line segment L10.
  • the key long edge (ie the line segment L1) and the edge parallel to the line segment L11 in the point cloud bounding box 30 ie the line segment L6, the line segment L9 and the line segment L10) Perform size processing to obtain the first target bounding box 33 .
  • the first target bounding box 33 and the point cloud data corresponding to the moving direction indicated by the arrow A1 are called target point cloud data.
  • the edge corresponding to the line segment L2 in the point cloud bounding box 30 also includes the line segment L5 and the line segment L11 and line segment L12.
  • the key broadside ie the line segment L2
  • the edge parallel to the line segment L2 in the point cloud bounding box 30 ie the line segment L5, the line segment L11 and the line segment L12
  • the first target bounding box 34 and the point cloud data corresponding to the moving direction indicated by the arrow A1 are called target point cloud data.
  • the edge corresponding to the line segment L3 in the point cloud bounding box 30 also includes the line segment L4 and the line segment L7 and line segment L8.
  • the key high side ie the line segment L3
  • the edge parallel to the line segment L3 in the point cloud bounding box 30 ie the line segment L4, the line segment L7 and the line segment L8.
  • the first target bounding box 35 and the point cloud data corresponding to the moving direction indicated by the arrow A1 are called target point cloud data.
  • the above example uses one target edge for size processing.
  • the target edge and the corresponding target edge are sequentially processed according to the corresponding target size. Size processing is performed, which is not repeated here.
  • the embodiments of the present application take the target object as a vehicle for illustration, and the processing method of point cloud bounding boxes of other object types (eg, aircraft type, ship type, robot type, etc.) can refer to this method, and will not be repeated here.
  • the edge that does not meet the first size among the key long side, the key wide side and the key high side is used as the target side, and then the target length of the first size and the target extension direction are determined, so as to determine the target length according to the target length.
  • the size of each target edge and the corresponding edge of each target edge in the point cloud bounding box is processed with the target expansion direction to obtain the first target bounding box that meets the vehicle type, which improves the accuracy of the size processing of the point cloud bounding box.
  • FIG. 10 is a schematic flowchart of another data processing method provided by an embodiment of the present application.
  • the method is described by taking an electronic device as an example, and the specific process may include the following steps S1001-S1004, wherein:
  • S1001 Label the incomplete point cloud data of the target object according to the image of the target object, and obtain label information, wherein the label information includes the moving direction of the target object and the point cloud bounding box containing the incomplete point cloud data, and the point cloud bounding box
  • the z direction of is parallel to the z axis and the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane.
  • S1002 Determine extended constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction, wherein the extended constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices.
  • S1003 Mark extended constraint information on the point cloud bounding box to obtain reference point cloud data.
  • steps S1001-S1003 reference may be made to the description of steps S601-S603, which will not be repeated here.
  • the data obtained by labeling the extended constraint information on the point cloud bounding box of the labeling information may be referred to as reference point cloud data. That is to say, the reference point cloud data includes the annotation information obtained in step S601 or S1001 and the extension constraint information of the bounding box of the point cloud in the annotation information obtained in step S602 or S1002. As shown in (b) of FIG. 8 , the point cloud image corresponding to the reference point cloud data is marked with the moving direction of the target object, the point cloud bounding box 30 and the extended constraint information.
  • the incomplete point cloud data corresponding to the target object is annotated according to the image of the target object, so as to obtain the point cloud bounding box and the annotation information of the moving direction of the target object. Then determine the extended constraint information of the point cloud bounding box according to the moving direction of the target object and/or the z direction of the point cloud bounding box, mark the extended constraint information on the point cloud bounding box, obtain the reference point cloud data, and store the reference point Cloud data, so that the target bounding box that meets the actual demand can be obtained according to the extended constraint information, which is convenient to further improve the utilization rate of the data.
  • the point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling.
  • the accuracy of the point cloud bounding box is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
  • the target object is a vehicle
  • the annotation information further includes a vehicle type.
  • the method further includes: receiving an annotation instruction for the reference point cloud data; determining the second size of the point cloud bounding box according to the annotation instruction and the vehicle type ; Perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box.
  • the labeling instruction may include size processing accuracy and size requirements corresponding to the bounding box of the point cloud. It can be understood that there are differences in the accuracy of dimensions for different teams. For example, the dimensions of vehicles in Team 1 are required to be accurate to decimeters, and the dimensions of vehicles in Team 2 are required to be accurate to millimeters.
  • the algorithms of different teams are different, and the required target size may also be different. For example, the target size of team 1 requires length, width, and height to be 5.2, 4.3, and 2.0, respectively, and the vehicle size of team 2 requires 5.25, 3.55 , 2.00, etc.
  • the tagging instruction may also include identification information of the electronic device that sends the tagging instruction, etc., which is not limited here.
  • After acquiring the second target bounding box send the point cloud data corresponding to the second target bounding box to the electronic device that sends the labeling instruction according to the identification information.
  • the labeling instruction is used to instruct the electronic device to use the incomplete point cloud data in the reference point cloud data. It can be understood that the size of the point cloud bounding box corresponding to the incomplete point cloud data is processed, so that the incomplete point cloud data can be used for data.
  • the labeling instruction may be an instruction obtained according to the information input by the labeling person in the electronic device, or may be an instruction received from other electronic devices, which is not limited herein.
  • the second size of the target object mapped in the point cloud data is determined according to the labeling instruction and the vehicle type, that is, the size to be obtained by processing the bounding box of the point cloud. It should be noted that since the target object in the point cloud data is incomplete point cloud data, that is to say, the image of the target object is incomplete, in most cases, the size processing operation of the point cloud bounding box is an expansion operation. , but there may also be scaling operations depending on the dimensioning instructions.
  • the target size can also be determined according to the scaling ratio in the annotation information, which can be used to obtain the size relationship between the target object in the point cloud data and the actual target object, so it can be based on the scaling ratio, annotation instructions and vehicle type. Obtaining the size of the target object in the point cloud image can improve the accuracy of obtaining the second size.
  • the second target bounding box is a bounding box obtained by size processing according to the labeling instruction of the point cloud bounding box, the vehicle type, and the extended constraint information.
  • the present application does not limit the method for obtaining the second target bounding box, and reference may be made to the description of the method for obtaining the first target bounding box, which will not be repeated here.
  • the embodiments of the present application take the target object as a vehicle for illustration, and the method for processing point cloud bounding boxes of other object types (eg, aircraft type, ship type, robot type, etc.) can also refer to this method, and will not be repeated here.
  • the second size of the bounding box of the point cloud can be determined according to the labeling instruction and the vehicle type, and then the bounding box of the point cloud can be sized according to the extended constraint information and the second size.
  • the second target bounding box that satisfies the vehicle type and the labeling instruction can be obtained, the accuracy of the size processing of the point cloud bounding box is improved, and the utilization rate of data is improved.
  • FIG. 12 is a schematic structural diagram of a data processing apparatus provided by an embodiment of the present application.
  • the data processing apparatus may include a labeling unit 1201 , a determining unit 1202 , a processing unit 1203 , a storage unit 1204 , and a communication unit 1205 .
  • the communication unit 1205 can be used to receive the information collected by the collecting device, or receive the labeling instruction sent by other electronic devices, and send the target bounding box or the target bounding box obtained after data processing with other electronic devices. Data such as point cloud data of the target bounding box.
  • the communication unit 1205 may be configured to send data such as the target bounding box obtained after data processing or the point cloud data including the target bounding box to the electronic device.
  • the embodiments of the present application take the data processing apparatus as an electronic device for illustration, and the detailed description of each unit is as follows.
  • the labeling unit 1201 is used to label the incomplete point cloud data of the target object according to the image of the target object, and obtain label information, wherein the label information includes the moving direction of the target object and the point cloud bounding box containing the incomplete point cloud data.
  • the z direction of the cloud bounding box is parallel to the z axis, and parallel to the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane;
  • the determining unit 1202 is configured to determine extension constraint information of the bounding box of the point cloud according to the moving direction and/or the z-direction, wherein the extension constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices ;
  • the labeling unit 1201 is also used for labeling extended constraint information on the bounding box of the point cloud.
  • the determining unit 1202 is specifically configured to determine the vertex confidence of each vertex in the point cloud bounding box, wherein the vertex confidence is used to describe the probability that the vertex is the vertex of the object bounding box of the target object; determine The vertex corresponding to the maximum value of the vertex confidence is the key vertex; the three combined edges that intersect the key vertex in the point cloud bounding box are determined as the three reference edges; the key long edge, At least one of the critical wide side and the critical high side.
  • the determining unit 1202 is specifically configured to determine the vertex confidence of the vertex according to the number of point clouds corresponding to each vertex in the bounding box of the point cloud, wherein, the larger the number of point clouds, the higher the vertex confidence ; and/or, determine the vertex confidence of the vertex according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the vertex confidence is greater, and the acquisition device has collected incomplete point cloud data .
  • the determining unit 1202 is specifically configured to determine the overall confidence level of the three combined edges intersecting each vertex in the point cloud bounding box, wherein the overall confidence level is used to describe that the three combined edges are all target objects.
  • the probability of the edge of the bounding box of the object determine the three combined edges corresponding to the maximum value of the overall confidence as the three reference edges; determine the vertex where the three reference edges intersect as the key vertex; determine the three reference edges according to the moving direction and/or the z direction. At least one of the critical long side, the critical wide side, and the critical high side.
  • the determining unit 1202 is specifically configured to determine the overall confidence of the three combined edges according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, wherein, when the number of point clouds is greater When it is large, the overall confidence is larger; and/or, according to the distance between each vertex in the point cloud bounding box and the acquisition device, determine the overall confidence of the three combined edges in the point cloud bounding box that intersect the vertices, wherein, when The smaller the distance, the greater the overall confidence, and the acquisition device has collected incomplete point cloud data.
  • the target object is a vehicle
  • the annotation information also includes the vehicle type
  • the determining unit 1202 is further configured to determine the first size of the point cloud bounding box according to the vehicle type
  • the data processing apparatus further includes a processing unit 1203, configured to Size processing is performed on the bounding box of the point cloud according to the extended constraint information and the first size to obtain the first target bounding box.
  • the processing unit 1203 is specifically configured to determine, according to the first size and the expansion constraint information, at least one target side among the critical long side, the critical wide side, and the critical high side, and the target length and the target length of the at least one target side.
  • Target extension direction According to the target length and target extension direction, size processing is performed on the target edge and the edge corresponding to the target edge in the point cloud bounding box to obtain the first target bounding box.
  • the data processing apparatus further includes: the storage unit 1204 is configured to store the reference point cloud data obtained by marking the extended constraint information on the bounding box of the point cloud.
  • the target object is a vehicle
  • the labeling information also includes the vehicle type
  • the data processing apparatus further includes a communication unit 1205 and a processing unit 1203, where the communication unit 1205 is configured to receive labeling instructions for the reference point cloud data
  • the determining unit 1202 is also used to determine the second size of the point cloud bounding box according to the annotation instruction and the vehicle type; the processing unit 1203 is used to perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box.
  • each unit may also correspond to the corresponding description of the method embodiment shown in FIG. 6 or FIG. 10 .
  • FIG. 13 is a data processing apparatus provided by an embodiment of the present application.
  • the data processing apparatus includes a processor 1301 , a memory 1302 and a communication interface 1303 .
  • the processor 1301 , the memory 1302 and the communication interface 1303 communicate with each other through a bus 1304 connect.
  • the relevant functions realized by the communication unit 1205 shown in FIG. 12 can be realized through the communication interface 1303, the relevant functions realized by the storage unit 1204 shown in FIG. 12 can be realized through the memory 1302, and the labeling unit 1201,
  • the related functions implemented by the determining unit 1202 and the processing unit 1203 can be implemented by the processor 1301 .
  • the memory 1302 includes, but is not limited to, random access memory (RAM), read-only memory (ROM), erasable programmable read only memory (EPROM), or A portable read-only memory (compact disc read-only memory, CD-ROM), the memory 1302 is used for related computer programs and data.
  • the communication interface 1303 is used to receive and transmit data.
  • the processor 1301 may be one or more central processing units (central processing units, CPUs).
  • CPUs central processing units
  • the CPU may be a single-core CPU or a multi-core CPU.
  • the processor 1301 of the data processing apparatus is configured to read the computer program codes stored in the memory 1302, and perform the following operations:
  • the direction is parallel to the z-axis and the direction corresponding to the height of the point cloud bounding box, and the z-axis is perpendicular to the horizontal plane;
  • the extension constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices;
  • the processor 1301 is specifically configured to perform the following operations:
  • vertex confidence is used to describe the probability that the vertex is the vertex of the object bounding box of the target object
  • At least one of the critical long side, the critical broad side and the critical high side of the three reference sides is determined according to the movement direction and/or the z direction.
  • the processor 1301 in determining the vertex confidence of each vertex in the point cloud bounding box, is specifically configured to perform the following operations:
  • the vertex confidence of a vertex is determined according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when the number of point clouds is larger, the vertex confidence is larger; and/or, according to each vertex in the point cloud bounding box
  • the distance from the acquisition device determines the vertex confidence of the vertex, wherein, when the distance is smaller, the vertex confidence is larger, and the acquisition device has collected incomplete point cloud data.
  • the processor 1301 is specifically configured to perform the following operations:
  • At least one of the critical long side, the critical broad side and the critical high side of the three reference sides is determined according to the movement direction and/or the z direction.
  • the processor 1301 in determining the overall confidence of the three combined edges intersecting each vertex in the point cloud bounding box, is specifically configured to perform the following operations:
  • the overall confidence of the three combined edges is determined according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, wherein the greater the number of point clouds, the greater the overall confidence; and/or, according to The distance between each vertex in the point cloud bounding box and the acquisition device determines the overall confidence of the three combined edges in the point cloud bounding box that intersect with the vertex. Incomplete point cloud data.
  • the target object is a vehicle
  • the annotation information further includes the vehicle type
  • the processor 1301 is further configured to perform the following operations:
  • Size processing is performed on the bounding box of the point cloud according to the extended constraint information and the first size to obtain the first target bounding box.
  • the processor 1301 is specifically configured to perform the following operations:
  • size processing is performed on the target edge and the edge corresponding to the target edge in the point cloud bounding box to obtain the first target bounding box.
  • the processor 1301 is further configured to perform the following operations:
  • the target object is a vehicle
  • the annotation information further includes the vehicle type
  • the processor 1301 is further configured to perform the following operations:
  • each operation may also correspond to the corresponding description with reference to the method embodiment shown in FIG. 6 or FIG. 10 .
  • Embodiments of the present application further provide a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed on an electronic device, the method flow shown in FIG. 6 or FIG. 10 is implemented.
  • the embodiment of the present application further provides a computer program product, when the computer program product runs on an electronic device, the method flow shown in FIG. 6 or FIG. 10 is implemented.
  • An embodiment of the present application further provides a chip, including a processor, configured to call and execute instructions stored in the memory from the memory, so that the terminal device with the chip installed executes the method shown in FIG. 6 or FIG. 10 .
  • the embodiment of the present application further provides another chip, which may be a chip in a terminal device or an access network device, and the chip includes: an input interface, an output interface, and a processing circuit, and the input interface, the output interface and the circuit pass through the internal The connection paths are connected, and the processing circuit is used to execute the method shown in FIG. 6 or FIG. 10 .
  • the embodiment of the present application further provides another chip, including: an input interface, an output interface, a processor, and optionally, a memory.
  • the input interface, the output interface, the processor, and the memory are connected through an internal connection path, and the processor uses an internal connection path.
  • the processor For executing the code in the memory, when the code is executed, the processor is used to execute the method shown in FIG. 6 or FIG. 10 .
  • An embodiment of the present application further provides a chip system, the chip system includes at least one processor, a memory, and an interface circuit, the memory, the transceiver, and the at least one processor are interconnected by lines, and the at least one memory
  • a computer program is stored in the computer; when the computer program is executed by the processor, the method flow shown in FIG. 6 or FIG. 10 is realized.
  • extended constraint information for size processing is added to the point cloud bounding box, so that the target bounding box that meets the actual needs can be obtained according to the extended constraint information, which is convenient to improve the utilization rate of data .
  • the point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling.
  • the accuracy of the point cloud bounding box is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
  • the aforementioned storage medium includes: ROM or random storage memory RAM, magnetic disk or optical disk and other mediums that can store computer program codes.

Abstract

A data processing method and apparatus, and a storage medium. The method comprises: labeling incomplete point cloud data of a target object according to an image of the target object, so as to obtain labeling information, wherein the labeling information comprises the moving direction of the target object and a point cloud bounding box which includes the incomplete point cloud data, the z direction of the point cloud bounding box being parallel to a z axis and being parallel to the direction that corresponds to the height of the point cloud bounding box, and the z axis being perpendicular to a horizontal plane (S601); determining expansion constraint information of the point cloud bounding box according to the moving direction and/or the z direction, wherein the expansion constraint information comprises at least one of a key long side, a key wide side and a key high side, which intersect at a key vertex (S602); and labeling the point cloud bounding box with the expansion constraint information (S603). By means of the method, expansion constraint information of a point cloud bounding box can be labeled on the point cloud bounding box that includes incomplete point cloud data of a target object, thereby improving the accuracy of size processing and the utilization rate of data.

Description

数据处理方法及相关装置Data processing method and related device
本申请要求于2020年12月29日提交中国专利局、申请号为202011613732.X、申请名称为“数据处理方法及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202011613732.X and the application name "Data Processing Method and Related Apparatus" filed with the China Patent Office on December 29, 2020, the entire contents of which are incorporated herein by reference middle.
技术领域technical field
本申请涉及自动驾驶技术领域,尤其涉及一种数据处理方法及相关装置。The present application relates to the technical field of automatic driving, and in particular, to a data processing method and a related device.
背景技术Background technique
随着自动驾驶技术的发展,一些无人行驶车辆或无人机等设备已渐渐落地。为了得到好的感知和控制模型,算法团队需要丰富且有效的数据集,其背后离不开数据标注信息的支持。然而,在采集设备采集图像(例如,点云数据、二维图像等)时,对于距离远或者有遮挡的物体,只能采集到部分的点云数据,即非完整点云数据。由于部分的点云数据不能还原物体的完整轮廓,导致该部分的点云数据难以被使用。With the development of autonomous driving technology, some devices such as unmanned vehicles or drones have gradually landed. In order to obtain a good perception and control model, the algorithm team needs a rich and effective data set, which is inseparable from the support of data annotation information. However, when the acquisition device acquires images (eg, point cloud data, two-dimensional images, etc.), for objects that are far away or occluded, only part of the point cloud data, that is, incomplete point cloud data, can be acquired. Since part of the point cloud data cannot restore the complete outline of the object, it is difficult to use this part of the point cloud data.
发明内容SUMMARY OF THE INVENTION
本申请实施例公开了一种数据处理方法及相关装置,能够在包含目标物体的非完整点云数据的点云包围盒上标注该点云包围盒的扩展约束信息,便于提高尺寸处理的准确率以及数据的使用率。The embodiment of the present application discloses a data processing method and a related device, which can mark extended constraint information of the point cloud bounding box on a point cloud bounding box containing incomplete point cloud data of a target object, so as to improve the accuracy of size processing and data usage.
第一方面,本申请实施例公开了一种数据处理方法,其中:根据目标物体的图像对目标物体的非完整点云数据进行标注,得到标注信息,该标注信息包括目标物体的移动方向以及包含非完整点云数据的点云包围盒,点云包围盒的z方向与z轴平行,且与点云包围盒的高对应的方向平行,z轴与水平面垂直;根据移动方向和/或z方向确定点云包围盒的扩展约束信息,该扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项;在点云包围盒上标注扩展约束信息。也就是说,在点云包围盒上添加用于点云包围盒进行尺寸处理的扩展约束信息,从而可根据该扩展约束信息获取满足实际需求的目标包围盒,便于提高数据的使用率。点云包围盒包含非完整点云数据,且保持z方向与点云包围盒的高对应的方向,以及与水平面垂直的z轴平行,可避免由于采集角度导致长方体的方向偏移,提高了标注点云包围盒的准确率。又扩展约束信息是根据移动方向和/或z方向确定的,便于提高尺寸处理的准确率。In a first aspect, an embodiment of the present application discloses a data processing method, wherein: the incomplete point cloud data of the target object is annotated according to the image of the target object, and the annotation information is obtained, and the annotation information includes the moving direction of the target object and the Point cloud bounding box of incomplete point cloud data, the z direction of the point cloud bounding box is parallel to the z axis, and parallel to the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane; according to the moving direction and/or z direction The extended constraint information of the point cloud bounding box is determined, the extended constraint information includes at least one of the critical long side, the critical wide side and the critical high side intersecting with the key vertices; the extended constraint information is marked on the point cloud bounding box. That is to say, the expansion constraint information for size processing of the point cloud bounding box is added to the point cloud bounding box, so that the target bounding box that meets the actual needs can be obtained according to the expanded constraint information, which is convenient to improve the utilization rate of data. The point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling. The accuracy of the point cloud bounding box. Furthermore, the extended constraint information is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
在一种可能的示例中,根据移动方向和/或z方向确定点云包围盒的扩展约束信息包括:确定点云包围盒中每个顶点的顶点置信度,该顶点置信度用于描述顶点为目标物体的物体包围盒的顶点的概率;确定顶点置信度的最大值对应的顶点为关键顶点;确定点云包围盒中与关键顶点相交的三条组合边为三条参考边;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。也就是说,先将点云包围盒中最可能和物体包围盒中的顶点重合的顶点作为关键顶点,再根据移动方向和/或z方向确定与关键顶点相连接的三条参考边中与物体包围盒的长、宽、高对应的边分别作为关键长边、关键宽边和关键高边,可提高确定扩展约束信息的准确率。In a possible example, determining the extended constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction includes: determining the vertex confidence of each vertex in the bounding box of the point cloud, where the vertex confidence is used to describe that the vertex is The probability of the vertices of the object bounding box of the target object; the vertex corresponding to the maximum vertex confidence is determined as the key vertex; the three combined edges in the point cloud bounding box that intersect the key vertex are determined as three reference edges; according to the moving direction and/or The z-direction determines at least one of a critical long side, a critical wide side, and a critical high side among the three reference sides. That is to say, the vertices in the point cloud bounding box that are most likely to coincide with the vertices in the object bounding box are used as key vertices, and then the three reference edges connected to the key vertices are determined according to the moving direction and/or the z direction to surround the object. The sides corresponding to the length, width and height of the box are used as the key long side, the key wide side and the key high side respectively, which can improve the accuracy of determining the extended constraint information.
在一种可能的示例中,确定点云包围盒中每个顶点的顶点置信度包括:根据点云包围盒中每个顶点对应的点云数量确定顶点的顶点置信度,其中,当点云数量越大时,顶点置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定顶点的顶点置信度,其中,当距离越小时,顶点置信度越大,采集设备采集了非完整点云数据。可以理解,点云可体现目标物体被采集到的信息,采集设备与点云之间的距离越近时,采集的点云的准确率越高。在该示例中,根据顶点对应的点云数量和/或顶点与采集设备之间的距离确定该顶点为物体包围盒的顶点的概率(即顶点置信度),可提高确定顶点置信度的准确率。In a possible example, determining the vertex confidence of each vertex in the point cloud bounding box includes: determining the vertex confidence of the vertex according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when the number of point clouds When the value is larger, the vertex confidence is higher; and/or, the vertex confidence of the vertex is determined according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the vertex confidence is higher, The acquisition device collects incomplete point cloud data. It can be understood that the point cloud can reflect the information collected by the target object. The closer the distance between the collection device and the point cloud, the higher the accuracy of the point cloud collected. In this example, the probability that the vertex is a vertex of the object bounding box (ie, the vertex confidence) is determined according to the number of point clouds corresponding to the vertex and/or the distance between the vertex and the acquisition device, which can improve the accuracy of determining the vertex confidence. .
在一种可能的示例中,根据移动方向和/或z方向确定点云包围盒的扩展约束信息包括:确定点云包围盒中相交于每个顶点的三条组合边的整体置信度,该整体置信度用于描述三条组合边均为目标物体的物体包围盒的边的概率;确定整体置信度的最大值对应的三条组合边为三条参考边;确定三条参考边相交的顶点为关键顶点;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。也就是说,先将点云包围盒中最可能和物体包围盒中的边重合的三条组合边作为三条参考边,再根据移动方向和/或z方向确定三条参考边中与物体包围盒的长、宽、高对应的边分别作为关键长边、关键宽边和关键高边,可提高确定扩展约束信息的准确率。In a possible example, determining the extended constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction includes: determining the overall confidence of three combined edges intersecting with each vertex in the bounding box of the point cloud, the overall confidence The degree is used to describe the probability that the three combined edges are the edges of the object bounding box of the target object; the three combined edges corresponding to the maximum value of the overall confidence are determined as the three reference edges; the vertices where the three reference edges intersect are determined as key vertices; according to the movement The direction and/or the z-direction determines at least one of a critical long side, a critical broad side, and a critical high side of the three reference sides. That is to say, the three combined edges in the point cloud bounding box that are most likely to coincide with the edges in the object bounding box are used as the three reference edges, and then the length of the three reference edges and the object bounding box is determined according to the moving direction and/or the z direction. The edges corresponding to , width and height are used as the key long edge, the key wide edge and the key high edge respectively, which can improve the accuracy of determining the extended constraint information.
在一种可能的示例中,确定点云包围盒中相交于每个顶点的三条组合边的整体置信度包括:根据点云包围盒中相交于每个顶点的三条组合边对应的点云数量确定三条组合边的整体置信度,其中,当点云数量越大时,整体置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定点云包围盒中相交于顶点的三条组合边的整体置信度,其中,当距离越小时,整体置信度越大,采集设备采集了非完整点云数据。可以理解,点云可体现目标物体被采集到的信息,采集设备与点云之间的距离越近时,采集的点云的准确率越高。在该示例中,根据点云包围盒中相交于一个顶点的三条组合边对应的点云数量,和/或三条组合边相交的顶点与采集设备之间的距离确定该三条组合边均为物体包围盒的边的概率(即整体置信度),可提高确定整体置信度的准确率。In a possible example, determining the overall confidence of the three combined edges in the point cloud bounding box intersecting with each vertex includes: determining according to the number of point clouds corresponding to the three combined edges in the point cloud bounding box intersecting with each vertex The overall confidence of the three combined edges, where the larger the number of point clouds, the larger the overall confidence; and/or, according to the distance between each vertex in the point cloud bounding box and the acquisition device, determine the point cloud bounding box in the The overall confidence of the three combined edges that intersect the vertex, where the smaller the distance, the greater the overall confidence, and the acquisition device has collected incomplete point cloud data. It can be understood that the point cloud can reflect the information collected by the target object. The closer the distance between the collection device and the point cloud, the higher the accuracy of the point cloud collected. In this example, according to the number of point clouds corresponding to the three combined edges intersecting one vertex in the point cloud bounding box, and/or the distance between the vertexes intersected by the three combined edges and the acquisition device, it is determined that the three combined edges are all surrounded by objects The probability of the edges of the box (ie, the overall confidence), which improves the accuracy of determining the overall confidence.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,该方法还包括:根据车辆类型确定点云包围盒的第一尺寸;根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理,得到第一目标包围盒。如此,提高了点云包围盒进行尺寸处理的准确率,可提高第一目标包围盒的真实性。In a possible example, the target object is a vehicle, and the annotation information further includes a vehicle type. The method further includes: determining a first size of the point cloud bounding box according to the vehicle type; enclosing the point cloud according to the extended constraint information and the first size The box is dimensioned to obtain the first target bounding box. In this way, the accuracy of the size processing of the point cloud bounding box is improved, and the authenticity of the first target bounding box can be improved.
在一种可能的示例中,根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理,得到第一目标包围盒包括:根据扩展约束信息和第一尺寸确定关键长边、关键宽边和关键高边中的至少一条目标边,以及至少一条目标边的目标长度和目标扩展方向;根据目标长度和目标扩展方向,对点云包围盒中目标边和目标边对应的边进行尺寸处理,得到第一目标包围盒。如此,基于第一尺寸和扩展约束信息获取满足车辆类型的第一目标包围盒,提高了点云包围盒进行尺寸处理的准确率。In a possible example, performing size processing on the bounding box of the point cloud according to the extended constraint information and the first size, and obtaining the first target bounding box includes: determining the key long side, the key wide side and the key width according to the extended constraint information and the first size At least one target edge in the key high side, as well as the target length and target extension direction of at least one target edge; according to the target length and target extension direction, the size of the target edge and the corresponding edge of the target edge in the point cloud bounding box are processed to obtain The first target bounding box. In this way, the first target bounding box satisfying the vehicle type is obtained based on the first size and the expansion constraint information, which improves the accuracy of the size processing of the point cloud bounding box.
在一种可能的示例中,该方法还包括:对在点云包围盒上标注扩展约束信息得到的参考点云数据进行存储。如此,便于进一步提高数据的使用率。In a possible example, the method further includes: storing the reference point cloud data obtained by marking the extended constraint information on the point cloud bounding box. In this way, it is convenient to further improve the utilization rate of data.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,该方法还包括:接收针对参考点云数据的标注指令;根据标注指令和车辆类型确定点云包围盒的第二尺寸;根据扩展约束信息和第二尺寸对点云包围盒进行尺寸处理,得到第二目标包围盒。如此,基于第二尺寸和扩展约束信息获取满足标注指令和车辆类型的第二目标包围盒,提高了点云包围盒进行尺寸处理的准确率,且提高了数据的使用率。In a possible example, the target object is a vehicle, and the annotation information further includes a vehicle type. The method further includes: receiving an annotation instruction for the reference point cloud data; determining the second size of the point cloud bounding box according to the annotation instruction and the vehicle type ; Perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box. In this way, the second target bounding box that satisfies the labeling instruction and the vehicle type is obtained based on the second size and the extended constraint information, which improves the accuracy of size processing of the point cloud bounding box and improves the data usage rate.
第二方面,本申请实施例公开了一种数据处理装置,其中:标注单元用于根据目标物体的图像对目标物体的非完整点云数据进行标注,得到标注信息,该标注信息包括目标物体的移动方向以及包含非完整点云数据的点云包围盒,点云包围盒的z方向与z轴平行,且与点云包围盒的高对应的方向平行,z轴与水平面垂直;确定单元用于根据移动方向和/或z方向确定点云包围盒的扩展约束信息,该扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项;标注单元还用于在点云包围盒上标注扩展约束信息。也就是说,在点云包围盒上添加用于点云包围盒进行尺寸处理的扩展约束信息,从而可根据该扩展约束信息获取满足实际需求的目标包围盒,便于提高数据的使用率。且点云包围盒包含非完整点云数据,且保持z方向与点云包围盒的高对应的方向,以及与水平面垂直的z轴平行,可避免由于采集角度导致长方体的方向偏移,提高了标注点云包围盒的准确率。又扩展约束信息是根据移动方向和/或z方向确定的,便于提高尺寸处理的准确率。In a second aspect, an embodiment of the present application discloses a data processing device, wherein the labeling unit is configured to label incomplete point cloud data of the target object according to an image of the target object, and obtain labeling information, where the labeling information includes the target object's data. The moving direction and the point cloud bounding box containing incomplete point cloud data, the z direction of the point cloud bounding box is parallel to the z axis and the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane; the determination unit is used for The extended constraint information of the bounding box of the point cloud is determined according to the moving direction and/or the z direction, the extended constraint information includes at least one of the key long side, the key wide side and the key high side intersecting with the key vertices; the labeling unit is also used for Annotate extended constraint information on the point cloud bounding box. That is to say, the expansion constraint information for size processing of the point cloud bounding box is added to the point cloud bounding box, so that the target bounding box that meets the actual needs can be obtained according to the expanded constraint information, which is convenient to improve the utilization rate of data. And the point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the Accuracy of labeling point cloud bounding boxes. Furthermore, the extended constraint information is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
在一种可能的示例中,确定单元具体用于确定点云包围盒中每个顶点的顶点置信度,其中,顶点置信度用于描述顶点为目标物体的物体包围盒的顶点的概率;确定顶点置信度的最大值对应的顶点为关键顶点;确定点云包围盒中与关键顶点相交的三条组合边为三条参考边;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。也就是说,先将点云包围盒中最可能和物体包围盒中的顶点重合的顶点作为关键顶点,再根据移动方向和/或z方向确定与关键顶点相连接的三条参考边中与物体包围盒的长、宽、高对应的边分别作为关键长边、关键宽边和关键高边,可提高确定扩展约束信息的准确率。In a possible example, the determining unit is specifically configured to determine the vertex confidence of each vertex in the point cloud bounding box, wherein the vertex confidence is used to describe the probability that the vertex is the vertex of the object bounding box of the target object; determine the vertex The vertex corresponding to the maximum confidence value is the key vertex; determine the three combined edges that intersect the key vertex in the point cloud bounding box as the three reference edges; determine the key long edge, key edge, and key edge among the three reference edges according to the moving direction and/or the z direction At least one of wide side and critical high side. That is to say, the vertices in the point cloud bounding box that are most likely to coincide with the vertices in the object bounding box are used as key vertices, and then the three reference edges connected to the key vertices are determined according to the moving direction and/or the z direction to surround the object. The sides corresponding to the length, width and height of the box are used as the key long side, the key wide side and the key high side respectively, which can improve the accuracy of determining the extended constraint information.
在一种可能的示例中,确定单元具体用于根据点云包围盒中每个顶点对应的点云数量确定顶点的顶点置信度,其中,当点云数量越大时,顶点置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定顶点的顶点置信度,其中,当距离越小时,顶点置信度越大,采集设备采集了非完整点云数据。可以理解,点云可体现目标物体被采集到的信息,采集设备与点云之间的距离越近时,采集的点云的准确率。在该示例中,根据顶点对应的点云数量和/或顶点与采集设备之间的距离确定该顶点为物体包围盒的顶点的概率(即顶点置信度),可提高确定顶点置信度的准确率。In a possible example, the determining unit is specifically configured to determine the vertex confidence of the vertex according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when the number of point clouds is larger, the vertex confidence is larger; And/or, the vertex confidence of the vertex is determined according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the vertex confidence is larger, and the acquisition device has collected incomplete point cloud data. It can be understood that the point cloud can reflect the information collected by the target object. The closer the distance between the collection device and the point cloud, the accuracy of the point cloud collected. In this example, the probability that the vertex is a vertex of the object bounding box (ie, the vertex confidence) is determined according to the number of point clouds corresponding to the vertex and/or the distance between the vertex and the acquisition device, which can improve the accuracy of determining the vertex confidence. .
在一种可能的示例中,确定单元具体用于确定点云包围盒中相交于每个顶点的三条组合边的整体置信度,其中,整体置信度用于描述三条组合边均为目标物体的物体包围盒的边的概率;确定整体置信度的最大值对应的三条组合边为三条参考边;确定三条参考边相交的顶点为关键顶点;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。也就是说,先将点云包围盒中最可能和物体包围盒中的边重合的三条组合边作为三条参考边,再根据移动方向和/或z方向确定三条参考边中与物体包围盒的长、宽、高对应的边分别作为关键长边、关键宽边和关键高边,可提高确定扩展约束信息的准确率。In a possible example, the determining unit is specifically configured to determine the overall confidence level of the three combined edges intersecting each vertex in the point cloud bounding box, wherein the overall confidence level is used to describe the object whose three combined edges are the target object The probability of the edges of the bounding box; determine the three combined edges corresponding to the maximum value of the overall confidence as the three reference edges; determine the vertex where the three reference edges intersect as the key vertex; determine the key among the three reference edges according to the movement direction and/or the z direction At least one of Long Side, Critical Wide Side, and Critical High Side. That is to say, the three combined edges in the point cloud bounding box that are most likely to coincide with the edges in the object bounding box are used as the three reference edges, and then the length of the three reference edges and the object bounding box is determined according to the moving direction and/or the z direction. The edges corresponding to , width and height are used as the key long edge, the key wide edge and the key high edge respectively, which can improve the accuracy of determining the extended constraint information.
在一种可能的示例中,确定单元具体用于根据点云包围盒中相交于每个顶点的三条组合边对应的点云数量确定三条组合边的整体置信度,其中,当点云数量越大时,整体置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定点云包围盒中相交于顶点的三条组合边的整体置信度,其中,当距离越小时,整体置信度越大,采集设备采集了非完整点云数据。可以理解,点云可体现目标物体被采集到的信息,采集设备与点云之间的距离越近时,采集的点云的准确率。在该示例中,根据点云包围盒中相交于一个顶点的三条组合边对应的点云数量,和/或三条组合边相交的顶点与采集设备之间的距离确定该三条组合边均为物体包围盒的边的概率(即整体置信度),可提高确定整体置信度的准确率。In a possible example, the determining unit is specifically configured to determine the overall confidence of the three combined edges according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, wherein, when the number of point clouds is larger, the greater the number of point clouds When , the overall confidence is greater; and/or, according to the distance between each vertex in the point cloud bounding box and the acquisition device, determine the overall confidence of the three combined edges in the point cloud bounding box that intersect with the vertices, wherein, when the distance The smaller the value, the greater the overall confidence, and the acquisition device has collected incomplete point cloud data. It can be understood that the point cloud can reflect the information collected by the target object. The closer the distance between the collection device and the point cloud, the accuracy of the point cloud collected. In this example, according to the number of point clouds corresponding to the three combined edges intersecting one vertex in the point cloud bounding box, and/or the distance between the vertexes intersected by the three combined edges and the acquisition device, it is determined that the three combined edges are all surrounded by objects The probability of the edges of the box (ie, the overall confidence), which improves the accuracy of determining the overall confidence.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,确定单元还用于根 据车辆类型确定点云包围盒的第一尺寸;数据处理装置还包括处理单元,用于根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理,得到第一目标包围盒。如此,提高了点云包围盒进行尺寸处理的准确率,可提高第一目标包围盒的真实性。In a possible example, the target object is a vehicle, the annotation information further includes a vehicle type, and the determining unit is further configured to determine the first size of the bounding box of the point cloud according to the vehicle type; the data processing apparatus further includes a processing unit configured to determine the first size of the bounding box of the point cloud according to the expansion The constraint information and the first size are used to perform size processing on the bounding box of the point cloud to obtain the first target bounding box. In this way, the accuracy of the size processing of the point cloud bounding box is improved, and the authenticity of the first target bounding box can be improved.
在一种可能的示例中,处理单元具体用于根据第一尺寸和扩展约束信息确定关键长边、关键宽边和关键高边中的至少一条目标边,以及至少一条目标边的目标长度和目标扩展方向;根据目标长度和目标扩展方向,对点云包围盒中目标边和目标边对应的边进行尺寸处理,得到第一目标包围盒。如此,基于第一尺寸和扩展约束信息获取满足车辆类型的第一目标包围盒,提高了点云包围盒进行尺寸处理的准确率。In a possible example, the processing unit is specifically configured to determine, according to the first size and the expansion constraint information, at least one target side among the critical long side, the critical wide side, and the critical high side, as well as the target length and the target side of the at least one target side Expansion direction: According to the target length and the target expansion direction, size processing is performed on the target edge and the edge corresponding to the target edge in the point cloud bounding box to obtain the first target bounding box. In this way, the first target bounding box satisfying the vehicle type is obtained based on the first size and the expansion constraint information, which improves the accuracy of the size processing of the point cloud bounding box.
在一种可能的示例中,数据处理装置还包括:存储单元用于对在点云包围盒上标注扩展约束信息得到的参考点云数据进行存储。如此,便于进一步提高数据的使用率。In a possible example, the data processing apparatus further includes: a storage unit configured to store the reference point cloud data obtained by marking the extended constraint information on the point cloud bounding box. In this way, it is convenient to further improve the utilization rate of data.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,数据处理装置还包括通信单元和处理单元,其中:通信单元用于接收针对参考点云数据的标注指令;确定单元,还用于根据标注指令和车辆类型确定点云包围盒的第二尺寸;处理单元用于根据扩展约束信息和第二尺寸对点云包围盒进行尺寸处理,得到第二目标包围盒。如此,基于第二尺寸和扩展约束信息获取满足标注指令和车辆类型的第二目标包围盒,提高了点云包围盒进行尺寸处理的准确率,且提高了数据的使用率。In a possible example, the target object is a vehicle, the labeling information further includes the vehicle type, and the data processing device further includes a communication unit and a processing unit, wherein: the communication unit is used for receiving labeling instructions for the reference point cloud data; the determining unit, It is also used to determine the second size of the point cloud bounding box according to the annotation instruction and the vehicle type; the processing unit is used to perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box. In this way, the second target bounding box that satisfies the labeling instruction and the vehicle type is obtained based on the second size and the extended constraint information, which improves the accuracy of size processing of the point cloud bounding box and improves the data usage rate.
第三方面,本申请实施例公开了另一种数据处理装置,包括处理器和与处理器连接的存储器,存储器用于存储一个或多个程序,并且被配置由处理器执行上述的第一方面的步骤。In a third aspect, an embodiment of the present application discloses another data processing apparatus, comprising a processor and a memory connected to the processor, the memory is used to store one or more programs, and is configured to be executed by the processor to execute the above-mentioned first aspect A step of.
第四方面,本申请提供了一种计算机可读存储介质,计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述的第一方面的方法。In a fourth aspect, the present application provides a computer-readable storage medium, where instructions are stored in the computer-readable storage medium, when the computer-readable storage medium runs on a computer, the computer executes the method of the first aspect.
第五方面,本申请提供了一种计算机程序产品,计算机程序产品用于存储计算机程序,当计算机程序在计算机上运行时,使得计算机执行上述的第一方面的方法。In a fifth aspect, the present application provides a computer program product. The computer program product is used to store a computer program, and when the computer program runs on a computer, the computer can execute the method of the first aspect.
第六方面,本申请提供了一种芯片,包括处理器和存储器,处理器用于从存储器中调用并运行存储器中存储的指令,使得安装有芯片的设备执行上述的第一方面的方法。In a sixth aspect, the present application provides a chip, including a processor and a memory, where the processor is configured to call and execute instructions stored in the memory from the memory, so that a device equipped with the chip executes the method of the above-mentioned first aspect.
第七方面,本申请提供了另一种芯片,包括:输入接口、输出接口和处理电路,输入接口、输出接口与处理电路之间通过内部连接通路相连,处理电路用于执行上述的第一方面的方法。In a seventh aspect, the present application provides another chip, comprising: an input interface, an output interface and a processing circuit, the input interface, the output interface and the processing circuit are connected through an internal connection path, and the processing circuit is used to execute the above-mentioned first aspect Methods.
第八方面,本申请提供了另一种芯片,包括:输入接口、输出接口、处理器,可选的,还包括存储器,输入接口、输出接口、处理器以及存储器之间通过内部连接通路相连,处理器用于执行存储器中的代码,当代码被执行时,处理器用于执行上述任一方面中的方法。In an eighth aspect, the present application provides another chip, including: an input interface, an output interface, a processor, and optionally, a memory, and the input interface, the output interface, the processor, and the memory are connected through an internal connection path, The processor is used to execute code in the memory, and when the code is executed, the processor is used to perform the method of any of the above aspects.
第九方面,本申请实施例提供一种芯片系统,包括至少一个处理器,存储器和接口电路,存储器、收发器和至少一个处理器通过线路互联,至少一个存储器中存储有计算机程序;计算机程序被处理器执行上述的第一方面中的方法。In a ninth aspect, an embodiment of the present application provides a chip system, including at least one processor, a memory and an interface circuit, the memory, the transceiver and the at least one processor are interconnected by lines, and at least one memory stores a computer program; the computer program is The processor performs the method of the first aspect described above.
附图说明Description of drawings
以下对本申请实施例用到的附图进行介绍。The accompanying drawings used in the embodiments of the present application will be introduced below.
图1是本申请实施例提供的一种数据处理系统的结构示意图;1 is a schematic structural diagram of a data processing system provided by an embodiment of the present application;
图2是本申请实施例提供的一种电子设备的结构示意图;2 is a schematic structural diagram of an electronic device provided by an embodiment of the present application;
图3是本申请实施例提供的一种标注平台中可供选择的车辆类型的示意图;3 is a schematic diagram of optional vehicle types in a labeling platform provided by an embodiment of the present application;
图4是本申请实施例提供的一种采集设备采集的二维图像和点云图像;4 is a two-dimensional image and a point cloud image collected by a collection device provided in an embodiment of the present application;
图5是现有技术提供的一种点云包围盒扩展为假想包围盒的示意图;5 is a schematic diagram of a point cloud bounding box provided by the prior art extended to an imaginary bounding box;
图6是本申请实施例提供的一种数据处理方法的流程示意图;6 is a schematic flowchart of a data processing method provided by an embodiment of the present application;
图7是本申请实施例提供的一种标注点云包围盒和移动方向的示意图;FIG. 7 is a schematic diagram of marking a point cloud bounding box and a moving direction provided by an embodiment of the present application;
图8是本申请实施例提供的一种标注扩展约束信息的示意图;8 is a schematic diagram of annotating extended constraint information provided by an embodiment of the present application;
图9是本申请实施例提供的另一种采集设备采集的二维图像和点云图像;9 is a two-dimensional image and a point cloud image collected by another collection device provided in an embodiment of the present application;
图10是本申请实施例提供的另一种数据处理方法的流程示意图;10 is a schematic flowchart of another data processing method provided by an embodiment of the present application;
图11是本申请实施例提供的一种点云包围盒进行尺寸处理的示意图;11 is a schematic diagram of size processing of a point cloud bounding box provided by an embodiment of the present application;
图12是本申请实施例提供的一种数据处理装置的结构示意图;12 is a schematic structural diagram of a data processing apparatus provided by an embodiment of the present application;
图13是本申请实施例提供的另一种数据处理装置的结构示意图。FIG. 13 is a schematic structural diagram of another data processing apparatus provided by an embodiment of the present application.
具体实施方式Detailed ways
下面结合本申请实施例中的附图对本申请实施例进行描述。The embodiments of the present application will be described below with reference to the accompanying drawings in the embodiments of the present application.
请参照图1,图1是为应用于本申请实施例的数据传输方法的系统构架图。如图1所示,该系统包括电子设备10和采集设备20。本申请对于电子设备10和采集设备20的数量不做限定。Please refer to FIG. 1 . FIG. 1 is a system architecture diagram of a data transmission method applied to an embodiment of the present application. As shown in FIG. 1 , the system includes an electronic device 10 and a collection device 20 . The present application does not limit the number of electronic devices 10 and collection devices 20 .
本申请实施例中的电子设备可以包括但不限于个人计算机、服务器计算机、手持式或膝上型设备、移动设备(比如手机、移动电话、平板电脑、个人数字助理、媒体播放器等)、消费型电子设备、小型计算机、大型计算机、移动机器人、无人机等。该电子设备可以是计算机系统(或车载系统)中的车载设备,也可以是其他的设备,在此不做限定。在图1中,电子设备10以个人计算机进行描述。The electronic devices in the embodiments of the present application may include, but are not limited to, personal computers, server computers, handheld or laptop devices, mobile devices (such as cell phones, mobile phones, tablet computers, personal digital assistants, media players, etc.), consumer electronic devices, minicomputers, mainframe computers, mobile robots, drones, etc. The electronic device may be an in-vehicle device in a computer system (or an in-vehicle system), or may be other devices, which are not limited herein. In Figure 1, the electronic device 10 is depicted as a personal computer.
请参照图2,图2是为本申请实施例提供的一种电子设备的结构示意图。如图2所示,电子设备10可以包括显示设备110、处理器120以及存储器130。其中,存储器130可用于存储软件程序以及数据,处理器120可以通过运行存储在存储器130的软件程序以及数据,从而执行电子设备10的各种功能应用以及数据处理。Please refer to FIG. 2 , which is a schematic structural diagram of an electronic device according to an embodiment of the present application. As shown in FIG. 2 , the electronic device 10 may include a display device 110 , a processor 120 and a memory 130 . The memory 130 may be used to store software programs and data, and the processor 120 may execute various functional applications and data processing of the electronic device 10 by running the software programs and data stored in the memory 130 .
存储器130可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如图像采集功能等)等;存储数据区可存储根据电子设备10的使用所创建的数据(比如音频数据、文本信息、图像数据等)等。此外,存储器130可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。The memory 130 may mainly include a stored program area and a stored data area, wherein the stored program area may store an operating system, an application program required for at least one function (such as an image acquisition function, etc.), and the like; Use the created data (such as audio data, text information, image data, etc.) and the like. Additionally, memory 130 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
处理器120是电子设备10的控制中心,利用各种接口和线路连接整个电子设备10的各个部分,通过运行或执行存储在存储器130内的软件程序和/或数据,执行电子设备10的各种功能和处理数据,从而对电子设备10进行整体监控。处理器120可以包括一个或多个处理单元,例如:处理器120可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 120 is the control center of the electronic device 10, uses various interfaces and lines to connect various parts of the entire electronic device 10, and executes various functions of the electronic device 10 by running or executing the software programs and/or data stored in the memory 130. function and process data for overall monitoring of the electronic device 10 . The processor 120 may include one or more processing units, for example, the processor 120 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) Wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
其中,NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备10的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解 等。Among them, the NPU is a neural-network (NN) computing processor. By borrowing the structure of biological neural networks, such as the transmission mode between neurons in the human brain, it can quickly process the input information and can continuously learn by itself. Applications such as intelligent cognition of the electronic device 10 can be realized through the NPU, for example: image recognition, face recognition, speech recognition, text understanding, etc.
在一些实施例中,处理器120可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户标识模块(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, the processor 120 may include one or more interfaces. The interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,CL)。在一些实施例中,处理器120可以包含多组I2C总线。处理器120可以通过不同的I2C总线接口分别耦合触摸传感器,充电器,闪光灯,摄像头160等。例如:处理器120可以通过I2C接口耦合触摸传感器,使处理器120与触摸传感器通过I2C总线接口通信,实现电子设备10的触摸功能。The I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (CL). In some embodiments, the processor 120 may contain multiple sets of I2C buses. The processor 120 can be respectively coupled to the touch sensor, the charger, the flash, the camera 160 and the like through different I2C bus interfaces. For example, the processor 120 can couple the touch sensor through the I2C interface, so that the processor 120 communicates with the touch sensor through the I2C bus interface, so as to realize the touch function of the electronic device 10 .
I2S接口可以用于音频通信。在一些实施例中,处理器120可以包含多组I2S总线。处理器120可以通过I2S总线与音频模块耦合,实现处理器120与音频模块之间的通信。在一些实施例中,音频模块可以通过I2S接口向无线保真(wireless fidelity,WiFi)模块190传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, the processor 120 may contain multiple sets of I2S buses. The processor 120 may be coupled with the audio module through an I2S bus to implement communication between the processor 120 and the audio module. In some embodiments, the audio module can transmit audio signals to the wireless fidelity (WiFi) module 190 through the I2S interface, so as to realize the function of answering calls through the Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块与WiFi模块190可以通过PCM总线接口耦合。在一些实施例中,音频模块也可以通过PCM接口向WiFi模块190传递音频信号,实现通过蓝牙耳机接听电话的功能。I2S接口和PCM接口都可以用于音频通信。The PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals. In some embodiments, the audio module and WiFi module 190 may be coupled through a PCM bus interface. In some embodiments, the audio module can also transmit audio signals to the WiFi module 190 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器120与WiFi模块190。例如:处理器120通过UART接口与WiFi模块190中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块可以通过UART接口向WiFi模块190传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is typically used to connect the processor 120 and the WiFi module 190 . For example, the processor 120 communicates with the Bluetooth module in the WiFi module 190 through the UART interface to implement the Bluetooth function. In some embodiments, the audio module can transmit the audio signal to the WiFi module 190 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
MIPI接口可以被用于连接处理器120与显示设备110、摄像头160等外围器件。MIPI接口包括摄像头160串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器120和摄像头160通过CSI接口通信,实现电子设备10的拍摄功能。处理器120和显示屏通过DSI接口通信,实现电子设备10的显示功能。The MIPI interface may be used to connect the processor 120 with peripheral devices such as the display device 110 and the camera 160 . MIPI interfaces include camera 160 serial interface (camera serial interface, CSI), display serial interface (display serial interface, DSI) and so on. In some embodiments, the processor 120 communicates with the camera 160 through a CSI interface to implement the photographing function of the electronic device 10 . The processor 120 communicates with the display screen through the DSI interface to implement the display function of the electronic device 10 .
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器120与摄像头160,显示设备110,WiFi模块190,音频模块,传感器模块等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured by software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface may be used to connect the processor 120 with the camera 160, the display device 110, the WiFi module 190, the audio module, the sensor module, and the like. The GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口可以用于连接充电器为电子设备10充电,也可以用于电子设备10与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如,增强现实(Augmented Reality,AR)设备等。The USB interface is an interface that conforms to the USB standard specification, which can be a Mini USB interface, a Micro USB interface, a USB Type C interface, etc. The USB interface can be used to connect a charger to charge the electronic device 10, and can also be used to transmit data between the electronic device 10 and peripheral devices. It can also be used to connect headphones to play audio through the headphones. The interface can also be used to connect other electronic devices, for example, Augmented Reality (AR) devices.
可以理解的是,本发明实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备10的结构限定。在本申请另一些实施例中,电子设备10也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationship between the modules illustrated in the embodiment of the present invention is only a schematic illustration, and does not constitute a structural limitation of the electronic device 10 . In other embodiments of the present application, the electronic device 10 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
电子设备10中还包括用于拍摄图像或视频的摄像头160。摄像头160可以是普通摄像头, 也可以是对焦摄像头。The electronic device 10 also includes a camera 160 for capturing images or videos. The camera 160 may be a common camera or a focusing camera.
电子设备10还可以包括输入设备140,用于接收输入的数字信息、字符信息或接触式触摸操作/非接触式手势,以及产生与电子设备10的用户设置以及功能控制有关的信号输入等。The electronic device 10 may further include an input device 140 for receiving input numerical information, character information or contact touch operations/non-contact gestures, and generating signal inputs related to user settings and function control of the electronic device 10 .
显示设备110,包括的显示面板,用于显示由用户输入的信息或提供给用户的信息以及电子设备10的各种菜单界面等,在本申请实施例中主要用于显示电子设备10中摄像头或者传感器采集的待检测图像。可选的,显示面板可以采用液晶显示器(liquid crystal display,LCD)或有机发光二极管(organic light-emitting diode,OLED)等形式来配置显示面板。The display device 110, including a display panel, is used to display the information input by the user or the information provided to the user and various menu interfaces of the electronic device 10, etc., and is mainly used to display the camera in the electronic device 10 or The image to be detected collected by the sensor. Optionally, the display panel may be configured in the form of a liquid crystal display (liquid crystal display, LCD) or an organic light-emitting diode (organic light-emitting diode, OLED) or the like.
电子设备10还可以包括一个或多个传感器170,例如图像传感器、红外传感器、激光传感器(可包括激光位移传感器和激光雷达传感器等)、压力传感器、陀螺仪传感器、气压传感器、磁传感器、加速度传感器、距离传感器、接近光传感器、环境光传感器、指纹传感器、触摸传感器、温度传感器、骨传导传感器等,其中图像传感器可以为飞行时间(time of flight,TOF)传感器、结构光传感器等。The electronic device 10 may also include one or more sensors 170, such as image sensors, infrared sensors, laser sensors (which may include laser displacement sensors and lidar sensors, etc.), pressure sensors, gyroscope sensors, air pressure sensors, magnetic sensors, acceleration sensors , distance sensor, proximity light sensor, ambient light sensor, fingerprint sensor, touch sensor, temperature sensor, bone conduction sensor, etc., wherein the image sensor can be time of flight (TOF) sensor, structured light sensor, etc.
除此之外,电子设备10还可以包括用于给其他模块供电的电源150。电子设备10还可以包括无线射频(radio frequency,RF)电路180,用于与无线网络设备进行网络通信,还可以包括WiFi模块190,用于与其他设备进行WiFi通信,比如,用于接收其他设备传输的图像或者数据等。In addition to this, the electronic device 10 may also include a power supply 150 for powering other modules. The electronic device 10 may further include a radio frequency (RF) circuit 180 for performing network communication with wireless network devices, and may also include a WiFi module 190 for performing WiFi communication with other devices, for example, for receiving other devices transmitted images or data, etc.
尽管并未在图2中示出,电子设备10还可以包括闪光灯、蓝牙模块、外部接口、按键、马达等其他可能的功能模块,在此不再赘述。Although not shown in FIG. 2 , the electronic device 10 may also include other possible functional modules such as a flashlight, a Bluetooth module, an external interface, a button, a motor, etc., which will not be repeated here.
本申请实施例中的采集设备可以为可移动设备,该可移动设备可以包括但不限于飞机、轮船、机器人、车辆等,也可以是道路上的设备,例如,路侧单元(road side unit,RSU)。本申请实施例描述的飞机、轮船和车辆可以是人为驾驶的设备,也可以是无人驾驶的设备,在此不做限定。在图1中,采集设备20以车辆进行描述。采集设备可包括处理器、显示设备和存储器,可参照电子设备的描述,在此不再赘述。采集设备20还可包括传感器,例如,图像拾取设备(例如,摄像头等)和激光雷达传感器等。其中,图像拾取设备用于采集二维图像。激光雷达传感器用于探测激光雷达发送的激光信号的反射信号,从而得到激光点云(或点云)。需要说明的是,采集二维图形和采集点云的采集设备可以是同一个设备,也可以是不同的设备,在此不做限定。采集设备的处理器可包括点云处理模块,用于处理点云的数据。The collection device in this embodiment of the present application may be a movable device, and the movable device may include, but is not limited to, an airplane, a ship, a robot, a vehicle, etc., and may also be a device on the road, for example, a roadside unit (roadside unit, RSU). The airplanes, ships, and vehicles described in the embodiments of the present application may be human-driven devices or unmanned devices, which are not limited herein. In Figure 1, the acquisition device 20 is depicted as a vehicle. The collection device may include a processor, a display device, and a memory, and reference may be made to the description of the electronic device, which will not be repeated here. The acquisition device 20 may also include sensors, such as image pickup devices (eg, cameras, etc.), lidar sensors, and the like. Among them, the image pickup device is used to collect two-dimensional images. The lidar sensor is used to detect the reflected signal of the laser signal sent by the lidar, thereby obtaining the laser point cloud (or point cloud). It should be noted that the collection device for collecting the two-dimensional graphics and the point cloud may be the same device or different devices, which are not limited here. The processor of the acquisition device may include a point cloud processing module for processing point cloud data.
在本申请实施例中,采集设备可用于采集目标物体的数据(例如,二维图形、点云数据、与目标物体之间的距离等中的至少一项),并将该数据发送给对应的电子设备。电子设备可用于接收来自采集设备发送的数据,并根据该数据执行本申请实施例中所描述的数据处理方法。在一种可能的示例中,由采集设备(或采集设备中的处理器或处理器中的点云处理模块等)直接执行本申请实施例中所描述的数据处理方法。In this embodiment of the present application, the collection device may be used to collect data of the target object (for example, at least one of two-dimensional graphics, point cloud data, distance to the target object, etc.), and send the data to the corresponding Electronic equipment. The electronic device may be configured to receive data sent from the collection device, and execute the data processing method described in the embodiments of the present application according to the data. In a possible example, the data processing method described in the embodiments of the present application is directly executed by the acquisition device (or the processor in the acquisition device or the point cloud processing module in the processor, etc.).
电子设备中可运行标注平台对应的应用程序,该标注平台可用于展示从采集设备接收到的数据,可提供给标注人员进行标注。本申请实施例中所描述的数据处理方法也可由该标注平台对应的应用程序等执行,在此不做限定。An application program corresponding to the labeling platform can be run in the electronic device, and the labeling platform can be used to display the data received from the collection device, and can be provided to the labeling personnel for labeling. The data processing method described in the embodiments of the present application may also be executed by an application program corresponding to the labeling platform, etc., which is not limited herein.
为了便于理解,下文首先介绍几个本申请涉及到的概念和术语。For ease of understanding, the following first introduces several concepts and terms involved in the present application.
(1)目标物体、目标物体的移动方向和物体类型。(1) The target object, the moving direction of the target object and the object type.
在本申请实施例中,目标物体是采集设备或电子设备需要识别的对象。目标物体包括道路上的对象和道路外的对象。其中,道路上的对象包括道路上的人、车、交通灯、交通指示牌(如限速指示牌等)、交通标志杆和异物等。异物是指本不该出现在道路上的物体,如遗落在道路上的纸箱、轮胎等。道路外的对象包括道路两旁的建筑物、树木及道路之间的隔离带 等。目标物体还可以是飞机、轮船、机器人等设备,在此不做限定。In this embodiment of the present application, the target object is an object that needs to be identified by the acquisition device or the electronic device. The target objects include objects on the road and objects outside the road. Among them, the objects on the road include people, cars, traffic lights, traffic signs (such as speed limit signs, etc.), traffic sign poles and foreign objects on the road. Foreign objects refer to objects that should not appear on the road, such as cartons, tires, etc. left on the road. Objects outside the road include buildings on both sides of the road, trees, and isolation belts between roads. The target object may also be a device such as an airplane, a ship, a robot, etc., which is not limited here.
移动方向是指目标物体的移动方向。当目标物体为车辆时,由于车辆的移动方向通常为前进方向,因此,也可将移动方向称为车头方向。The moving direction refers to the moving direction of the target object. When the target object is a vehicle, since the moving direction of the vehicle is usually the forward direction, the moving direction can also be referred to as the head direction.
物体类型是指目标物体的分类,例如,飞机类型、轮船类型、机器人类型、车辆类型等。物体类型还可按照飞机类型、轮船类型、机器人类型、车辆类型等进一步分类,或具体为无人机类型、无人船类型、无人车类型等。以车辆类型进行举例说明,如图3所示,当目标物体为车辆时,标注平台中的车辆类型可包括公交车、摩托车、自行车、工程车、三轮车、罐式货车或皮卡等类型供标注人员选择。以上的车辆类型还可以基于该车辆类型的图像特征供给电子设备进行识别。可以理解,不同的车辆类型之间的车辆大小不同,每一种车辆对应一个尺寸。不同的物体类型之间的物体大小也不同,因此,基于物体类型可确定目标物体在点云数据中的大致尺寸。且不同物体类型的目标物体的移动方向与该物体对应的长方体的长和宽的方向不同,例如,当目标物体为车辆时,目标物体的移动方向通常与车辆对应的长方体的长对应的方向一致。当目标物体为人型的机器人时,目标物体的移动方向通常为直立行走的方向,即与人型的机器人对应的长方体的宽对应的方向一致。Object type refers to the classification of the target object, for example, aircraft type, ship type, robot type, vehicle type, etc. Object types can also be further classified according to aircraft types, ship types, robot types, vehicle types, etc., or specifically, drone types, unmanned ship types, unmanned vehicle types, and the like. Taking the vehicle type as an example, as shown in Figure 3, when the target object is a vehicle, the vehicle type in the labeling platform can include types such as buses, motorcycles, bicycles, construction vehicles, tricycles, tank trucks or pickups for labeling Personnel selection. The above vehicle types can also be supplied to the electronic device for identification based on the image features of the vehicle type. It can be understood that the vehicle sizes are different between different vehicle types, and each type of vehicle corresponds to one size. Object sizes vary between different object types, so the approximate size of the target object in the point cloud data can be determined based on the object type. And the moving direction of the target object of different object types is different from the direction of the length and width of the cuboid corresponding to the object. For example, when the target object is a vehicle, the moving direction of the target object is usually consistent with the direction corresponding to the length of the cuboid corresponding to the vehicle. . When the target object is a humanoid robot, the moving direction of the target object is usually the direction of upright walking, that is, the direction corresponding to the width of the rectangular parallelepiped corresponding to the humanoid robot.
(2)点云数据和点云图像。(2) Point cloud data and point cloud images.
点云数据,也称为激光点云(point cloud,PCD)、三维点云或点云,是利用激光在同一空间参考系下获取物体表面每个采样点的三维空间坐标(通常以x,y,z三维坐标的形式表示),所得到的一系列表达目标空间分布和目标表面特性的海量点的集合。相比于图像,点云虽然缺乏详细的纹理信息,但是包含了丰富的三维空间信息。除了三维空间信息,点云数据还可包括颜色信息,灰度值,深度,分割结果等,在此不做限定。本申请实施例中将点云数据投影到二维平面得到的图像称为点云图像。Point cloud data, also known as laser point cloud (PCD), three-dimensional point cloud or point cloud, is the use of laser to obtain the three-dimensional spatial coordinates of each sampling point on the surface of the object (usually x, y) in the same spatial reference system. , z in the form of three-dimensional coordinates), a series of mass points that express the spatial distribution of the target and the characteristics of the target surface are obtained. Compared with images, although point clouds lack detailed texture information, they contain rich three-dimensional spatial information. In addition to three-dimensional space information, point cloud data may also include color information, gray value, depth, segmentation results, etc., which are not limited here. In the embodiment of the present application, the image obtained by projecting the point cloud data to a two-dimensional plane is called a point cloud image.
(3)目标物体的图像和非完整点云数据。(3) The image and incomplete point cloud data of the target object.
采集设备对于距离远或者有遮挡的物体,只能采集到部分的点云。本申请实施例中将所有采集到的部分的点云数据称为非完整点云数据,若目标物体的点云数据不足,则所有采集到的目标物体的点云数据称为目标物体的非完整点云数据。请参照图4,图4以目标物体为车辆进行举例说明。图4中的(a)为采集设备20采集的二维图形,图4中的(b)为采集设备20采集的所有点云数据对应的点云图像。从图4中的(a)可以看出,采集设备20的前方包括4个目标物体(即4辆车)21,且采集设备20与前方的目标物体21之间的距离较远,且邻近路边的目标物体21可能会被路边的树叶遮挡,目标物体21的点云数据存在不完整的可能性。从图4中的(b)可以看出,目标物体21的标注框中存在稀疏的点云数据,从而可确定该目标物体21的点云数据不足,采集设备20所有采集到的目标物体21的点云数据称为目标物体21的非完整点云数据。The acquisition device can only collect part of the point cloud for objects that are far away or occluded. In the embodiment of this application, all the collected point cloud data are called incomplete point cloud data. If the point cloud data of the target object is insufficient, all the collected point cloud data of the target object are called incomplete point cloud data of the target object. point cloud data. Please refer to FIG. 4 . FIG. 4 takes the target object as a vehicle for illustration. (a) in FIG. 4 is a two-dimensional graph collected by the collecting device 20 , and (b) in FIG. 4 is a point cloud image corresponding to all the point cloud data collected by the collecting device 20 . As can be seen from (a) in FIG. 4 , the front of the acquisition device 20 includes 4 target objects (ie, 4 vehicles) 21 , and the distance between the acquisition device 20 and the front target object 21 is relatively far, and the road is adjacent to the road The target object 21 on the side may be occluded by the leaves on the roadside, and the point cloud data of the target object 21 may be incomplete. As can be seen from (b) in FIG. 4 , there are sparse point cloud data in the labeling frame of the target object 21 , so that it can be determined that the point cloud data of the target object 21 is insufficient, and all the target objects 21 collected by the collection device 20 have sparse point cloud data. The point cloud data is called incomplete point cloud data of the target object 21 .
目标物体的图像包括采集设备针对目标物体采集的二维图像,还可包括目标物体的非完整点云数据对应的三维图像等,在此不做限定。The image of the target object includes a two-dimensional image collected by the acquisition device for the target object, and may also include a three-dimensional image corresponding to incomplete point cloud data of the target object, etc., which is not limited herein.
(4)包围盒(bounding box,BB)、点云包围盒和物体包围盒。(4) Bounding box (BB), point cloud bounding box and object bounding box.
包围盒是一种求解离散点集最优包围空间的算法,基本思想是用体积稍大且特性简单的几何体(称为包围盒)来近似地代替复杂的几何对象。最常见的包围盒有球体,轴对齐包围盒(axis-aligned bounding box,AABB),包围球(sphere),有向包围盒(oriented bounding box,OBB)以及固定方向凸包(fixed directions hulls或k-DOP,FDH)。其中,轴对齐包围盒和有向包围盒为长方体对应的包围盒,一个给定对象的轴对齐包围盒被定义为包含该对象且各边平行于坐标轴的最小的六面体。一个给定对象的有向包围盒被定义为包含该对象且相对于坐 标轴方向任意的最小的长方体,有向包围盒的最大特点是它的方向的任意性,这使得它可以根据被包围对象的形状特点尽可能紧密的包围对象。示例性的,当有向包围盒的z轴对应的z方向与水平面垂直时,x轴对应的x方向/y轴对应的y方向可与x轴/y有一定夹角使得x方向/y方向组成面的面积最小。本申请实施例中描述的z轴可以是大地坐标系中的z轴方向等,在此不做限定。Bounding box is an algorithm for solving the optimal bounding space of discrete point sets. The most common bounding boxes are sphere, axis-aligned bounding box (AABB), bounding sphere (sphere), oriented bounding box (OBB), and fixed directions hulls or k -DOP, FDH). Among them, the axis-aligned bounding box and the oriented bounding box are bounding boxes corresponding to cuboids, and the axis-aligned bounding box of a given object is defined as the smallest hexahedron containing the object and each side parallel to the coordinate axis. The directed bounding box of a given object is defined as the smallest cuboid that contains the object and has an arbitrary direction relative to the coordinate axis. The shape feature encloses the object as tightly as possible. Exemplarily, when the z-direction corresponding to the z-axis of the directional bounding box is perpendicular to the horizontal plane, the y-direction corresponding to the x-axis/y-axis may have a certain angle with the x-axis/y such that the x-direction/y-direction The area of the constituent faces is the smallest. The z-axis described in the embodiments of the present application may be the z-axis direction in the geodetic coordinate system, etc., which is not limited herein.
本申请实施例中的点云包围盒是一种包含给定对象的所有的点云(即目标物体的非完整点云数据)的长方体,且该长方体(即点云包围盒)的z方向与z轴以及点云包围盒的高对应的方向平行,z轴与水平面垂直。点云包围盒对应的坐标轴的轴心可以位于点云包围盒的中心,该坐标轴的x轴、y轴和z轴方向可分别与该点云包围盒的长、宽和高的方向平行。对于长方体而言,高垂直于水平面,长大于宽的长度。因此,该点云包围盒的z方向与点云包围盒的高对应的方向平行,且与该点云包围盒的x方向与点云包围盒的长对应的方向平行,y方向与点云包围盒的宽对应的方向平行。而边的长度与实际采集的点云数据的长度相关。也就是说,除了高对应的边以外,采集的点云数量较长的边可作为点云包围盒的长,与该长和高相交于一个顶点的另一边或与该边平行的边可作为点云包围盒的宽。The point cloud bounding box in this embodiment of the present application is a cuboid that includes all point clouds of a given object (that is, incomplete point cloud data of the target object), and the z direction of the cuboid (that is, the point cloud bounding box) is the same as the The z-axis and the direction corresponding to the height of the point cloud bounding box are parallel, and the z-axis is perpendicular to the horizontal plane. The axis of the coordinate axis corresponding to the point cloud bounding box can be located in the center of the point cloud bounding box, and the x-axis, y-axis and z-axis directions of the coordinate axis can be parallel to the length, width and height of the point cloud bounding box respectively. . For a cuboid, the height is perpendicular to the horizontal plane, and the length is greater than the width. Therefore, the z direction of the point cloud bounding box is parallel to the direction corresponding to the height of the point cloud bounding box, and the x direction of the point cloud bounding box is parallel to the direction corresponding to the length of the point cloud bounding box, and the y direction is parallel to the point cloud bounding box. The directions corresponding to the width of the box are parallel. The length of the edge is related to the length of the point cloud data actually collected. That is to say, in addition to the edge corresponding to the height, the edge with a longer number of collected point clouds can be used as the length of the bounding box of the point cloud, and the edge that intersects with the length and height at the other edge of a vertex or is parallel to the edge can be used as the length of the bounding box of the point cloud. The width of the point cloud bounding box.
可以理解,目标物体对应的物体包围盒的z方向是与z轴平行的。当限定点云包围盒的z方向与z轴平行时,不论采集角度如何,均可保证点云包围盒和物体包围盒对应的高的方向与水平面垂直。也就是说,点云包围盒的z方向与目标物体的z方向平行,可提高标注点云包围盒的准确率。再根据x轴、y轴和z轴之间两两相互垂直的方向关系,标注出包含目标物体的非完整点云数据的点云包围盒。可选的,点云包围盒的类型为有向包围盒,且点云包围盒的z方向与z轴平行。由于有向包围盒为被定义为包含该对象且相对于坐标轴方向任意的最小的长方体,可保证点云包围盒的紧致性,从而进一步提高了标注点云包围盒的准确率。It can be understood that the z direction of the object bounding box corresponding to the target object is parallel to the z axis. When the z-direction that defines the bounding box of the point cloud is parallel to the z-axis, regardless of the acquisition angle, it can be ensured that the height direction corresponding to the bounding box of the point cloud and the bounding box of the object is perpendicular to the horizontal plane. That is to say, the z direction of the point cloud bounding box is parallel to the z direction of the target object, which can improve the accuracy of labeling the point cloud bounding box. Then, according to the mutually perpendicular direction relationship between the x-axis, the y-axis and the z-axis, the point cloud bounding box containing the incomplete point cloud data of the target object is marked. Optionally, the type of the point cloud bounding box is a directed bounding box, and the z direction of the point cloud bounding box is parallel to the z axis. Since the directed bounding box is the smallest cuboid defined to contain the object and any direction relative to the coordinate axis, the compactness of the bounding box of the point cloud can be guaranteed, thereby further improving the accuracy of labeling the bounding box of the point cloud.
本申请实施例中的物体包围盒是一种包含给定对象(即目标物体)的长方体,且该长方体的z方向与z轴平行。可参照点云包围盒的描述,物体包围盒的高对应的方向与z轴平行,物体包围盒的x方向平行于物体包围盒的长对应的方向,物体包围盒的y方向平行于物体包围盒的宽对应的方向。通过物体包围盒的移动方向和物体类型可确定物体包围盒的长或宽,例如,当目标物体的物体类型为车辆时,目标物体的移动方向通常与物体包围盒的长对应的方向一致,从而可确定物体包围盒中与目标物体的移动方向平行的边作为该物体包围盒的长,再将该物体包围盒中与长和高相交于一个顶点的另一边或与该边平行的边作为该物体包围盒的宽。当目标物体为物体类型为人型的机器人时,目标物体的移动方向通常为直立行走的方向,即与物体包围盒的宽对应的方向一致,从而可确定物体包围盒中与目标物体的移动方向平行的边作为该物体包围盒的长,再将该物体包围盒中与长和高相交于一个顶点的另一边或与该边平行的边作为该物体包围盒的宽。The object bounding box in this embodiment of the present application is a cuboid containing a given object (ie, a target object), and the z-direction of the cuboid is parallel to the z-axis. You can refer to the description of the point cloud bounding box. The direction corresponding to the height of the object bounding box is parallel to the z-axis, the x direction of the object bounding box is parallel to the direction corresponding to the length of the object bounding box, and the y direction of the object bounding box is parallel to the object bounding box. The width corresponds to the direction. The length or width of the object bounding box can be determined by the moving direction of the object bounding box and the object type. For example, when the object type of the target object is a vehicle, the moving direction of the target object is usually consistent with the direction corresponding to the length of the object bounding box, so that The edge in the bounding box of the object that is parallel to the moving direction of the target object can be determined as the length of the bounding box of the object, and the other side of the bounding box of the object that intersects with the length and height at a vertex or the edge parallel to the edge is used as the bounding box of the object. The width of the bounding box of the object. When the target object is a humanoid robot, the moving direction of the target object is usually the direction of upright walking, that is, the direction corresponding to the width of the bounding box of the object, so it can be determined that the bounding box of the object is parallel to the moving direction of the target object The side of the bounding box of the object is used as the length of the bounding box of the object, and the other side of the bounding box of the object that intersects with the length and height at a vertex or the side parallel to the side is used as the width of the bounding box of the object.
目前常用的标注非完整点云数据的方法如图5中的(a)所示,在数据标注时,先对非完整点云数据对应的包围盒30和箭头A1表示的移动方向进行标注;再对包围盒30进行扩框处理,得到假想包围盒31和箭头A2表示的移动方向,使得假想包围盒31符合正常车辆的尺寸(例如,图5中的(b)标注的车辆类型为工程车,且该工程车对应的假想包围盒31的长、宽、高(单位为米)分别为1.77、2.78、2.00)。The currently commonly used method for labeling incomplete point cloud data is shown in (a) in Figure 5. When labeling the data, the bounding box 30 corresponding to the incomplete point cloud data and the moving direction indicated by the arrow A1 are first marked; Perform frame expansion processing on the bounding box 30 to obtain the imaginary bounding box 31 and the moving direction indicated by the arrow A2, so that the imaginary bounding box 31 conforms to the size of a normal vehicle (for example, the type of vehicle marked in (b) in FIG. 5 is an engineering vehicle, And the length, width and height (unit is meter) of the virtual bounding box 31 corresponding to the construction vehicle are 1.77, 2.78 and 2.00 respectively).
在此方法中,扩框的规范难以确定,通常是由标注人员根据主观经验对包围盒进行扩框处理,缺乏客观性。且由于扩框操作会导致标注效率降低。此外,扩框得到的假想包围盒难以被其他团队或人员利用。In this method, the specification of the expansion box is difficult to determine, and the bounding box is usually expanded by the annotator based on subjective experience, which lacks objectivity. And the labeling efficiency is reduced due to the frame expansion operation. In addition, the imaginary bounding box obtained by frame expansion is difficult to be used by other teams or personnel.
基于此,本申请实例提供的一种数据处理方法,可应用于数据处理装置,该数据处理装 置可以是上述的电子设备或采集设备。本申请实施例以电子设备为例,对数据处理方法进行描述,请参照图6,图6为本申请实施例应用的数据处理方法的流程示意图。该方法可以包括以下步骤S601~S603,其中:Based on this, a data processing method provided by an example of this application can be applied to a data processing apparatus, and the data processing apparatus may be the above-mentioned electronic equipment or collection equipment. The embodiment of the present application takes an electronic device as an example to describe the data processing method. Please refer to FIG. 6 , which is a schematic flowchart of the data processing method applied by the embodiment of the present application. The method may include the following steps S601-S603, wherein:
S601:根据目标物体的图像对目标物体的非完整点云数据进行标注,得到标注信息,其中,标注信息包括目标物体的移动方向以及包含非完整点云数据的点云包围盒,点云包围盒的z方向与z轴平行,且与点云包围盒的高对应的方向平行,z轴与水平面垂直。S601: Label the incomplete point cloud data of the target object according to the image of the target object, and obtain label information, wherein the label information includes the moving direction of the target object and the point cloud bounding box containing the incomplete point cloud data, and the point cloud bounding box The z direction of is parallel to the z axis and the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane.
其中,目标物体、目标物体的图像和非完整点云数据、点云包围盒和目标物体的移动方向、点云包围盒的z方向和高对应的方向,以及z轴可参照前述的定义,在此不再赘述。在本申请实施例中,标注信息包括包含非完整点云数据的点云包围盒,以及目标物体的移动方向,如图7中的(b)所示可包括用长方体表示的点云包围盒30,用箭头A1表示的移动方向。Among them, the target object, the image of the target object and incomplete point cloud data, the moving direction of the point cloud bounding box and the target object, the z direction of the point cloud bounding box and the direction corresponding to the height, and the z axis can refer to the aforementioned definition, in This will not be repeated here. In this embodiment of the present application, the labeling information includes a point cloud bounding box containing incomplete point cloud data, and the moving direction of the target object. As shown in (b) in FIG. 7 , it may include a point cloud bounding box 30 represented by a cuboid. , the moving direction indicated by the arrow A1.
标注信息还可包括物体类型、遮挡情况、上下方向、缩放比例等,在此不做限定。其中,物体类型可参照前述,在此不再赘述。当目标物体为车辆时,标注信息还可包括车辆类型。遮挡情况用于描述目标物体有无遮挡,以及被遮挡的部位等信息,可确定非完整点云数据的缺失情况,当缺失了大部分的数据时,对该非完整点云数据进行尺寸处理得到的目标包围盒的准确率不足。当目标物体的物体类型为车辆、轮船、飞机或其他的载人设备时,上下方向与物体包围盒的宽的方向对应,基于上下方向可直接确定物体包围盒的宽的方向,当点云包围盒中标注了上下方向,即可确定点云包围盒和物体包围盒对应的宽的方向,便于提高确定点云包围盒的扩展方向的速率。缩放比例用于描述采集设备采集的点云数据与现实中对象之间的缩放大小,缩放比例可以理解为目标物体对应的非完整点云数据和实际的目标物体之间的缩放大小。The annotation information may also include object type, occlusion situation, up-down direction, zoom ratio, etc., which are not limited herein. The object types can be referred to above, and are not repeated here. When the target object is a vehicle, the annotation information may further include the vehicle type. The occlusion situation is used to describe whether the target object is occluded, as well as the occluded parts and other information. It can determine the lack of incomplete point cloud data. When most of the data is missing, the incomplete point cloud data is obtained by size processing. The accuracy of the target bounding box is insufficient. When the object type of the target object is a vehicle, ship, aircraft or other manned equipment, the up and down direction corresponds to the width of the object bounding box, and the width of the object bounding box can be directly determined based on the up and down direction. If the up and down directions are marked in the box, the width direction corresponding to the bounding box of the point cloud and the bounding box of the object can be determined, which is convenient to improve the speed of determining the expansion direction of the bounding box of the point cloud. The zoom ratio is used to describe the zoom size between the point cloud data collected by the acquisition device and the real object. The zoom ratio can be understood as the zoom size between the incomplete point cloud data corresponding to the target object and the actual target object.
本申请实施例对于标注点云包围盒的方法不做限定,在一种可能的示例中,确定采集设备采集的点云数据中目标物体对应的目标区域;根据预设算法对目标区域中非完整点云数据对应的点云包围盒进行标注。This embodiment of the present application does not limit the method for marking the bounding box of the point cloud. In a possible example, the target area corresponding to the target object in the point cloud data collected by the collection device is determined; The point cloud bounding box corresponding to the point cloud data is marked.
其中,目标区域为采集设备采集的点云数据中目标物体对应的位置,本申请可由标注人员针对点云数据和二维图形进行对比,得到的标注信息确定目标物体对应的目标区域,还可由电子设备根据二维图形和点云数据之间的映射关系确定目标物体在点云数据中的位置作为目标区域,从而将目标区域对应的三维点云作为目标物体对应的非完整点云数据。本申请对于确定目标区域的方法不做限定。Among them, the target area is the position corresponding to the target object in the point cloud data collected by the collection device. In this application, the labeler can compare the point cloud data with the two-dimensional graphics, and the obtained label information can determine the target area corresponding to the target object. The device determines the position of the target object in the point cloud data as the target area according to the mapping relationship between the two-dimensional graphics and the point cloud data, so that the three-dimensional point cloud corresponding to the target area is used as the incomplete point cloud data corresponding to the target object. The present application does not limit the method for determining the target area.
本申请对于预设算法不做限定,可以根据点云包围盒中要求的包含所有的目标物体的点云(即非完整点云数据)的长方体,且该长方体的z方向与z轴平行,从而对目标区域中的点云数据对应的长方体进行缩框操作得到。This application does not limit the preset algorithm. It can be based on the cuboid containing all the point clouds of the target objects (that is, incomplete point cloud data) required in the point cloud bounding box, and the z direction of the cuboid is parallel to the z axis, so that It is obtained by shrinking the box corresponding to the point cloud data in the target area.
采用该示例,如图7中的(a)所示,先确定采集设备采集的点云数据中目标物体对应的目标区域32,再根据预设算法确定目标区域32中非完整点云数据对应的长方体,再对该长方体进行标注,即如图7中的(b)所示的非完整点云数据对应的点云包围盒30。如此,可提高标注点云包围盒的准确率,便于后续基于实际需求对该点云包围盒进行扩展。Using this example, as shown in (a) in FIG. 7 , first determine the target area 32 corresponding to the target object in the point cloud data collected by the collection device, and then determine the target area 32 corresponding to the incomplete point cloud data according to a preset algorithm cuboid, and then label the cuboid, that is, the point cloud bounding box 30 corresponding to the incomplete point cloud data as shown in (b) in FIG. 7 . In this way, the accuracy of labeling the bounding box of the point cloud can be improved, which facilitates subsequent expansion of the bounding box of the point cloud based on actual needs.
进一步的,可通过标注人员对上述预设算法得到的长方体进行微调,得到点云包围盒。可以理解,通过标注人员微调得到的点云包围盒,可进一步提高点云包围盒的准确率。Further, the point cloud bounding box can be obtained by fine-tuning the cuboid obtained by the above preset algorithm by an annotator. It can be understood that the accuracy of the point cloud bounding box can be further improved by fine-tuning the point cloud bounding box obtained by the annotator.
S602:根据移动方向和/或z方向确定点云包围盒的扩展约束信息,其中,扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项。S602: Determine extension constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction, wherein the extension constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices.
在本申请实施例中,扩展约束信息用于限定点云包围盒的扩展方向,可包括关键顶点,以及相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项,还可包括关键顶点分 别和关键长边、关键宽边和关键高边对应的扩展方向等,在此不做限定。In this embodiment of the present application, the expansion constraint information is used to define the expansion direction of the bounding box of the point cloud, which may include key vertices, and at least one of the key long sides, key broad sides, and key high sides intersecting with the key vertices, and also It may include the expansion directions corresponding to the key vertices and the key long sides, the key wide sides, and the key high sides, etc., which are not limited here.
其中,关键顶点用于描述点云包围盒中与物体包围盒中最接近的顶点,也就是说,关键顶点为点云包围盒中最可能和物体包围盒中的顶点重合的顶点。例如,图8中的(b)所示的点云包围盒30中的顶点d1为关键顶点。Among them, the key vertex is used to describe the vertices in the point cloud bounding box that are closest to the object bounding box, that is, the key vertex is the vertex in the point cloud bounding box that most likely coincides with the vertices in the object bounding box. For example, the vertex d1 in the point cloud bounding box 30 shown in (b) of FIG. 8 is a key vertex.
关键长边、关键宽边和关键高边为相交于关键顶点的三条组合边,且分别与物体包围盒的长、宽和高对应的边。也就是说,关键长边、关键宽边和关键高边最可能和物体包围盒中相交于一个顶点的三条边重合。例如,图8中箭头A1表示目标物体的移动方向,线段L1、线段L2和线段L3相交于关键顶点d1,在点云包围盒30中线段L1、线段L2和线段L3对应的边可分别称为关键长边、关键宽边和关键高边。The key long edge, the key wide edge and the key high edge are the three combined edges that intersect at the key vertices and correspond to the length, width and height of the bounding box of the object respectively. That is, the critical long side, critical wide side, and critical high side are most likely to coincide with the three edges in the object's bounding box that intersect at a vertex. For example, the arrow A1 in FIG. 8 indicates the moving direction of the target object, the line segment L1, the line segment L2 and the line segment L3 intersect at the key vertex d1, and the edges corresponding to the line segment L1, the line segment L2 and the line segment L3 in the point cloud bounding box 30 can be called as Critical Long Side, Critical Wide Side, and Critical High Side.
本申请对于确定扩展约束信息的方法不做限定,可包括以下两种实施方式,其中:The present application does not limit the method for determining the extension constraint information, which may include the following two implementation manners, wherein:
第一种,确定点云包围盒中每个顶点的顶点置信度;确定顶点置信度的最大值对应的顶点为关键顶点;确定点云包围盒中与关键顶点相交的三条组合边为三条参考边;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。First, determine the vertex confidence of each vertex in the point cloud bounding box; determine the vertex corresponding to the maximum vertex confidence as the key vertex; determine the three combined edges in the point cloud bounding box that intersect with the key vertex as the three reference edges ; Determine at least one of the key long side, the key wide side and the key high side of the three reference sides according to the movement direction and/or the z direction.
其中,顶点置信度用于描述点云包围盒的顶点为目标物体的物体包围盒的顶点的概率。本申请对于确定顶点置信度的方法不做限定,在第一种可能的示例中,根据点云包围盒中每个顶点对应的点云数量确定该顶点的顶点置信度,当点云数量越大时,顶点置信度越大。Among them, the vertex confidence is used to describe the probability that the vertex of the point cloud bounding box is the vertex of the object bounding box of the target object. This application does not limit the method of determining the vertex confidence. In a first possible example, the vertex confidence of each vertex is determined according to the number of point clouds corresponding to each vertex in the bounding box of the point cloud. When , the vertex confidence is greater.
其中,顶点对应的点云数量可以理解为顶点对应的预设范围内的点云的数量,该预设范围可以是点云包围盒中与顶点相连接的三条组合边中与顶点之间的距离相差同一个阈值的点连接而成的1/4球体,三菱锥或者正方体等,在此不做限定。可选的,本申请实施例可选取点云包围盒中一个平面对应的顶点,如图8中的(a)所示,选取点云包围盒中的顶点d1、顶点d2、顶点d3和顶点d4,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点对应的预设范围内的点云数量之间的大小关系为顶点d1>顶点d2>顶点d3>顶点d4。根据点云数量越大顶点置信度越大可知,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点的顶点置信度之间的大小关系为顶点d1>顶点d2>顶点d3>顶点d4,因此,可确定关键顶点为顶点d1。The number of point clouds corresponding to the vertices can be understood as the number of point clouds within a preset range corresponding to the vertices, and the preset range can be the distance between the three combined edges connected to the vertices in the bounding box of the point cloud and the vertices The 1/4 sphere, Mitsubishi cone or cube formed by connecting points that differ by the same threshold is not limited here. Optionally, in this embodiment of the present application, a vertex corresponding to a plane in the point cloud bounding box may be selected. As shown in (a) in FIG. 8 , vertex d1, vertex d2, vertex d3 and vertex d4 in the point cloud bounding box are selected , the size relationship between the number of point clouds in the preset range corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is vertex d1 > vertex d2 > vertex d3 > vertex d4. According to the greater the number of point clouds, the greater the vertex confidence, the relationship between the vertex confidences of each vertex in vertex d1, vertex d2, vertex d3 and vertex d4 is vertex d1> vertex d2> vertex d3> vertex d4, so , the key vertex can be determined as vertex d1.
可以理解,点云可体现目标物体被采集到的信息,当点云数量越大时,表示该点云数据对应的区域被采集到的概率越大。在该示例中,根据顶点对应的点云数量确定该顶点为物体包围盒的顶点的概率(即顶点置信度),可提高确定顶点置信度的准确率。It can be understood that the point cloud can reflect the collected information of the target object. The larger the number of point clouds, the greater the probability that the area corresponding to the point cloud data is collected. In this example, determining the probability that the vertex is a vertex of the object bounding box (ie, the vertex confidence) according to the number of point clouds corresponding to the vertex can improve the accuracy of determining the vertex confidence.
在第二种可能的示例中,根据点云包围盒的顶点与采集设备之间的距离确定顶点的顶点置信度,当距离越小时,顶点置信度越大。In a second possible example, the vertex confidence of the vertex is determined according to the distance between the vertex of the bounding box of the point cloud and the acquisition device. When the distance is smaller, the vertex confidence is higher.
其中,采集设备采集了非完整点云数据,也就是说,采集设备为采集非完整点云数据的设备。该采集设备还可以是采集目标物体的二维图像的设备,在此不做限定。点云包围盒的顶点与采集设备之间的距离,可以通过点云包围盒的顶点对应的三维坐标,以及采集设备(可以是采集设备中的激光雷达传感器)对应的三维坐标进行计算得到,也可以通过与点云包围盒的顶点对应的物体包围盒的顶点对应的三维坐标,以及采集设备(可以是采集设备中的激光雷达传感器)对应的三维坐标进行计算得到等,在此不做限定。Among them, the acquisition device collects incomplete point cloud data, that is, the acquisition device is a device that collects incomplete point cloud data. The acquisition device may also be a device for acquiring a two-dimensional image of the target object, which is not limited herein. The distance between the vertices of the point cloud bounding box and the acquisition device can be calculated by the three-dimensional coordinates corresponding to the vertices of the point cloud bounding box and the three-dimensional coordinates corresponding to the acquisition device (which can be a lidar sensor in the acquisition device). The 3D coordinates corresponding to the vertices of the object bounding box corresponding to the vertices of the point cloud bounding box and the 3D coordinates corresponding to the acquisition device (which may be a lidar sensor in the acquisition device) can be calculated and obtained, which are not limited here.
示例性的,如图8中的(a)所示,选取点云包围盒中的顶点d1、顶点d2、顶点d3和顶点d4,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点和采集设备之间的距离的大小关系为顶点d1<顶点d2<顶点d3<顶点d4。根据距离越近顶点置信度越大可知,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点的顶点置信度之间的大小关系为顶点d1>顶点d2>顶点d3>顶点d4,因此,可确定关键顶点为顶点d1。Exemplarily, as shown in (a) in FIG. 8 , select the vertex d1, vertex d2, vertex d3 and vertex d4 in the point cloud bounding box, and each vertex in vertex d1, vertex d2, vertex d3 and vertex d4 and the acquisition The size relationship of the distance between the devices is vertex d1 < vertex d2 < vertex d3 < vertex d4. According to the closer the distance is, the higher the vertex confidence is, it can be seen that the size relationship between the vertex confidences of each vertex in vertex d1, vertex d2, vertex d3 and vertex d4 is vertex d1> vertex d2> vertex d3> vertex d4, therefore, it can be Determine the key vertex as vertex d1.
可以理解,当采集设备的采集距离越近时,采集的点云数据的准确率越高。因此,在该 示例中,根据点云包围盒的顶点与采集设备之间的距离确定该顶点为物体包围盒的顶点的概率(即顶点置信度),可提高确定顶点置信度的准确率。It can be understood that when the collection distance of the collection device is closer, the accuracy of the collected point cloud data is higher. Therefore, in this example, according to the distance between the vertex of the point cloud bounding box and the acquisition device, determine the probability that the vertex is the vertex of the object bounding box (that is, the vertex confidence), which can improve the accuracy of determining the vertex confidence.
在第三种可能的示例中,根据目标物体的二维图像确定点云包围盒中每个顶点的遮挡概率;根据遮挡概率确定顶点置信度,当遮挡概率越大时,顶点置信度越小。In a third possible example, the occlusion probability of each vertex in the bounding box of the point cloud is determined according to the two-dimensional image of the target object; the vertex confidence is determined according to the occlusion probability. When the occlusion probability is larger, the vertex confidence is smaller.
其中,顶点的遮挡概率用于描述顶点被遮挡的概率,也就是说,该顶点能用于还原轮廓的概率。举例来说,如图8中的(a)所示,假设根据二维图像中可确定顶点d1、顶点d2、顶点d3和顶点d4的遮挡概率之间的大小关系为顶点d1<顶点d2<顶点d3<顶点d4。根据遮挡概率越小顶点置信度越大可知,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点的顶点置信度之间的大小关系为顶点d1>顶点d2>顶点d3>顶点d4,因此,可确定关键顶点为顶点d1。Among them, the occlusion probability of the vertex is used to describe the probability that the vertex is occluded, that is, the probability that the vertex can be used to restore the outline. For example, as shown in (a) of FIG. 8 , it is assumed that the magnitude relationship between the occlusion probabilities of the vertex d1, vertex d2, vertex d3 and vertex d4 can be determined according to the two-dimensional image as vertex d1<vertex d2<vertex d3 < vertex d4. According to the smaller the occlusion probability, the greater the vertex confidence, it can be seen that the size relationship between the vertex confidences of each vertex in vertex d1, vertex d2, vertex d3 and vertex d4 is vertex d1> vertex d2> vertex d3> vertex d4, therefore, The key vertex can be determined to be vertex d1.
可以理解,二维图像可体现目标物体被遮挡的情况,当遮挡概率越大时,表示该顶点能进行还原的概率越小,即整体置信度越小。在该示例中,根据二维图形可确定各个顶点被遮挡的概率(即遮挡概率),再根据遮挡概率确定顶点置信度,可提高确定顶点置信度的准确率。It can be understood that the two-dimensional image can reflect the situation that the target object is occluded. When the probability of occlusion is larger, it means that the probability that the vertex can be restored is smaller, that is, the overall confidence is smaller. In this example, the probability of each vertex being occluded (that is, the occlusion probability) can be determined according to the two-dimensional graph, and then the vertex confidence can be determined according to the occlusion probability, which can improve the accuracy of determining the vertex confidence.
需要说明的是,上述三种可能的示例并不构成对本申请实施例的限定,实际应用中,还可以采用其他实施方式确定顶点置信度或确定关键顶点,例如,根据二维图像和非完整点云数据确定点云包围盒的顶点的遮挡概率,再根据该遮挡概率确定顶点置信度,还可根据点云数量和距离确定顶点置信度(例如,获取点云数量和距离对应的加权平均值,由加权平均值确定顶点置信度等);或者根据点云数据和距离确定关键顶点(例如,当确定最大的点云数量对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点与采集设备之间的距离,将距离最小的顶点作为关键顶点;或者当确定最小的距离对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点对应的点云数量,将点云数量的最大值对应的顶点作为关键顶点等);或者根据点云数量和遮挡概率确定顶点置信度(例如,获取点云数量和距离对应的加权平均值,由加权平均值确定顶点置信度等);或者根据点云数量和遮挡概率确定关键顶点(例如,当确定最大的点云数量对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点的遮挡概率,将遮挡概率最小的顶点作为关键顶点;或者当确定最小的遮挡概率对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点对应的点云数量,将点云数量的最大值对应的顶点作为关键顶点等)等。It should be noted that the above three possible examples do not constitute limitations to the embodiments of the present application. In practical applications, other implementations may also be used to determine vertex confidence or determine key vertices, for example, based on two-dimensional images and incomplete points. The cloud data determines the occlusion probability of the vertices of the point cloud bounding box, and then determines the vertex confidence according to the occlusion probability, and also determines the vertex confidence according to the number and distance of the point cloud (for example, obtain the weighted average value corresponding to the number and distance of the point cloud, Determine vertex confidence by weighted average, etc.); or determine key vertices according to point cloud data and distance (for example, when it is determined that the maximum number of point clouds corresponds to 2 or more vertices, it can be determined according to the 2 or The distance between two or more vertices and the acquisition device, the vertex with the smallest distance is used as the key vertex; or when it is determined that the smallest distance corresponds to two or more vertices, the two or more The number of point clouds corresponding to the vertices of the vertices, the vertex corresponding to the maximum number of point clouds is regarded as the key vertex, etc.); or the vertex confidence is determined according to the number of point clouds and the occlusion probability (for example, the weighted average value corresponding to the number of point clouds and the distance is obtained. , the vertex confidence is determined by the weighted average, etc.); or key vertices are determined according to the number of point clouds and occlusion probability (for example, when it is determined that the maximum number of point clouds corresponds to 2 or more vertices, according to the 2 here For the occlusion probability of two or more vertices, the vertex with the smallest occlusion probability is used as the key vertex; or when it is determined that the minimum occlusion probability corresponds to 2 or more vertices, the 2 or more vertices can be The number of point clouds corresponding to the vertices of , and the vertices corresponding to the maximum number of point clouds are regarded as key vertices, etc.) and so on.
在一个长方体中,每个顶点可连接三条边。在本申请实施例中,可将点云包围盒中与关键顶点相交的三条组合边称为三条参考边,也就是说,参考边为点云包围盒中与关键顶点相连接的边。In a cuboid, each vertex can connect three edges. In the embodiment of the present application, the three combined edges in the point cloud bounding box that intersect with the key vertices may be called three reference edges, that is, the reference edges are the edges in the point cloud bounding box that connect with the key vertices.
在本申请实施例中,关键高边是与物体包围盒的高对应的边,物体包围盒的高和点云包围盒的高均与点云包围盒的z方向平行,可确定三条参考边中与z方向对应的边为关键高边。如前所述,可根据目标物体的物体类型可确定点云包围盒(物体包围盒)中与移动方向对应的长边或宽边。因此,当根据物体类型可确定与移动方向对应的长边时,可将与关键顶点相交的三条参考边中选取与移动方向对应的长边作为点云包围盒的关键长边。当根据物体类型可确定与移动方向对应的宽边时,可将与关键顶点相交的三条参考边中选取与移动方向对应的宽边作为点云包围盒的关键宽边。可以理解,关键长边、关键宽边和关键高边中每一条边均为分别三条参考边中除了其他两条参考边之外的参考边,当根据物体类型可确定与移动方向对应的关键长边时,可确定三条参考边中除了与移动方向对应的参考边和与z方向对应的参考边之外的参考边为关键宽边。当根据物体类型可确定与移动方向对应的关键宽边时,可确定三条参考边中除了与移动方向对应的参考边和与z方向对应的参考边之外的参考边为关键长边。In the embodiment of the present application, the key high side is the side corresponding to the height of the bounding box of the object. The height of the bounding box of the object and the height of the bounding box of the point cloud are both parallel to the z direction of the bounding box of the point cloud. It can be determined that among the three reference edges The edge corresponding to the z direction is the critical high edge. As mentioned above, according to the object type of the target object, the long side or the wide side corresponding to the moving direction in the point cloud bounding box (object bounding box) can be determined. Therefore, when the long edge corresponding to the moving direction can be determined according to the object type, the long edge corresponding to the moving direction can be selected as the key long edge of the point cloud bounding box among the three reference edges that intersect with the key vertices. When the broadside corresponding to the moving direction can be determined according to the object type, the broadside corresponding to the moving direction can be selected from the three reference edges intersecting with the key vertices as the key broadside of the point cloud bounding box. It can be understood that each of the key long side, the key wide side and the key high side is a reference edge except the other two reference edges among the three reference edges respectively. When the key length corresponding to the moving direction can be determined according to the object type When the edge is selected, the reference edge other than the reference edge corresponding to the moving direction and the reference edge corresponding to the z direction among the three reference edges can be determined as the key broad edge. When the critical broad side corresponding to the moving direction can be determined according to the object type, the reference side except the reference side corresponding to the moving direction and the reference side corresponding to the z direction among the three reference sides can be determined as the critical long side.
以目标物体为车辆进行举例说明,移动方向与长边的方向对应,也就是说,可确定三条参考边中与移动方向对应的边为关键长边。如图8中的(b)所示,当关键顶点为d1时,可确定点云包围盒30中与箭头A1对应的移动方向对应,且与关键顶点d1相交的边为关键长边,即关键长边为线段L1。根据点云包围盒30的z方向与z轴方向对应,且与关键顶点d1相交的边为关键高边,即确定点云包围盒30中的线段L3为关键高边。最后,由于三条参考边除了与移动方向对应的参考边,和与z方向对应的参考边之外,剩余的参考边为关键宽边。因此,可在点云包围盒中与关键顶点d1相交的边,还剩下线段L2,即为关键宽边。Taking the target object as a vehicle for illustration, the moving direction corresponds to the direction of the long side, that is, the side corresponding to the moving direction among the three reference sides can be determined as the key long side. As shown in (b) of FIG. 8 , when the key vertex is d1, it can be determined that the moving direction corresponding to the arrow A1 in the point cloud bounding box 30 corresponds, and the edge intersecting with the key vertex d1 is the key long edge, that is, the key The long side is the line segment L1. According to the z direction of the point cloud bounding box 30 corresponding to the z axis direction, and the edge intersecting with the key vertex d1 is the key high edge, that is, the line segment L3 in the point cloud bounding box 30 is determined as the key high edge. Finally, since the three reference edges are in addition to the reference edge corresponding to the moving direction and the reference edge corresponding to the z direction, the remaining reference edges are key broad edges. Therefore, the edge that intersects with the key vertex d1 in the point cloud bounding box, and the remaining line segment L2, is the key broad edge.
本申请对于确定关键长边、关键宽边和关键高边中的哪一条边不做限定,可以分别确定关键长边、关键宽边和关键高边,也可以对非完整点云数据进行分析,得到需要进行扩展的边。例如,根据物体类型或具体的物体类型(例如,车辆类型、飞机类型、轮船类型、机器人类型等)确定目标物体在点云图像中的目标尺寸,该目标尺寸包括长、宽和高对应的长度。根据移动方向和z方向可确定关键长边和/或关键宽边,根据z方向可确定关键高边。再根据目标尺寸中三边的长度,和关键长边、关键宽边和关键高边中在点云包围盒中的尺寸,从而可确定关键长边、关键宽边和关键高边中需要扩展的边。This application does not limit which side of the key long side, the key wide side and the key high side is determined. The key long side, the key wide side and the key high side can be determined respectively, and the incomplete point cloud data can also be analyzed. Get the edge that needs to be extended. For example, determine the target size of the target object in the point cloud image according to the object type or specific object type (for example, vehicle type, aircraft type, ship type, robot type, etc.), and the target size includes the length corresponding to the length, width and height. . The critical long side and/or the critical broad side can be determined according to the movement direction and the z-direction, and the critical high side can be determined according to the z-direction. Then according to the length of the three sides in the target size, and the size of the key long side, the key wide side and the key high side in the bounding box of the point cloud, the key long side, the key wide side and the key high side that need to be expanded can be determined. side.
可以理解,在第一种确定扩展约束信息的方法中,先确定点云包围盒中各个顶点的顶点置信度,再将最大的顶点置信度对应的顶点作为关键顶点。也就是说,先将点云包围盒中最可能和物体包围盒中的顶点重合的顶点作为关键顶点,再确定与关键顶点相交的三条组合边为三条参考边,再根据移动方向和/或z方向确定三条参考边中与物体包围盒的长、宽、高对应的边分别作为关键长边、关键宽边和关键高边,可提高确定扩展约束信息的准确率。It can be understood that, in the first method for determining extended constraint information, the vertex confidence of each vertex in the bounding box of the point cloud is determined first, and then the vertex corresponding to the maximum vertex confidence is used as the key vertex. That is to say, first take the vertices in the point cloud bounding box that are most likely to coincide with the vertices in the object bounding box as the key vertices, and then determine the three combined edges that intersect with the key vertices as the three reference edges, and then according to the moving direction and/or z Among the three reference edges for direction determination, the edges corresponding to the length, width and height of the bounding box of the object are used as the key long edge, key wide edge and key high edge respectively, which can improve the accuracy of determining extended constraint information.
第二种,确定点云包围盒中相交于每个顶点的三条组合边的整体置信度;确定整体置信度的最大值对应的三条组合边为三条参考边;确定三条参考边相交的顶点为关键顶点;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。The second is to determine the overall confidence of the three combined edges that intersect each vertex in the point cloud bounding box; determine the three combined edges corresponding to the maximum value of the overall confidence as the three reference edges; determine the vertex where the three reference edges intersect as the key Vertices; determine at least one of the critical long side, the critical wide side, and the critical high side of the three reference edges according to the movement direction and/or the z-direction.
其中,整体置信度用于描述三条组合边均为目标物体的物体包围盒的边的概率。本申请对于确定整体置信度的方法不做限定,在第一种可能的示例中,根据点云包围盒中相交于每个顶点的三条组合边对应的点云数量确定三条组合边的整体置信度,当点云数量越大时,整体置信度越大。Among them, the overall confidence is used to describe the probability that the three combined edges are the edges of the object bounding box of the target object. This application does not limit the method for determining the overall confidence. In a first possible example, the overall confidence of the three combined edges is determined according to the number of point clouds corresponding to the three combined edges in the point cloud bounding box that intersect each vertex. , when the number of point clouds is larger, the overall confidence is larger.
其中,点云包围盒中相交于每个顶点的三条组合边对应的点云数量可以理解为每个顶点和三条组合边对应的预设范围内的点云的数量,该预设范围可以是三条组合边中与顶点之间的距离相差同一个阈值的点连接而成的1/4球体,三菱锥或者正方体等,在此不做限定。可选的,本申请实施例可选取点云包围盒中一个平面对应的顶点对应的三条组合边,如图8中的(a)所示,选取点云包围盒中的顶点d1、顶点d2、顶点d3和顶点d4,则顶点d1对应的三条组合边为L1、L2和L3,顶点d2对应的三条组合边为L2、L7和L6,顶点d3对应的三条组合边为L1、L4和L5,顶点d4对应的三条组合边为L5、L6和L8。各个相交于每个顶点的三条组合边对应的点云数量之间的大小关系为顶点d1对应的三条组合边>顶点d2对应的三条组合边>顶点d3对应的三条组合边>顶点d4对应的三条组合边。根据点云数量越大整体置信度越大可知,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点对应的三条组合边的整体置信度之间的大小关系为顶点d1对应的三条组合边>顶点d2对应的三条组合边>顶点d3对应的三条组合边>顶点d4对应的三条组合边,因此,可确定三条参考边为顶点d1对应的三条组合边,即L1、L2和L3。Among them, the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box can be understood as the number of point clouds within the preset range corresponding to each vertex and the three combined edges, and the preset range can be three A 1/4 sphere, a Mitsubishi cone or a cube, etc., formed by connecting points whose distances from the vertices differ by the same threshold in the combined edge are not limited here. Optionally, in this embodiment of the present application, three combined edges corresponding to a vertex corresponding to a plane in the point cloud bounding box may be selected. As shown in (a) in FIG. 8 , the vertex d1, vertex d2, Vertex d3 and vertex d4, the three combined edges corresponding to vertex d1 are L1, L2 and L3, the three combined edges corresponding to vertex d2 are L2, L7 and L6, the three combined edges corresponding to vertex d3 are L1, L4 and L5, and the three combined edges corresponding to vertex d3 are L1, L4 and L5. The three combined edges corresponding to d4 are L5, L6 and L8. The size relationship between the number of point clouds corresponding to the three combined edges that intersect each vertex is: three combined edges corresponding to vertex d1 > three combined edges corresponding to vertex d2 > three combined edges corresponding to vertex d3 > three combined edges corresponding to vertex d4 Combine edges. According to the larger the number of point clouds, the larger the overall confidence, the relationship between the overall confidences of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is the three combined edges corresponding to vertex d1> The three combined edges corresponding to the vertex d2 > the three combined edges corresponding to the vertex d3 > the three combined edges corresponding to the vertex d4. Therefore, it can be determined that the three reference edges are the three combined edges corresponding to the vertex d1, namely L1, L2 and L3.
可以理解,点云可体现目标物体被采集到的信息,当点云数量越大时,表示该点云数据对应的区域被采集到的概率越大。在该示例中,根据点云包围盒中相交于每个顶点的三条组 合边对应的点云数量确定该三条组合边均为物体包围盒的边的概率(即整体置信度),可提高确定整体置信度的准确率。It can be understood that the point cloud can reflect the collected information of the target object. The larger the number of point clouds, the greater the probability that the area corresponding to the point cloud data is collected. In this example, according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, determining the probability that the three combined edges are the edges of the object bounding box (ie, the overall confidence) can improve the determination of the overall Confidence accuracy.
在第二种可能的示例中,根据点云包围盒的顶点与采集设备之间的距离确定点云包围盒中相交于该顶点的三条组合边的整体置信度,当距离越小时,整体置信度越大。In a second possible example, the overall confidence level of the three combined edges in the point cloud bounding box intersecting the vertex is determined according to the distance between the vertex of the point cloud bounding box and the acquisition device. When the distance is smaller, the overall confidence level bigger.
其中,采集设备采集了非完整点云数据,也就是说,采集设备为采集非完整点云数据的设备。该采集设备还可以是采集目标物体的二维图像的设备,在此不做限定。点云包围盒的顶点与采集设备之间的距离,可以通过点云包围盒的顶点对应的三维坐标,以及采集设备(可以是采集设备中的激光雷达传感器)对应的三维坐标进行计算得到,也可以通过与点云包围盒的顶点对应的物体包围盒的顶点对应的三维坐标,以及采集设备(可以是采集设备中的激光雷达传感器)对应的三维坐标进行计算得到等,在此不做限定。Among them, the acquisition device collects incomplete point cloud data, that is, the acquisition device is a device that collects incomplete point cloud data. The acquisition device may also be a device for acquiring a two-dimensional image of the target object, which is not limited herein. The distance between the vertices of the point cloud bounding box and the acquisition device can be calculated by the three-dimensional coordinates corresponding to the vertices of the point cloud bounding box and the three-dimensional coordinates corresponding to the acquisition device (which can be a lidar sensor in the acquisition device). The 3D coordinates corresponding to the vertices of the object bounding box corresponding to the vertices of the point cloud bounding box and the 3D coordinates corresponding to the acquisition device (which may be a lidar sensor in the acquisition device) can be calculated and obtained, which are not limited here.
示例性的,如图8中的(a)所示,选取点云包围盒中的顶点d1、顶点d2、顶点d3和顶点d4,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点和采集设备之间的距离的大小关系为顶点d1<顶点d2<顶点d3<顶点d4。根据距离越近整体置信度越大可知,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点对应的三条组合边的整体置信度之间的大小关系为顶点d1对应的三条组合边>顶点d2对应的三条组合边>顶点d3对应的三条组合边>顶点d4对应的三条组合边,因此,可确定三条参考边为顶点d1对应的三条组合边,即L1、L2和L3。Exemplarily, as shown in (a) in FIG. 8 , select the vertex d1, vertex d2, vertex d3 and vertex d4 in the point cloud bounding box, and each vertex in vertex d1, vertex d2, vertex d3 and vertex d4 and the acquisition The size relationship of the distance between the devices is vertex d1 < vertex d2 < vertex d3 < vertex d4. According to the closer the distance, the greater the overall confidence, the relationship between the overall confidences of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is that the three combined edges corresponding to vertex d1 > vertex d2 The corresponding three combined edges>the three combined edges corresponding to the vertex d3>the three combined edges corresponding to the vertex d4, therefore, it can be determined that the three reference edges are the three combined edges corresponding to the vertex d1, namely L1, L2 and L3.
可以理解,当采集设备的采集距离越近时,采集的点云数据的准确率越高。因此,在该示例中,根据点云包围盒的顶点与采集设备之间的距离确定该顶点对应的三条组合边均为物体包围盒的边的概率(即整体置信度),可提高确定整体置信度的准确率。It can be understood that when the collection distance of the collection device is closer, the accuracy of the collected point cloud data is higher. Therefore, in this example, according to the distance between the vertex of the point cloud bounding box and the acquisition device, the probability that the three combined edges corresponding to the vertex are all the edges of the object bounding box (that is, the overall confidence) can improve the overall confidence in determining degree of accuracy.
在第三种可能的示例中,根据目标物体的二维图像确定点云包围盒中相交于每个顶点的三条组合边的遮挡概率;根据遮挡概率确定整体置信度,当遮挡概率越大时,整体置信度越小。In a third possible example, the occlusion probability of the three combined edges intersecting each vertex in the point cloud bounding box is determined according to the two-dimensional image of the target object; the overall confidence is determined according to the occlusion probability. The overall confidence is smaller.
其中,三条组合边的遮挡概率用于描述物体包围盒中与该三条组合边对应的区域被遮挡的概率,也就是说,该三条组合边对应的预设区域能用于还原轮廓的概率。举例来说,如图8中的(a)所示,假设二维图像中可确定顶点d1、顶点d2、顶点d3和顶点d4中每一顶点对应的三条组合边的遮挡概率之间的大小关系为顶点d1对应的三条组合边<顶点d2对应的三条组合边<顶点d3对应的三条组合边<顶点d4对应的三条组合边。根据遮挡概率越小整体置信度越大可知,顶点d1、顶点d2、顶点d3和顶点d4中各个顶点对应的三条组合边的整体置信度之间的大小关系为顶点d1对应的三条组合边>顶点d2对应的三条组合边>顶点d3对应的三条组合边>顶点d4对应的三条组合边,因此,可确定三条参考边为顶点d1对应的三条组合边,即L1、L2和L3。The occlusion probability of the three combined edges is used to describe the probability that the area corresponding to the three combined edges in the object bounding box is blocked, that is, the probability that the preset area corresponding to the three combined edges can be used to restore the outline. For example, as shown in (a) of FIG. 8, it is assumed that the magnitude relationship between the occlusion probabilities of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 can be determined in the two-dimensional image. is three combined edges corresponding to vertex d1 < three combined edges corresponding to vertex d2 < three combined edges corresponding to vertex d3 < three combined edges corresponding to vertex d4. According to the smaller the occlusion probability, the larger the overall confidence, the relationship between the overall confidences of the three combined edges corresponding to each of the vertex d1, vertex d2, vertex d3 and vertex d4 is that the three combined edges corresponding to vertex d1 > vertex The three combined edges corresponding to d2>the three combined edges corresponding to vertex d3>the three combined edges corresponding to vertex d4, therefore, it can be determined that the three reference edges are the three combined edges corresponding to vertex d1, namely L1, L2 and L3.
可以理解,二维图像可体现目标物体被遮挡的情况,当遮挡概率越大时,表示该顶点能进行还原的概率越小,即整体置信度越小。在该示例中,根据二维图形可确定各个相交于每个顶点的三条组合边被遮挡的概率(即遮挡概率),再根据遮挡概率确定整体置信度,可提高确定整体置信度的准确率。It can be understood that the two-dimensional image can reflect the situation that the target object is occluded. When the probability of occlusion is larger, it means that the probability that the vertex can be restored is smaller, that is, the overall confidence is smaller. In this example, the probability that each of the three combined edges intersecting each vertex is occluded (ie, the occlusion probability) can be determined according to the two-dimensional graph, and then the overall confidence level can be determined according to the occlusion probability, which can improve the accuracy of determining the overall confidence level.
需要说明的是,上述三种可能的示例并不构成对本申请实施例的限定,实际应用中,还可以采用其他实施方式确定整体置信度或确定三条参考边,例如,根据二维图像和非完整点云确定点云包围盒的顶点的遮挡概率,再根据该遮挡概率确定整体置信度,还可根据点云数量和距离确定整体置信度(例如,获取点云数量和距离对应的加权平均值,由加权平均值确定整体置信度等);或者根据点云数据和距离确定三条关键边(例如,当确定最大的点云数量对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点与采集设备之间的距离, 将与距离最小的顶点相交的三条组合边作为三条关键边;或者当确定最小的距离对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点对应的点云数量,将与点云数量的最大值对应的顶点相交的三条组合边作为三条关键边等);或者根据点云数量和遮挡概率确定整体置信度(例如,获取点云数量和距离对应的加权平均值,由加权平均值确定整体置信度等);或者根据点云数量和遮挡概率确定三条关键边(例如,当确定最大的点云数量对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点的遮挡概率,将与遮挡概率最小的顶点相交的三条组合边作为三条关键边;或者当确定最小的遮挡概率对应2个或2个以上的顶点时,可根据此处的2个或2个以上的顶点对应的点云数量,将与点云数量的最大值对应的顶点相交的三条组合边作为三条关键边等)等。当可确定点云包围盒中每条边的边置信度时,可根据相交于一个顶点的三条组合边的边置信度确定该三条组合变对应的整体置信度。此处的边置信度用于描述该边置信度对应的边为物体包围盒的边的概率,该整体置信度可按照长、宽和高分别对应的预设权值,对三条组合边的边置信度进行加权得到。此处的预设权值可根据长、宽和高需要进行扩展的重要性进行设置等,在此不做限定。It should be noted that the above three possible examples do not constitute limitations to the embodiments of the present application. In practical applications, other implementations may also be used to determine the overall confidence level or to determine the three reference edges, for example, according to two-dimensional images and incomplete The point cloud determines the occlusion probability of the vertices of the point cloud bounding box, and then determines the overall confidence according to the occlusion probability, and can also determine the overall confidence according to the number and distance of the point cloud (for example, obtain the weighted average corresponding to the number and distance of the point cloud, The overall confidence is determined by the weighted average, etc.); or three key edges are determined according to the point cloud data and distance (for example, when it is determined that the maximum number of point clouds corresponds to 2 or more vertices, it can be determined according to the 2 here or the distance between two or more vertices and the acquisition device, the three combined edges that intersect with the vertex with the smallest distance are used as three key edges; or when the smallest distance is determined to correspond to two or more vertices, it can be based on this The number of point clouds corresponding to 2 or more vertices at the location, and the three combined edges intersected by the vertices corresponding to the maximum number of point clouds are regarded as three key edges, etc.); or the overall confidence is determined according to the number of point clouds and the occlusion probability. (for example, obtain the weighted average corresponding to the number of point clouds and distance, and determine the overall confidence by the weighted average, etc.); or determine three key edges according to the number of point clouds and the occlusion probability (for example, when determining the maximum number of point clouds corresponding to When there are 2 or more vertices, the three combined edges that intersect with the vertex with the smallest occlusion probability can be used as the three key edges according to the occlusion probability of the 2 or more vertices here; or when the minimum occlusion probability is determined. When the probability corresponds to 2 or more vertices, according to the number of point clouds corresponding to 2 or more vertices here, the three combined edges that intersect the vertices corresponding to the maximum number of point clouds can be used as three keys. side etc.) etc. When the edge confidence of each edge in the bounding box of the point cloud can be determined, the overall confidence corresponding to the three combinations can be determined according to the edge confidences of the three combined edges intersecting at one vertex. The edge confidence here is used to describe the probability that the edge corresponding to the edge confidence is the edge of the object bounding box. The overall confidence can be based on the preset weights corresponding to the length, width and height respectively. Confidence is weighted. The preset weights here can be set according to the importance of the length, width, and height to be expanded, etc., which are not limited here.
确定关键长边、关键宽边和关键高边,以及确定关键长边、关键宽边和关键高边中的哪一条边可参照第一种确定扩展约束信息的方法中的描述,在此不再赘述。Determining the critical long side, critical wide side, and critical high side, and determining which of the critical long side, critical wide side, and critical high side, can refer to the description in the first method of determining extended constraint information, which is not repeated here. Repeat.
可以理解,在第二种确定扩展约束信息的方法中,先确定点云包围盒中相交于每个顶点的三条组合边的整体置信度,将最大的整体置信度对应的三条组合边作为三条参考边。也就是说,先将点云包围盒中最可能和物体包围盒中的边重合的边作为参考边,再确定与关键顶点相交的三条组合边为三条参考边,再根据移动方向和/或z方向确定三条参考边中与物体包围盒的长、宽、高对应的边分别作为关键长边、关键宽边和关键高边,可提高确定扩展约束信息的准确率。It can be understood that in the second method of determining extended constraint information, the overall confidence level of the three combined edges that intersect each vertex in the point cloud bounding box is determined first, and the three combined edges corresponding to the largest overall confidence level are used as three references. side. That is to say, the edge in the bounding box of the point cloud that is most likely to coincide with the edge in the bounding box of the object is used as the reference edge, and then the three combined edges that intersect with the key vertices are determined as the three reference edges, and then according to the moving direction and/or z Among the three reference edges for direction determination, the edges corresponding to the length, width and height of the bounding box of the object are used as the key long edge, key wide edge and key high edge respectively, which can improve the accuracy of determining extended constraint information.
在步骤S602之前,该方法还包括:根据二维图像确定目标物体的未遮挡位置包括目标物体的边界信息。Before step S602, the method further includes: determining, according to the two-dimensional image, that the unoccluded position of the target object includes boundary information of the target object.
其中,边界信息包括包含目标物体的物体包围盒的顶点或边。可以理解,在根据二维图像确定非完整点云数据中包括目标物体的边界信息时,可基于边界信息进行扩展操作。且边界信息的位置固定,便于提高确定扩展约束信息的准确率,便于提高数据复用的效果。The boundary information includes the vertices or edges of the object bounding box containing the target object. It can be understood that when it is determined according to the two-dimensional image that the incomplete point cloud data includes the boundary information of the target object, the expansion operation can be performed based on the boundary information. Moreover, the location of the boundary information is fixed, which is convenient for improving the accuracy of determining the extended constraint information and improving the effect of data multiplexing.
在另一种可能的示例中,若根据二维图像确定目标物体的未遮挡位置不包括目标物体的边界信息,则不执行步骤S602。In another possible example, if it is determined according to the two-dimensional image that the unoccluded position of the target object does not include boundary information of the target object, step S602 is not performed.
如图9中的(a)所示,椭圆框里面的目标物体21的车头部位被道路中的树木遮挡,目标物体的车身部位被旁边的车辆遮挡。因此,采集设备采集的点云数据如图9中的(b)所示的椭圆框中,目标物体的非完整点云数据仅包括目标物体未遮挡的部位对应的点云。且未遮挡位置不包括目标物体的边界信息,从而很难在目标物体的非完整点云数据中确定关键长边、关键宽边以及关键高边,不执行步骤S602。否则,执行步骤S602,即确定点云包围盒的扩展约束信息。As shown in (a) of FIG. 9 , the front part of the target object 21 in the elliptical frame is blocked by the trees on the road, and the body part of the target object is blocked by the next vehicle. Therefore, the point cloud data collected by the collection device is in an elliptical frame as shown in (b) of FIG. 9 , and the incomplete point cloud data of the target object only includes the point cloud corresponding to the unoccluded part of the target object. And the unoccluded position does not include the boundary information of the target object, so it is difficult to determine the key long side, the key wide side and the key high side in the incomplete point cloud data of the target object, and step S602 is not performed. Otherwise, step S602 is executed, that is, the extension constraint information of the bounding box of the point cloud is determined.
S603:在点云包围盒上标注扩展约束信息。S603: Mark extended constraint information on the point cloud bounding box.
在本申请实施例中,在点云包围盒上标注扩展约束信息,可以理解为,在标注信息上添加了扩展约束信息,例如,在标注信息上标注关键长边、关键宽边和关键高边中的至少一项,或标注关键顶点和关键长边、关键宽边和关键高边中的至少一项,或关键顶点和关键长边、关键宽边和关键高边中的至少一项对应的扩展方向。如图8中(b)和图11所示,在点云包围盒30上标注了相交于关键顶点d1的箭头A3、箭头A4和箭头A5,且箭头A3、箭头A4和箭头A5中每一箭头分别为关键长边L1、关键宽边L2和关键高边L3对应的扩展方向。In the embodiment of the present application, the extended constraint information is marked on the bounding box of the point cloud, which can be understood as adding extended constraint information to the marked information, for example, marking the key long side, key wide side and key high side on the marking information At least one of the key vertices and key long sides, key wide sides and key high sides, or at least one of the key vertices and key long sides, key wide sides and key high sides correspond to direction of expansion. As shown in (b) of FIG. 8 and FIG. 11 , arrows A3, A4 and A5 intersecting with the key vertex d1 are marked on the point cloud bounding box 30, and each of the arrows A3, A4 and A5 are the expansion directions corresponding to the key long side L1, the key wide side L2 and the key high side L3, respectively.
在图6所描述的方法中,根据目标物体的图像对目标物体对应的非完整点云数据进行标注,从而得到点云包围盒以及该目标物体的移动方向的标注信息。然后根据目标物体的移动方向和/或点云包围盒的z方向确定点云包围盒的扩展约束信息,在点云包围盒上标注该扩展约束信息,从而可根据该扩展约束信息获取满足实际需求的目标包围盒,便于提高数据的使用率。点云包围盒包含非完整点云数据,且保持z方向与点云包围盒的高对应的方向,以及与水平面垂直的z轴平行,可避免由于采集角度导致长方体的方向偏移,提高了标注点云包围盒的准确率。又扩展约束信息是根据移动方向和/或z方向确定的,便于提高尺寸处理的准确率。In the method described in FIG. 6 , the incomplete point cloud data corresponding to the target object is marked according to the image of the target object, so as to obtain the point cloud bounding box and the marking information of the moving direction of the target object. Then, the extended constraint information of the point cloud bounding box is determined according to the moving direction of the target object and/or the z direction of the point cloud bounding box, and the extended constraint information is marked on the point cloud bounding box, so that the actual needs can be obtained according to the extended constraint information. The target bounding box is convenient to improve the utilization rate of data. The point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling. The accuracy of the point cloud bounding box. Furthermore, the extended constraint information is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
在一种可能的示例中,目标物体为车辆,标注信息包括车辆类型,该方法还包括:根据车辆类型确定点云包围盒的第一尺寸;根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理,得到第一目标包围盒。In a possible example, the target object is a vehicle, the annotation information includes a vehicle type, and the method further includes: determining a first size of the point cloud bounding box according to the vehicle type; Perform size processing to obtain the first target bounding box.
在本申请实施例中,第一尺寸可包括目标物体在图像中表示的长、宽和高的长度,即第一目标包围盒对应的长方体的长、宽和高的长度。由于车辆的形状非长方体,则长方体所包含的目标物体可能包含了多余的空间,第一尺寸还可进一步包括目标物体对应的立方体中各个边的长度等,在此不做限定。In this embodiment of the present application, the first size may include the length, width and height of the target object represented in the image, that is, the length, width and height of the rectangular parallelepiped corresponding to the first target bounding box. Since the shape of the vehicle is not a rectangular parallelepiped, the target object contained in the rectangular parallelepiped may contain redundant space, and the first size may further include the length of each side of the cube corresponding to the target object, etc., which is not limited here.
可以理解,不同车辆类型的车辆尺寸不同。在本申请实施例中,根据车辆类型确定目标物体映射在点云数据中的第一尺寸,即点云包围盒需处理得到的尺寸。需要说明的是,由于点云数据中目标物体为非完整点云数据,也就是说,目标物体的图像是残缺的,对于大部分情况而言,对点云包围盒的尺寸处理操作是扩展操作。此外,第一尺寸还可根据标注信息中的缩放比例进行确定,该缩放比例可用于获取点云数据中目标物体和实际的目标物体之间的大小关系,因此可基于缩放比例和车辆类型获取目标物体在点云图像中的尺寸,可提高确定第一尺寸的准确率。It will be appreciated that different vehicle types have different vehicle sizes. In the embodiment of the present application, the first size of the target object mapped in the point cloud data is determined according to the vehicle type, that is, the size to be obtained by processing the bounding box of the point cloud. It should be noted that since the target object in the point cloud data is incomplete point cloud data, that is to say, the image of the target object is incomplete, in most cases, the size processing operation of the point cloud bounding box is an expansion operation. . In addition, the first size can also be determined according to the scaling ratio in the annotation information, and the scaling ratio can be used to obtain the size relationship between the target object in the point cloud data and the actual target object, so the target can be obtained based on the scaling ratio and the vehicle type The size of the object in the point cloud image can improve the accuracy of determining the first size.
在本申请实施例中,第一目标包围盒为按照点云包围盒的车辆类型和扩展约束信息进行尺寸处理得到的包围盒。在该示例中,可根据车辆类型确定点云包围盒的第一尺寸,再根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理得到第一目标包围盒,提高了点云包围盒进行尺寸处理的准确率,可提高第一目标包围盒的真实性。In the embodiment of the present application, the first target bounding box is a bounding box obtained by size processing according to the vehicle type and extension constraint information of the point cloud bounding box. In this example, the first size of the point cloud bounding box can be determined according to the vehicle type, and then the size of the point cloud bounding box is processed according to the extended constraint information and the first size to obtain the first target bounding box, which improves the performance of the point cloud bounding box. The accuracy of size processing can improve the authenticity of the first target bounding box.
本申请对于获取第一目标包围盒的方法不做限定,在一种可能的示例中,根据扩展约束信息和第一尺寸确定关键长边、关键宽边和关键高边中的至少一条目标边,以及目标边的目标长度和目标扩展方向;根据目标长度和目标扩展方向对点云包围盒中目标边和目标边对应的边进行尺寸处理,得到第一目标包围盒。The present application does not limit the method for obtaining the first target bounding box. In a possible example, at least one target side among the key long side, the key wide side and the key high side is determined according to the extended constraint information and the first size, and the target length of the target edge and the target extension direction; according to the target length and target extension direction, the size of the target edge and the corresponding edge of the target edge in the point cloud bounding box are processed to obtain the first target bounding box.
其中,目标边为关键长边、关键宽边和关键高边中需要尺寸处理的边,目标长度可包括目标边进行尺寸处理的长度,或者包括第一尺寸中目标边对应的长度等,在此不做限定。目标扩展方向为关键顶点和目标边对应的延展方向,如图11中的(a)所示,目标扩展方向可以是线段L1、线段L2和线段L3重合的箭头中的至少一个箭头所指示的方向,也就是说,箭头A3、箭头A4和箭头A5中的至少一个方向。点云包围盒中与目标边对应的边是指需跟随目标边进行尺寸处理的边,由于点云包围盒为长方体,目标边对应的边可以是长方体中与目标边平行的边。需要说明的是,当第一尺寸包括目标物体对应的立方体中各个边的长度时,目标边对应的边可以是在其他的需要缩小范围的边。The target side is the side that needs size processing among the key long side, the key wide side, and the key high side, and the target length may include the length of the target side for size processing, or the length corresponding to the target side in the first size, etc. Here Not limited. The target extension direction is the extension direction corresponding to the key vertex and the target edge. As shown in (a) in FIG. 11 , the target extension direction may be the direction indicated by at least one arrow in the overlapping arrows of the line segment L1, the line segment L2 and the line segment L3 , that is, at least one direction of arrow A3, arrow A4, and arrow A5. The edge corresponding to the target edge in the point cloud bounding box refers to the edge that needs to follow the target edge for size processing. Since the point cloud bounding box is a cuboid, the edge corresponding to the target edge can be the edge parallel to the target edge in the cuboid. It should be noted that, when the first size includes the length of each side in the cube corresponding to the target object, the side corresponding to the target side may be another side whose range needs to be reduced.
以下以目标边对应的边为目标边平行的边进行举例说明,如图11中的(a)和(b1)所示,假设线段L1为关键长边,目标边为线段L1,则点云包围盒30中与线段L1对应的边,还包括线段L6、线段L9和线段L10。根据第一尺寸和关键边长对应的方向(即箭头A3的方 向),对关键长边(即线段L1)和点云包围盒30中与线段L11平行的边(即线段L6、线段L9和线段L10)进行尺寸处理,得到第一目标包围盒33。该第一目标包围盒33、箭头A1表示的移动方向对应的点云数据称为目标点云数据。The following is an example of the side corresponding to the target side as the parallel side of the target side. As shown in (a) and (b1) in Figure 11, assuming that the line segment L1 is the key long side and the target side is the line segment L1, the point cloud is surrounded by The side corresponding to the line segment L1 in the box 30 further includes the line segment L6, the line segment L9 and the line segment L10. According to the direction corresponding to the first size and the length of the key edge (ie, the direction of the arrow A3), the key long edge (ie the line segment L1) and the edge parallel to the line segment L11 in the point cloud bounding box 30 (ie the line segment L6, the line segment L9 and the line segment L10) Perform size processing to obtain the first target bounding box 33 . The first target bounding box 33 and the point cloud data corresponding to the moving direction indicated by the arrow A1 are called target point cloud data.
如图11中的(a)和(b2)所示,假设线段L2为关键宽边,目标边为线段L2,则点云包围盒30中与线段L2对应的边,还包括线段L5、线段L11和线段L12。根据第一尺寸和关键宽边对应的方向(即箭头A4的方向),对关键宽边(即线段L2)和点云包围盒30中与线段L2平行的边(即线段L5、线段L11和线段L12)进行尺寸处理,得到第一目标包围盒34。该第一目标包围盒34、箭头A1表示的移动方向对应的点云数据称为目标点云数据。As shown in (a) and (b2) of FIG. 11 , assuming that the line segment L2 is the key broadside and the target edge is the line segment L2, the edge corresponding to the line segment L2 in the point cloud bounding box 30 also includes the line segment L5 and the line segment L11 and line segment L12. According to the first size and the direction corresponding to the key broadside (ie the direction of the arrow A4), the key broadside (ie the line segment L2) and the edge parallel to the line segment L2 in the point cloud bounding box 30 (ie the line segment L5, the line segment L11 and the line segment L12) Perform size processing to obtain the first target bounding box 34 . The first target bounding box 34 and the point cloud data corresponding to the moving direction indicated by the arrow A1 are called target point cloud data.
如图11中的(a)和(b3)所示,假设线段L3为关键高边,目标边为线段L3,则点云包围盒30中与线段L3对应的边,还包括线段L4、线段L7和线段L8。根据第一尺寸和关键高边对应的方向(即箭头A5的方向),对关键高边(即线段L3)和点云包围盒30中与线段L3平行的边(即线段L4、线段L7和线段L8)进行尺寸处理,得到第一目标包围盒35。该第一目标包围盒35、箭头A1表示的移动方向对应的点云数据称为目标点云数据。As shown in (a) and (b3) in FIG. 11 , assuming that the line segment L3 is the key high edge and the target edge is the line segment L3, the edge corresponding to the line segment L3 in the point cloud bounding box 30 also includes the line segment L4 and the line segment L7 and line segment L8. According to the first size and the direction corresponding to the key high side (ie the direction of the arrow A5), the key high side (ie the line segment L3) and the edge parallel to the line segment L3 in the point cloud bounding box 30 (ie the line segment L4, the line segment L7 and the line segment L8) Perform size processing to obtain the first target bounding box 35 . The first target bounding box 35 and the point cloud data corresponding to the moving direction indicated by the arrow A1 are called target point cloud data.
需要说明的是,以上举例分别以一条目标边进行尺寸处理,在实际尺寸处理时,可能存在两条目标边,或者三条目标边的情况,依次按照对应的目标尺寸对目标边以及对应的目标边进行尺寸处理,在此不再赘述。本申请实施例以目标物体为车辆进行举例说明,其他的物体类型(例如,飞机类型、轮船类型、机器人类型等)的点云包围盒的处理方法可参照此方法,在此不再赘述。It should be noted that the above example uses one target edge for size processing. In the case of actual size processing, there may be two target edges or three target edges. The target edge and the corresponding target edge are sequentially processed according to the corresponding target size. Size processing is performed, which is not repeated here. The embodiments of the present application take the target object as a vehicle for illustration, and the processing method of point cloud bounding boxes of other object types (eg, aircraft type, ship type, robot type, etc.) can refer to this method, and will not be repeated here.
可以理解,在该示例中,先将关键长边、关键宽边以及关键高边中不满足第一尺寸的边作为目标边,再确定第一尺寸的目标长度以及目标扩展方向,从而根据目标长度和目标扩展方向对点云包围盒中每条目标边以及每条目标边对应的边进行尺寸处理,得到满足车辆类型的第一目标包围盒,提高了点云包围盒进行尺寸处理的准确率。It can be understood that in this example, the edge that does not meet the first size among the key long side, the key wide side and the key high side is used as the target side, and then the target length of the first size and the target extension direction are determined, so as to determine the target length according to the target length. The size of each target edge and the corresponding edge of each target edge in the point cloud bounding box is processed with the target expansion direction to obtain the first target bounding box that meets the vehicle type, which improves the accuracy of the size processing of the point cloud bounding box.
与图6所示的实施例一致的,请参照图10,图10为本申请实施例提供的另一种数据处理方法的流程示意图。该方法以电子设备为例进行描述,具体流程可以包括以下步骤S1001~S1004,其中:Consistent with the embodiment shown in FIG. 6 , please refer to FIG. 10 , which is a schematic flowchart of another data processing method provided by an embodiment of the present application. The method is described by taking an electronic device as an example, and the specific process may include the following steps S1001-S1004, wherein:
S1001:根据目标物体的图像对目标物体的非完整点云数据进行标注,得到标注信息,其中,标注信息包括目标物体的移动方向以及包含非完整点云数据的点云包围盒,点云包围盒的z方向与z轴平行,且与点云包围盒的高对应的方向平行,z轴与水平面垂直。S1001: Label the incomplete point cloud data of the target object according to the image of the target object, and obtain label information, wherein the label information includes the moving direction of the target object and the point cloud bounding box containing the incomplete point cloud data, and the point cloud bounding box The z direction of is parallel to the z axis and the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane.
S1002:根据移动方向和/或z方向确定点云包围盒的扩展约束信息,其中,扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项。S1002: Determine extended constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction, wherein the extended constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices.
S1003:在点云包围盒上标注扩展约束信息,得到参考点云数据。S1003: Mark extended constraint information on the point cloud bounding box to obtain reference point cloud data.
S1004:存储参考点云数据。S1004: Store reference point cloud data.
其中,步骤S1001~S1003可参照步骤S601~S603的描述,在此不再赘述。Wherein, for steps S1001-S1003, reference may be made to the description of steps S601-S603, which will not be repeated here.
在本申请实施例中,可将在标注信息的点云包围盒上标注扩展约束信息得到的数据称为参考点云数据。也就是说,参考点云数据包括步骤S601或S1001得到的标注信息以及步骤S602或S1002得到的标注信息中点云包围盒的扩展约束信息。如图8中的(b)所示,参考点云数据对应的点云图像上标注了目标物体的移动方向、点云包围盒30以及扩展约束信息。In the embodiment of the present application, the data obtained by labeling the extended constraint information on the point cloud bounding box of the labeling information may be referred to as reference point cloud data. That is to say, the reference point cloud data includes the annotation information obtained in step S601 or S1001 and the extension constraint information of the bounding box of the point cloud in the annotation information obtained in step S602 or S1002. As shown in (b) of FIG. 8 , the point cloud image corresponding to the reference point cloud data is marked with the moving direction of the target object, the point cloud bounding box 30 and the extended constraint information.
在图10所描述的方法中,根据目标物体的图像对目标物体对应的非完整点云数据进行标注,从而得到点云包围盒以及该目标物体的移动方向的标注信息。然后根据目标物体的移动方向和/或点云包围盒的z方向确定点云包围盒的扩展约束信息,在点云包围盒上标注该扩展 约束信息,得到参考点云数据,并存储该参考点云数据,从而可根据该扩展约束信息获取满足实际需求的目标包围盒,便于进一步提高数据的使用率。点云包围盒包含非完整点云数据,且保持z方向与点云包围盒的高对应的方向,以及与水平面垂直的z轴平行,可避免由于采集角度导致长方体的方向偏移,提高了标注点云包围盒的准确率。又扩展约束信息是根据移动方向和/或z方向确定的,便于提高尺寸处理的准确率。In the method described in FIG. 10 , the incomplete point cloud data corresponding to the target object is annotated according to the image of the target object, so as to obtain the point cloud bounding box and the annotation information of the moving direction of the target object. Then determine the extended constraint information of the point cloud bounding box according to the moving direction of the target object and/or the z direction of the point cloud bounding box, mark the extended constraint information on the point cloud bounding box, obtain the reference point cloud data, and store the reference point Cloud data, so that the target bounding box that meets the actual demand can be obtained according to the extended constraint information, which is convenient to further improve the utilization rate of the data. The point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling. The accuracy of the point cloud bounding box. Furthermore, the extended constraint information is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,该方法还包括:接收针对参考点云数据的标注指令;根据标注指令和车辆类型确定点云包围盒的第二尺寸;根据扩展约束信息和第二尺寸对点云包围盒进行尺寸处理,得到第二目标包围盒。In a possible example, the target object is a vehicle, and the annotation information further includes a vehicle type. The method further includes: receiving an annotation instruction for the reference point cloud data; determining the second size of the point cloud bounding box according to the annotation instruction and the vehicle type ; Perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box.
在本申请实施例中,标注指令可包括点云包围盒对应的尺寸处理精度以及尺寸要求。可以理解,对于不同的团队来说,对于尺寸的精确度也存在差别,例如,团队1中的车辆尺寸要求精确到分米,团队2中的车辆尺寸要求到毫米。此外,不同团队的算法存在区别,所要求的目标尺寸可能也存在区别,例如,团队1的目标尺寸要求长、宽、高分别为5.2、4.3、2.0,团队2中的车辆尺寸要求5.25、3.55、2.00等。In this embodiment of the present application, the labeling instruction may include size processing accuracy and size requirements corresponding to the bounding box of the point cloud. It can be understood that there are differences in the accuracy of dimensions for different teams. For example, the dimensions of vehicles in Team 1 are required to be accurate to decimeters, and the dimensions of vehicles in Team 2 are required to be accurate to millimeters. In addition, the algorithms of different teams are different, and the required target size may also be different. For example, the target size of team 1 requires length, width, and height to be 5.2, 4.3, and 2.0, respectively, and the vehicle size of team 2 requires 5.25, 3.55 , 2.00, etc.
标注指令还可包括发送该标注指令的电子设备的标识信息等,在此不做限定。在获取第二目标包围盒之后,根据该标识信息向发送标注指令的电子设备发送第二目标包围盒对应的点云数据。该标注指令用于指示电子设备对参考点云数据中的非完整点云数据进行数据使用。可以理解为,对非完整点云数据对应的点云包围盒进行尺寸处理,以使非完整点云数据进行数据使用。该标注指令可以是根据标注人员在电子设备中输入的信息得到的指令,还可以是从其他的电子设备中接收到的指令,在此不做限定。The tagging instruction may also include identification information of the electronic device that sends the tagging instruction, etc., which is not limited here. After acquiring the second target bounding box, send the point cloud data corresponding to the second target bounding box to the electronic device that sends the labeling instruction according to the identification information. The labeling instruction is used to instruct the electronic device to use the incomplete point cloud data in the reference point cloud data. It can be understood that the size of the point cloud bounding box corresponding to the incomplete point cloud data is processed, so that the incomplete point cloud data can be used for data. The labeling instruction may be an instruction obtained according to the information input by the labeling person in the electronic device, or may be an instruction received from other electronic devices, which is not limited herein.
可以理解,不同车辆类型的车辆尺寸不同。在本申请实施例中,根据标注指令和车辆类型确定目标物体映射在点云数据中的第二尺寸,即点云包围盒需处理得到的尺寸。需要说明的是,由于点云数据中目标物体为非完整点云数据,也就是说,目标物体的图像是残缺的,对于大部分情况而言,对点云包围盒的尺寸处理操作是扩展操作,但根据不同的标注指令也可能存在缩放操作。此外,目标尺寸还可根据标注信息中的缩放比例进行确定,该缩放比例可用于获取点云数据中目标物体和实际的目标物体之间的大小关系,因此可基于缩放比例、标注指令和车辆类型获取目标物体在点云图像中的尺寸,可提高获取第二尺寸的准确率。It will be appreciated that different vehicle types have different vehicle sizes. In the embodiment of the present application, the second size of the target object mapped in the point cloud data is determined according to the labeling instruction and the vehicle type, that is, the size to be obtained by processing the bounding box of the point cloud. It should be noted that since the target object in the point cloud data is incomplete point cloud data, that is to say, the image of the target object is incomplete, in most cases, the size processing operation of the point cloud bounding box is an expansion operation. , but there may also be scaling operations depending on the dimensioning instructions. In addition, the target size can also be determined according to the scaling ratio in the annotation information, which can be used to obtain the size relationship between the target object in the point cloud data and the actual target object, so it can be based on the scaling ratio, annotation instructions and vehicle type. Obtaining the size of the target object in the point cloud image can improve the accuracy of obtaining the second size.
在本申请实施例中,第二目标包围盒为按照点云包围盒的标注指令、车辆类型和扩展约束信息进行尺寸处理得到的包围盒。本申请对于获取第二目标包围盒的方法不做限定,可参照获取第一目标包围盒的方法的描述,在此不再赘述。本申请实施例以目标物体为车辆进行举例说明,其他的物体类型(例如,飞机类型、轮船类型、机器人类型等)的点云包围盒的处理方法也可参照此方法,在此不再赘述。In the embodiment of the present application, the second target bounding box is a bounding box obtained by size processing according to the labeling instruction of the point cloud bounding box, the vehicle type, and the extended constraint information. The present application does not limit the method for obtaining the second target bounding box, and reference may be made to the description of the method for obtaining the first target bounding box, which will not be repeated here. The embodiments of the present application take the target object as a vehicle for illustration, and the method for processing point cloud bounding boxes of other object types (eg, aircraft type, ship type, robot type, etc.) can also refer to this method, and will not be repeated here.
可以理解,在该示例中,在接收到标注指令时,可先根据该标注指令和车辆类型确定点云包围盒的第二尺寸,再根据扩展约束信息和第二尺寸对点云包围盒进行尺寸处理,从而可得到满足车辆类型和标注指令的第二目标包围盒,提高了点云包围盒进行尺寸处理的准确率,且提高了数据的使用率。It can be understood that in this example, when a labeling instruction is received, the second size of the bounding box of the point cloud can be determined according to the labeling instruction and the vehicle type, and then the bounding box of the point cloud can be sized according to the extended constraint information and the second size. Through processing, the second target bounding box that satisfies the vehicle type and the labeling instruction can be obtained, the accuracy of the size processing of the point cloud bounding box is improved, and the utilization rate of data is improved.
上述详细阐述了本申请实施例的方法,下面提供了本申请实施例的装置。The methods of the embodiments of the present application are described in detail above, and the apparatuses of the embodiments of the present application are provided below.
请参见图12,图12是本申请实施例提供的一种数据处理装置的结构示意图,该数据处理装置可以包括标注单元1201、确定单元1202、处理单元1203、存储单元1204和通信单元1205。当该数据处理装置为电子设备时,通信单元1205可用于接收采集设备采集的信息,或接收其他的电子设备发送的标注指令,以及与其他的电子设备发送数据处理之后得到的目标 包围盒或包含目标包围盒的点云数据等数据。当该数据处理装置为采集设备时,通信单元1205可用于向电子设备发送数据处理之后得到的目标包围盒或包含目标包围盒的点云数据等数据。本申请实施例以数据处理装置为电子设备进行举例说明,各个单元的详细描述如下。Referring to FIG. 12 , FIG. 12 is a schematic structural diagram of a data processing apparatus provided by an embodiment of the present application. The data processing apparatus may include a labeling unit 1201 , a determining unit 1202 , a processing unit 1203 , a storage unit 1204 , and a communication unit 1205 . When the data processing device is an electronic device, the communication unit 1205 can be used to receive the information collected by the collecting device, or receive the labeling instruction sent by other electronic devices, and send the target bounding box or the target bounding box obtained after data processing with other electronic devices. Data such as point cloud data of the target bounding box. When the data processing apparatus is a collection device, the communication unit 1205 may be configured to send data such as the target bounding box obtained after data processing or the point cloud data including the target bounding box to the electronic device. The embodiments of the present application take the data processing apparatus as an electronic device for illustration, and the detailed description of each unit is as follows.
标注单元1201用于根据目标物体的图像对目标物体的非完整点云数据进行标注,得到标注信息,其中,标注信息包括目标物体的移动方向以及包含非完整点云数据的点云包围盒,点云包围盒的z方向与z轴平行,且与点云包围盒的高对应的方向平行,z轴与水平面垂直;The labeling unit 1201 is used to label the incomplete point cloud data of the target object according to the image of the target object, and obtain label information, wherein the label information includes the moving direction of the target object and the point cloud bounding box containing the incomplete point cloud data. The z direction of the cloud bounding box is parallel to the z axis, and parallel to the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane;
确定单元1202用于根据移动方向和/或z方向确定点云包围盒的扩展约束信息,其中,扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项;The determining unit 1202 is configured to determine extension constraint information of the bounding box of the point cloud according to the moving direction and/or the z-direction, wherein the extension constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices ;
标注单元1201还用于在点云包围盒上标注扩展约束信息。The labeling unit 1201 is also used for labeling extended constraint information on the bounding box of the point cloud.
在一种可能的示例中,确定单元1202具体用于确定点云包围盒中每个顶点的顶点置信度,其中,顶点置信度用于描述顶点为目标物体的物体包围盒的顶点的概率;确定顶点置信度的最大值对应的顶点为关键顶点;确定点云包围盒中与关键顶点相交的三条组合边为三条参考边;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。In a possible example, the determining unit 1202 is specifically configured to determine the vertex confidence of each vertex in the point cloud bounding box, wherein the vertex confidence is used to describe the probability that the vertex is the vertex of the object bounding box of the target object; determine The vertex corresponding to the maximum value of the vertex confidence is the key vertex; the three combined edges that intersect the key vertex in the point cloud bounding box are determined as the three reference edges; the key long edge, At least one of the critical wide side and the critical high side.
在一种可能的示例中,确定单元1202具体用于根据点云包围盒中每个顶点对应的点云数量确定顶点的顶点置信度,其中,当点云数量越大时,顶点置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定顶点的顶点置信度,其中,当距离越小时,顶点置信度越大,采集设备采集了非完整点云数据。In a possible example, the determining unit 1202 is specifically configured to determine the vertex confidence of the vertex according to the number of point clouds corresponding to each vertex in the bounding box of the point cloud, wherein, the larger the number of point clouds, the higher the vertex confidence ; and/or, determine the vertex confidence of the vertex according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the vertex confidence is greater, and the acquisition device has collected incomplete point cloud data .
在一种可能的示例中,确定单元1202具体用于确定点云包围盒中相交于每个顶点的三条组合边的整体置信度,其中,整体置信度用于描述三条组合边均为目标物体的物体包围盒的边的概率;确定整体置信度的最大值对应的三条组合边为三条参考边;确定三条参考边相交的顶点为关键顶点;根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。In a possible example, the determining unit 1202 is specifically configured to determine the overall confidence level of the three combined edges intersecting each vertex in the point cloud bounding box, wherein the overall confidence level is used to describe that the three combined edges are all target objects. The probability of the edge of the bounding box of the object; determine the three combined edges corresponding to the maximum value of the overall confidence as the three reference edges; determine the vertex where the three reference edges intersect as the key vertex; determine the three reference edges according to the moving direction and/or the z direction. At least one of the critical long side, the critical wide side, and the critical high side.
在一种可能的示例中,确定单元1202具体用于根据点云包围盒中相交于每个顶点的三条组合边对应的点云数量确定三条组合边的整体置信度,其中,当点云数量越大时,整体置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定点云包围盒中相交于顶点的三条组合边的整体置信度,其中,当距离越小时,整体置信度越大,采集设备采集了非完整点云数据。In a possible example, the determining unit 1202 is specifically configured to determine the overall confidence of the three combined edges according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, wherein, when the number of point clouds is greater When it is large, the overall confidence is larger; and/or, according to the distance between each vertex in the point cloud bounding box and the acquisition device, determine the overall confidence of the three combined edges in the point cloud bounding box that intersect the vertices, wherein, when The smaller the distance, the greater the overall confidence, and the acquisition device has collected incomplete point cloud data.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,确定单元1202还用于根据车辆类型确定点云包围盒的第一尺寸;数据处理装置还包括处理单元1203,用于根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理,得到第一目标包围盒。In a possible example, the target object is a vehicle, the annotation information also includes the vehicle type, and the determining unit 1202 is further configured to determine the first size of the point cloud bounding box according to the vehicle type; the data processing apparatus further includes a processing unit 1203, configured to Size processing is performed on the bounding box of the point cloud according to the extended constraint information and the first size to obtain the first target bounding box.
在一种可能的示例中,处理单元1203具体用于根据第一尺寸和扩展约束信息确定关键长边、关键宽边和关键高边中的至少一条目标边,以及至少一条目标边的目标长度和目标扩展方向;根据目标长度和目标扩展方向,对点云包围盒中目标边和目标边对应的边进行尺寸处理,得到第一目标包围盒。In a possible example, the processing unit 1203 is specifically configured to determine, according to the first size and the expansion constraint information, at least one target side among the critical long side, the critical wide side, and the critical high side, and the target length and the target length of the at least one target side. Target extension direction: According to the target length and target extension direction, size processing is performed on the target edge and the edge corresponding to the target edge in the point cloud bounding box to obtain the first target bounding box.
在一种可能的示例中,数据处理装置还包括:存储单元1204用于对在点云包围盒上标注扩展约束信息得到的参考点云数据进行存储。In a possible example, the data processing apparatus further includes: the storage unit 1204 is configured to store the reference point cloud data obtained by marking the extended constraint information on the bounding box of the point cloud.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,数据处理装置还包括通信单元1205和处理单元1203,其中:通信单元1205,用于接收针对参考点云数据的标注指令;确定单元1202,还用于根据标注指令和车辆类型确定点云包围盒的第二尺寸;处理单元1203用于根据扩展约束信息和第二尺寸对点云包围盒进行尺寸处理,得到第二目标包围 盒。In a possible example, the target object is a vehicle, the labeling information also includes the vehicle type, and the data processing apparatus further includes a communication unit 1205 and a processing unit 1203, where the communication unit 1205 is configured to receive labeling instructions for the reference point cloud data The determining unit 1202 is also used to determine the second size of the point cloud bounding box according to the annotation instruction and the vehicle type; the processing unit 1203 is used to perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box.
需要说明的是,各个单元的实现还可以对应参照图6或图10所示的方法实施例的相应描述。It should be noted that, the implementation of each unit may also correspond to the corresponding description of the method embodiment shown in FIG. 6 or FIG. 10 .
请参见图13,图13是本申请实施例提供的一种数据处理装置,该数据处理装置包括处理器1301、存储器1302和通信接口1303,处理器1301、存储器1302和通信接口1303通过总线1304相互连接。图12所示的通信单元1205所实现的相关功能可通过通信接口1303来实现,图12所示的存储单元1204所实现的相关功能可通过存储器1302来实现,图12所示的标注单元1201、确定单元1202和处理单元1203所实现的相关功能可通过处理器1301来实现。Please refer to FIG. 13 . FIG. 13 is a data processing apparatus provided by an embodiment of the present application. The data processing apparatus includes a processor 1301 , a memory 1302 and a communication interface 1303 . The processor 1301 , the memory 1302 and the communication interface 1303 communicate with each other through a bus 1304 connect. The relevant functions realized by the communication unit 1205 shown in FIG. 12 can be realized through the communication interface 1303, the relevant functions realized by the storage unit 1204 shown in FIG. 12 can be realized through the memory 1302, and the labeling unit 1201, The related functions implemented by the determining unit 1202 and the processing unit 1203 can be implemented by the processor 1301 .
存储器1302包括但不限于是随机存储记忆体(random access memory,RAM)、只读存储器(read-only memory,ROM)、可擦除可编程只读存储器(erasable programmable read only memory,EPROM)、或便携式只读存储器(compact disc read-only memory,CD-ROM),该存储器1302用于相关计算机程序及数据。通信接口1303用于接收和发送数据。The memory 1302 includes, but is not limited to, random access memory (RAM), read-only memory (ROM), erasable programmable read only memory (EPROM), or A portable read-only memory (compact disc read-only memory, CD-ROM), the memory 1302 is used for related computer programs and data. The communication interface 1303 is used to receive and transmit data.
处理器1301可以是一个或多个中央处理器(central processing unit,CPU),在处理器1301是一个CPU的情况下,该CPU可以是单核CPU,也可以是多核CPU。The processor 1301 may be one or more central processing units (central processing units, CPUs). When the processor 1301 is a CPU, the CPU may be a single-core CPU or a multi-core CPU.
该数据处理装置的处理器1301用于读取存储器1302中存储的计算机程序代码,执行以下操作:The processor 1301 of the data processing apparatus is configured to read the computer program codes stored in the memory 1302, and perform the following operations:
根据目标物体的图像对目标物体的非完整点云数据进行标注,得到标注信息,其中,标注信息包括目标物体的移动方向以及包含非完整点云数据的点云包围盒,点云包围盒的z方向与z轴平行,且与点云包围盒的高对应的方向平行,z轴与水平面垂直;Annotate the incomplete point cloud data of the target object according to the image of the target object, and obtain the annotation information, wherein the annotation information includes the moving direction of the target object, the point cloud bounding box containing the incomplete point cloud data, and the z of the point cloud bounding box. The direction is parallel to the z-axis and the direction corresponding to the height of the point cloud bounding box, and the z-axis is perpendicular to the horizontal plane;
根据移动方向和/或z方向确定点云包围盒的扩展约束信息,其中,扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项;Determine the extension constraint information of the point cloud bounding box according to the moving direction and/or the z direction, wherein the extension constraint information includes at least one of a key long side, a key wide side and a key high side intersecting with key vertices;
在点云包围盒上标注扩展约束信息。Annotate extended constraint information on the point cloud bounding box.
在一种可能的示例中,在根据移动方向和/或z方向确定点云包围盒的扩展约束信息方面,处理器1301具体用于执行以下操作:In a possible example, in terms of determining the expansion constraint information of the bounding box of the point cloud according to the moving direction and/or the z-direction, the processor 1301 is specifically configured to perform the following operations:
确定点云包围盒中每个顶点的顶点置信度,其中,顶点置信度用于描述顶点为目标物体的物体包围盒的顶点的概率;Determine the vertex confidence of each vertex in the point cloud bounding box, where the vertex confidence is used to describe the probability that the vertex is the vertex of the object bounding box of the target object;
确定顶点置信度的最大值对应的顶点为关键顶点;Determine the vertex corresponding to the maximum vertex confidence as the key vertex;
确定点云包围盒中与关键顶点相交的三条组合边为三条参考边;Determine the three combined edges that intersect with key vertices in the bounding box of the point cloud as three reference edges;
根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。At least one of the critical long side, the critical broad side and the critical high side of the three reference sides is determined according to the movement direction and/or the z direction.
在一种可能的示例中,在确定点云包围盒中每个顶点的顶点置信度方面,处理器1301具体用于执行以下操作:In a possible example, in determining the vertex confidence of each vertex in the point cloud bounding box, the processor 1301 is specifically configured to perform the following operations:
根据点云包围盒中每个顶点对应的点云数量确定顶点的顶点置信度,其中,当点云数量越大时,顶点置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定顶点的顶点置信度,其中,当距离越小时,顶点置信度越大,采集设备采集了非完整点云数据。The vertex confidence of a vertex is determined according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when the number of point clouds is larger, the vertex confidence is larger; and/or, according to each vertex in the point cloud bounding box The distance from the acquisition device determines the vertex confidence of the vertex, wherein, when the distance is smaller, the vertex confidence is larger, and the acquisition device has collected incomplete point cloud data.
在一种可能的示例中,在根据移动方向和/或z方向确定点云包围盒的扩展约束信息方面,处理器1301具体用于执行以下操作:In a possible example, in terms of determining the expansion constraint information of the bounding box of the point cloud according to the moving direction and/or the z-direction, the processor 1301 is specifically configured to perform the following operations:
确定点云包围盒中相交于每个顶点的三条组合边的整体置信度,其中,整体置信度用于描述三条组合边均为目标物体的物体包围盒的边的概率;Determine the overall confidence of the three combined edges that intersect each vertex in the point cloud bounding box, where the overall confidence is used to describe the probability that the three combined edges are the edges of the object bounding box of the target object;
确定整体置信度的最大值对应的三条组合边为三条参考边;Determine the three combined edges corresponding to the maximum value of the overall confidence as three reference edges;
确定三条参考边相交的顶点为关键顶点;Determine the vertex where the three reference edges intersect as the key vertex;
根据移动方向和/或z方向确定三条参考边中的关键长边、关键宽边和关键高边中的至少一项。At least one of the critical long side, the critical broad side and the critical high side of the three reference sides is determined according to the movement direction and/or the z direction.
在一种可能的示例中,在确定点云包围盒中相交于每个顶点的三条组合边的整体置信度方面,处理器1301具体用于执行以下操作:In a possible example, in determining the overall confidence of the three combined edges intersecting each vertex in the point cloud bounding box, the processor 1301 is specifically configured to perform the following operations:
根据点云包围盒中相交于每个顶点的三条组合边对应的点云数量确定三条组合边的整体置信度,其中,当点云数量越大时,整体置信度越大;和/或,根据点云包围盒中每个顶点与采集设备之间的距离确定点云包围盒中相交于顶点的三条组合边的整体置信度,其中,当距离越小时,整体置信度越大,采集设备采集了非完整点云数据。The overall confidence of the three combined edges is determined according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, wherein the greater the number of point clouds, the greater the overall confidence; and/or, according to The distance between each vertex in the point cloud bounding box and the acquisition device determines the overall confidence of the three combined edges in the point cloud bounding box that intersect with the vertex. Incomplete point cloud data.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,处理器1301还用于执行以下操作:In a possible example, the target object is a vehicle, the annotation information further includes the vehicle type, and the processor 1301 is further configured to perform the following operations:
根据车辆类型确定点云包围盒的第一尺寸;Determine the first size of the point cloud bounding box according to the vehicle type;
根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理,得到第一目标包围盒。Size processing is performed on the bounding box of the point cloud according to the extended constraint information and the first size to obtain the first target bounding box.
在一种可能的示例中,在根据扩展约束信息和第一尺寸对点云包围盒进行尺寸处理,得到第一目标包围盒方面,处理器1301具体用于执行以下操作:In a possible example, in terms of performing size processing on the point cloud bounding box according to the extended constraint information and the first size to obtain the first target bounding box, the processor 1301 is specifically configured to perform the following operations:
根据扩展约束信息和第一尺寸确定关键长边、关键宽边和关键高边中的至少一条目标边,以及至少一条目标边的目标长度和目标扩展方向;Determine at least one target side among the key long side, the key wide side and the key high side, and the target length and target extension direction of the at least one target side according to the extension constraint information and the first size;
根据目标长度和目标扩展方向,对点云包围盒中目标边和目标边对应的边进行尺寸处理,得到第一目标包围盒。According to the target length and the target extension direction, size processing is performed on the target edge and the edge corresponding to the target edge in the point cloud bounding box to obtain the first target bounding box.
在一种可能的示例中,处理器1301还用于执行以下操作:In a possible example, the processor 1301 is further configured to perform the following operations:
对在点云包围盒上标注扩展约束信息得到的参考点云数据进行存储。Store the reference point cloud data obtained by labeling the extended constraint information on the point cloud bounding box.
在一种可能的示例中,目标物体为车辆,标注信息还包括车辆类型,处理器1301还用于执行以下操作:In a possible example, the target object is a vehicle, the annotation information further includes the vehicle type, and the processor 1301 is further configured to perform the following operations:
接收针对参考点云数据的标注指令;Receive annotation instructions for reference point cloud data;
根据标注指令和车辆类型确定点云包围盒的第二尺寸;Determine the second size of the point cloud bounding box according to the annotation instruction and the vehicle type;
根据扩展约束信息和第二尺寸对点云包围盒进行尺寸处理,得到第二目标包围盒。Perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain the second target bounding box.
需要说明的是,各个操作的实现还可以对应参照图6或图10所示的方法实施例的相应描述。It should be noted that, the implementation of each operation may also correspond to the corresponding description with reference to the method embodiment shown in FIG. 6 or FIG. 10 .
本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,当其在电子设备上运行时,图6或图10所示的方法流程得以实现。Embodiments of the present application further provide a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed on an electronic device, the method flow shown in FIG. 6 or FIG. 10 is implemented.
本申请实施例还提供一种计算机程序产品,当所述计算机程序产品在电子设备上运行时,图6或图10所示的方法流程得以实现。The embodiment of the present application further provides a computer program product, when the computer program product runs on an electronic device, the method flow shown in FIG. 6 or FIG. 10 is implemented.
本申请实施例还提供一种芯片,包括处理器,用于从存储器中调用并运行存储器中存储的指令,使得安装有芯片的终端设备执行图6或图10所示的方法。An embodiment of the present application further provides a chip, including a processor, configured to call and execute instructions stored in the memory from the memory, so that the terminal device with the chip installed executes the method shown in FIG. 6 or FIG. 10 .
本申请实施例还提供另一种芯片,该芯片可以为终端设备或接入网设备内的芯片,该芯片包括:输入接口、输出接口和处理电路,输入接口、输出接口与电路之间通过内部连接通路相连,处理电路用于执行图6或图10所示的方法。The embodiment of the present application further provides another chip, which may be a chip in a terminal device or an access network device, and the chip includes: an input interface, an output interface, and a processing circuit, and the input interface, the output interface and the circuit pass through the internal The connection paths are connected, and the processing circuit is used to execute the method shown in FIG. 6 or FIG. 10 .
本申请实施例还提供另一种芯片,包括:输入接口、输出接口、处理器,可选的,还包括存储器,输入接口、输出接口、处理器以及存储器之间通过内部连接通路相连,处理器用 于执行存储器中的代码,当代码被执行时,处理器用于执行图6或图10所示的方法。The embodiment of the present application further provides another chip, including: an input interface, an output interface, a processor, and optionally, a memory. The input interface, the output interface, the processor, and the memory are connected through an internal connection path, and the processor uses an internal connection path. For executing the code in the memory, when the code is executed, the processor is used to execute the method shown in FIG. 6 or FIG. 10 .
本申请实施例还提供一种芯片系统,所述芯片系统包括至少一个处理器,存储器和接口电路,所述存储器、所述收发器和所述至少一个处理器通过线路互联,所述至少一个存储器中存储有计算机程序;所述计算机程序被所述处理器执行时,图6或图10所示的方法流程得以实现。An embodiment of the present application further provides a chip system, the chip system includes at least one processor, a memory, and an interface circuit, the memory, the transceiver, and the at least one processor are interconnected by lines, and the at least one memory A computer program is stored in the computer; when the computer program is executed by the processor, the method flow shown in FIG. 6 or FIG. 10 is realized.
综上所述,通过实施本申请实施例,在点云包围盒上添加用于尺寸处理的扩展约束信息,从而可根据该扩展约束信息获取满足实际需求的目标包围盒,便于提高数据的使用率。点云包围盒包含非完整点云数据,且保持z方向与点云包围盒的高对应的方向,以及与水平面垂直的z轴平行,可避免由于采集角度导致长方体的方向偏移,提高了标注点云包围盒的准确率。又扩展约束信息是根据移动方向和/或z方向确定的,便于提高尺寸处理的准确率。To sum up, by implementing the embodiments of the present application, extended constraint information for size processing is added to the point cloud bounding box, so that the target bounding box that meets the actual needs can be obtained according to the extended constraint information, which is convenient to improve the utilization rate of data . The point cloud bounding box contains incomplete point cloud data, and the z direction is kept in the direction corresponding to the height of the point cloud bounding box, and the z axis perpendicular to the horizontal plane is parallel, which can avoid the direction offset of the cuboid due to the acquisition angle, and improve the labeling. The accuracy of the point cloud bounding box. Furthermore, the extended constraint information is determined according to the moving direction and/or the z-direction, so as to improve the accuracy of size processing.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,该流程可以由计算机程序来计算机程序相关的硬件完成,该计算机程序可存储于计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法实施例的流程。而前述的存储介质包括:ROM或随机存储记忆体RAM、磁碟或者光盘等各种可存储计算机程序代码的介质。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented, and the process can be completed by a computer program or computer program-related hardware, and the computer program can be stored in a computer-readable storage medium. During execution, the processes of the foregoing method embodiments may be included. The aforementioned storage medium includes: ROM or random storage memory RAM, magnetic disk or optical disk and other mediums that can store computer program codes.

Claims (21)

  1. 一种数据处理方法,其特征在于,包括:A data processing method, comprising:
    根据目标物体的图像对所述目标物体的非完整点云数据进行标注,得到标注信息,其中,所述标注信息包括所述目标物体的移动方向以及包含所述非完整点云数据的点云包围盒,所述点云包围盒的z方向与z轴平行,且与所述点云包围盒的高对应的方向平行,所述z轴与水平面垂直;Annotate the incomplete point cloud data of the target object according to the image of the target object to obtain annotation information, wherein the annotation information includes the moving direction of the target object and the point cloud surrounding the incomplete point cloud data. box, the z direction of the point cloud bounding box is parallel to the z axis, and parallel to the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane;
    根据所述移动方向和/或所述z方向确定所述点云包围盒的扩展约束信息,其中,所述扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项;Expansion constraint information of the bounding box of the point cloud is determined according to the movement direction and/or the z direction, wherein the expansion constraint information includes key long sides, key broad sides and key high sides intersecting with key vertices at least one;
    在所述点云包围盒上标注所述扩展约束信息。The extended constraint information is marked on the point cloud bounding box.
  2. 根据权利要求1所述的数据处理方法,其特征在于,所述根据所述移动方向和/或所述z方向确定所述点云包围盒的扩展约束信息,包括:The data processing method according to claim 1, wherein the determining the expansion constraint information of the point cloud bounding box according to the moving direction and/or the z direction comprises:
    确定所述点云包围盒中每个顶点的顶点置信度,其中,所述顶点置信度用于描述所述顶点为所述目标物体的物体包围盒的顶点的概率;determining the vertex confidence of each vertex in the point cloud bounding box, wherein the vertex confidence is used to describe the probability that the vertex is a vertex of the object bounding box of the target object;
    确定所述顶点置信度的最大值对应的顶点为所述关键顶点;Determine that the vertex corresponding to the maximum value of the vertex confidence is the key vertex;
    确定所述点云包围盒中与所述关键顶点相交的三条组合边为三条参考边;Determine that the three combined edges in the point cloud bounding box that intersect with the key vertices are three reference edges;
    根据所述移动方向和/或所述z方向确定所述三条参考边中的所述关键长边、所述关键宽边和所述关键高边中的至少一项。At least one of the critical long side, the critical broad side and the critical high side of the three reference sides is determined according to the moving direction and/or the z-direction.
  3. 根据权利要求2所述的数据处理方法,其特征在于,所述确定所述点云包围盒中每个顶点的顶点置信度,包括:The data processing method according to claim 2, wherein the determining the vertex confidence of each vertex in the point cloud bounding box comprises:
    根据所述点云包围盒中每个顶点对应的点云数量确定所述顶点的顶点置信度,其中,当所述点云数量越大时,所述顶点置信度越大;和/或,The vertex confidence of the vertex is determined according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when the number of the point cloud is larger, the confidence of the vertex is larger; and/or,
    根据所述点云包围盒中每个顶点与采集设备之间的距离确定所述顶点的顶点置信度,其中,当所述距离越小时,所述顶点置信度越大,所述采集设备采集了所述非完整点云数据。The vertex confidence of the vertex is determined according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the vertex confidence is greater, and the acquisition device has collected the incomplete point cloud data.
  4. 根据权利要求1所述的数据处理方法,其特征在于,所述根据所述移动方向和/或所述z方向确定所述点云包围盒的扩展约束信息,包括:The data processing method according to claim 1, wherein the determining the expansion constraint information of the point cloud bounding box according to the moving direction and/or the z direction comprises:
    确定所述点云包围盒中相交于每个顶点的三条组合边的整体置信度,其中,所述整体置信度用于描述所述三条组合边均为所述目标物体的物体包围盒的边的概率;Determine the overall confidence level of the three combined edges intersecting at each vertex in the point cloud bounding box, wherein the overall confidence level is used to describe that the three combined edges are all the edges of the object bounding box of the target object probability;
    确定所述整体置信度的最大值对应的三条组合边为三条参考边;Determine that the three combined edges corresponding to the maximum value of the overall confidence are three reference edges;
    确定所述三条参考边相交的顶点为所述关键顶点;Determine the vertex where the three reference edges intersect as the key vertex;
    根据所述移动方向和/或所述z方向确定所述三条参考边中的所述关键长边、所述关键宽边和所述关键高边中的至少一项。At least one of the critical long side, the critical broad side and the critical high side of the three reference sides is determined according to the moving direction and/or the z-direction.
  5. 根据权利要求4所述的数据处理方法,其特征在于,所述确定所述点云包围盒中相交于每个顶点的三条组合边的整体置信度,包括:The data processing method according to claim 4, wherein the determining the overall confidence of the three combined edges in the point cloud bounding box intersecting with each vertex comprises:
    根据所述点云包围盒中相交于每个顶点的三条组合边对应的点云数量确定所述三条组合边的整体置信度,其中,当所述点云数量越大时,所述整体置信度越大;和/或,The overall confidence level of the three combined edges is determined according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box, wherein, when the number of the point clouds is larger, the overall confidence level the greater; and/or,
    根据所述点云包围盒中每个顶点与采集设备之间的距离确定所述点云包围盒中相交于所述顶点的三条组合边的整体置信度,其中,当所述距离越小时,所述整体置信度越大,所述采集设备采集了所述非完整点云数据。The overall confidence of the three combined edges in the point cloud bounding box intersecting with the vertex is determined according to the distance between each vertex in the point cloud bounding box and the acquisition device, wherein, when the distance is smaller, the The larger the overall confidence is, the more incomplete point cloud data is collected by the collection device.
  6. 根据权利要求1-5中任一项所述的数据处理方法,其特征在于,所述目标物体为车辆,所述标注信息还包括车辆类型,所述方法还包括:The data processing method according to any one of claims 1-5, wherein the target object is a vehicle, the label information further includes a vehicle type, and the method further includes:
    根据所述车辆类型确定所述点云包围盒的第一尺寸;determining a first size of the point cloud bounding box according to the vehicle type;
    根据所述扩展约束信息和所述第一尺寸对所述点云包围盒进行尺寸处理,得到第一目标包围盒。Size processing is performed on the point cloud bounding box according to the extended constraint information and the first size to obtain a first target bounding box.
  7. 根据权利要求6所述的数据处理方法,其特征在于,所述根据所述扩展约束信息和所述第一尺寸对所述点云包围盒进行尺寸处理,得到第一目标包围盒,包括:The data processing method according to claim 6, wherein the performing size processing on the point cloud bounding box according to the expansion constraint information and the first size to obtain a first target bounding box, comprising:
    根据所述扩展约束信息和所述第一尺寸确定所述关键长边、所述关键宽边和所述关键高边中的至少一条目标边,以及所述至少一条目标边的目标长度和目标扩展方向;Determine at least one target side of the critical long side, the critical wide side and the critical high side, and the target length and target extension of the at least one target side according to the extension constraint information and the first size direction;
    根据所述目标长度和所述目标扩展方向,对所述点云包围盒中所述目标边和所述目标边对应的边进行尺寸处理,得到第一目标包围盒。According to the target length and the target extension direction, size processing is performed on the target edge and the edge corresponding to the target edge in the point cloud bounding box to obtain a first target bounding box.
  8. 根据权利要求1-5中任一项所述的数据处理方法,其特征在于,所述方法还包括:The data processing method according to any one of claims 1-5, wherein the method further comprises:
    对在所述点云包围盒上标注所述扩展约束信息得到的参考点云数据进行存储。The reference point cloud data obtained by marking the extended constraint information on the point cloud bounding box is stored.
  9. 根据权利要求8所述的数据处理方法,其特征在于,所述目标物体为车辆,所述标注信息还包括车辆类型,所述方法还包括:The data processing method according to claim 8, wherein the target object is a vehicle, the label information further includes a vehicle type, and the method further comprises:
    接收针对所述参考点云数据的标注指令;receiving an annotation instruction for the reference point cloud data;
    根据所述标注指令和所述车辆类型确定所述点云包围盒的第二尺寸;determining a second size of the point cloud bounding box according to the annotation instruction and the vehicle type;
    根据所述扩展约束信息和所述第二尺寸对所述点云包围盒进行尺寸处理,得到第二目标包围盒。Size processing is performed on the point cloud bounding box according to the extended constraint information and the second size to obtain a second target bounding box.
  10. 一种数据处理装置,其特征在于,包括:A data processing device, comprising:
    标注单元,用于根据目标物体的图像对所述目标物体的非完整点云数据进行标注,得到标注信息,其中,所述标注信息包括所述目标物体的移动方向以及包含所述非完整点云数据的点云包围盒,所述点云包围盒的z方向与z轴平行,且与所述点云包围盒的高对应的方向平行,所述z轴与水平面垂直;A labeling unit, configured to label the incomplete point cloud data of the target object according to the image of the target object to obtain label information, wherein the label information includes the moving direction of the target object and the incomplete point cloud containing the target object. The point cloud bounding box of the data, the z direction of the point cloud bounding box is parallel to the z axis, and is parallel to the direction corresponding to the height of the point cloud bounding box, and the z axis is perpendicular to the horizontal plane;
    确定单元,用于根据所述移动方向和/或所述z方向确定所述点云包围盒的扩展约束信息,其中,所述扩展约束信息包括相交于关键顶点的关键长边、关键宽边和关键高边中的至少一项;a determining unit, configured to determine extended constraint information of the bounding box of the point cloud according to the moving direction and/or the z direction, wherein the extended constraint information includes a key long side, a key broad side and At least one of the critical high sides;
    所述标注单元,还用于在所述点云包围盒上标注所述扩展约束信息。The labeling unit is further configured to label the extended constraint information on the point cloud bounding box.
  11. 根据权利要求10所述的数据处理装置,其特征在于,所述确定单元具体用于确定所述点云包围盒中每个顶点的顶点置信度,其中,所述顶点置信度用于描述所述顶点为所述目标物体的物体包围盒的顶点的概率;确定所述顶点置信度的最大值对应的顶点为所述关键顶点;确定所述点云包围盒中与所述关键顶点相交的三条组合边为三条参考边;根据所述移动方向和/或所述z方向确定所述三条参考边中的所述关键长边、所述关键宽边和所述关键高边中的至少一项。The data processing apparatus according to claim 10, wherein the determining unit is specifically configured to determine the vertex confidence of each vertex in the point cloud bounding box, wherein the vertex confidence is used to describe the The probability that the vertex is the vertex of the object bounding box of the target object; determine that the vertex corresponding to the maximum value of the vertex confidence is the key vertex; determine the three combinations in the point cloud bounding box that intersect with the key vertex The sides are three reference sides; at least one of the critical long side, the critical broad side and the critical high side of the three reference sides is determined according to the movement direction and/or the z-direction.
  12. 根据权利要求11所述的数据处理装置,其特征在于,所述确定单元具体用于根据所述点云包围盒中每个顶点对应的点云数量确定所述顶点的顶点置信度,其中,当所述点云数量越大时,所述顶点置信度越大;和/或,根据所述点云包围盒中每个顶点与采集设备之间的距离确定所述顶点的顶点置信度,其中,当所述距离越小时,所述顶点置信度越大,所述采集设备采集了所述非完整点云数据。The data processing device according to claim 11, wherein the determining unit is specifically configured to determine the vertex confidence of the vertex according to the number of point clouds corresponding to each vertex in the point cloud bounding box, wherein, when When the number of the point cloud is larger, the confidence of the vertex is larger; and/or, the vertex confidence of the vertex is determined according to the distance between each vertex in the bounding box of the point cloud and the acquisition device, wherein, When the distance is smaller, the vertex confidence is larger, and the collection device has collected the incomplete point cloud data.
  13. 根据权利要求10所述的数据处理装置,其特征在于,所述确定单元具体用于确定所述点云包围盒中相交于每个顶点的三条组合边的整体置信度,其中,所述整体置信度用于描述所述三条组合边均为所述目标物体的物体包围盒的边的概率;确定所述整体置信度的最大值对应的三条组合边为三条参考边;确定所述三条参考边相交的顶点为所述关键顶点;根据 所述移动方向和/或所述z方向确定所述三条参考边中的所述关键长边、所述关键宽边和所述关键高边中的至少一项。The data processing device according to claim 10, wherein the determining unit is specifically configured to determine the overall confidence of the three combined edges intersecting each vertex in the point cloud bounding box, wherein the overall confidence The degree is used to describe the probability that the three combined edges are the edges of the object bounding box of the target object; determine that the three combined edges corresponding to the maximum value of the overall confidence are three reference edges; determine that the three reference edges intersect The vertex is the key vertex; at least one of the key long side, the key wide side and the key high side of the three reference sides is determined according to the moving direction and/or the z direction .
  14. 根据权利要求10所述的数据处理装置,其特征在于,所述确定单元具体用于根据所述点云包围盒中相交于每个顶点的三条组合边对应的点云数量确定所述三条组合边的整体置信度,其中,当所述点云数量越大时,所述整体置信度越大;和/或,根据所述点云包围盒中每个顶点与采集设备之间的距离确定所述点云包围盒中相交于所述顶点的三条组合边的整体置信度,其中,当所述距离越小时,所述整体置信度越大,所述采集设备采集了所述非完整点云数据。The data processing device according to claim 10, wherein the determining unit is specifically configured to determine the three combined edges according to the number of point clouds corresponding to the three combined edges intersecting each vertex in the point cloud bounding box The overall confidence of , wherein, when the number of the point cloud is larger, the overall confidence is larger; and/or, according to the distance between each vertex in the point cloud bounding box and the acquisition device to determine the The overall confidence level of the three combined edges in the point cloud bounding box that intersect the vertex, wherein, when the distance is smaller, the overall confidence level is larger, and the collection device has collected the incomplete point cloud data.
  15. 根据权利要求10-14中任一项所述的数据处理装置,其特征在于,所述目标物体为车辆,所述标注信息还包括车辆类型,所述确定单元还用于根据所述车辆类型确定所述点云包围盒的第一尺寸;所述数据处理装置还包括处理单元,用于根据所述扩展约束信息和所述第一尺寸对所述点云包围盒进行尺寸处理,得到第一目标包围盒。The data processing device according to any one of claims 10-14, wherein the target object is a vehicle, the labeling information further includes a vehicle type, and the determining unit is further configured to determine according to the vehicle type the first size of the point cloud bounding box; the data processing device further includes a processing unit, configured to perform size processing on the point cloud bounding box according to the extended constraint information and the first size to obtain a first target bounding box.
  16. 根据权利要求15所述的数据处理装置,其特征在于,所述处理单元具体用于根据所述第一尺寸和所述扩展约束信息确定所述关键长边、所述关键宽边和所述关键高边中的至少一条目标边,以及所述至少一条目标边的目标长度和目标扩展方向;根据所述目标长度和所述目标扩展方向,对所述点云包围盒中所述目标边和所述目标边对应的边进行尺寸处理,得到第一目标包围盒。The data processing apparatus according to claim 15, wherein the processing unit is specifically configured to determine the critical long side, the critical broad side and the critical long side according to the first size and the expansion constraint information At least one target side in the high side, and the target length and target extension direction of the at least one target side; according to the target length and the target extension direction, the target side and all the Size processing is performed on the edge corresponding to the target edge to obtain the first target bounding box.
  17. 根据权利要求10-14中任一项所述的数据处理装置,其特征在于,所述数据处理装置还包括:The data processing device according to any one of claims 10-14, wherein the data processing device further comprises:
    存储单元,用于对在所述点云包围盒上标注所述扩展约束信息得到的参考点云数据进行存储。The storage unit is configured to store the reference point cloud data obtained by marking the extended constraint information on the point cloud bounding box.
  18. 根据权利要求17所述的数据处理装置,其特征在于,所述目标物体为车辆,所述标注信息还包括车辆类型,所述数据处理装置还包括通信单元和处理单元,其中:The data processing device according to claim 17, wherein the target object is a vehicle, the label information further includes a vehicle type, the data processing device further comprises a communication unit and a processing unit, wherein:
    所述通信单元,用于接收针对所述参考点云数据的标注指令;the communication unit, configured to receive an annotation instruction for the reference point cloud data;
    所述确定单元,还用于根据所述标注指令和所述车辆类型确定所述点云包围盒的第二尺寸;The determining unit is further configured to determine the second size of the point cloud bounding box according to the labeling instruction and the vehicle type;
    所述处理单元,用于根据所述扩展约束信息和所述第二尺寸对所述点云包围盒进行尺寸处理,得到第二目标包围盒。The processing unit is configured to perform size processing on the point cloud bounding box according to the extended constraint information and the second size to obtain a second target bounding box.
  19. 一种数据处理装置,其特征在于,包括处理器和与所述处理器连接的存储器,其中,所述存储器用于存储一个或多个程序,并且被配置由所述处理器执行,所述程序包括用于执行如权利要求1-9中任一项所述的方法中的步骤的指令。A data processing device, comprising a processor and a memory connected to the processor, wherein the memory is used to store one or more programs and is configured to be executed by the processor, the programs comprising instructions for performing steps in the method of any of claims 1-9.
  20. 一种计算机存储介质,其特征在于,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如权利要求1-9任一项所述的执行命令的方法。A computer storage medium, characterized in that it includes computer instructions, which, when the computer instructions are executed on an electronic device, cause the electronic device to execute the method for executing a command according to any one of claims 1-9.
  21. 一种计算机程序产品,其特征在于,所述计算机程序产品用于存储计算机程序,当所述计算机程序在计算机上运行时,使得所述计算机如权利要求1-9任一项所述的执行命令的方法。A computer program product, characterized in that, the computer program product is used to store a computer program, and when the computer program runs on a computer, the computer can execute the command according to any one of claims 1-9. Methods.
PCT/CN2021/132867 2020-12-29 2021-11-24 Data processing method and related apparatus WO2022142890A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011613732.X 2020-12-29
CN202011613732.XA CN114693865A (en) 2020-12-29 2020-12-29 Data processing method and related device

Publications (1)

Publication Number Publication Date
WO2022142890A1 true WO2022142890A1 (en) 2022-07-07

Family

ID=82133235

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/132867 WO2022142890A1 (en) 2020-12-29 2021-11-24 Data processing method and related apparatus

Country Status (2)

Country Link
CN (1) CN114693865A (en)
WO (1) WO2022142890A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116413740A (en) * 2023-06-09 2023-07-11 广汽埃安新能源汽车股份有限公司 Laser radar point cloud ground detection method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407947A (en) * 2016-09-29 2017-02-15 百度在线网络技术(北京)有限公司 Target object recognition method and device applied to unmanned vehicle
CN109509236A (en) * 2017-09-15 2019-03-22 百度在线网络技术(北京)有限公司 Vehicle bounding box generation method, device and storage medium in unmanned scene
CN110222626A (en) * 2019-06-03 2019-09-10 宁波智能装备研究院有限公司 A kind of unmanned scene point cloud target mask method based on deep learning algorithm
CN111009040A (en) * 2018-10-08 2020-04-14 阿里巴巴集团控股有限公司 Point cloud entity marking system, method and device and electronic equipment
US20200218944A1 (en) * 2017-11-29 2020-07-09 Beijing Greenvalley Technology Co., Ltd. Method, Apparatus, and Electronic Device for Processing Point Cloud Data, and Computer Readable Storage Medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407947A (en) * 2016-09-29 2017-02-15 百度在线网络技术(北京)有限公司 Target object recognition method and device applied to unmanned vehicle
CN109509236A (en) * 2017-09-15 2019-03-22 百度在线网络技术(北京)有限公司 Vehicle bounding box generation method, device and storage medium in unmanned scene
US20200218944A1 (en) * 2017-11-29 2020-07-09 Beijing Greenvalley Technology Co., Ltd. Method, Apparatus, and Electronic Device for Processing Point Cloud Data, and Computer Readable Storage Medium
CN111009040A (en) * 2018-10-08 2020-04-14 阿里巴巴集团控股有限公司 Point cloud entity marking system, method and device and electronic equipment
CN110222626A (en) * 2019-06-03 2019-09-10 宁波智能装备研究院有限公司 A kind of unmanned scene point cloud target mask method based on deep learning algorithm

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116413740A (en) * 2023-06-09 2023-07-11 广汽埃安新能源汽车股份有限公司 Laser radar point cloud ground detection method and device
CN116413740B (en) * 2023-06-09 2023-09-05 广汽埃安新能源汽车股份有限公司 Laser radar point cloud ground detection method and device

Also Published As

Publication number Publication date
CN114693865A (en) 2022-07-01

Similar Documents

Publication Publication Date Title
US20210279503A1 (en) Image processing method, apparatus, and device, and storage medium
US10055879B2 (en) 3D human face reconstruction method, apparatus and server
CN109947886B (en) Image processing method, image processing device, electronic equipment and storage medium
WO2019100839A1 (en) Method and apparatus for identifying damaged vehicle parts, server, client terminal, and system
CN111126182B (en) Lane line detection method, lane line detection device, electronic device, and storage medium
WO2022105395A1 (en) Data processing method, apparatus, and system, computer device, and non-transitory storage medium
CN110400304B (en) Object detection method, device, equipment and storage medium based on deep learning
WO2021046829A1 (en) Positioning method, device and system
CN112802111B (en) Object model construction method and device
WO2021147113A1 (en) Plane semantic category identification method and image data processing apparatus
CN112927362A (en) Map reconstruction method and device, computer readable medium and electronic device
CN113591518B (en) Image processing method, network training method and related equipment
CN111192341A (en) Method and device for generating high-precision map, automatic driving equipment and storage medium
WO2022142890A1 (en) Data processing method and related apparatus
CN113343457B (en) Automatic driving simulation test method, device, equipment and storage medium
CN112269939B (en) Automatic driving scene searching method, device, terminal, server and medium
WO2021088497A1 (en) Virtual object display method, global map update method, and device
CN113822263A (en) Image annotation method and device, computer equipment and storage medium
US20230169680A1 (en) Beijing baidu netcom science technology co., ltd.
An et al. Image-based positioning system using LED Beacon based on IoT central management
CN114690144A (en) Point cloud data labeling method, device and system
US20230204755A1 (en) Target tracking method and apparatus
US20210348938A1 (en) Sensor calibration for space translation
CN114462580A (en) Training method of text recognition model, text recognition method, device and equipment
CN113920222A (en) Method, device and equipment for acquiring map building data and readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21913632

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21913632

Country of ref document: EP

Kind code of ref document: A1