WO2021213241A1 - Target detection method and apparatus, and electronic device, storage medium and program - Google Patents

Target detection method and apparatus, and electronic device, storage medium and program Download PDF

Info

Publication number
WO2021213241A1
WO2021213241A1 PCT/CN2021/087424 CN2021087424W WO2021213241A1 WO 2021213241 A1 WO2021213241 A1 WO 2021213241A1 CN 2021087424 W CN2021087424 W CN 2021087424W WO 2021213241 A1 WO2021213241 A1 WO 2021213241A1
Authority
WO
WIPO (PCT)
Prior art keywords
obstacle
information
grid
point
area
Prior art date
Application number
PCT/CN2021/087424
Other languages
French (fr)
Chinese (zh)
Inventor
周辉
洪方舟
王哲
石建萍
Original Assignee
上海商汤临港智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海商汤临港智能科技有限公司 filed Critical 上海商汤临港智能科技有限公司
Priority to KR1020217043313A priority Critical patent/KR20220016221A/en
Priority to JP2021577017A priority patent/JP2022539093A/en
Publication of WO2021213241A1 publication Critical patent/WO2021213241A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/187Segmentation; Edge detection involving region growing; involving region merging; involving connected component labelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30248Vehicle exterior or interior
    • G06T2207/30252Vehicle exterior; Vicinity of vehicle
    • G06T2207/30261Obstacle

Definitions

  • the present disclosure relates to the field of automatic driving technology, and in particular to a target detection method and device, electronic equipment, storage medium, and program.
  • Target detection of obstacles is an important part of ensuring safe driving in automatic driving.
  • Target detection can use deep learning technology based on neural networks to predict the possible size and location of obstacles.
  • accuracy of target detection based on deep learning technology depends on specific types of training data and the pros and cons of training algorithms, resulting in low target detection accuracy for obstacles.
  • the present disclosure proposes a technical solution for target detection.
  • a target detection method includes: acquiring point cloud information, the point cloud information includes at least a target object and point cloud information corresponding to the object to be detected, wherein the to be detected The object is a person or thing around the target object; according to the point cloud information, grid information is obtained, and the grid information includes at least obstacle point information indicating the object to be detected; according to the grid information, identification Obstacles in the object to be detected that affect the movement of the target object are extracted.
  • a target detection device including: an acquisition unit configured to acquire point cloud information, the point cloud information including at least the target object and the point cloud information corresponding to the object to be detected;
  • the object to be detected is a person or thing around the target object;
  • an information processing unit is configured to obtain grid information according to the point cloud information, and the grid information includes at least obstacle point information indicating the object to be detected
  • the detection unit is used to identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.
  • an electronic device including: a processor; and a memory for storing instructions executable by the processor.
  • the processor is configured to execute the above-mentioned target detection method.
  • a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above-mentioned target detection method when executed by a processor.
  • a computer program is also provided, the computer program is stored in a storage medium, and when a processor executes the computer program, the processor is used to execute the above-mentioned target detection method.
  • grid information is obtained according to point cloud information corresponding to at least the target object and the object to be detected, and the grid information includes at least obstacle point information indicating the object to be detected, so that the grid information can be Grid information to identify obstacles in the object to be detected that affect the movement of the target object. Since the content of the point cloud information is relatively rich and is not limited to a specific type of object, such as a vehicle or a pedestrian, the technical solution of the present disclosure is suitable for more target detection scenarios. In addition, by identifying the obstacle in the object to be detected according to the grid information including the obstacle point information, the target detection accuracy for the obstacle is effectively improved.
  • Fig. 1 shows a flowchart of a target detection method according to an embodiment of the present disclosure.
  • Fig. 2 shows a schematic diagram of grid information according to an embodiment of the present disclosure.
  • Fig. 3 shows a schematic diagram of different ring IDs of pixel sources in a grid area according to an embodiment of the present disclosure.
  • Fig. 4 shows a schematic diagram of the source of pixels in the grid area with the same ring ID according to an embodiment of the present disclosure.
  • Fig. 5 shows a schematic diagram of obstacle point information in each grid area according to an embodiment of the present disclosure.
  • Figures 6a-6b show schematic diagrams of a communication manner of a connected area according to an embodiment of the present disclosure.
  • Fig. 7 shows a schematic diagram of an obstacle in a grid map according to an embodiment of the present disclosure.
  • FIG. 8 shows a schematic diagram of deleting obstructed obstacles in a grid image according to an embodiment of the present disclosure.
  • Fig. 9 shows a block diagram of a target detection device according to an embodiment of the present disclosure.
  • FIG. 10 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
  • FIG. 11 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
  • a and/or B can mean: A alone exists, A and B exist at the same time, and B exists alone.
  • at least one herein means any one of a plurality of types or any combination of at least two of the plurality of types.
  • including at least one of A, B, and C may mean including any one or more elements selected from the set consisting of A, B, and C.
  • Detecting target objects such as detecting target objects such as vehicles or pedestrians in autonomous driving or unmanned driving scenes, can be achieved by using deep learning technology based on neural networks.
  • the accuracy of target detection based on deep learning technology depends on specific types of training data, which limits its applicable application scenarios. That is to say, the neural network trained according to the deep learning technology is feasible for a certain specific scene related to the selected training data, but cannot be generalized to other non-specific scenes. For example, for a specific scene, such as target detection of a vehicle or pedestrian, since the specific scene is relatively common, a large amount of data related to the target detection of the vehicle or pedestrian has been accumulated. Regarding these data as a specific type of training data, a neural network trained based on deep learning technology will look for objects that meet these types of features in the input data, thereby ensuring the accuracy of target detection in the specific scene.
  • the training process may become overly complicated, which is prone to overfitting.
  • the results given may not be accurate. Because it is difficult for training data to cover all possible road conditions, it can only give highly reliable results for specific training data and related specific scenarios.
  • the accuracy of target detection based on deep learning technology also depends on the quality of the training algorithm.
  • the characteristics of deep learning are not completely controllable, that is, the prediction result for a given input data is unpredictable, so it is difficult to achieve the ideal value of 100% recall rate.
  • the recall rate refers to the number of objects identified through target detection divided by the number of actual objects.
  • the recall rate the higher the recall rate, the higher the safety of driving.
  • the use of deep learning technology to achieve target detection in autonomous driving or unmanned driving scenarios is more suitable for the detection of target objects such as vehicles or pedestrians.
  • target detection of obstacles in the road to avoid collisions cannot reach the accuracy required for obstacle detection.
  • the accuracy of obstacle detection is an important part of automatic driving in order to ensure safe driving. For example, if the target detection of obstacles fails to achieve the accuracy of obstacle detection, the safety of autonomous driving or unmanned driving cannot be guaranteed.
  • Fig. 1 shows a flowchart of a target detection method according to an embodiment of the present disclosure.
  • the method is applied to a target detection device.
  • the device can be deployed in a terminal device or a server or other processing equipment, and can perform processing such as target detection or target classification in automatic driving.
  • the terminal device may be a user equipment (UE, User Equipment), mobile device, cellular phone, cordless phone, personal digital assistant (PDA, Personal Digital Assistant), handheld device, computing device, in-vehicle device, wearable device, etc.
  • the method may be implemented by a processor invoking computer-readable instructions stored in the memory. As shown in Figure 1, the process includes:
  • Step S101 Obtain point cloud information, where the point cloud information includes at least the point cloud information corresponding to the target object and the object to be detected.
  • multiple pieces of to-be-processed point cloud information obtained through scanning by at least two sensors may be obtained, and the multiple pieces of to-be-processed point cloud information may be spliced to obtain the point cloud information.
  • grid processing can be performed according to the point cloud information to obtain grid information.
  • the at least two sensors may be sensors with laser emitting and receiving functions in the lidar.
  • the target object may refer to a target device scanned by at least two sensors during the target detection process, such as a vehicle in an autonomous driving or unmanned driving scene.
  • the target object in the present disclosure is not limited to the target device, and may also include pedestrians who guide the blind.
  • the object to be detected may refer to an object related to the target object in the target detection process.
  • the target object is a vehicle in an autonomous driving or unmanned driving scene, for safe driving
  • the object to be detected may be stones, leaves, roadblocks, etc. on the driving route of the vehicle.
  • the object to be detected may also refer to an object in the same observation frame as the target object during the target detection process.
  • the target object is still a vehicle as an example, and the object to be detected may be a roadside billboard, a tree and its canopy in the same observation screen as the vehicle.
  • Step S102 Obtain grid information according to the point cloud information.
  • the grid information includes at least obstacle point information indicating the object to be detected.
  • the point cloud information may include the point cloud information corresponding to the target object, such as the point cloud information corresponding to the vehicle in the autonomous driving or unmanned driving scene, and the point cloud information corresponding to the object to be detected, such as pebbles, Leaves, roadblocks, roadside billboards, trees and their canopies, etc.
  • the point cloud information corresponding to the object to be detected such as pebbles, Leaves, roadblocks, roadside billboards, trees and their canopies, etc.
  • the point cloud information can be gridded to obtain a grid map composed of multiple grid regions.
  • Fig. 2 shows a schematic diagram of grid information according to an embodiment of the present disclosure.
  • An implementation manner of the grid information of the present disclosure may be a grid graph or other chart forms, which is not limited.
  • the grid map contains multiple grid areas 11, and each grid area includes one or more pixels (in Figure 2, each grid area includes multiple pixels as an example) .
  • an obstacle point For each grid area in the grid map, it is necessary to identify whether the grid area has a pixel point corresponding to the object to be detected or even an obstacle (hereinafter referred to as an obstacle point) and identify it with obstacle point information.
  • the grid graph obtained by the gridding process can be regarded as the initial grid graph, that is, the obstacle point information of each grid area is the first value representing "none", such as "0".
  • the process of identifying obstacles can be regarded as updating the obstacle point information of each grid area in the grid map (it can be specifically, updating the obstacle point information of a certain grid area from a first value to a second value, for example, " 1”), and at least the sensor identification (ring ID) in the point cloud information can be used as the update basis.
  • the obstacle point information can be marked in the grid area according to the ring ID.
  • FIG. 5 shows a schematic diagram of obstacle point information in each grid area according to an embodiment of the present disclosure, taking the number “0" and the number "1" as the obstacle point information as an example.
  • marking the grid area as the first value "0" indicates that there are no obstacles in the grid area
  • marking the grid area as the second value "1" indicates that there are obstacle points in the grid area.
  • a grid map containing obstacle point information is obtained, so that the obstacle in the object to be detected can be identified according to the grid map containing obstacle point information.
  • Step S103 Identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.
  • the grid information may be a grid graph containing obstacle point information.
  • obstacles in the object to be detected can be identified. For example, marking the grid area as "1" indicates that there are obstacle points in the grid area. Connecting multiple obstacle points can obtain the connected area corresponding to the multiple obstacle points, and determine the shape of the object to be detected corresponding to the multiple obstacle points, and even the shape of the obstacle.
  • point cloud information corresponding to the target object and the object to be detected is obtained, and according to the point cloud information, information at least including obstacle points indicating whether the object to be detected exists is obtained
  • the obstacles in the object to be detected can be identified according to the obstacle point information contained in the grid information, which improves the accuracy of target detection for the obstacles.
  • the point cloud information can be obtained according to the scanning detection signal sent by the sensor and the return signal received.
  • the sensor transmits a scanning detection signal to the vehicle and its surroundings, and then the sensor receives the return signal reflected from the vehicle and its surrounding objects, and compares the return signal with the transmitted scanning detection signal to obtain the vehicle and its surroundings.
  • Objects such as position information, height information, distance information, speed information, posture information, shape information and other parameters, so that the vehicle and its surrounding objects can be tracked and identified based on these parameters.
  • the point cloud information of the present disclosure is a collection of massive points that express the spatial distribution and surface characteristics of objects in the target area under the same spatial reference system, and the three-dimensional coordinates of each pixel point are recorded in the form of pixels ( Among them, the X/Y two-dimensional coordinates in the three-dimensional coordinates are used to calibrate the position information in the above parameters, and the third dimension Z in the three-dimensional coordinates is used to calibrate the height information in the above parameters), color information (RGB), and laser reflection intensity (Intensity ) A combination of multiple items in information, etc.
  • the ring ID information of each pixel can be obtained from the point cloud information, and the ring ID included in the target grid area in the multiple grid areas can be determined to be in the target grid area. Whether there are obstacles. Further, in the case that there are obstacle points in the target grid area, updating the grid information may include the following content:
  • Fig. 3 shows a schematic diagram of different ring IDs of pixel points in a grid area according to an embodiment of the present disclosure, including a sensor 21, a sensor 22, and a sensor 23, an object to be detected 24, and a plurality of pixels (respectively use 1-6 to Logo).
  • the triangular shape of the object 24 to be detected is merely illustrative, and is not intended to limit the actual shape of the object to be detected.
  • the laser beams emitted by the sensor 21 and the sensor 22 should not originally fall into the target grid area where the object 24 to be detected is located.
  • the laser beams emitted by the sensor 21 and the sensor 22 are generated. ⁇ Reflected.
  • the sensor 21 is scanned to obtain point cloud information composed of multiple pixels
  • the laser beam 211 emitted by the sensor 21 meets the object 24 to be detected and is reflected, and the pixel 1 falls into the target grid area
  • the sensor 22 scans to obtain point cloud information composed of multiple pixels
  • the laser beam 221 and the laser beam 222 emitted by the sensor 22 meet the object 24 to be detected and reflect, and the pixel points 2 and the pixel points 3 fall into the target network.
  • the laser beam 231, laser beam 232, and laser beam 233 emitted by the sensor 23 do not encounter the object to be detected 24, and the pixel points 4, Pixel 5 and pixel 6 fall into the target grid area.
  • the ring IDs corresponding to multiple pixels are different identifiers, which means that the multiple pixels are obtained by different sensors.
  • the obstacle point information corresponding to the target grid area is updated from the initial first value to the second value to mark the The existence of obstacle points.
  • the multiple sensors (sensor 21, sensor 22, and sensor 23) in Figure 3 are not necessarily arranged separately in actual applications, they can also be arranged next to each other, or even multiple sensors can be arranged together and displayed. Different projection angles.
  • multiple sensors are dispersedly arranged for more intuitiveness. Multiple sensor placement positions that can be imagined by those skilled in the art without creative work are within the protection scope of the present disclosure.
  • FIG. 4 shows a schematic diagram of the source of pixels in the grid area with the same ring ID according to an embodiment of the present disclosure, including a sensor 31 and a plurality of pixels (identified by 7-10, respectively).
  • the laser beam 311, laser beam 312, laser beam 313, and laser beam 314 emitted by the sensor 31 have not encountered obstacles, and the pixel point 7, pixel Point 8, pixel point 9 and pixel point 10 fall into the target grid area.
  • the ring IDs corresponding to multiple pixels are the same identifier, which means that the multiple pixels are obtained by different sensors. At this time, it can be determined that there are no obstacle points in the target grid area, and the obstacle point information corresponding to the target grid area is maintained as the initial first value.
  • the object to be detected included in the point cloud information may be obstacles such as pebbles and leaves, as well as non-obstacles such as tree crowns and signs in autonomous driving or unmanned driving scenarios. Therefore, on the basis of the obstacle point judgment based on the above ring ID, the height information can be further added, and the obstacle points determined by the above ring ID can be verified to avoid possible misjudgments, such as tree crowns and signs. Other non-obstacles are also recognized as obstacles. Because, in the case of autonomous driving or unmanned driving, the target object is a vehicle, and objects in the sky such as tree crowns and signboards should not be obstacles, and are usually much higher than obstacles such as stones and leaves. Therefore, the height information of the pixels in the point cloud information can be added to exclude non-obstacles such as tree crowns and signs from the grid area.
  • updating the grid information further includes: determining the height information according to the height information.
  • the category of the obstacle point existing in the target grid area; and the obstacle point information corresponding to the target grid area in the grid information is updated according to the category of the obstacle point.
  • the grid information is a grid graph marked with obstacle point information
  • the target grid area corresponds to The obstruction point information is updated from the second value to the first value, which can effectively reduce the probability of occurrence of the above-mentioned misjudgment.
  • the obstacle point information corresponding to the target grid area is maintained at a second value. In this way, after the grid information is updated, a more accurate grid map containing only the obstacle point information corresponding to the obstacle can be obtained for subsequent target detection processing.
  • determining the category of obstacle points existing in the target grid area according to the height information includes: obtaining ring IDs and height information corresponding to at least two pixels in the target grid area; The at least two pixels are divided according to the ring ID, and the pixels corresponding to the same ring ID are used as a set of data to obtain multiple sets of pixel data.
  • the minimum height value in each group of pixel data is determined; by classifying and counting the minimum height values in the multiple groups of pixel data, one or more minimum height categories are obtained, and each group is determined accordingly.
  • the minimum height category includes the number of height values and the minimum value.
  • the type of obstacle point in the target grid area can be determined according to the number of height values included in each minimum height category corresponding to the target grid area and the minimum value thereof, that is, whether the obstacle point is a corresponding obstacle Of pixels.
  • the number of height values included in each minimum height class corresponding to the target grid area can be compared with the number threshold (ring_count_th), and the smallest height value included in each minimum height class can be The value is compared with the height threshold (height_th) to determine the category of obstacle points in the target grid area.
  • the number of height values included in the target minimum height class is greater than or equal to the number threshold (ring_count_th) and the minimum value of the included height values If it is less than the height threshold (height_th), it is considered that the obstacle points existing in the target grid area correspond to the obstacle.
  • ring_count_th 3
  • height_th can be the height of the vehicle, for example, 2m. .
  • a connected area analysis can be performed on the grid graph to obtain a connected area, and an obstacle in the object to be detected can be identified according to the connected area.
  • the obstacle can be represented in the form of a polygon such as a concave polygon, a convex polygon, a rectangle, or a triangle, as long as it can be recognized that the obstacle is different from other objects.
  • convex polygons are used.
  • the network with the obstacle point information "1" connected together can be searched based on the obstacle point information marked as "0" or "1" in the grid area as shown in FIG. 5.
  • Grid area thus forming a "connected area”.
  • Figures 6a-6b show schematic diagrams of a communication manner of a connected area according to an embodiment of the present disclosure.
  • the connected area calculation can be implemented by the Breadth First Search (BFS) algorithm.
  • BFS Breadth First Search
  • the smallest unit in an image is a pixel, and there are 8 adjacent pixels around each pixel, and there are 2 types of adjacent relationships: 4-adjacent (as shown in Figure 6a) and 8-adjacent (as shown in Figure 6b).
  • 4 is adjacent to a total of 4 points, that is, there are a total of four pixel points up, down, left, and right.
  • the 8 adjacent points also include 4 points on the diagonal position, that is, a total of 8 pixel points.
  • FIG. 7 shows a schematic diagram of obstacles in a grid diagram according to an embodiment of the present disclosure. As shown in FIG. 7, the grid diagram contains multiple obstacles represented by convex polygons.
  • the method further includes: acquiring a plurality of points to be processed on the first line segment of the connected area, and selecting at least two points from the plurality of points to be processed
  • the reference point is connected to the at least two reference points to obtain a second line segment, and the connected area is adjusted according to the second line segment to obtain the first area.
  • the first area may be smaller than the connected area. If the obstacle is a convex polygon, the adjustment process of the connected area can be called convex hull processing.
  • a certain line segment (called the first line segment) that constitutes a connected region has 10 points to be processed, and 6 reference points are selected from the 10 points to be processed, and the 6 reference points are connected to obtain a line segment (called the first line segment).
  • the first area can be obtained after adjusting the connected area according to the second line segment, and the first area is smaller than the connected area. That is to say, after the convex hull is processed, the number of convex edges used to represent obstacles is reduced (because there are fewer points, the convex edges are reduced accordingly), and the convex polygon is smaller than its original shape. Convex hull processing can reduce the amount of calculation.
  • the method further includes: extracting the point cloud information corresponding to the target object from the point cloud information, and according to the target object The coordinates of the pixel points in the corresponding point cloud information are obtained to obtain the target position corresponding to the target object; at least two obstacles identified based on the grid information are obtained; the center point of the target position is used as a reference, according to the prediction
  • the guide line issued by the angle obtains a fan-shaped area; when the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the second obstacle The obstacle point information of the obstacle is deleted from the grid information.
  • the grid diagram contains a target object and at least two obstacles.
  • the target object may be a vehicle 41
  • at least The first obstacle among the two obstacles may be the warning object 42
  • the second obstacle among the at least two obstacles may be one or more stones 43.
  • a fan-shaped area is obtained according to the guide line issued by the preset angle ⁇ , and the warning object 42 and one or more stones 43 are all covered by the fan-shaped area.
  • the second obstacle is not limited to small stones that are blocked, and can also be grass on the side of the road.
  • the method includes: sending a message that there is an obstacle on the navigation path to a target object (such as a vehicle), so that the target object performs obstacle avoidance processing and/or replans the navigation path in response to the message.
  • a target object such as a vehicle
  • the grid information may be a grid graph marked with obstacle point information.
  • the grid area should be a plane that almost matches the height of the ground.
  • the laser light emitted by the adjacent sensor of the grid area corresponds to the sensor. Will not be hindered by the grid area, which makes the laser light emitted to the grid area come from the same sensor. Therefore, it is assumed that all pixels falling in a certain grid area originate from the same sensor, that is, the ring IDs corresponding to all pixels in the grid area are the same, that is, all the pixels falling in the grid area are the same. If the pixel points are scanned by the same sensor, it can be considered that there are no obstacle points that may correspond to obstacles in the grid area.
  • the laser light emitted by the adjacent sensor of the sensor corresponding to the grid area will be blocked and reflected by the protruding object on the grid area, which makes it hit
  • the laser in the grid area comes from different sensors. Therefore, it is assumed that there are multiple sensors corresponding to pixels that fall in a certain grid area, that is, the ring IDs corresponding to the pixels in the grid area are different, that is, the pixels that fall into the grid area If it is scanned by different sensors, it can be considered that there are obstacle points that may correspond to obstacles in the grid area.
  • the number of ring IDs of pixels falling in the grid area is used to determine whether there may be an obstacle in the grid area, which can be further optimized.
  • a certain grid area that includes objects in the air such as tree crowns, signboards, etc.
  • lasers belonging to multiple sensors will also be emitted into the grid area, that is, the pixels that fall into the grid area
  • the ring IDs corresponding to the points are different.
  • the target object is a vehicle
  • objects in the sky such as tree crowns and signboards do not belong to the obstacles that the vehicle pays attention to, and it is necessary to eliminate the situation where the tree canopy and signboards are also identified as obstacles that the vehicle needs to avoid. Therefore, the height information of the pixels can be taken into consideration to check the possible obstacles obtained by the ring ID to filter out objects higher than a certain height, thereby further improving the accuracy of obstacle detection.
  • an N ⁇ M grid can be constructed for each point cloud information scanned by lidar, and each grid can be preset The side length of represents 0.1m in reality, and the coordinates (N/2, M/2) are set as the center of the vehicle.
  • an N ⁇ M grid map is directly constructed. Regardless of whether the point cloud information includes the fusion of multiple lidar scan results or one lidar scan result, the following obstacle identification method is used to judge the obstacles to obtain a grid map with obstacle point information.
  • the pixels in the point cloud information scanned by a single lidar can be allocated to the grid according to the position information. For each grid area, count the ring IDs of the pixels allocated to it (the same ring ID is not counted repeatedly). Then, the pixels corresponding to the same ring ID are used as a set of data to obtain multiple sets of pixel data. Then, according to the height information, the minimum height value in each group of pixel data is determined, and the minimum height values of the multiple groups of pixels are classified and counted to obtain at least one minimum height category.
  • the class of possible obstacles is determined according to the number of height values included in the minimum height class and the minimum of these height values.
  • the network can be determined by comparing the number of height values included in each minimum height class with a threshold value, and comparing the minimum value of the height values included in each minimum height class with the height threshold value.
  • the category of obstacle points that exist in the grid area if the following target minimum height class exists in the at least one minimum height class, the number of height values included in the target minimum height class is greater than or equal to the number threshold (ring_count_th), and the minimum value of the included height values If it is less than the height threshold (height_th), it is considered that the obstacle points in the grid area correspond to obstacles that will really affect the target object.
  • the advantage of using classification statistics is to find a continuous segment of obstacles in height, rather than a single point.
  • a grid map can be obtained for each lidar, and each element of the multiple grid maps is merged by "OR" operation, and the output result is obtained, that is, the grid map with obstacle point information is obtained.
  • An example of the "or” operation is: "1" in the grid graph indicates an obstacle point, and "0" indicates an obstacle-free point.
  • the corresponding position of the grid area is marked with "1”, and the result of the "or” operation on these two grid graphs is [1, 1, 0].
  • the compensation method of the surrounding grid area can be: also need to count the grid area as the center, n ⁇ n size range. in,
  • the around function represents rounding, and a is a small predetermined constant.
  • a is a small predetermined constant.
  • gap_th is used as a correction function, and its value can be corrected according to the distance between the grid area and the center of the vehicle (distance). For example, according to different conditions such as the installation position, angle, and point cloud sparseness of the sensor, different compensation schemes are adopted. In an example,
  • the unit of the threshold gap_th is meters, and a and b are relatively small constants.
  • the calculated gap_th is a small value, which can be 0.1m.
  • the value of the number threshold ring_count_th compensation can be made according to the sparseness of the point cloud information.
  • a fixed value may be used, for example, 3.
  • the value of the height threshold height_th since the sensor (the lidar that the sensor can be installed on the vehicle) has a certain elevation angle, the height threshold height_th cannot be set to a fixed value. It can be based on the distance between the grid area and the center of the vehicle (distance ) Perform a certain angle correction. For example, in an example, assuming that the tangent of the correction angle is a, then let
  • the unit of the height threshold height_th is meters. It should be noted that the values of the above-mentioned parameters can be set according to actual conditions, and the specific setting methods are not limited here.
  • each grid area indicates whether there is an obstacle point corresponding to the obstacle in the grid area. Due to the sparseness of the point cloud information, some larger objects are divided into many parts, and the image expansion algorithm can be used to process the grid first to connect multiple parts of the same object. Next, perform connected area analysis (each connected area can represent an object, such as an obstacle). For each connected region, calculate its convex hull, and then use convex hull operations for each convex hull, such as the Ramer–Douglas–Peucker algorithm, which can simplify the number of edges of the convex hull and reduce the amount of computation. Finally, FOV analysis is performed to remove small obstacles that cannot be observed from the center of the vehicle.
  • An example of convex hull operation includes:
  • the polylines formed by connecting each dividing point in turn can be used as an approximation of the initial polylines to obtain the updated convex hull.
  • An example of FOV analysis includes: for every two convex hulls C1 and C2, it is necessary to detect whether the convex hull C1 can be observed from the position of the vehicle, such as the center point A of the vehicle, under the occlusion of the convex hull C2. Specifically, it can include:
  • n is greater than or equal to a certain threshold fov_th, the convex hull C1 is considered invisible, and the convex Package C1 is deleted.
  • the value of the threshold fov_th needs to be corrected according to the distance from the obstacle to the vehicle.
  • An example of a correction is:
  • fov_th min(1, ceil(convex_point_num ⁇ (1–distance/a))
  • the writing order of the steps does not mean a strict execution order but constitutes any limitation on the implementation process.
  • the specific execution order of each step should be based on its function and possibility.
  • the inner logic is determined.
  • the present disclosure also provides a target detection device, electronic equipment, computer-readable storage medium, and a program, all of which can be used to implement any target detection method provided in the present disclosure.
  • a target detection device electronic equipment, computer-readable storage medium, and a program, all of which can be used to implement any target detection method provided in the present disclosure.
  • Fig. 9 shows a block diagram of a target detection device according to an embodiment of the present disclosure.
  • the device includes: an acquiring unit 51 for acquiring point cloud information.
  • the point cloud information includes at least a target object and a target object to be detected.
  • Point cloud information corresponding to the object, where the target object can move, and the object to be detected is people or things around the target object;
  • the information processing unit 52 is configured to obtain grid information according to the point cloud information, Wherein, the grid information includes at least obstacle point information indicating the object to be detected;
  • the detection unit 53 is configured to identify, according to the grid information, that the object to be detected affects the movement of the target object Obstacles.
  • the acquiring unit is configured to: acquire a plurality of to-be-processed point cloud information scanned by at least two sensors; and perform stitching processing on the plurality of to-be-processed point cloud information to obtain Describe point cloud information.
  • the point cloud information further includes a sensor identification (ring ID).
  • the information processing unit is configured to: perform grid processing on the point cloud information to obtain a grid graph, the grid graph includes a plurality of grid regions, and each of the grid regions corresponds to the The obstacle point information is the first value; for each grid area, according to the ring ID included in the grid area, determine whether there is an obstacle point corresponding to the object to be detected in the target grid area; If the obstacle point exists in the grid area, the obstacle point information corresponding to the grid area in the grid information is updated to a second value.
  • the information processing unit is configured to determine that the obstacle point exists in the grid area when the ring IDs corresponding to at least two pixel points in the grid area are different.
  • the point cloud information further includes height information
  • the device further includes a category determining unit configured to determine the category of the obstacle point existing in the grid area according to the height information ; According to the category of the obstacle point, update the obstacle point information corresponding to the grid area in the grid information.
  • the category determining unit is configured to: obtain ring IDs and height information respectively corresponding to at least two pixels in the grid area; and correspond to the same ring ID in the at least two pixels
  • a set of data multiple sets of pixel data are obtained; according to the height information, the minimum height value in each set of pixel data is determined; the minimum height value in the multiple sets of pixel data is determined
  • Categorize statistics to obtain one or more minimum height categories; determine the categories of obstacle points existing in the grid area according to the number of height values included in each minimum height category and the minimum value thereof.
  • the detection unit is configured to: perform a connected area analysis according to the obstacle point information in the grid information to obtain a connected area; according to the connected area, identify the object to be detected The obstacles.
  • the device further includes a connected area adjustment unit, configured to: obtain a plurality of points to be processed on the first line segment of the connected area; select at least two points from the plurality of points to be processed Reference point; connecting the at least two reference points to obtain a second line segment, and adjusting the connected area according to the second line segment to obtain the first area.
  • the first area may be smaller than the connected area.
  • the device further includes: an occlusion processing unit, configured to: extract the point cloud information corresponding to the target object from the point cloud information, and according to the coordinates of the pixel points in the point cloud information corresponding to the target object , Obtain the target position corresponding to the target object; obtain at least two obstacles identified based on the grid information; use the center point of the target position as a reference, and obtain a fan-shaped area according to a guide line issued at a preset angle; In the case that the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the obstacle point information of the second obstacle is obtained from the network Delete from the grid information.
  • an occlusion processing unit configured to: extract the point cloud information corresponding to the target object from the point cloud information, and according to the coordinates of the pixel points in the point cloud information corresponding to the target object , Obtain the target position corresponding to the target object; obtain at least two obstacles identified based on the grid information; use the center point of the target position as
  • the device further includes a sending unit, configured to send a message that there is an obstacle on the navigation path to the target object, so that the target object performs obstacle avoidance processing and/or in response to the message Re-plan the navigation path.
  • a sending unit configured to send a message that there is an obstacle on the navigation path to the target object, so that the target object performs obstacle avoidance processing and/or in response to the message Re-plan the navigation path.
  • the functions or modules contained in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments.
  • the functions or modules contained in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments.
  • the embodiments of the present disclosure also provide a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above-mentioned method when executed by a processor.
  • the computer-readable storage medium may be a non-volatile computer-readable storage medium.
  • An embodiment of the present disclosure also provides an electronic device, including: a processor; a memory for storing executable instructions of the processor; wherein the processor is configured as the above-mentioned method.
  • the electronic device can be provided as a terminal, server or other form of device.
  • Fig. 10 is a block diagram showing an electronic device 800 according to an exemplary embodiment.
  • the electronic device 800 may be a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and other terminals.
  • the electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, and a sensor component 814 , And communication component 816.
  • the processing component 802 generally controls the overall operations of the electronic device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
  • the processing component 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the foregoing method.
  • the processing component 802 may include one or more modules to facilitate the interaction between the processing component 802 and other components.
  • the processing component 802 may include a multimedia module to facilitate the interaction between the multimedia component 808 and the processing component 802.
  • the memory 804 is configured to store various types of data to support operations in the electronic device 800. Examples of these data include instructions for any application or method to operate on the electronic device 800, contact data, phone book data, messages, pictures, videos, etc.
  • the memory 804 can be implemented by any type of volatile or non-volatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable and Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic Disk or Optical Disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read-only memory
  • EPROM erasable and Programmable Read Only Memory
  • PROM Programmable Read Only Memory
  • ROM Read Only Memory
  • Magnetic Memory Flash Memory
  • Magnetic Disk Magnetic Disk or Optical Disk.
  • the power supply component 806 provides power for various components of the electronic device 800.
  • the power supply component 806 may include a power management system, one or more power supplies, and other components associated with the generation, management, and distribution of power for the electronic device 800.
  • the multimedia component 808 includes a screen of an output interface between the electronic device 800 and the user.
  • the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user.
  • the touch panel includes one or more touch sensors to sense touch, sliding, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure related to the touch or slide operation.
  • the multimedia component 808 includes a front camera and/or a rear camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
  • the audio component 810 is configured to output and/or input audio signals.
  • the audio component 810 includes a microphone (MIC), and when the electronic device 800 is in an operation mode, such as a call mode, a recording mode, and a voice recognition mode, the microphone is configured to receive an external audio signal.
  • the received audio signal may be further stored in the memory 804 or transmitted via the communication component 816.
  • the audio component 810 further includes a speaker for outputting audio signals.
  • An input/output (I/O) interface 812 provides an interface between the processing component 802 and a peripheral interface module.
  • the peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include, but are not limited to: home button, volume button, start button, and lock button.
  • the sensor component 814 includes one or more sensors for providing the electronic device 800 with various aspects of state evaluation.
  • the sensor component 814 can detect the on/off state of the electronic device 800 and the relative positioning of the components.
  • the components are the display and keypad of the electronic device 800, the sensor component 814 can also detect the position change of the electronic device 800 or a component of the electronic device 800, the presence or absence of contact between the user and the electronic device 800, the position of the electronic device 800 or Acceleration/deceleration and temperature changes of the electronic device 800.
  • the sensor component 814 may include a proximity sensor configured to detect the presence of nearby objects when there is no physical contact.
  • the sensor component 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
  • the sensor component 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • the communication component 816 is configured to facilitate wired or wireless communication between the electronic device 800 and other devices.
  • the electronic device 800 can access a wireless network based on a communication standard, such as WiFi, 2G, or 3G, or a combination thereof.
  • the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel.
  • the communication component 816 also includes a near field communication (NFC) module to facilitate short-range communication.
  • the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
  • RFID radio frequency identification
  • IrDA infrared data association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • the electronic device 800 may be implemented by one or more application-specific integrated circuits (ASIC), digital signal processors (DSP), digital signal processing devices (DSPD), programmable logic devices (PLD), field-available A programmable gate array (FPGA), controller, microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.
  • ASIC application-specific integrated circuits
  • DSP digital signal processors
  • DSPD digital signal processing devices
  • PLD programmable logic devices
  • FPGA field-available A programmable gate array
  • controller microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.
  • a non-volatile computer-readable storage medium such as the memory 804 including computer program instructions, which can be executed by the processor 820 of the electronic device 800 to complete the foregoing method.
  • Fig. 11 is a block diagram showing an electronic device 900 according to an exemplary embodiment.
  • the electronic device 900 may be provided as a server.
  • the electronic device 900 includes a processing component 922, which further includes one or more processors, and a memory resource represented by a memory 932, for storing instructions that can be executed by the processing component 922, such as an application program.
  • the application program stored in the memory 932 may include one or more modules each corresponding to a set of instructions.
  • the processing component 922 is configured to execute instructions to perform the above-described methods.
  • the electronic device 900 may also include a power component 926 configured to perform power management of the electronic device 900, a wired or wireless network interface 950 configured to connect the electronic device 900 to a network, and an input/output (I/O) interface 958.
  • the electronic device 900 can operate based on an operating system stored in the memory 932, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
  • a non-volatile computer-readable storage medium is also provided, such as a memory 932 including computer program instructions, which can be executed by the processing component 922 of the electronic device 900 to complete the foregoing method.
  • the present disclosure may be a system, method and/or computer program product.
  • the computer program product may include a computer-readable storage medium loaded with computer-readable program instructions for enabling a processor to implement various aspects of the present disclosure.
  • the computer-readable storage medium may be a tangible device that can hold and store instructions used by the instruction execution device.
  • the computer-readable storage medium may include, but is not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing, for example.
  • Computer-readable storage media include: portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM) Or flash memory), static random access memory (SRAM), portable compact disk read-only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanical encoding device, such as a printer with instructions stored thereon
  • RAM random access memory
  • ROM read-only memory
  • EPROM erasable programmable read-only memory
  • flash memory flash memory
  • SRAM static random access memory
  • CD-ROM compact disk read-only memory
  • DVD digital versatile disk
  • memory stick floppy disk
  • mechanical encoding device such as a printer with instructions stored thereon
  • the computer-readable storage medium used here is not interpreted as the instantaneous signal itself, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (for example, light pulses through fiber optic cables), or through wires Transmission of electrical signals.
  • the computer-readable program instructions described herein can be downloaded from a computer-readable storage medium to various computing/processing devices, or downloaded to an external computer or external storage device via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network.
  • the network may include copper transmission cables, optical fiber transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers.
  • the network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network, and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device .
  • the computer program instructions used to perform the operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, state setting data, or in one or more programming languages.
  • Source code or object code written in any combination, the programming language includes object-oriented programming languages such as Smalltalk, C++, etc., and conventional procedural programming languages such as "C" language or similar programming languages.
  • Computer-readable program instructions can be executed entirely on the user's computer, partly on the user's computer, executed as a stand-alone software package, partly on the user's computer and partly executed on a remote computer, or entirely on the remote computer or server implement.
  • the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (for example, using an Internet service provider to access the Internet). connect).
  • LAN local area network
  • WAN wide area network
  • an electronic circuit such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can be customized by using the status information of the computer-readable program instructions.
  • FPGA field programmable gate array
  • PDA programmable logic array
  • the computer-readable program instructions are executed to realize various aspects of the present disclosure.
  • These computer-readable program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, thereby producing a machine that makes these instructions when executed by the processor of the computer or other programmable data processing device , A device that implements the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams is produced. It is also possible to store these computer-readable program instructions in a computer-readable storage medium. These instructions make computers, programmable data processing apparatuses, and/or other devices work in a specific manner, so that the computer-readable medium storing the instructions includes An article of manufacture, which includes instructions for implementing various aspects of the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of an instruction, and the module, program segment, or part of an instruction contains one or more components for realizing the specified logical function.
  • Executable instructions may also occur in a different order from the order marked in the drawings. For example, two consecutive blocks can actually be executed substantially in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved.
  • each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart can be implemented by a dedicated hardware-based system that performs the specified functions or actions Or it can be realized by a combination of dedicated hardware and computer instructions.

Abstract

The present disclosure relates to a target detection method and apparatus, and an electronic device, a storage medium and a program. An example of the method comprises: obtaining point cloud information, the point cloud information at least comprising a target object and point cloud information corresponding to an object to be detected, and said object being a person or thing around the target object; obtaining grid information according to the point cloud information, the grid information at least comprising obstacle point information indicating said object; and identifying, according to the grid information, an obstacle in said object that affects the movement of the target object.

Description

目标检测方法及装置、电子设备、存储介质及程序Target detection method and device, electronic equipment, storage medium and program
相关申请的交叉引用Cross-references to related applications
本专利申请要求于2020年4月20日提交的、申请号为2020103141666、发明名称为“目标检测方法及装置、电子设备和存储介质”的中国专利申请的优先权,该申请以引用的方式并入文本中。This patent application claims the priority of the Chinese patent application filed on April 20, 2020, the application number is 2020103141666, and the invention title is "target detection method and device, electronic equipment and storage medium". This application is incorporated by reference. Into the text.
技术领域Technical field
本公开涉及自动驾驶技术领域,尤其涉及一种目标检测方法及装置、电子设备、存储介质及程序。The present disclosure relates to the field of automatic driving technology, and in particular to a target detection method and device, electronic equipment, storage medium, and program.
背景技术Background technique
对障碍物进行目标检测,是自动驾驶中确保安全驾驶的一项重要环节。目标检测可以使用基于神经网络的深度学习技术,对障碍物可能的大小及位置进行预测。然而,基于深度学习技术实现目标检测的精度,依赖于特定类型的训练数据及训练算法的优劣,从而导致对障碍物的目标检测精度不高。然而,相关技术中未存在有效的解决方案。Target detection of obstacles is an important part of ensuring safe driving in automatic driving. Target detection can use deep learning technology based on neural networks to predict the possible size and location of obstacles. However, the accuracy of target detection based on deep learning technology depends on specific types of training data and the pros and cons of training algorithms, resulting in low target detection accuracy for obstacles. However, there is no effective solution in the related art.
发明内容Summary of the invention
本公开提出了一种目标检测的技术方案。The present disclosure proposes a technical solution for target detection.
根据本公开的一方面,提供了一种目标检测方法,所述方法包括:获取点云信息,所述点云信息至少包括目标对象及待检测对象对应的点云信息,其中,所述待检测对象为所述目标对象周围的人或物;根据所述点云信息,得到网格信息,所述网格信息至少包括指示所述待检测对象的障碍点信息;根据所述网格信息,识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物。According to an aspect of the present disclosure, there is provided a target detection method, the method includes: acquiring point cloud information, the point cloud information includes at least a target object and point cloud information corresponding to the object to be detected, wherein the to be detected The object is a person or thing around the target object; according to the point cloud information, grid information is obtained, and the grid information includes at least obstacle point information indicating the object to be detected; according to the grid information, identification Obstacles in the object to be detected that affect the movement of the target object are extracted.
根据本公开的一方面,还提供了一种目标检测装置,包括:获取单元,用于获取点云信息,所述点云信息至少包括目标对象及待检测对象对应的点云信息;其中,所述待检测对象为所述目标对象周围的人或物;信息处理单元,用于根据所述点云信息,得到网格信息,所述网格信息至少包括指示所述待检测对象的障碍点信息;检测单元,用于根据所述网格信息,识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物。According to an aspect of the present disclosure, there is also provided a target detection device, including: an acquisition unit configured to acquire point cloud information, the point cloud information including at least the target object and the point cloud information corresponding to the object to be detected; The object to be detected is a person or thing around the target object; an information processing unit is configured to obtain grid information according to the point cloud information, and the grid information includes at least obstacle point information indicating the object to be detected The detection unit is used to identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.
根据本公开的一方面,还提供了一种电子设备,包括:处理器;用于存储处理器可执行指令的存储器。其中,所述处理器被配置为:执行上述目标检测方法。According to an aspect of the present disclosure, there is also provided an electronic device, including: a processor; and a memory for storing instructions executable by the processor. Wherein, the processor is configured to execute the above-mentioned target detection method.
根据本公开的一方面,还提供了一种计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现上述目标检测方法。According to an aspect of the present disclosure, there is also provided a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above-mentioned target detection method when executed by a processor.
根据本公开的一方面,还提供了一种计算机程序,所述计算机程序存储在存储介质中,当处理器执行所述计算机程序时,所述处理器用于执行上述目标检测方法。According to an aspect of the present disclosure, a computer program is also provided, the computer program is stored in a storage medium, and when a processor executes the computer program, the processor is used to execute the above-mentioned target detection method.
根据本公开的示例,通过根据至少包括目标对象及待检测对象对应的点云信息得到 网格信息,所述网格信息至少包括指示所述待检测对象的障碍点信息,使得可根据所述网格信息来识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物。由于点云信息的内容较为丰富,而且不限于特定的某一类对象,如车辆或行人等,因此本公开技术方案适用于更多的目标检测场景。并且,通过根据包括障碍点信息的网格信息识别出所述待检测对象中的障碍物,有效提高了针对障碍物的目标检测精度。According to an example of the present disclosure, grid information is obtained according to point cloud information corresponding to at least the target object and the object to be detected, and the grid information includes at least obstacle point information indicating the object to be detected, so that the grid information can be Grid information to identify obstacles in the object to be detected that affect the movement of the target object. Since the content of the point cloud information is relatively rich and is not limited to a specific type of object, such as a vehicle or a pedestrian, the technical solution of the present disclosure is suitable for more target detection scenarios. In addition, by identifying the obstacle in the object to be detected according to the grid information including the obstacle point information, the target detection accuracy for the obstacle is effectively improved.
以上的一般描述和后文的细节描述仅是示例性和解释性的,而非限制本公开。The above general description and the following detailed description are only exemplary and explanatory, and do not limit the present disclosure.
根据下面参考附图对示例性实施例的详细说明,本公开其它特征及方面将变得清楚。According to the following detailed description of exemplary embodiments with reference to the accompanying drawings, other features and aspects of the present disclosure will become clear.
附图说明Description of the drawings
此处的附图被并入说明书中并构成本说明书的一部分,这些附图示出了符合本公开的实施例,并与说明书一起用于说明本公开的技术方案。The drawings herein are incorporated into the specification and constitute a part of the specification. These drawings illustrate embodiments that conform to the present disclosure, and are used together with the specification to explain the technical solutions of the present disclosure.
图1示出根据本公开实施例的目标检测方法的流程图。Fig. 1 shows a flowchart of a target detection method according to an embodiment of the present disclosure.
图2示出根据本公开实施例的网格信息的示意图。Fig. 2 shows a schematic diagram of grid information according to an embodiment of the present disclosure.
图3示出根据本公开实施例的网格区域中像素点来源不同ring ID的示意图。Fig. 3 shows a schematic diagram of different ring IDs of pixel sources in a grid area according to an embodiment of the present disclosure.
图4示出根据本公开实施例的网格区域中像素点来源同一ring ID的示意图。Fig. 4 shows a schematic diagram of the source of pixels in the grid area with the same ring ID according to an embodiment of the present disclosure.
图5示出根据本公开实施例的每个网格区域中障碍点信息的示意图。Fig. 5 shows a schematic diagram of obstacle point information in each grid area according to an embodiment of the present disclosure.
图6a-图6b示出根据本公开实施例的连通区域连通方式的示意图。Figures 6a-6b show schematic diagrams of a communication manner of a connected area according to an embodiment of the present disclosure.
图7示出根据本公开实施例的网格图中障碍物的示意图。Fig. 7 shows a schematic diagram of an obstacle in a grid map according to an embodiment of the present disclosure.
图8示出根据本公开实施例的删除网格图中被遮挡障碍物的示意图。FIG. 8 shows a schematic diagram of deleting obstructed obstacles in a grid image according to an embodiment of the present disclosure.
图9示出根据本公开实施例的目标检测装置的框图。Fig. 9 shows a block diagram of a target detection device according to an embodiment of the present disclosure.
图10示出根据本公开实施例的电子设备的框图。FIG. 10 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
图11示出根据本公开实施例的电子设备的框图。FIG. 11 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
具体实施方式Detailed ways
以下将参考附图详细说明本公开的各种示例性实施例、特征和方面。附图中相同的附图标记表示功能相同或相似的元件。尽管在附图中示出了实施例的各种方面,但是除非特别指出,不必按比例绘制附图。Various exemplary embodiments, features, and aspects of the present disclosure will be described in detail below with reference to the drawings. The same reference numerals in the drawings indicate elements with the same or similar functions. Although various aspects of the embodiments are shown in the drawings, unless otherwise noted, the drawings are not necessarily drawn to scale.
在这里专用的词“示例性”意为“用作例子、实施例或说明性”。这里作为“示例性”所说明的任何实施例不必解释为优于或好于其它实施例。The dedicated word "exemplary" here means "serving as an example, embodiment, or illustration." Any embodiment described herein as "exemplary" need not be construed as being superior or better than other embodiments.
本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系。例如,A和/或B可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中术语“至少一种”表示多种中的任意一种或多种中的至少两种的任意 组合。例如,包括A、B、C中的至少一种,可以表示包括从A、B和C构成的集合中选择的任意一个或多个元素。The term "and/or" in this document is merely an association relationship that describes the associated objects, which means that there can be three types of relationships. For example, A and/or B can mean: A alone exists, A and B exist at the same time, and B exists alone. In addition, the term "at least one" herein means any one of a plurality of types or any combination of at least two of the plurality of types. For example, including at least one of A, B, and C may mean including any one or more elements selected from the set consisting of A, B, and C.
另外,在下文的具体实施方式中给出了众多的具体细节。本领域技术人员应当理解,没有某些具体细节,本公开同样可以实施。在一些实例中,对于本领域技术人员熟知的方法、手段、元件和电路未作详细描述,以便于凸显本公开的主旨。In addition, numerous specific details are given in the following specific embodiments. Those skilled in the art should understand that the present disclosure can also be implemented without certain specific details. In some instances, the methods, means, elements, and circuits well known to those skilled in the art have not been described in detail, so as to highlight the gist of the present disclosure.
对目标对象进行检测,如针对自动驾驶或无人驾驶场景中的车辆或行人等目标对象进行检测,可以采用基于神经网络的深度学习技术来实现。Detecting target objects, such as detecting target objects such as vehicles or pedestrians in autonomous driving or unmanned driving scenes, can be achieved by using deep learning technology based on neural networks.
一方面,基于深度学习技术实现目标检测的精度,依赖于特定类型的训练数据,导致其适用的应用场景受限。也就是说,根据深度学习技术训练出的神经网络,对与所选择训练数据相关的某种特定场景是可行的,而无法泛化到其他非特定的场景中。比如,对于特定场景,如车辆或行人的目标检测,由于该特定场景比较常见,因此,积累了大量与该车辆或行人的目标检测相关的数据。将这些数据作为特定类型的训练数据,则基于深度学习技术训练出的神经网络会在输入的数据中寻找符合这些类型特征的物体,从而保证了在该特定场景下的目标检测精度。可是,对于并不常见的物体,如一个形状随机的树干或者石块等障碍物,由于根据深度学习技术在训练神经网络过程中没有使用过这类物体的训练数据,因此,很难检测出该障碍物,从而难以将所训练出的神经网络应用于其他非特定的任意场景中。换言之,在某个特定场景下训练的神经网络,在另一个不同类型的场景中的表现就会较差,导致该神经网络的泛化能力较弱。而且,深度学习技术本质上就是对于给定的数据(预期目标)拟合一个复杂的函数,使得符合相同分布的数据输入该函数之后能够给出正确的结果,以得到匹配的假设。但是,常常为了得到这个假设,训练过程有可能变得过度复杂,从而容易出现过拟合。而且,如果输入的数据不符合训练数据的分布,则给出的结果不一定准确。由于训练数据很难覆盖所有可能的道路情况,所以只能针对特定的训练数据以及相关的某种特定场景下给出可靠性较高的结果。On the one hand, the accuracy of target detection based on deep learning technology depends on specific types of training data, which limits its applicable application scenarios. That is to say, the neural network trained according to the deep learning technology is feasible for a certain specific scene related to the selected training data, but cannot be generalized to other non-specific scenes. For example, for a specific scene, such as target detection of a vehicle or pedestrian, since the specific scene is relatively common, a large amount of data related to the target detection of the vehicle or pedestrian has been accumulated. Regarding these data as a specific type of training data, a neural network trained based on deep learning technology will look for objects that meet these types of features in the input data, thereby ensuring the accuracy of target detection in the specific scene. However, for uncommon objects, such as a random-shaped tree trunk or obstacles such as rocks, since the training data of such objects has not been used in the process of training neural networks according to deep learning technology, it is difficult to detect the object. Obstacles, it is difficult to apply the trained neural network to other non-specific arbitrary scenes. In other words, a neural network trained in a certain scene will perform poorly in a different type of scene, resulting in a weaker generalization ability of the neural network. Moreover, the deep learning technology essentially fits a complex function to the given data (expected target), so that the data that conforms to the same distribution can be input into the function to give correct results to get the matching hypothesis. However, often in order to obtain this hypothesis, the training process may become overly complicated, which is prone to overfitting. Moreover, if the input data does not conform to the distribution of the training data, the results given may not be accurate. Because it is difficult for training data to cover all possible road conditions, it can only give highly reliable results for specific training data and related specific scenarios.
另一方面,基于深度学习技术实现目标检测的精度,还依赖于训练算法的优劣。深度学习的特性是不完全可控,即对于给定的输入数据的预测结果不可预期,从而很难达到100%召回率这一理想值。其中,召回率指通过目标检测所识别出来的物体的个数除以实际上物体的个数。一般来说,在自动驾驶或无人驾驶场景中,召回率越高,驾驶的安全性越高。On the other hand, the accuracy of target detection based on deep learning technology also depends on the quality of the training algorithm. The characteristics of deep learning are not completely controllable, that is, the prediction result for a given input data is unpredictable, so it is difficult to achieve the ideal value of 100% recall rate. Among them, the recall rate refers to the number of objects identified through target detection divided by the number of actual objects. Generally speaking, in autonomous driving or unmanned driving scenarios, the higher the recall rate, the higher the safety of driving.
综上所述,采用深度学习技术来实现自动驾驶或无人驾驶场景中的目标检测,更适用于对车辆或行人等目标对象的检测。而对于道路中障碍物的目标检测以避碰,达不到障碍物检测所需要的精度。而障碍物检测的精度,是自动驾驶中为了确保安全驾驶的一项重要环节。例如,如果对障碍物的目标检测达不到障碍物检测的精度,则自动驾驶或无人驾驶的安全性无法得到保障。In summary, the use of deep learning technology to achieve target detection in autonomous driving or unmanned driving scenarios is more suitable for the detection of target objects such as vehicles or pedestrians. However, the target detection of obstacles in the road to avoid collisions cannot reach the accuracy required for obstacle detection. The accuracy of obstacle detection is an important part of automatic driving in order to ensure safe driving. For example, if the target detection of obstacles fails to achieve the accuracy of obstacle detection, the safety of autonomous driving or unmanned driving cannot be guaranteed.
图1示出根据本公开实施例的目标检测方法的流程图。该方法应用于目标检测装置,例如,该装置可以部署于终端设备或服务器或其它处理设备,可执行自动驾驶中的目标检测或目标分类等处理。其中,终端设备可以为用户设备(UE,User Equipment)、移动设备、蜂窝电话、无绳电话、个人数字助理(PDA,Personal Digital Assistant)、手持设备、计算设备、车载设备、可穿戴设备等。在一些可能的实现方式中,该方法可通过处理器调用存储器中存储的计算机可读指令的方式来实现。如图1所示,该流程包括:Fig. 1 shows a flowchart of a target detection method according to an embodiment of the present disclosure. The method is applied to a target detection device. For example, the device can be deployed in a terminal device or a server or other processing equipment, and can perform processing such as target detection or target classification in automatic driving. Among them, the terminal device may be a user equipment (UE, User Equipment), mobile device, cellular phone, cordless phone, personal digital assistant (PDA, Personal Digital Assistant), handheld device, computing device, in-vehicle device, wearable device, etc. In some possible implementation manners, the method may be implemented by a processor invoking computer-readable instructions stored in the memory. As shown in Figure 1, the process includes:
步骤S101、获取点云信息,所述点云信息至少包括目标对象及待检测对象对应的点云信息。Step S101: Obtain point cloud information, where the point cloud information includes at least the point cloud information corresponding to the target object and the object to be detected.
一示例中,可以获取通过至少两个传感器分别扫描得到的多个待处理的点云信息,将所述多个待处理的点云信息进行拼接处理,得到所述点云信息。此外,可以根据该点云信息进行网格化处理,以得到网格信息。In an example, multiple pieces of to-be-processed point cloud information obtained through scanning by at least two sensors may be obtained, and the multiple pieces of to-be-processed point cloud information may be spliced to obtain the point cloud information. In addition, grid processing can be performed according to the point cloud information to obtain grid information.
一示例中,至少两个传感器可以为激光雷达中具备激光发射及接收功能的传感器。In an example, the at least two sensors may be sensors with laser emitting and receiving functions in the lidar.
一示例中,所述目标对象可以指在目标检测过程中通过至少两个传感器扫描的目标设备,如自动驾驶或无人驾驶场景中的车辆。本公开中的目标对象不限于该目标设备,还可以包括导盲的行人等。In an example, the target object may refer to a target device scanned by at least two sensors during the target detection process, such as a vehicle in an autonomous driving or unmanned driving scene. The target object in the present disclosure is not limited to the target device, and may also include pedestrians who guide the blind.
一示例中,所述待检测对象可以指在目标检测过程中与目标对象相关的物体。例如,如目标对象为自动驾驶或无人驾驶场景中的车辆,则为了安全驾驶,待检测对象可以为车辆驾驶路线上的石子、树叶、路障等。所述待检测对象还可以指在目标检测过程中与目标对象处于同一观测画面中的物体。比如,目标对象仍以车辆为例,待检测对象可以为与所述车辆在同一观测画面中的路边广告牌、树木及其树冠等。In an example, the object to be detected may refer to an object related to the target object in the target detection process. For example, if the target object is a vehicle in an autonomous driving or unmanned driving scene, for safe driving, the object to be detected may be stones, leaves, roadblocks, etc. on the driving route of the vehicle. The object to be detected may also refer to an object in the same observation frame as the target object during the target detection process. For example, the target object is still a vehicle as an example, and the object to be detected may be a roadside billboard, a tree and its canopy in the same observation screen as the vehicle.
步骤S102、根据所述点云信息,得到网格信息。其中,所述网格信息至少包括指示所述待检测对象的障碍点信息。Step S102: Obtain grid information according to the point cloud information. Wherein, the grid information includes at least obstacle point information indicating the object to be detected.
一示例中,点云信息可以包括目标对象对应的点云信息,如自动驾驶或无人驾驶场景中的车辆对应的点云信息,还可以包括待检测对象对应的点云信息,如小石子、树叶、路障、路边广告牌、树木及其树冠等。需要指出的是,自动驾驶或无人驾驶场景中,待检测对象中的小石子、树叶、路障是后续要识别的障碍物,而路边广告牌、树木及其树冠由于位于车辆驾驶路径之外,可不作为障碍物来考虑。这样,不仅可以降低运算量,且可以提高对障碍物的检测精度。In an example, the point cloud information may include the point cloud information corresponding to the target object, such as the point cloud information corresponding to the vehicle in the autonomous driving or unmanned driving scene, and the point cloud information corresponding to the object to be detected, such as pebbles, Leaves, roadblocks, roadside billboards, trees and their canopies, etc. It should be pointed out that in autonomous driving or unmanned driving scenarios, small stones, leaves, and roadblocks in the object to be detected are obstacles to be identified later, while roadside billboards, trees and their canopies are located outside the driving path of the vehicle. , Can not be considered as an obstacle. In this way, not only the amount of calculation can be reduced, but also the detection accuracy of obstacles can be improved.
一示例中,可以对点云信息进行网格化处理,得到由多个网格区域构成的网格图。图2示出根据本公开实施例的网格信息的示意图。本公开网格信息的一种实现方式可以是网格图,也可以是其他图表形式,并不做限定。图2中,网格图中包含多个网格区域11,在每个网格区域中包括一个或是多个像素点(图2中,以每个网格区域包括多个像素点为例)。针对网格图中的每个网格区域,需要识别出该网格区域是否存在对应待检测对象、甚至障碍物的像素点(以下简称为障碍点)并以障碍点信息进行标识。这样, 通过网格化处理得到的网格图可视为初始网格图,即其中各网格区域的障碍点信息均为表示“无”的第一值、例如“0”。识别障碍物的过程可视为对网格图中各网格区域的障碍点信息进行更新(可具体为,将某个网格区域的障碍点信息从第一值更新为第二值、例如“1”)的过程,并至少可以采用点云信息中的传感器标识(ring ID)作为更新依据。比如,可以根据ring ID在该网格区域中标记障碍点信息。图5示出根据本公开实施例的每个网格区域中障碍点信息的示意图,以数字“0”及数字“1”作为障碍点信息为例。其中,将网格区域标记为第一值“0”,则表示该网格区域中无障碍点,将网格区域标记为第二值“1”,则表示该网格区域中存在障碍点,从而得到包含障碍点信息的网格图,以便根据该包含障碍点信息的网格图,识别出上述待检测对象中的障碍物。In an example, the point cloud information can be gridded to obtain a grid map composed of multiple grid regions. Fig. 2 shows a schematic diagram of grid information according to an embodiment of the present disclosure. An implementation manner of the grid information of the present disclosure may be a grid graph or other chart forms, which is not limited. In Figure 2, the grid map contains multiple grid areas 11, and each grid area includes one or more pixels (in Figure 2, each grid area includes multiple pixels as an example) . For each grid area in the grid map, it is necessary to identify whether the grid area has a pixel point corresponding to the object to be detected or even an obstacle (hereinafter referred to as an obstacle point) and identify it with obstacle point information. In this way, the grid graph obtained by the gridding process can be regarded as the initial grid graph, that is, the obstacle point information of each grid area is the first value representing "none", such as "0". The process of identifying obstacles can be regarded as updating the obstacle point information of each grid area in the grid map (it can be specifically, updating the obstacle point information of a certain grid area from a first value to a second value, for example, " 1”), and at least the sensor identification (ring ID) in the point cloud information can be used as the update basis. For example, the obstacle point information can be marked in the grid area according to the ring ID. FIG. 5 shows a schematic diagram of obstacle point information in each grid area according to an embodiment of the present disclosure, taking the number "0" and the number "1" as the obstacle point information as an example. Wherein, marking the grid area as the first value "0" indicates that there are no obstacles in the grid area, and marking the grid area as the second value "1" indicates that there are obstacle points in the grid area. In this way, a grid map containing obstacle point information is obtained, so that the obstacle in the object to be detected can be identified according to the grid map containing obstacle point information.
步骤S103、根据所述网格信息,识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物。Step S103: Identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.
一示例中,网格信息可以为包含障碍点信息的网格图。根据该包含障碍点信息的网格图,可以识别出待检测对象中的障碍物。比如,将网格区域标记为“1”,表示为该网格区域中存在障碍点。将多个障碍点连通起来,可以得到该多个障碍点对应的连通区域,并据此确定该多个障碍点对应的待检测对象、甚至障碍物的形状。In an example, the grid information may be a grid graph containing obstacle point information. According to the grid map containing obstacle point information, obstacles in the object to be detected can be identified. For example, marking the grid area as "1" indicates that there are obstacle points in the grid area. Connecting multiple obstacle points can obtain the connected area corresponding to the multiple obstacle points, and determine the shape of the object to be detected corresponding to the multiple obstacle points, and even the shape of the obstacle.
本公开中,通过根据至少两个传感器对目标对象进行扫描,得到包含目标对象及待检测对象对应的点云信息,并根据所述点云信息得到至少包括指示是否存在待检测对象的障碍点信息的网格信息,使得可以根据该网格信息中包含的障碍点信息识别出待检测对象中的障碍物,提高了针对障碍物的目标检测精度。In the present disclosure, by scanning the target object according to at least two sensors, point cloud information corresponding to the target object and the object to be detected is obtained, and according to the point cloud information, information at least including obstacle points indicating whether the object to be detected exists is obtained According to the grid information of the grid information, the obstacles in the object to be detected can be identified according to the obstacle point information contained in the grid information, which improves the accuracy of target detection for the obstacles.
一示例中,至少两个传感器分别对目标对象进行扫描的过程中,根据传感器发出的扫描探测信号及接收的回传信号,可以得到点云信息。比如,传感器向车辆及其周边发射扫描探测信号,然后,传感器接收从车辆及其周边物体反射回来的回传信号,将该回传信号与发射的扫描探测信号进行比较,可以获得车辆及其周边物体的如位置信息、高度信息、距离信息、速度信息、姿态信息、形状信息等参数,从而根据这些参数可以对车辆及其周边物体进行跟踪和识别。In an example, in the process of scanning the target object by at least two sensors, the point cloud information can be obtained according to the scanning detection signal sent by the sensor and the return signal received. For example, the sensor transmits a scanning detection signal to the vehicle and its surroundings, and then the sensor receives the return signal reflected from the vehicle and its surrounding objects, and compares the return signal with the transmitted scanning detection signal to obtain the vehicle and its surroundings. Objects such as position information, height information, distance information, speed information, posture information, shape information and other parameters, so that the vehicle and its surrounding objects can be tracked and identified based on these parameters.
需要指出的是,本公开的点云信息,是同一空间参考系下表达目标区域内物体的空间分布和表面特性的海量点的集合,并以像素点的形式记录每个像素点的三维坐标(其中,采用三维坐标中X/Y两维坐标来标定上述参数中的位置信息,采用三维坐标中的第三维Z来标定上述参数中的高度信息)、颜色信息(RGB)以及激光反射强度(Intensity)信息等中的多项的组合。It should be pointed out that the point cloud information of the present disclosure is a collection of massive points that express the spatial distribution and surface characteristics of objects in the target area under the same spatial reference system, and the three-dimensional coordinates of each pixel point are recorded in the form of pixels ( Among them, the X/Y two-dimensional coordinates in the three-dimensional coordinates are used to calibrate the position information in the above parameters, and the third dimension Z in the three-dimensional coordinates is used to calibrate the height information in the above parameters), color information (RGB), and laser reflection intensity (Intensity ) A combination of multiple items in information, etc.
一示例中,可以从所述点云信息中获取每个像素点的ring ID信息,根据所述多个网格区域中目标网格区域包括的所述ring ID,确定所述目标网格区域中是否存在障碍点。进一步,在所述目标网格区域中存在有障碍点的情况下,更新所述网格信息,可包括如下内容:In an example, the ring ID information of each pixel can be obtained from the point cloud information, and the ring ID included in the target grid area in the multiple grid areas can be determined to be in the target grid area. Whether there are obstacles. Further, in the case that there are obstacle points in the target grid area, updating the grid information may include the following content:
在所述目标网格区域中的至少两个像素点对应的ring ID不同的情况下,确定所述目标网格区域中存在障碍点。图3示出根据本公开实施例的网格区域中像素点来源不同ring ID的示意图,包括传感器21、传感器22及传感器23,待检测对象24,及多个像素点(分别用①-⑥来标识)。需要指出的是,该待检测对象24形状为三角形仅为示意,并不作为对待检测对象实际形状的限定。传感器21和传感器22发射的激光束原本不应当落入待检测对象24所在目标网格区域中,由于目标网格区域存在该待检测对象24,所以,导致传感器21和传感器22发射的激光束产生了反射。其中,通过传感器21扫描以得到由多个像素点构成的点云信息情况下,传感器21发射的激光束211遇到待检测对象24发生反射,则像素点①落入目标网格区域中;通过传感器22扫描以得到由多个像素点构成的点云信息情况下,传感器22发射的激光束221及激光束222遇到待检测对象24发生反射,则像素点②及像素点③落入目标网格区域中;通过传感器23扫描以得到由多个像素点构成的点云信息情况下,传感器23发射的激光束231、激光束232及激光束233未遇到待检测对象24,像素点④、像素点⑤及像素点⑥落入目标网格区域中。可见:多个像素点(分别用①-⑥来标识)分别对应的ring ID为不同标识,意味着所述多个像素点是通过不同传感器得到的。此时,可确定目标网格区域中存在有待检测对象对应的像素点、即障碍点,并将目标网格区域对应的障碍点信息从初始的第一值更新为第二值,以标记所述障碍点的存在。In a case where the ring IDs corresponding to at least two pixel points in the target grid area are different, it is determined that there are obstacle points in the target grid area. Fig. 3 shows a schematic diagram of different ring IDs of pixel points in a grid area according to an embodiment of the present disclosure, including a sensor 21, a sensor 22, and a sensor 23, an object to be detected 24, and a plurality of pixels (respectively use ①-⑥ to Logo). It should be pointed out that the triangular shape of the object 24 to be detected is merely illustrative, and is not intended to limit the actual shape of the object to be detected. The laser beams emitted by the sensor 21 and the sensor 22 should not originally fall into the target grid area where the object 24 to be detected is located. Because the object 24 to be detected exists in the target grid area, the laser beams emitted by the sensor 21 and the sensor 22 are generated.了Reflected. Wherein, when the sensor 21 is scanned to obtain point cloud information composed of multiple pixels, the laser beam 211 emitted by the sensor 21 meets the object 24 to be detected and is reflected, and the pixel ① falls into the target grid area; When the sensor 22 scans to obtain point cloud information composed of multiple pixels, the laser beam 221 and the laser beam 222 emitted by the sensor 22 meet the object 24 to be detected and reflect, and the pixel points ② and the pixel points ③ fall into the target network. In the grid area; in the case of scanning by the sensor 23 to obtain point cloud information composed of multiple pixels, the laser beam 231, laser beam 232, and laser beam 233 emitted by the sensor 23 do not encounter the object to be detected 24, and the pixel points ④, Pixel ⑤ and pixel ⑥ fall into the target grid area. It can be seen that the ring IDs corresponding to multiple pixels (identified by ①-⑥ respectively) are different identifiers, which means that the multiple pixels are obtained by different sensors. At this time, it can be determined that there are pixel points corresponding to the object to be detected in the target grid area, that is, obstacle points, and the obstacle point information corresponding to the target grid area is updated from the initial first value to the second value to mark the The existence of obstacle points.
需要指出的是,图3中的多个传感器(传感器21、传感器22及传感器23)在实际应用中不一定是分散设置,也可以是挨着设置,甚至可将多个传感器设置在一起并呈现不同的投射角度。本示例中为了方便解释对待检测对象、包括障碍物的识别,将多个传感器分散设置,是为了更加直观。本领域技术人员不付出创造性劳动所能够想到的多个传感器摆放位置,都在本公开的保护范围之内。It should be pointed out that the multiple sensors (sensor 21, sensor 22, and sensor 23) in Figure 3 are not necessarily arranged separately in actual applications, they can also be arranged next to each other, or even multiple sensors can be arranged together and displayed. Different projection angles. In this example, in order to facilitate the explanation of the recognition of the object to be detected, including obstacles, multiple sensors are dispersedly arranged for more intuitiveness. Multiple sensor placement positions that can be imagined by those skilled in the art without creative work are within the protection scope of the present disclosure.
在目标网格区域中的至少两个像素点对应的ring ID同一个的情况下,确定所述目标网格区域中不存在障碍点。图4示出根据本公开实施例的网格区域中像素点来源同一ring ID的示意图,包括传感器31,及多个像素点(分别用⑦-⑩来标识)。通过传感器31扫描以得到由多个像素点构成的点云信息情况下,传感器31发射的激光束311、激光束312、激光束313、及激光束314未遇到障碍物,像素点⑦、像素点⑧、像素点⑨及像素点⑩落入目标网格区域中。可见:多个像素点(分别用⑦-⑩来标识)分别对应的ring ID为同一标识,意味着所述多个像素点是通过不同传感器得到的。此时,可确定目标网格区域中不存在障碍点,并将目标网格区域对应的障碍点信息保持为初始的第一值。In a case where the ring IDs corresponding to at least two pixel points in the target grid area are the same, it is determined that there are no obstacle points in the target grid area. FIG. 4 shows a schematic diagram of the source of pixels in the grid area with the same ring ID according to an embodiment of the present disclosure, including a sensor 31 and a plurality of pixels (identified by ⑦-⑩, respectively). In the case of scanning by the sensor 31 to obtain point cloud information composed of multiple pixels, the laser beam 311, laser beam 312, laser beam 313, and laser beam 314 emitted by the sensor 31 have not encountered obstacles, and the pixel point ⑦, pixel Point ⑧, pixel point ⑨ and pixel point ⑩ fall into the target grid area. It can be seen that the ring IDs corresponding to multiple pixels (identified by ⑦-⑩ respectively) are the same identifier, which means that the multiple pixels are obtained by different sensors. At this time, it can be determined that there are no obstacle points in the target grid area, and the obstacle point information corresponding to the target grid area is maintained as the initial first value.
由于点云信息中所包括的待检测对象可以是如小石子、树叶等障碍物,以及在自动驾驶或无人驾驶场景中如树冠,标牌等的非障碍物。因此,在通过上述ring ID进行障碍点判断的基础上,可以进一步增加高度信息,对通过上述ring ID判断出的障碍点进行校验,以避免可能发生的误判断,比如,将如树冠、标牌等非障碍物也识别为障碍物。 因为,以自动驾驶或无人驾驶场景而言,目标对象为车辆,树冠、标识牌等空中的物体不应该属于障碍物,且通常会比石子、树叶等障碍物要高的多。因此,可以将点云信息中像素点的高度信息加入,以从网格区域中排除诸如树冠、标识牌等非障碍物。Since the object to be detected included in the point cloud information may be obstacles such as pebbles and leaves, as well as non-obstacles such as tree crowns and signs in autonomous driving or unmanned driving scenarios. Therefore, on the basis of the obstacle point judgment based on the above ring ID, the height information can be further added, and the obstacle points determined by the above ring ID can be verified to avoid possible misjudgments, such as tree crowns and signs. Other non-obstacles are also recognized as obstacles. Because, in the case of autonomous driving or unmanned driving, the target object is a vehicle, and objects in the sky such as tree crowns and signboards should not be obstacles, and are usually much higher than obstacles such as stones and leaves. Therefore, the height information of the pixels in the point cloud information can be added to exclude non-obstacles such as tree crowns and signs from the grid area.
一示例中,点云信息还包括高度信息的情况下,在所述目标网格区域中存在有障碍点的情况下,更新所述网格信息,还包括:根据所述高度信息,确定所述目标网格区域中存在的障碍点的类别;根据所述障碍点的类别,更新所述网格信息中对应所述目标网格区域的障碍点信息。比如,在网格信息为网格图标记有障碍点信息的例子中,在确定所述目标网格区域中存在的障碍点对应树冠等的非障碍物时,将所述目标网格区域对应的障碍点信息从第二值更新为第一值,从而可有效降低上述误判发生的几率。此外,在确定所述障碍点对应例如路障等的障碍物时,将所述目标网格区域对应的障碍点信息保持为第二值。这样,在更新网格信息后,可以得到更为精确的仅包含障碍物对应的障碍点信息的网格图,以用于后续的目标检测处理。In an example, when the point cloud information further includes height information, and when there are obstacle points in the target grid area, updating the grid information further includes: determining the height information according to the height information. The category of the obstacle point existing in the target grid area; and the obstacle point information corresponding to the target grid area in the grid information is updated according to the category of the obstacle point. For example, in an example where the grid information is a grid graph marked with obstacle point information, when it is determined that the obstacle points existing in the target grid area correspond to non-obstacles such as tree crowns, the target grid area corresponds to The obstruction point information is updated from the second value to the first value, which can effectively reduce the probability of occurrence of the above-mentioned misjudgment. In addition, when it is determined that the obstacle point corresponds to an obstacle such as a roadblock, the obstacle point information corresponding to the target grid area is maintained at a second value. In this way, after the grid information is updated, a more accurate grid map containing only the obstacle point information corresponding to the obstacle can be obtained for subsequent target detection processing.
一示例中,根据所述高度信息,确定所述目标网格区域中存在的障碍点的类别,包括:获取所述目标网格区域中至少两个像素点分别对应的ring ID及高度信息;将所述至少两个像素点根据所述ring ID进行划分,将对应同一个ring ID的像素点作为一组数据,得到多组像素点数据。根据高度信息,确定每一组像素点数据中的最小高度值;通过对所述多组像素点数据中的最小高度值进行归类统计,获得一个或多个最小高度类,并相应确定每个最小高度类所包括的高度值的数量及其中最小值。然后,可根据该目标网格区域对应的每个最小高度类所包括的高度值的数量及其中最小值,确定该目标网格区域中障碍点的类别,即所述障碍点是否为对应障碍物的像素点。一示例中,可以通过将该目标网格区域对应的每个最小高度类所包括的高度值的数量与数量阈值(ring_count_th)进行比较,并将每个最小高度类所包括的高度值中的最小值与高度阈值(height_th)进行比较,来确定该目标网格区域中障碍点的类别。其中,若所述一个或多个最小高度类存在如下的目标最小高度类,该目标最小高度类所包括的高度值的数量大于或等于数量阈值(ring_count_th)并且所包括的高度值中的最小值小于高度阈值(height_th),则认为该目标网格区域中存在的障碍点对应于障碍物。比如,设置ring_count_th=3,而height_th可以取车辆的高度,例如2m。.In an example, determining the category of obstacle points existing in the target grid area according to the height information includes: obtaining ring IDs and height information corresponding to at least two pixels in the target grid area; The at least two pixels are divided according to the ring ID, and the pixels corresponding to the same ring ID are used as a set of data to obtain multiple sets of pixel data. According to the height information, the minimum height value in each group of pixel data is determined; by classifying and counting the minimum height values in the multiple groups of pixel data, one or more minimum height categories are obtained, and each group is determined accordingly. The minimum height category includes the number of height values and the minimum value. Then, the type of obstacle point in the target grid area can be determined according to the number of height values included in each minimum height category corresponding to the target grid area and the minimum value thereof, that is, whether the obstacle point is a corresponding obstacle Of pixels. In an example, the number of height values included in each minimum height class corresponding to the target grid area can be compared with the number threshold (ring_count_th), and the smallest height value included in each minimum height class can be The value is compared with the height threshold (height_th) to determine the category of obstacle points in the target grid area. Wherein, if the one or more minimum height classes have the following target minimum height classes, the number of height values included in the target minimum height class is greater than or equal to the number threshold (ring_count_th) and the minimum value of the included height values If it is less than the height threshold (height_th), it is considered that the obstacle points existing in the target grid area correspond to the obstacle. For example, set ring_count_th=3, and height_th can be the height of the vehicle, for example, 2m. .
通过上述基于某个ring ID的划分得到多个划分结果后,可以该对该网格图进行连通区域分析得到连通区域,并根据所述连通区域识别出待检测对象中的障碍物。可以通过诸如凹多边形、凸多边形、矩形或者三角形等多边形的形式来表示该障碍物,只要可以识别出该障碍物有别于其他对象即可。本公开一示例中采用凸多边形,一方面,就凸多边形的形状特性而言,相比矩形或者三角形,边的数目比矩形或者三角形的更多,更容易准确的表示出障碍物的形状;另一方面,采用凸多边形相比凹多边形,不会引入多余的计算量,运算成本适中。After multiple division results are obtained through the above-mentioned division based on a certain ring ID, a connected area analysis can be performed on the grid graph to obtain a connected area, and an obstacle in the object to be detected can be identified according to the connected area. The obstacle can be represented in the form of a polygon such as a concave polygon, a convex polygon, a rectangle, or a triangle, as long as it can be recognized that the obstacle is different from other objects. In an example of the present disclosure, convex polygons are used. On the one hand, in terms of the shape characteristics of convex polygons, compared with rectangles or triangles, the number of sides is more than that of rectangles or triangles, which makes it easier to accurately represent the shape of obstacles; On the one hand, compared with concave polygons, the use of convex polygons does not introduce redundant calculations, and the computational cost is moderate.
一示例中,对于上述连通区域分析,可以根据如图5所示网格区域中标记为“0”或 “1”的障碍点信息,来搜索连在一起的障碍点信息为“1”的网格区域,从而形成“连通区域”。In an example, for the above-mentioned connected area analysis, the network with the obstacle point information "1" connected together can be searched based on the obstacle point information marked as "0" or "1" in the grid area as shown in FIG. 5. Grid area, thus forming a "connected area".
图6a-图6b示出根据本公开实施例的连通区域连通方式的示意图。连通区域运算可以通过广度优先搜索(BFS,Breadth First Search)算法来实现。一个例子中,图像中最小的单位是像素,每个像素周围有8个邻接像素,则邻接关系有2种:4邻接(如图6a所示)与8邻接(如图6b所示)。其中,4邻接一共4个点,即上下左右共四个像素点。而8邻接的点由于还包括了对角线位置的4个点,即共8个像素点。如果某一个像素点A与像素点B邻接且连通,则形成了一个区域。这样一来,所有彼此连通的像素点构成的集合,称为一个“连通区域”。通过连通区域运算可以得到障碍物,图7示出根据本公开实施例的网格图中障碍物的示意图,如图7所示,网格图中包含多个用凸多边形表示的障碍物。Figures 6a-6b show schematic diagrams of a communication manner of a connected area according to an embodiment of the present disclosure. The connected area calculation can be implemented by the Breadth First Search (BFS) algorithm. In an example, the smallest unit in an image is a pixel, and there are 8 adjacent pixels around each pixel, and there are 2 types of adjacent relationships: 4-adjacent (as shown in Figure 6a) and 8-adjacent (as shown in Figure 6b). Among them, 4 is adjacent to a total of 4 points, that is, there are a total of four pixel points up, down, left, and right. The 8 adjacent points also include 4 points on the diagonal position, that is, a total of 8 pixel points. If a certain pixel point A and pixel point B are adjacent and connected, a region is formed. In this way, the set of all connected pixels is called a "connected region". Obstacles can be obtained through connected region operations. FIG. 7 shows a schematic diagram of obstacles in a grid diagram according to an embodiment of the present disclosure. As shown in FIG. 7, the grid diagram contains multiple obstacles represented by convex polygons.
一示例中,根据上述连通区域,识别出待检测对象中的障碍物之后,还包括:获取连通区域的第一线段上的多个待处理点,从多个待处理点中选取至少两个参考点,连接该至少两个参考点得到第二线段,根据该第二线段调整连通区域后得到第一区域。比如,该第一区域可以小于连通区域。若障碍物是凸多边形,则该连通区域的调整过程可以称之为凸包处理。比如,构成连通区域某个线段(称之为第一线段)有10个待处理点,从10个待处理点中选取6个参考点,连接该6个参考点重新得到一个线段(称之为第二线段),则根据该第二线段调整连通区域后可以得到第一区域,且该第一区域小于连通区域。也就是说经凸包处理后,减少了用于表示障碍物的凸边数量(因为点少了,凸边相应减少),凸多边形较其初始形状变小了。采用凸包处理,可以降低运算量。In an example, after identifying the obstacle in the object to be detected based on the above-mentioned connected area, the method further includes: acquiring a plurality of points to be processed on the first line segment of the connected area, and selecting at least two points from the plurality of points to be processed The reference point is connected to the at least two reference points to obtain a second line segment, and the connected area is adjusted according to the second line segment to obtain the first area. For example, the first area may be smaller than the connected area. If the obstacle is a convex polygon, the adjustment process of the connected area can be called convex hull processing. For example, a certain line segment (called the first line segment) that constitutes a connected region has 10 points to be processed, and 6 reference points are selected from the 10 points to be processed, and the 6 reference points are connected to obtain a line segment (called the first line segment). Is the second line segment), the first area can be obtained after adjusting the connected area according to the second line segment, and the first area is smaller than the connected area. That is to say, after the convex hull is processed, the number of convex edges used to represent obstacles is reduced (because there are fewer points, the convex edges are reduced accordingly), and the convex polygon is smaller than its original shape. Convex hull processing can reduce the amount of calculation.
一示例中,根据所述连通区域,识别出所述待检测对象中的障碍物之后,所述方法还包括:从所述点云信息中提取目标对象对应的点云信息,根据所述目标对象对应的点云信息中像素点的坐标,得到所述目标对象对应的目标位置;获取基于所述网格信息识别出的至少两个障碍物;以所述目标位置的中心点为基准,根据预设角度发出的指引线得到扇形区域;在所述扇形区域覆盖第一障碍物和第二障碍物,且所述第二障碍物被所述第一障碍物遮挡的情况下,将所述第二障碍物的障碍点信息从所述网格信息中删除。图8示出根据本公开实施例的删除网格图中被遮挡障碍物的示意图,如图8所示,网格图中包含目标对象及至少两个障碍物,目标对象可以为车辆41,至少两个障碍物中的第一障碍物可以为警示对象42,至少两个障碍物中的第二障碍物可以为一个或多个石子43。以车辆41当前位置的中心点为基准,根据预设角度α发出的指引线得到扇形区域,警示对象42及一个或多个石子43均被该扇形区域所覆盖。然而,由于一个或多个石子43被警示对象42遮挡,因此,从车辆41当前的中心位置去观察障碍物,是看不到一个或多个石子43,只能看到警示对象42。换言之,一个或多个石子43,作为比警示对象42更小的物体,可以不去关心。因此,可将一个或多个石子43从网格图中删除。需要指出的是,第二障碍物不限于被遮挡的小石子,还可以是路边的草丛等。In an example, after the obstacle in the object to be detected is identified according to the connected area, the method further includes: extracting the point cloud information corresponding to the target object from the point cloud information, and according to the target object The coordinates of the pixel points in the corresponding point cloud information are obtained to obtain the target position corresponding to the target object; at least two obstacles identified based on the grid information are obtained; the center point of the target position is used as a reference, according to the prediction Suppose the guide line issued by the angle obtains a fan-shaped area; when the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the second obstacle The obstacle point information of the obstacle is deleted from the grid information. FIG. 8 shows a schematic diagram of deleting obstructed obstacles in a grid diagram according to an embodiment of the present disclosure. As shown in FIG. 8, the grid diagram contains a target object and at least two obstacles. The target object may be a vehicle 41, at least The first obstacle among the two obstacles may be the warning object 42, and the second obstacle among the at least two obstacles may be one or more stones 43. Taking the center point of the current position of the vehicle 41 as a reference, a fan-shaped area is obtained according to the guide line issued by the preset angle α, and the warning object 42 and one or more stones 43 are all covered by the fan-shaped area. However, since one or more stones 43 are blocked by the warning object 42, when observing the obstacle from the current center position of the vehicle 41, one or more stones 43 can not be seen, but only the warning object 42 can be seen. In other words, one or more stones 43, as objects smaller than the warning object 42, can be ignored. Therefore, one or more stones 43 can be deleted from the grid map. It should be pointed out that the second obstacle is not limited to small stones that are blocked, and can also be grass on the side of the road.
一示例中,所述方法包括:向目标对象(如车辆)发送导航路径上存在障碍物的消息,以使所述目标对象响应于该消息进行避障处理和/或重新规划导航路径。In an example, the method includes: sending a message that there is an obstacle on the navigation path to a target object (such as a vehicle), so that the target object performs obstacle avoidance processing and/or replans the navigation path in response to the message.
应用示例:Application example:
根据上述实施例的一个应用示例,包括如下内容:An application example according to the above embodiment includes the following content:
一、根据激光雷达中多个传感器对目标对象进行扫描,得到包含目标对象及待检测对象对应的点云信息。然后,基于所述点云信息,得到至少包含指示待检测对象的障碍点信息的网格信息,如网格信息可以是标记有障碍点信息的网格图。1. Scan the target object according to multiple sensors in the lidar, and obtain the point cloud information corresponding to the target object and the object to be detected. Then, based on the point cloud information, grid information containing at least obstacle point information indicating the object to be detected is obtained. For example, the grid information may be a grid graph marked with obstacle point information.
一个或多个激光雷达中可以有多个传感器,多个传感器共同构建整个场景的点云信息,将整个扫描区域(每个激光雷达在同一时刻或是同一时段内,扫描出来的一组点云所覆盖的区域)对应到网格图上。由于每个激光雷达中每个激光发射器指向的方向与水平面的夹角都不一样,因此,激光雷达每扫描一次,每个传感器都会转一圈扫描到某个角度下一圈的点云信息。There can be multiple sensors in one or more lidars, and multiple sensors jointly construct the point cloud information of the entire scene, and the entire scanning area (a group of point clouds scanned by each lidar at the same time or within the same period of time) The area covered) corresponds to the grid map. Since the angle between the direction of each laser transmitter in each lidar and the horizontal plane is different, every time the lidar scans, each sensor will turn and scan the point cloud information at a certain angle and the next circle. .
若是网格图中某个网格区域上无例如障碍物的物体凸起,则该网格区域应当是与地面高度差不多匹配的平面,该网格区域对应的传感器的相邻传感器发射的激光就不会被该网格区域所阻碍,这使得发射到该网格区域的激光都来源于同一个传感器。因此,假设落入某个网格区域内的所有像素点来源于同一个传感器,即该网格区域内的所有像素点对应的ring ID是相同的,也即落到该网格区域内的所有像素点是同一个传感器所扫描得到的,则可认为该网格区域内不存在可能对应障碍物的障碍点。然而,如果某个网格区域有例如障碍物的物体凸起,则该网格区域对应的传感器的相邻传感器发射的激光会被该网格区域上突出的物体阻碍并反射,这使得射到该网格区域的激光来源于不同的传感器。因此,假设落到某个网格区域内的像素点对应的传感器有多个,即该网格区域内的像素点对应的ring ID是不同的,也即落入该网格区域内的像素点是不同传感器所扫描得到的,则可认为该网格区域内存在可能对应障碍物的障碍点。If there is no object such as an obstacle on a certain grid area in the grid map, the grid area should be a plane that almost matches the height of the ground. The laser light emitted by the adjacent sensor of the grid area corresponds to the sensor. Will not be hindered by the grid area, which makes the laser light emitted to the grid area come from the same sensor. Therefore, it is assumed that all pixels falling in a certain grid area originate from the same sensor, that is, the ring IDs corresponding to all pixels in the grid area are the same, that is, all the pixels falling in the grid area are the same. If the pixel points are scanned by the same sensor, it can be considered that there are no obstacle points that may correspond to obstacles in the grid area. However, if a certain grid area has a raised object such as an obstacle, the laser light emitted by the adjacent sensor of the sensor corresponding to the grid area will be blocked and reflected by the protruding object on the grid area, which makes it hit The laser in the grid area comes from different sensors. Therefore, it is assumed that there are multiple sensors corresponding to pixels that fall in a certain grid area, that is, the ring IDs corresponding to the pixels in the grid area are different, that is, the pixels that fall into the grid area If it is scanned by different sensors, it can be considered that there are obstacle points that may correspond to obstacles in the grid area.
进一步的,利用落入网格区域中的像素点的ring ID的个数来判断该网格区域内是否可能存在有障碍物,还可以进一步优化。具言之,对于包括例如树冠,标识牌等空中的物体的某个网格区域,虽然也会有属于多个传感器的激光射到该网格区域中,即落到该网格区域内的像素点对应的ring ID是不同的。但是对于目标对象为车辆的情况下,树冠、标识牌等空中的物体不属于车辆所关注的障碍物,并需要排除将树冠、标识牌也识别作为车辆需要避让的障碍物的情况。因此,可将像素点的高度信息加入考虑,来对由ring ID得到的可能障碍物进行校验以过滤掉高于某个高度的物体,从而进一步提高对障碍物检测的精度。Further, the number of ring IDs of pixels falling in the grid area is used to determine whether there may be an obstacle in the grid area, which can be further optimized. In particular, for a certain grid area that includes objects in the air such as tree crowns, signboards, etc., although lasers belonging to multiple sensors will also be emitted into the grid area, that is, the pixels that fall into the grid area The ring IDs corresponding to the points are different. However, when the target object is a vehicle, objects in the sky such as tree crowns and signboards do not belong to the obstacles that the vehicle pays attention to, and it is necessary to eliminate the situation where the tree canopy and signboards are also identified as obstacles that the vehicle needs to avoid. Therefore, the height information of the pixels can be taken into consideration to check the possible obstacles obtained by the ring ID to filter out objects higher than a certain height, thereby further improving the accuracy of obstacle detection.
需要指出的,如果输入的点云信息为多个激光雷达扫描结果的融合,则对于每一个激光雷达扫描出的点云信息都可以构建N×M的网格图,可以预先设定每一格的边长在现实中表示0.1m,并将坐标(N/2,M/2)处设置为车辆中心。对于输入的点云信息为 一个激光雷达扫描结果的情况,则直接构建N×M的网格图。无论点云信息包括多个激光雷达扫描结果的融合,还是包括一个激光雷达扫描结果,都采用下述识别障碍物的方法来判断障碍物,以得到带障碍点信息的网格图。It should be pointed out that if the input point cloud information is the fusion of multiple lidar scan results, then an N×M grid can be constructed for each point cloud information scanned by lidar, and each grid can be preset The side length of represents 0.1m in reality, and the coordinates (N/2, M/2) are set as the center of the vehicle. For the case where the input point cloud information is the result of a lidar scan, an N×M grid map is directly constructed. Regardless of whether the point cloud information includes the fusion of multiple lidar scan results or one lidar scan result, the following obstacle identification method is used to judge the obstacles to obtain a grid map with obstacle point information.
根据ring ID以及高度信息来判断某个网格区域是否存在障碍物的过程中,可将单个激光雷达扫描出的点云信息中的像素点按照位置信息分配到网格中。对于每个网格区域,统计分配到其中的像素点的ring ID(相同的ring ID不重复统计)。然后,将对应同一个ring ID的像素点作为一组数据,得到多组像素点数据。接着,根据高度信息,确定每一组像素点数据中的最小高度值,并对所述多组像素点的最小高度值进行归类统计,获得至少一个最小高度类。针对每个最小高度类,根据该最小高度类所包括的高度值的数量以及这些高度值中的最小值,确定可能障碍物的类别。一示例中,可以通过将每个最小高度类所包括的高度值的数量与数量阈值进行比较,并将每个最小高度类所包括高度值中的最小值与高度阈值进行比较,来确定该网格区域中存在的障碍点的类别。其中,若所述至少一个最小高度类中存在如下的目标最小高度类,该目标最小高度类所包括的高度值的数量大于或等于数量阈值(ring_count_th),并且所包括的高度值中的最小值小于高度阈值(height_th),则认为该网格区域内存在的障碍点对应真的会影响目标对象行进的障碍物。采用归类统计的好处是:找到在高度上连续的一段障碍物,而不是某个单点。最终,对于每个激光雷达都可以获得一张网格图,将多个网格图的每个元素进行“或”操作进行融合,得到输出结果,即得到带障碍点信息的网格图。针对“或”操作的一个例子是:网格图中用“1”表示有障碍点,用“0”表示无障碍点,有两个1x3的网格图,分别为[1,0,0]和[0,1,0],则将两个网格图做“或”操作是对于其中对应的网格区域,若有其中一个或者两个网格区域标为“1”,则叠加后网格区域的对应位置就标“1”,则针对这两个网格图执行“或”操作之后的结果就是[1,1,0]。In the process of judging whether there is an obstacle in a grid area according to the ring ID and height information, the pixels in the point cloud information scanned by a single lidar can be allocated to the grid according to the position information. For each grid area, count the ring IDs of the pixels allocated to it (the same ring ID is not counted repeatedly). Then, the pixels corresponding to the same ring ID are used as a set of data to obtain multiple sets of pixel data. Then, according to the height information, the minimum height value in each group of pixel data is determined, and the minimum height values of the multiple groups of pixels are classified and counted to obtain at least one minimum height category. For each minimum height class, the class of possible obstacles is determined according to the number of height values included in the minimum height class and the minimum of these height values. In one example, the network can be determined by comparing the number of height values included in each minimum height class with a threshold value, and comparing the minimum value of the height values included in each minimum height class with the height threshold value. The category of obstacle points that exist in the grid area. Wherein, if the following target minimum height class exists in the at least one minimum height class, the number of height values included in the target minimum height class is greater than or equal to the number threshold (ring_count_th), and the minimum value of the included height values If it is less than the height threshold (height_th), it is considered that the obstacle points in the grid area correspond to obstacles that will really affect the target object. The advantage of using classification statistics is to find a continuous segment of obstacles in height, rather than a single point. In the end, a grid map can be obtained for each lidar, and each element of the multiple grid maps is merged by "OR" operation, and the output result is obtained, that is, the grid map with obstacle point information is obtained. An example of the "or" operation is: "1" in the grid graph indicates an obstacle point, and "0" indicates an obstacle-free point. There are two 1x3 grid graphs, respectively [1, 0, 0] And [0, 1, 0], then the two grid graphs will be "or" for the corresponding grid area, if one or two of the grid areas are marked as "1", then the grid will be superimposed. The corresponding position of the grid area is marked with "1", and the result of the "or" operation on these two grid graphs is [1, 1, 0].
在以ring ID判断可能存在障碍物的基础上加以高度信息来校验某个网格区域是否真的存在障碍物的过程中,可以采用补偿方式来提升检测距离,从而保证检测质量。在统计每个网格区域中像素点的ring ID时,由于点云信息在远处物体上会变稀疏,因此,网格区域距离车辆中心的距离(distance)越远时,需要采用一并统计其周围网格区域的补偿方式,该补偿方式可以为:还需要统计以该网格区域为中心,n×n大小的范围。其中,In the process of judging that there may be obstacles based on the ring ID, height information is added to verify whether there are obstacles in a certain grid area. Compensation methods can be used to increase the detection distance to ensure the detection quality. When counting the ring IDs of pixels in each grid area, because the point cloud information will become sparse on distant objects, when the distance between the grid area and the center of the vehicle (distance) is farther, it is necessary to use collective statistics The compensation method of the surrounding grid area, the compensation method can be: also need to count the grid area as the center, n×n size range. in,
n=around(1+a×distance),n=around(1+a×distance),
around函数表示四舍五入,a为一个预先设定的较小常数。这样,在进行归类统计时,可先对于最小高度值数组中所有高度值进行排序,排序后的数列中若连续两项的高度差大于某个阈值(gap_th),则分为两个类。The around function represents rounding, and a is a small predetermined constant. In this way, when performing classification statistics, all the height values in the minimum height value array can be sorted first, and if the height difference between two consecutive two items in the sorted sequence is greater than a certain threshold (gap_th), it is divided into two categories.
其中,由于远处物体上反射的像素点的分布一定比近处的分布更加分散,因此,gap_th作为修正函数,其取值可以根据网格区域距离车辆中心的距离(distance)进行一 定的修正。比如,根据传感器的安装位置、角度、点云稀疏性等不同的情况,分别采用不同的补偿方案。一个示例中,Among them, because the distribution of pixels reflected on distant objects must be more scattered than the distribution nearby, gap_th is used as a correction function, and its value can be corrected according to the distance between the grid area and the center of the vehicle (distance). For example, according to different conditions such as the installation position, angle, and point cloud sparseness of the sensor, different compensation schemes are adopted. In an example,
gap_th=a×distance+bgap_th=a×distance+b
其中,阈值gap_th的单位为米,a、b为较小常数。计算得到的gap_th是一个较小的值,可以取0.1m。Among them, the unit of the threshold gap_th is meters, and a and b are relatively small constants. The calculated gap_th is a small value, which can be 0.1m.
就数量阈值ring_count_th取值而言,可以根据点云信息的稀疏程度进行补偿。一个示例中,可以取定值,例如取3。就高度阈值height_th的取值而言,由于传感器(传感器可以设置于车辆上的激光雷达)有一定的仰角,因此,高度阈值height_th不能取定值,可以根据网格区域距离车辆中心的距离(distance)进行一定的角度修正。比如,一个示例中,假设修正角度的正切值为a,则令As for the value of the number threshold ring_count_th, compensation can be made according to the sparseness of the point cloud information. In an example, a fixed value may be used, for example, 3. As for the value of the height threshold height_th, since the sensor (the lidar that the sensor can be installed on the vehicle) has a certain elevation angle, the height threshold height_th cannot be set to a fixed value. It can be based on the distance between the grid area and the center of the vehicle (distance ) Perform a certain angle correction. For example, in an example, assuming that the tangent of the correction angle is a, then let
height_th=1+a×distanceheight_th=1+a×distance
其中,高度阈值height_th的单位为米。需要说明的是,对于上述各参数的取值,可以结合实际情况进行设置,在此不限定具体的设置方式。Among them, the unit of the height threshold height_th is meters. It should be noted that the values of the above-mentioned parameters can be set according to actual conditions, and the specific setting methods are not limited here.
二、对网格图中的障碍点进行连通区域分析,得到连通区域,根据连通区域可以得到凸多边形表示的障碍物。2. Analyze the connected area of the obstacle points in the grid graph to obtain the connected area. According to the connected area, the obstacle represented by the convex polygon can be obtained.
得到上述网格图后,每个网格区域的值表示了该网格区域是否存在有对应障碍物的障碍点。由于点云信息的稀疏性,有些较大的物体被分成了很多部分,可以先用图像膨胀算法处理该网格图,从而将同一个物体的多个部分连通。接着,进行连通区域分析(每个连通区域可表示一个物体,如障碍物)。对于每个连通区域,计算其凸包,接着对于每个凸包使用凸包运算,如Ramer–Douglas–Peucker算法,从而可以简化凸包的边数,降低运算量。最终,做FOV分析去除从车辆中心点观察所无法观察到的小障碍物。After obtaining the above grid map, the value of each grid area indicates whether there is an obstacle point corresponding to the obstacle in the grid area. Due to the sparseness of the point cloud information, some larger objects are divided into many parts, and the image expansion algorithm can be used to process the grid first to connect multiple parts of the same object. Next, perform connected area analysis (each connected area can represent an object, such as an obstacle). For each connected region, calculate its convex hull, and then use convex hull operations for each convex hull, such as the Ramer–Douglas–Peucker algorithm, which can simplify the number of edges of the convex hull and reduce the amount of computation. Finally, FOV analysis is performed to remove small obstacles that cannot be observed from the center of the vehicle.
凸包运算的一个例子中,包括:An example of convex hull operation includes:
1.对于一条需要简化的折线,在折线首尾两点A、B之间连接一条直线AB;1. For a polyline that needs to be simplified, connect a straight line AB between the two points A and B at the beginning and end of the polyline;
2.遍历找到折线上距离直线AB最远的点C,计算其与直线AB的距离;2. Traverse to find the point C farthest from the line AB on the polyline, and calculate the distance between it and the line AB;
3.比较该距离与预先给定的阈值,若该距离小于或等于该阈值,则将直线AB作为这段折线的近似,该段折线处理完成;3. Compare the distance with a predetermined threshold. If the distance is less than or equal to the threshold, the straight line AB is taken as the approximation of this section of polyline, and the processing of this section of polyline is completed;
4.如果该距离大于该阈值,则用点C将直线AB分为两段直线AC和BC,并分别对两段直线AC和BC进行上述1-4步骤的处理;4. If the distance is greater than the threshold, use point C to divide the straight line AB into two straight lines AC and BC, and perform the above-mentioned steps 1-4 on the two straight lines AC and BC respectively;
5.当所有的折线都处理完毕时,依次连接各个分割点形成的折线,既可作为初始折线的近似,得到更新后的凸包。5. When all the polylines have been processed, the polylines formed by connecting each dividing point in turn can be used as an approximation of the initial polylines to obtain the updated convex hull.
FOV分析的一个例子中,包括:对于每两个凸包C1和C2,需要检测在凸包C2的遮挡下,从本车位置、例如车辆的中心点A能否观察到凸包C1。具言之,可包括:An example of FOV analysis includes: for every two convex hulls C1 and C2, it is necessary to detect whether the convex hull C1 can be observed from the position of the vehicle, such as the center point A of the vehicle, under the occlusion of the convex hull C2. Specifically, it can include:
1.对于凸包C1上的每个点P,连接中心点A和点P,1. For each point P on the convex hull C1, connect the center point A and point P,
2.检测直线AP是否穿过凸包C2,这一步的检测可以通过叉乘运算判断凸包C2上所有点是否都在直线AP的同一边,如果都在同一边,则认为直线AP没有穿过凸包C2。2. Detect whether the straight line AP passes through the convex hull C2. In this step of detection, whether all the points on the convex hull C2 are on the same side of the straight line AP through the cross product operation. If they are on the same side, it is considered that the straight line AP does not pass through Convex hull C2.
3.遍历凸包C1上的点后可得到在本车位置无法观察到凸包C1上的点的个数n,如果n大于等于某个阈值fov_th,则认为凸包C1不可见,可将凸包C1删除。3. After traversing the points on the convex hull C1, the number n of points on the convex hull C1 that cannot be observed at the position of the vehicle can be obtained. If n is greater than or equal to a certain threshold fov_th, the convex hull C1 is considered invisible, and the convex Package C1 is deleted.
其中,阈值fov_th的取值需要根据障碍物到本车的距离进行修正。一个修正示例为:Among them, the value of the threshold fov_th needs to be corrected according to the distance from the obstacle to the vehicle. An example of a correction is:
fov_th=min(1,ceil(convex_point_num×(1–distance/a))),fov_th=min(1, ceil(convex_point_num×(1–distance/a))),
其中,convex_point_num为对应凸包上的点的个数;distance为凸包到本车的距离;a为某个较大的常数,可以取值为最大可感知的距离值;ceil为向上取整函数。Among them, convex_point_num is the number of points on the corresponding convex hull; distance is the distance from the convex hull to the vehicle; a is a larger constant, which can be taken as the maximum perceivable distance value; ceil is a rounding up function .
本领域技术人员可以理解,在具体实施方式的上述方法中,各步骤的撰写顺序并不意味着严格的执行顺序而对实施过程构成任何限定,各步骤的具体执行顺序应当以其功能和可能的内在逻辑确定。Those skilled in the art can understand that in the above-mentioned methods of the specific implementation, the writing order of the steps does not mean a strict execution order but constitutes any limitation on the implementation process. The specific execution order of each step should be based on its function and possibility. The inner logic is determined.
本公开提及的上述各个方法实施例,在不违背原理逻辑的情况下,均可以彼此相互结合形成结合后的实施例,限于篇幅,本公开不再赘述。The foregoing various method embodiments mentioned in the present disclosure can all be combined with each other to form a combined embodiment without violating the principle and logic, and the length is limited, and the details of this disclosure will not be repeated.
此外,本公开还提供了一种目标检测装置、电子设备、计算机可读存储介质、程序,上述均可用来实现本公开提供的任一种目标检测方法,相应技术方案和描述和参见方法部分的相应记载,不再赘述。In addition, the present disclosure also provides a target detection device, electronic equipment, computer-readable storage medium, and a program, all of which can be used to implement any target detection method provided in the present disclosure. For the corresponding technical solutions and descriptions, refer to the method section. The corresponding records will not be repeated here.
图9示出根据本公开实施例的目标检测装置的框图,如图9所示,该装置,包括:获取单元51,用于获取点云信息,所述点云信息至少包括目标对象及待检测对象对应的点云信息,其中所述目标对象能够移动,所述待检测对象为所述目标对象周围的人或物;信息处理单元52,用于根据所述点云信息,得到网格信息,其中,所述网格信息至少包括指示所述待检测对象的障碍点信息;检测单元53,用于根据所述网格信息,识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物。Fig. 9 shows a block diagram of a target detection device according to an embodiment of the present disclosure. As shown in Fig. 9, the device includes: an acquiring unit 51 for acquiring point cloud information. The point cloud information includes at least a target object and a target object to be detected. Point cloud information corresponding to the object, where the target object can move, and the object to be detected is people or things around the target object; the information processing unit 52 is configured to obtain grid information according to the point cloud information, Wherein, the grid information includes at least obstacle point information indicating the object to be detected; the detection unit 53 is configured to identify, according to the grid information, that the object to be detected affects the movement of the target object Obstacles.
可能的实现方式中,所述获取单元,用于:获取通过至少两个传感器分别扫描得到的多个待处理的点云信息;将所述多个待处理的点云信息进行拼接处理,得到所述点云信息。In a possible implementation manner, the acquiring unit is configured to: acquire a plurality of to-be-processed point cloud information scanned by at least two sensors; and perform stitching processing on the plurality of to-be-processed point cloud information to obtain Describe point cloud information.
可能的实现方式中,所述点云信息还包括传感器标识(ring ID)。所述信息处理单元,用于:对所述点云信息进行网格化处理,得到网格图,所述网格图包括多个网格区域,且每个所述网格区域对应的所述障碍点信息为第一值;针对每个所述网格区域,根据所述网格区域包括的ring ID,确定所述目标网格区域中是否存在对应所述待检测对象的障碍点;在所述网格区域中存在所述有障碍点的情况下,将所述网格信息中对应所述网格区域的所述障碍点信息更新为第二值。In a possible implementation manner, the point cloud information further includes a sensor identification (ring ID). The information processing unit is configured to: perform grid processing on the point cloud information to obtain a grid graph, the grid graph includes a plurality of grid regions, and each of the grid regions corresponds to the The obstacle point information is the first value; for each grid area, according to the ring ID included in the grid area, determine whether there is an obstacle point corresponding to the object to be detected in the target grid area; If the obstacle point exists in the grid area, the obstacle point information corresponding to the grid area in the grid information is updated to a second value.
可能的实现方式中,所述信息处理单元,用于:在所述网格区域中的至少两个像素点对应的ring ID不同的情况下,确定所述网格区域中存在所述障碍点。In a possible implementation manner, the information processing unit is configured to determine that the obstacle point exists in the grid area when the ring IDs corresponding to at least two pixel points in the grid area are different.
可能的实现方式中,所述点云信息还包括高度信息,所述装置,还包括类别确定单元,用于:根据所述高度信息,确定所述网格区域中存在的所述障碍点的类别;根据所述障碍点的类别,更新所述网格信息中对应所述网格区域的所述障碍点信息。In a possible implementation manner, the point cloud information further includes height information, and the device further includes a category determining unit configured to determine the category of the obstacle point existing in the grid area according to the height information ; According to the category of the obstacle point, update the obstacle point information corresponding to the grid area in the grid information.
可能的实现方式中,所述类别确定单元,用于:获取所述网格区域中至少两个像素点分别对应的ring ID及高度信息;将所述至少两个像素点中对应同一个ring ID的像素点作为一组数据,得到多组像素点数据;根据所述高度信息,确定每一组所述像素点数据中的最小高度值;对所述多组像素点数据中的最小高度值进行归类统计,获得一个或多个最小高度类;根据每个最小高度类所包括的高度值的数量及其中最小值,确定所述网格区域中存在的障碍点的类别。In a possible implementation manner, the category determining unit is configured to: obtain ring IDs and height information respectively corresponding to at least two pixels in the grid area; and correspond to the same ring ID in the at least two pixels As a set of data, multiple sets of pixel data are obtained; according to the height information, the minimum height value in each set of pixel data is determined; the minimum height value in the multiple sets of pixel data is determined Categorize statistics to obtain one or more minimum height categories; determine the categories of obstacle points existing in the grid area according to the number of height values included in each minimum height category and the minimum value thereof.
可能的实现方式中,所述检测单元,用于:根据所述网格信息中的所述障碍点信息进行连通区域分析,得到连通区域;根据所述连通区域,识别出所述待检测对象中所述的障碍物。In a possible implementation manner, the detection unit is configured to: perform a connected area analysis according to the obstacle point information in the grid information to obtain a connected area; according to the connected area, identify the object to be detected The obstacles.
可能的实现方式中,所述装置还包括连通区域调整单元,用于:获取所述连通区域的第一线段上的多个待处理点;从所述多个待处理点中选取至少两个参考点;连接所述至少两个参考点得到第二线段,根据所述第二线段调整所述连通区域后得到第一区域。一示例中,所述第一区域可以小于所述连通区域。In a possible implementation manner, the device further includes a connected area adjustment unit, configured to: obtain a plurality of points to be processed on the first line segment of the connected area; select at least two points from the plurality of points to be processed Reference point; connecting the at least two reference points to obtain a second line segment, and adjusting the connected area according to the second line segment to obtain the first area. In an example, the first area may be smaller than the connected area.
可能的实现方式中,所述装置还包括:遮挡处理单元,用于:从所述点云信息中提取目标对象对应的点云信息,根据所述目标对象对应的点云信息中像素点的坐标,得到所述目标对象对应的目标位置;获取基于所述网格信息识别出的至少两个障碍物;以所述目标位置的中心点为基准,根据预设角度发出的指引线得到扇形区域;在所述扇形区域覆盖第一障碍物和第二障碍物,且所述第二障碍物被所述第一障碍物遮挡的情况下,将所述第二障碍物的障碍点信息从所述网格信息中删除。In a possible implementation manner, the device further includes: an occlusion processing unit, configured to: extract the point cloud information corresponding to the target object from the point cloud information, and according to the coordinates of the pixel points in the point cloud information corresponding to the target object , Obtain the target position corresponding to the target object; obtain at least two obstacles identified based on the grid information; use the center point of the target position as a reference, and obtain a fan-shaped area according to a guide line issued at a preset angle; In the case that the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the obstacle point information of the second obstacle is obtained from the network Delete from the grid information.
可能的实现方式中,所述装置还包括发送单元,用于:向所述目标对象发送导航路径上存在障碍物的消息,以使所述目标对象响应于所述消息进行避障处理和/或重新规划导航路径。In a possible implementation manner, the device further includes a sending unit, configured to send a message that there is an obstacle on the navigation path to the target object, so that the target object performs obstacle avoidance processing and/or in response to the message Re-plan the navigation path.
在一些实施例中,本公开实施例提供的装置具有的功能或包含的模块可以用于执行上文方法实施例描述的方法,其具体实现可以参照上文方法实施例的描述,为了简洁,这里不再赘述。In some embodiments, the functions or modules contained in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments. For specific implementation, refer to the description of the above method embodiments. For brevity, here No longer.
本公开实施例还提出一种计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现上述方法。计算机可读存储介质可以是非易失性计算机可读存储介质。The embodiments of the present disclosure also provide a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above-mentioned method when executed by a processor. The computer-readable storage medium may be a non-volatile computer-readable storage medium.
本公开实施例还提出一种电子设备,包括:处理器;用于存储处理器可执行指令的存储器;其中,所述处理器被配置为上述方法。An embodiment of the present disclosure also provides an electronic device, including: a processor; a memory for storing executable instructions of the processor; wherein the processor is configured as the above-mentioned method.
电子设备可以被提供为终端、服务器或其它形态的设备。The electronic device can be provided as a terminal, server or other form of device.
图10是根据一示例性实施例示出的一种电子设备800的框图。例如,电子设备800可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等终端。Fig. 10 is a block diagram showing an electronic device 800 according to an exemplary embodiment. For example, the electronic device 800 may be a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and other terminals.
参照图10,电子设备800可以包括以下一个或多个组件:处理组件802,存储器804,电源组件806,多媒体组件808,音频组件810,输入/输出(I/O)的接口812,传感器组件814,以及通信组件816。10, the electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, and a sensor component 814 , And communication component 816.
处理组件802通常控制电子设备800的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理组件802可以包括一个或多个处理器820来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件802可以包括一个或多个模块,便于处理组件802和其他组件之间的交互。例如,处理组件802可以包括多媒体模块,以方便多媒体组件808和处理组件802之间的交互。The processing component 802 generally controls the overall operations of the electronic device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing component 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the foregoing method. In addition, the processing component 802 may include one or more modules to facilitate the interaction between the processing component 802 and other components. For example, the processing component 802 may include a multimedia module to facilitate the interaction between the multimedia component 808 and the processing component 802.
存储器804被配置为存储各种类型的数据以支持在电子设备800的操作。这些数据的示例包括用于在电子设备800上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器804可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。The memory 804 is configured to store various types of data to support operations in the electronic device 800. Examples of these data include instructions for any application or method to operate on the electronic device 800, contact data, phone book data, messages, pictures, videos, etc. The memory 804 can be implemented by any type of volatile or non-volatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable and Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic Disk or Optical Disk.
电源组件806为电子设备800的各种组件提供电力。电源组件806可以包括电源管理系统,一个或多个电源,及其他与为电子设备800生成、管理和分配电力相关联的组件。The power supply component 806 provides power for various components of the electronic device 800. The power supply component 806 may include a power management system, one or more power supplies, and other components associated with the generation, management, and distribution of power for the electronic device 800.
多媒体组件808包括在所述电子设备800和用户之间的一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界,还检测与所述触摸或滑动操作相关的持续时间和压力。在一些实施例中,多媒体组件808包括一个前置摄像头和/或后置摄像头。当电子设备800处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。The multimedia component 808 includes a screen of an output interface between the electronic device 800 and the user. In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user. The touch panel includes one or more touch sensors to sense touch, sliding, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure related to the touch or slide operation. In some embodiments, the multimedia component 808 includes a front camera and/or a rear camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
音频组件810被配置为输出和/或输入音频信号。例如,音频组件810包括一个麦克风(MIC),当电子设备800处于操作模式,如呼叫模式、记录模式和语音识别模式时, 麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器804或经由通信组件816发送。在一些实施例中,音频组件810还包括一个扬声器,用于输出音频信号。The audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a microphone (MIC), and when the electronic device 800 is in an operation mode, such as a call mode, a recording mode, and a voice recognition mode, the microphone is configured to receive an external audio signal. The received audio signal may be further stored in the memory 804 or transmitted via the communication component 816. In some embodiments, the audio component 810 further includes a speaker for outputting audio signals.
输入/输出(I/O)接口812为处理组件802和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。An input/output (I/O) interface 812 provides an interface between the processing component 802 and a peripheral interface module. The peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include, but are not limited to: home button, volume button, start button, and lock button.
传感器组件814包括一个或多个传感器,用于为电子设备800提供各个方面的状态评估。例如,传感器组件814可以检测到电子设备800的打开/关闭状态,组件的相对定位。例如所述组件为电子设备800的显示器和小键盘,传感器组件814还可以检测电子设备800或电子设备800一个组件的位置改变,用户与电子设备800接触的存在或不存在,电子设备800方位或加速/减速和电子设备800的温度变化。传感器组件814可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件814还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件814还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。The sensor component 814 includes one or more sensors for providing the electronic device 800 with various aspects of state evaluation. For example, the sensor component 814 can detect the on/off state of the electronic device 800 and the relative positioning of the components. For example, the components are the display and keypad of the electronic device 800, the sensor component 814 can also detect the position change of the electronic device 800 or a component of the electronic device 800, the presence or absence of contact between the user and the electronic device 800, the position of the electronic device 800 or Acceleration/deceleration and temperature changes of the electronic device 800. The sensor component 814 may include a proximity sensor configured to detect the presence of nearby objects when there is no physical contact. The sensor component 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor component 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
通信组件816被配置为便于电子设备800和其他设备之间有线或无线方式的通信。电子设备800可接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件816经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,通信组件816还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。The communication component 816 is configured to facilitate wired or wireless communication between the electronic device 800 and other devices. The electronic device 800 can access a wireless network based on a communication standard, such as WiFi, 2G, or 3G, or a combination thereof. In an exemplary embodiment, the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 also includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
在示例性实施例中,电子设备800可被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。In an exemplary embodiment, the electronic device 800 may be implemented by one or more application-specific integrated circuits (ASIC), digital signal processors (DSP), digital signal processing devices (DSPD), programmable logic devices (PLD), field-available A programmable gate array (FPGA), controller, microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.
在示例性实施例中,还提供了一种非易失性计算机可读存储介质,例如包括计算机程序指令的存储器804,上述计算机程序指令可由电子设备800的处理器820执行以完成上述方法。In an exemplary embodiment, there is also provided a non-volatile computer-readable storage medium, such as the memory 804 including computer program instructions, which can be executed by the processor 820 of the electronic device 800 to complete the foregoing method.
图11是根据一示例性实施例示出的一种电子设备900的框图。例如,电子设备900可以被提供为一服务器。参照图11,电子设备900包括处理组件922,其进一步包括一个或多个处理器,以及由存储器932所代表的存储器资源,用于存储可由处理组件922的执行的指令,例如应用程序。存储器932中存储的应用程序可以包括一个或一个以上的每一个对应于一组指令的模块。此外,处理组件922被配置为执行指令,以执行上述方法。Fig. 11 is a block diagram showing an electronic device 900 according to an exemplary embodiment. For example, the electronic device 900 may be provided as a server. 11, the electronic device 900 includes a processing component 922, which further includes one or more processors, and a memory resource represented by a memory 932, for storing instructions that can be executed by the processing component 922, such as an application program. The application program stored in the memory 932 may include one or more modules each corresponding to a set of instructions. In addition, the processing component 922 is configured to execute instructions to perform the above-described methods.
电子设备900还可以包括一个电源组件926被配置为执行电子设备900的电源管理,一个有线或无线网络接口950被配置为将电子设备900连接到网络,和一个输入/输出(I/O)接口958。电子设备900可以操作基于存储在存储器932的操作系统,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM或类似。The electronic device 900 may also include a power component 926 configured to perform power management of the electronic device 900, a wired or wireless network interface 950 configured to connect the electronic device 900 to a network, and an input/output (I/O) interface 958. The electronic device 900 can operate based on an operating system stored in the memory 932, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
在示例性实施例中,还提供了一种非易失性计算机可读存储介质,例如包括计算机程序指令的存储器932,上述计算机程序指令可由电子设备900的处理组件922执行以完成上述方法。In an exemplary embodiment, a non-volatile computer-readable storage medium is also provided, such as a memory 932 including computer program instructions, which can be executed by the processing component 922 of the electronic device 900 to complete the foregoing method.
本公开可以是系统、方法和/或计算机程序产品。计算机程序产品可以包括计算机可读存储介质,其上载有用于使处理器实现本公开的各个方面的计算机可读程序指令。The present disclosure may be a system, method and/or computer program product. The computer program product may include a computer-readable storage medium loaded with computer-readable program instructions for enabling a processor to implement various aspects of the present disclosure.
计算机可读存储介质可以是可以保持和存储由指令执行设备使用的指令的有形设备。计算机可读存储介质例如可以包括但不限于电存储设备、磁存储设备、光存储设备、电磁存储设备、半导体存储设备或者上述的任意合适的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、静态随机存取存储器(SRAM)、便携式压缩盘只读存储器(CD-ROM)、数字多功能盘(DVD)、记忆棒、软盘、机械编码设备、例如其上存储有指令的打孔卡或凹槽内凸起结构、以及上述的任意合适的组合。这里所使用的计算机可读存储介质不被解释为瞬时信号本身,诸如无线电波或者其他自由传播的电磁波、通过波导或其他传输媒介传播的电磁波(例如,通过光纤电缆的光脉冲)、或者通过电线传输的电信号。The computer-readable storage medium may be a tangible device that can hold and store instructions used by the instruction execution device. The computer-readable storage medium may include, but is not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing, for example. More specific examples of computer-readable storage media (non-exhaustive list) include: portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM) Or flash memory), static random access memory (SRAM), portable compact disk read-only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanical encoding device, such as a printer with instructions stored thereon The protruding structure in the hole card or the groove, and any suitable combination of the above. The computer-readable storage medium used here is not interpreted as the instantaneous signal itself, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (for example, light pulses through fiber optic cables), or through wires Transmission of electrical signals.
这里所描述的计算机可读程序指令可以从计算机可读存储介质下载到各个计算/处理设备,或者通过网络、例如因特网、局域网、广域网和/或无线网下载到外部计算机或外部存储设备。网络可以包括铜传输电缆、光纤传输、无线传输、路由器、防火墙、交换机、网关计算机和/或边缘服务器。每个计算/处理设备中的网络适配卡或者网络接口从网络接收计算机可读程序指令,并转发该计算机可读程序指令,以供存储在各个计算/处理设备中的计算机可读存储介质中。The computer-readable program instructions described herein can be downloaded from a computer-readable storage medium to various computing/processing devices, or downloaded to an external computer or external storage device via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, optical fiber transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers. The network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network, and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device .
用于执行本公开操作的计算机程序指令可以是汇编指令、指令集架构(ISA)指令、机器指令、机器相关指令、微代码、固件指令、状态设置数据、或者以一种或多种编程语言的任意组合编写的源代码或目标代码,所述编程语言包括面向对象的编程语言—诸如Smalltalk、C++等,以及常规的过程式编程语言—诸如“C”语言或类似的编程语言。计算机可读程序指令可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络—包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。在一些实施例中, 通过利用计算机可读程序指令的状态信息来个性化定制电子电路,例如可编程逻辑电路、现场可编程门阵列(FPGA)或可编程逻辑阵列(PLA),该电子电路可以执行计算机可读程序指令,从而实现本公开的各个方面。The computer program instructions used to perform the operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, state setting data, or in one or more programming languages. Source code or object code written in any combination, the programming language includes object-oriented programming languages such as Smalltalk, C++, etc., and conventional procedural programming languages such as "C" language or similar programming languages. Computer-readable program instructions can be executed entirely on the user's computer, partly on the user's computer, executed as a stand-alone software package, partly on the user's computer and partly executed on a remote computer, or entirely on the remote computer or server implement. In the case of a remote computer, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (for example, using an Internet service provider to access the Internet). connect). In some embodiments, an electronic circuit, such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can be customized by using the status information of the computer-readable program instructions. The computer-readable program instructions are executed to realize various aspects of the present disclosure.
这里参照根据本公开实施例的方法、装置(系统)和计算机程序产品的流程图和/或框图描述了本公开的各个方面。应当理解,流程图和/或框图的每个方框以及流程图和/或框图中各方框的组合,都可以由计算机可读程序指令实现。Here, various aspects of the present disclosure are described with reference to flowcharts and/or block diagrams of methods, devices (systems) and computer program products according to embodiments of the present disclosure. It should be understood that each block of the flowcharts and/or block diagrams, and combinations of blocks in the flowcharts and/or block diagrams, can be implemented by computer-readable program instructions.
这些计算机可读程序指令可以提供给通用计算机、专用计算机或其它可编程数据处理装置的处理器,从而生产出一种机器,使得这些指令在通过计算机或其它可编程数据处理装置的处理器执行时,产生了实现流程图和/或框图中的一个或多个方框中规定的功能/动作的装置。也可以把这些计算机可读程序指令存储在计算机可读存储介质中,这些指令使得计算机、可编程数据处理装置和/或其他设备以特定方式工作,从而,存储有指令的计算机可读介质则包括一个制造品,其包括实现流程图和/或框图中的一个或多个方框中规定的功能/动作的各个方面的指令。These computer-readable program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, thereby producing a machine that makes these instructions when executed by the processor of the computer or other programmable data processing device , A device that implements the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams is produced. It is also possible to store these computer-readable program instructions in a computer-readable storage medium. These instructions make computers, programmable data processing apparatuses, and/or other devices work in a specific manner, so that the computer-readable medium storing the instructions includes An article of manufacture, which includes instructions for implementing various aspects of the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.
也可以把计算机可读程序指令加载到计算机、其它可编程数据处理装置、或其它设备上,使得在计算机、其它可编程数据处理装置或其它设备上执行一系列操作步骤,以产生计算机实现的过程,从而使得在计算机、其它可编程数据处理装置、或其它设备上执行的指令实现流程图和/或框图中的一个或多个方框中规定的功能/动作。It is also possible to load computer-readable program instructions onto a computer, other programmable data processing device, or other equipment, so that a series of operation steps are executed on the computer, other programmable data processing device, or other equipment to produce a computer-implemented process , So that the instructions executed on the computer, other programmable data processing apparatus, or other equipment realize the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.
附图中的流程图和框图显示了根据本公开的多个实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或指令的一部分,所述模块、程序段或指令的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the accompanying drawings show the possible implementation architecture, functions, and operations of the system, method, and computer program product according to multiple embodiments of the present disclosure. In this regard, each block in the flowchart or block diagram may represent a module, program segment, or part of an instruction, and the module, program segment, or part of an instruction contains one or more components for realizing the specified logical function. Executable instructions. In some alternative implementations, the functions marked in the block may also occur in a different order from the order marked in the drawings. For example, two consecutive blocks can actually be executed substantially in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved. It should also be noted that each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart, can be implemented by a dedicated hardware-based system that performs the specified functions or actions Or it can be realized by a combination of dedicated hardware and computer instructions.
在不违背逻辑的情况下,本公开不同实施例之间可以相互结合,不同实施例描述有所侧重,为侧重描述的部分可以参见其他实施例的记载。Without violating logic, different embodiments of the present disclosure can be combined with each other, and the description of different embodiments is emphasized. For the part of the description, reference may be made to the records of other embodiments.
以上已经描述了本公开的各实施例,上述说明是示例性的,并非穷尽性的,并且也不限于所披露的各实施例。在不偏离所说明的各实施例的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。本文中所用术语的选择,旨在最好地解释各实施例的原理、实际应用或对市场中技术的技术改进,或者使本技术领域的其它普通技术人员能理解本文披露的各实施例。The embodiments of the present disclosure have been described above, and the above description is exemplary, not exhaustive, and is not limited to the disclosed embodiments. Without departing from the scope and spirit of the illustrated embodiments, many modifications and changes are obvious to those of ordinary skill in the art. The choice of terms used herein is intended to best explain the principles, practical applications, or technical improvements of the technologies in the market, or to enable other ordinary skilled in the art to understand the embodiments disclosed herein.

Claims (15)

  1. 一种目标检测方法,包括:A target detection method includes:
    获取点云信息,所述点云信息至少包括目标对象及待检测对象对应的点云信息,其中,所述待检测对象为所述目标对象周围的人或物;Acquiring point cloud information, where the point cloud information includes at least a target object and point cloud information corresponding to an object to be detected, wherein the object to be detected is a person or thing around the target object;
    根据所述点云信息,得到网格信息,所述网格信息至少包括指示所述待检测对象的障碍点信息;Obtaining grid information according to the point cloud information, where the grid information includes at least obstacle point information indicating the object to be detected;
    根据所述网格信息,识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物。According to the grid information, an obstacle in the object to be detected that affects the movement of the target object is identified.
  2. 根据权利要求1所述的方法,其特征在于,所述获取点云信息,包括:The method according to claim 1, wherein said acquiring point cloud information comprises:
    获取通过至少两个传感器分别扫描得到的多个待处理的点云信息;Acquiring multiple to-be-processed point cloud information scanned by at least two sensors;
    将所述多个待处理的点云信息进行拼接处理,得到所述点云信息。The multiple point cloud information to be processed are spliced to obtain the point cloud information.
  3. 根据权利要求1或2所述的方法,其特征在于,所述点云信息还包括传感器标识,所述根据所述点云信息,得到网格信息,包括:The method according to claim 1 or 2, wherein the point cloud information further includes a sensor identifier, and the obtaining grid information according to the point cloud information includes:
    对所述点云信息进行网格化处理,得到网格图,所述网格图包括多个网格区域,且每个所述网格区域对应的所述障碍点信息为第一值;Performing griding processing on the point cloud information to obtain a grid graph, the grid graph including a plurality of grid regions, and the obstacle point information corresponding to each grid region is a first value;
    针对每个所述网格区域,For each grid area,
    根据所述网格区域包括的像素点对应的传感器标识,确定所述网格区域中是否存在对应所述待检测对象的障碍点;Determining whether there is an obstacle point corresponding to the object to be detected in the grid area according to the sensor identifiers corresponding to the pixels included in the grid area;
    在所述网格区域中存在有所述障碍点的情况下,将所述网格信息中对应所述网格区域的所述障碍点信息更新为第二值。In a case where the obstacle point exists in the grid area, the obstacle point information corresponding to the grid area in the grid information is updated to a second value.
  4. 根据权利要求3所述的方法,其特征在于,所述根据所述网格区域包括的像素点对应的传感器标识,确定所述网格区域中是否存在对应所述待检测对象的障碍点,包括:The method according to claim 3, wherein the determining whether there are obstacle points corresponding to the object to be detected in the grid area according to the sensor identifiers corresponding to the pixels included in the grid area comprises :
    在所述网格区域中的至少两个像素点对应的传感器标识不同的情况下,确定所述网格区域中存在所述障碍点。In a case where the sensor identifiers corresponding to at least two pixel points in the grid area are different, it is determined that the obstacle point exists in the grid area.
  5. 根据权利要求3或4所述的方法,其特征在于,所述点云信息还包括高度信息,所述在所述目标网格区域中存在有所述障碍点的情况下,将所述网格信息中对应所述网格区域的所述障碍点信息更新为第二值,还包括:The method according to claim 3 or 4, wherein the point cloud information further includes height information, and when the obstacle point exists in the target grid area, the grid The update of the obstacle point information corresponding to the grid area to the second value in the information further includes:
    根据所述网格区域中的像素点的高度信息,确定所述网格区域中存在的所述障碍点的类别;Determine the category of the obstacle point existing in the grid area according to the height information of the pixel points in the grid area;
    根据所述障碍点的类别,更新所述网格信息中对应所述网格区域的所述障碍点信息。According to the category of the obstacle point, the obstacle point information corresponding to the grid area in the grid information is updated.
  6. 根据权利要求5所述的方法,其特征在于,所述根据所述网格区域中的像素点的高度信息,确定所述网格区域中存在的所述障碍点的类别,包括:The method according to claim 5, wherein the determining the category of the obstacle point existing in the grid area according to the height information of the pixel points in the grid area comprises:
    获取所述网格区域中至少两个像素点分别对应的传感器标识及高度信息;Acquiring sensor identifiers and height information respectively corresponding to at least two pixels in the grid area;
    将所述至少两个像素点中对应同一个传感器标识的像素点作为一组数据,得到多组像素点数据;Taking the pixel points corresponding to the same sensor identifier among the at least two pixel points as a set of data to obtain multiple sets of pixel point data;
    根据所述高度信息,确定每一组所述像素点数据中的最小高度值;Determining the minimum height value in each group of pixel data according to the height information;
    对所述多组像素点数据中的最小高度值进行归类处理,获得一个或多个最小高度类;Categorizing the minimum height values in the multiple sets of pixel data to obtain one or more minimum height categories;
    根据每个所述最小高度类所包括的高度值的数量及其中最小值,确定所述网格区域中存在的障碍点的类别。According to the number of height values included in each of the minimum height classes and the minimum value thereof, the types of obstacle points existing in the grid area are determined.
  7. 根据权利要求6所述的方法,其特征在于,所述根据每个所述最小高度类所包括的高度值的数量及其中最小值,确定所述网格区域中存在的障碍点的类别,包括:The method according to claim 6, wherein the determining the category of obstacle points existing in the grid area according to the number of height values included in each of the minimum height classes and the minimum value thereof includes :
    在所述一个或多个最小高度类中存在目标最小高度类的情况下,则确定所述网格区域中存在的障碍点对应障碍物,其中In the case where there is a target minimum height class in the one or more minimum height classes, it is determined that the obstacle points existing in the grid area correspond to the obstacle, where
    所述目标最小高度类所包括的高度值的数量大于或等于预设的数量阈值,The number of height values included in the target minimum height class is greater than or equal to a preset number threshold,
    所述目标最小高度类所包括的高度值中的最小值小于或等于预设的高度阈值;The minimum value of the height values included in the target minimum height class is less than or equal to a preset height threshold;
    在所述一个或多个最小高度类中不存在所述目标最小高度类的情况下,则确定所述网格区域中存在的障碍点对应非障碍物。In the case that the target minimum height class does not exist in the one or more minimum height classes, it is determined that the obstacle points existing in the grid area correspond to non-obstacles.
  8. 根据权利要求5至7中任一项所述的方法,其特征在于,所述障碍点的类别包括所述障碍点对应障碍物和所述障碍点对应非障碍物,所述根据所述障碍点的类别,更新所述网格信息中对应所述网格区域的所述障碍点信息,包括:The method according to any one of claims 5 to 7, wherein the categories of the obstacle points include obstacles corresponding to the obstacle points and non-obstacles corresponding to the obstacle points. The category of updating the obstacle point information corresponding to the grid area in the grid information includes:
    在所述障碍点的类别表示所述障碍点对应障碍物的情况下,将所述网格信息中对应所述网格区域的所述障碍点信息保持为所述第二值;In the case where the category of the obstacle point indicates that the obstacle point corresponds to an obstacle, maintaining the obstacle point information corresponding to the grid area in the grid information as the second value;
    在所述障碍点的类别表示所述障碍点对应非障碍物的情况下,将所述网格信息中对应所述网格区域的所述障碍点信息更新为所述第一值。When the category of the obstacle point indicates that the obstacle point corresponds to a non-obstacle, the obstacle point information corresponding to the grid area in the grid information is updated to the first value.
  9. 根据权利要求1至8中任一项所述的方法,其特征在于,所述根据所述网格信息,识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物,包括:The method according to any one of claims 1 to 8, wherein the identifying an obstacle in the object to be detected that affects the movement of the target object according to the grid information comprises :
    根据所述网格信息中的所述障碍点信息进行连通区域分析,得到连通区域;Perform a connected area analysis according to the obstacle point information in the grid information to obtain a connected area;
    根据所述连通区域,识别出所述待检测对象中的所述障碍物。According to the connected area, the obstacle in the object to be detected is identified.
  10. 根据权利要求9所述的方法,还包括:The method according to claim 9, further comprising:
    获取所述连通区域的第一线段上的多个待处理点;Acquiring multiple points to be processed on the first line segment of the connected region;
    从所述多个待处理点中选取至少两个参考点;Selecting at least two reference points from the plurality of points to be processed;
    连接所述至少两个参考点得到第二线段,根据所述第二线段调整所述连通区域后得到第一区域。The second line segment is obtained by connecting the at least two reference points, and the first area is obtained after adjusting the connected area according to the second line segment.
  11. 根据权利要求1至10中任一项所述的方法,还包括:The method according to any one of claims 1 to 10, further comprising:
    从所述点云信息中提取所述目标对象对应的点云信息,根据所述目标对象对应的点云信息中像素点的坐标,得到所述目标对象对应的目标位置;Extracting the point cloud information corresponding to the target object from the point cloud information, and obtaining the target position corresponding to the target object according to the coordinates of the pixel points in the point cloud information corresponding to the target object;
    获取基于所述网格信息识别出的至少两个障碍物;Acquiring at least two obstacles identified based on the grid information;
    以所述目标位置的中心点为基准,根据预设角度发出的指引线得到扇形区域;Using the center point of the target position as a reference, obtain a fan-shaped area according to a guide line issued at a preset angle;
    在所述扇形区域覆盖第一障碍物和第二障碍物,且所述第二障碍物被所述第一障碍物遮挡的情况下,将所述第二障碍物的障碍点信息从所述网格信息中删除。In the case that the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the obstacle point information of the second obstacle is obtained from the network Delete from the grid information.
  12. 一种目标检测装置,包括:A target detection device includes:
    获取单元,用于获取点云信息,所述点云信息至少包括目标对象及待检测对象对应的点云信息,其中,所述待检测对象为所述目标对象周围的人或物;An acquiring unit, configured to acquire point cloud information, the point cloud information includes at least a target object and point cloud information corresponding to an object to be detected, wherein the object to be detected is a person or thing around the target object;
    信息处理单元,用于根据所述点云信息,得到网格信息,所述网格信息至少包括指示所述待检测对象的障碍点信息;An information processing unit, configured to obtain grid information according to the point cloud information, where the grid information includes at least obstacle point information indicating the object to be detected;
    检测单元,用于根据所述网格信息,识别出所述待检测对象中对所述目标对象的移动造成影响的障碍物。The detection unit is configured to identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.
  13. 一种电子设备,包括:An electronic device including:
    处理器;processor;
    用于存储处理器可执行指令的存储器;A memory for storing processor executable instructions;
    其中,所述处理器被配置为执行权利要求1至11中任意一项所述的方法。Wherein, the processor is configured to execute the method of any one of claims 1-11.
  14. 一种计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现权利要求1至11中任意一项所述的方法。A computer-readable storage medium having computer program instructions stored thereon, and when the computer program instructions are executed by a processor, the method according to any one of claims 1 to 11 is implemented.
  15. 一种计算机程序,所述计算机程序存储在存储介质中,当处理器执行所述计算机程序时,所述处理器用于执行权利要求1-11任一所述的目标检测方法。A computer program, the computer program is stored in a storage medium, and when a processor executes the computer program, the processor is used to execute the target detection method according to any one of claims 1-11.
PCT/CN2021/087424 2020-04-20 2021-04-15 Target detection method and apparatus, and electronic device, storage medium and program WO2021213241A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
KR1020217043313A KR20220016221A (en) 2020-04-20 2021-04-15 Target detection method and apparatus, electronic device, storage medium and program
JP2021577017A JP2022539093A (en) 2020-04-20 2021-04-15 Target detection method and device, electronic device, storage medium, and program

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010314166.6A CN111507973B (en) 2020-04-20 2020-04-20 Target detection method and device, electronic equipment and storage medium
CN202010314166.6 2020-04-20

Publications (1)

Publication Number Publication Date
WO2021213241A1 true WO2021213241A1 (en) 2021-10-28

Family

ID=71878738

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/087424 WO2021213241A1 (en) 2020-04-20 2021-04-15 Target detection method and apparatus, and electronic device, storage medium and program

Country Status (4)

Country Link
JP (1) JP2022539093A (en)
KR (1) KR20220016221A (en)
CN (1) CN111507973B (en)
WO (1) WO2021213241A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117091516A (en) * 2022-05-12 2023-11-21 广州镭晨智能装备科技有限公司 Method, system and storage medium for detecting thickness of circuit board protective layer

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111507973B (en) * 2020-04-20 2024-04-12 上海商汤临港智能科技有限公司 Target detection method and device, electronic equipment and storage medium
CN112697188B (en) * 2020-12-08 2022-12-23 北京百度网讯科技有限公司 Detection system test method and device, computer equipment, medium and program product
CN113901970B (en) * 2021-12-08 2022-05-24 深圳市速腾聚创科技有限公司 Obstacle detection method and apparatus, medium, and electronic device
CN115330969A (en) * 2022-10-12 2022-11-11 之江实验室 Local static environment vectorization description method for ground unmanned vehicle

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779280A (en) * 2012-06-19 2012-11-14 武汉大学 Traffic information extraction method based on laser sensor
CN105957145A (en) * 2016-04-29 2016-09-21 百度在线网络技术(北京)有限公司 Road barrier identification method and device
CN106951847A (en) * 2017-03-13 2017-07-14 百度在线网络技术(北京)有限公司 Obstacle detection method, device, equipment and storage medium
US20190035148A1 (en) * 2017-07-28 2019-01-31 The Boeing Company Resolution adaptive mesh that is generated using an intermediate implicit representation of a point cloud
CN109840448A (en) * 2017-11-24 2019-06-04 百度在线网络技术(北京)有限公司 Information output method and device for automatic driving vehicle
CN111507973A (en) * 2020-04-20 2020-08-07 上海商汤临港智能科技有限公司 Target detection method and device, electronic equipment and storage medium

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11144747B2 (en) * 2017-03-31 2021-10-12 Pioneer Corporation 3D data generating device, 3D data generating method, 3D data generating program, and computer-readable recording medium storing 3D data generating program
CN109145677A (en) * 2017-06-15 2019-01-04 百度在线网络技术(北京)有限公司 Obstacle detection method, device, equipment and storage medium
JP6969738B2 (en) * 2017-07-10 2021-11-24 株式会社Zmp Object detection device and method
JP7056842B2 (en) * 2018-03-23 2022-04-19 株式会社豊田中央研究所 State estimator and program
JP7128577B2 (en) * 2018-03-30 2022-08-31 セコム株式会社 monitoring device
JP2019207655A (en) * 2018-05-30 2019-12-05 株式会社Ihi Detection device and detection system
JP7479799B2 (en) * 2018-08-30 2024-05-09 キヤノン株式会社 Information processing device, information processing method, program, and system
CN110147706B (en) * 2018-10-24 2022-04-12 腾讯科技(深圳)有限公司 Obstacle recognition method and device, storage medium, and electronic device
CN109635685B (en) * 2018-11-29 2021-02-12 北京市商汤科技开发有限公司 Target object 3D detection method, device, medium and equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779280A (en) * 2012-06-19 2012-11-14 武汉大学 Traffic information extraction method based on laser sensor
CN105957145A (en) * 2016-04-29 2016-09-21 百度在线网络技术(北京)有限公司 Road barrier identification method and device
CN106951847A (en) * 2017-03-13 2017-07-14 百度在线网络技术(北京)有限公司 Obstacle detection method, device, equipment and storage medium
US20190035148A1 (en) * 2017-07-28 2019-01-31 The Boeing Company Resolution adaptive mesh that is generated using an intermediate implicit representation of a point cloud
CN109840448A (en) * 2017-11-24 2019-06-04 百度在线网络技术(北京)有限公司 Information output method and device for automatic driving vehicle
CN111507973A (en) * 2020-04-20 2020-08-07 上海商汤临港智能科技有限公司 Target detection method and device, electronic equipment and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
SIHENG CHEN; BAOAN LIU; CHEN FENG; CARLOS VALLESPI-GONZALEZ; CARL WELLINGTON: "3D Point Cloud Processing and Learning for Autonomous Driving", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 1 March 2020 (2020-03-01), 201 Olin Library Cornell University Ithaca, NY 14853 , XP081612080 *
XIN, YU ET AL.: "Dynamic Obstacle Detection and Representation Approach for Unmanned Vehicles Based on Laser Sensor", ROBOT, vol. 36, no. 6, 30 November 2014 (2014-11-30), pages 654 - 661, XP055861209, ISSN: 1002-0446 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117091516A (en) * 2022-05-12 2023-11-21 广州镭晨智能装备科技有限公司 Method, system and storage medium for detecting thickness of circuit board protective layer

Also Published As

Publication number Publication date
JP2022539093A (en) 2022-09-07
CN111507973A (en) 2020-08-07
CN111507973B (en) 2024-04-12
KR20220016221A (en) 2022-02-08

Similar Documents

Publication Publication Date Title
WO2021213241A1 (en) Target detection method and apparatus, and electronic device, storage medium and program
US11468581B2 (en) Distance measurement method, intelligent control method, electronic device, and storage medium
US11308809B2 (en) Collision control method and apparatus, and storage medium
US20210009080A1 (en) Vehicle door unlocking method, electronic device and storage medium
EP3252658B1 (en) Information processing apparatus and information processing method
US11301726B2 (en) Anchor determination method and apparatus, electronic device, and storage medium
CN111340766A (en) Target object detection method, device, equipment and storage medium
KR20180068578A (en) Electronic device and method for recognizing object by using a plurality of senses
CN106934347B (en) Obstacle identification method and device, computer equipment and readable medium
KR102129698B1 (en) Automatic fish counting system
KR20200081450A (en) Biometric detection methods, devices and systems, electronic devices and storage media
WO2021103423A1 (en) Method and apparatus for detecting pedestrian events, electronic device and storage medium
CN113064135B (en) Method and device for detecting obstacle in 3D radar point cloud continuous frame data
KR20220062107A (en) Light intensity control method, apparatus, electronic device and storage medium
CN109696173A (en) A kind of car body air navigation aid and device
US20220035003A1 (en) Method and apparatus for high-confidence people classification, change detection, and nuisance alarm rejection based on shape classifier using 3d point cloud data
CN116420058A (en) Replacing autonomous vehicle data
KR20210148134A (en) Object counting method, apparatus, electronic device, storage medium and program
KR20180125858A (en) Electronic device and method for controlling operation of vehicle
CN114332821A (en) Decision information acquisition method, device, terminal and storage medium
CN115641518A (en) View sensing network model for unmanned aerial vehicle and target detection method
CN110390252B (en) Obstacle detection method and device based on prior map information and storage medium
CN111860074B (en) Target object detection method and device, and driving control method and device
KR102120812B1 (en) Target recognition and classification system based on probability fusion of camera-radar and method thereof
CN113450459A (en) Method and device for constructing three-dimensional model of target object

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21792821

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2021577017

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 20217043313

Country of ref document: KR

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21792821

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM1205 DATED 12.04.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21792821

Country of ref document: EP

Kind code of ref document: A1