WO2021213241A1

WO2021213241A1 - Target detection method and apparatus, and electronic device, storage medium and program

Info

Publication number: WO2021213241A1
Application number: PCT/CN2021/087424
Authority: WO
Inventors: 周辉; 洪方舟; 王哲; 石建萍
Original assignee: 上海商汤临港智能科技有限公司
Priority date: 2020-04-20
Filing date: 2021-04-15
Publication date: 2021-10-28
Also published as: JP2022539093A; CN111507973A; CN111507973B; KR20220016221A

Abstract

The present disclosure relates to a target detection method and apparatus, and an electronic device, a storage medium and a program. An example of the method comprises: obtaining point cloud information, the point cloud information at least comprising a target object and point cloud information corresponding to an object to be detected, and said object being a person or thing around the target object; obtaining grid information according to the point cloud information, the grid information at least comprising obstacle point information indicating said object; and identifying, according to the grid information, an obstacle in said object that affects the movement of the target object.

Description

Target detection method and device, electronic equipment, storage medium and program

Cross-references to related applications

This patent application claims the priority of the Chinese patent application filed on April 20, 2020, the application number is 2020103141666, and the invention title is "target detection method and device, electronic equipment and storage medium". This application is incorporated by reference. Into the text.

Technical field

The present disclosure relates to the field of automatic driving technology, and in particular to a target detection method and device, electronic equipment, storage medium, and program.

Background technique

Target detection of obstacles is an important part of ensuring safe driving in automatic driving. Target detection can use deep learning technology based on neural networks to predict the possible size and location of obstacles. However, the accuracy of target detection based on deep learning technology depends on specific types of training data and the pros and cons of training algorithms, resulting in low target detection accuracy for obstacles. However, there is no effective solution in the related art.

Summary of the invention

The present disclosure proposes a technical solution for target detection.

According to an aspect of the present disclosure, there is provided a target detection method, the method includes: acquiring point cloud information, the point cloud information includes at least a target object and point cloud information corresponding to the object to be detected, wherein the to be detected The object is a person or thing around the target object; according to the point cloud information, grid information is obtained, and the grid information includes at least obstacle point information indicating the object to be detected; according to the grid information, identification Obstacles in the object to be detected that affect the movement of the target object are extracted.

According to an aspect of the present disclosure, there is also provided a target detection device, including: an acquisition unit configured to acquire point cloud information, the point cloud information including at least the target object and the point cloud information corresponding to the object to be detected; The object to be detected is a person or thing around the target object; an information processing unit is configured to obtain grid information according to the point cloud information, and the grid information includes at least obstacle point information indicating the object to be detected The detection unit is used to identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.

According to an aspect of the present disclosure, there is also provided an electronic device, including: a processor; and a memory for storing instructions executable by the processor. Wherein, the processor is configured to execute the above-mentioned target detection method.

According to an aspect of the present disclosure, there is also provided a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above-mentioned target detection method when executed by a processor.

According to an aspect of the present disclosure, a computer program is also provided, the computer program is stored in a storage medium, and when a processor executes the computer program, the processor is used to execute the above-mentioned target detection method.

According to an example of the present disclosure, grid information is obtained according to point cloud information corresponding to at least the target object and the object to be detected, and the grid information includes at least obstacle point information indicating the object to be detected, so that the grid information can be Grid information to identify obstacles in the object to be detected that affect the movement of the target object. Since the content of the point cloud information is relatively rich and is not limited to a specific type of object, such as a vehicle or a pedestrian, the technical solution of the present disclosure is suitable for more target detection scenarios. In addition, by identifying the obstacle in the object to be detected according to the grid information including the obstacle point information, the target detection accuracy for the obstacle is effectively improved.

The above general description and the following detailed description are only exemplary and explanatory, and do not limit the present disclosure.

According to the following detailed description of exemplary embodiments with reference to the accompanying drawings, other features and aspects of the present disclosure will become clear.

Description of the drawings

The drawings herein are incorporated into the specification and constitute a part of the specification. These drawings illustrate embodiments that conform to the present disclosure, and are used together with the specification to explain the technical solutions of the present disclosure.

Fig. 1 shows a flowchart of a target detection method according to an embodiment of the present disclosure.

Fig. 2 shows a schematic diagram of grid information according to an embodiment of the present disclosure.

Fig. 3 shows a schematic diagram of different ring IDs of pixel sources in a grid area according to an embodiment of the present disclosure.

Fig. 4 shows a schematic diagram of the source of pixels in the grid area with the same ring ID according to an embodiment of the present disclosure.

Fig. 5 shows a schematic diagram of obstacle point information in each grid area according to an embodiment of the present disclosure.

Figures 6a-6b show schematic diagrams of a communication manner of a connected area according to an embodiment of the present disclosure.

Fig. 7 shows a schematic diagram of an obstacle in a grid map according to an embodiment of the present disclosure.

FIG. 8 shows a schematic diagram of deleting obstructed obstacles in a grid image according to an embodiment of the present disclosure.

Fig. 9 shows a block diagram of a target detection device according to an embodiment of the present disclosure.

FIG. 10 shows a block diagram of an electronic device according to an embodiment of the present disclosure.

FIG. 11 shows a block diagram of an electronic device according to an embodiment of the present disclosure.

Detailed ways

Various exemplary embodiments, features, and aspects of the present disclosure will be described in detail below with reference to the drawings. The same reference numerals in the drawings indicate elements with the same or similar functions. Although various aspects of the embodiments are shown in the drawings, unless otherwise noted, the drawings are not necessarily drawn to scale.

The dedicated word "exemplary" here means "serving as an example, embodiment, or illustration." Any embodiment described herein as "exemplary" need not be construed as being superior or better than other embodiments.

The term "and/or" in this document is merely an association relationship that describes the associated objects, which means that there can be three types of relationships. For example, A and/or B can mean: A alone exists, A and B exist at the same time, and B exists alone. In addition, the term "at least one" herein means any one of a plurality of types or any combination of at least two of the plurality of types. For example, including at least one of A, B, and C may mean including any one or more elements selected from the set consisting of A, B, and C.

In addition, numerous specific details are given in the following specific embodiments. Those skilled in the art should understand that the present disclosure can also be implemented without certain specific details. In some instances, the methods, means, elements, and circuits well known to those skilled in the art have not been described in detail, so as to highlight the gist of the present disclosure.

Detecting target objects, such as detecting target objects such as vehicles or pedestrians in autonomous driving or unmanned driving scenes, can be achieved by using deep learning technology based on neural networks.

On the one hand, the accuracy of target detection based on deep learning technology depends on specific types of training data, which limits its applicable application scenarios. That is to say, the neural network trained according to the deep learning technology is feasible for a certain specific scene related to the selected training data, but cannot be generalized to other non-specific scenes. For example, for a specific scene, such as target detection of a vehicle or pedestrian, since the specific scene is relatively common, a large amount of data related to the target detection of the vehicle or pedestrian has been accumulated. Regarding these data as a specific type of training data, a neural network trained based on deep learning technology will look for objects that meet these types of features in the input data, thereby ensuring the accuracy of target detection in the specific scene. However, for uncommon objects, such as a random-shaped tree trunk or obstacles such as rocks, since the training data of such objects has not been used in the process of training neural networks according to deep learning technology, it is difficult to detect the object. Obstacles, it is difficult to apply the trained neural network to other non-specific arbitrary scenes. In other words, a neural network trained in a certain scene will perform poorly in a different type of scene, resulting in a weaker generalization ability of the neural network. Moreover, the deep learning technology essentially fits a complex function to the given data (expected target), so that the data that conforms to the same distribution can be input into the function to give correct results to get the matching hypothesis. However, often in order to obtain this hypothesis, the training process may become overly complicated, which is prone to overfitting. Moreover, if the input data does not conform to the distribution of the training data, the results given may not be accurate. Because it is difficult for training data to cover all possible road conditions, it can only give highly reliable results for specific training data and related specific scenarios.

On the other hand, the accuracy of target detection based on deep learning technology also depends on the quality of the training algorithm. The characteristics of deep learning are not completely controllable, that is, the prediction result for a given input data is unpredictable, so it is difficult to achieve the ideal value of 100% recall rate. Among them, the recall rate refers to the number of objects identified through target detection divided by the number of actual objects. Generally speaking, in autonomous driving or unmanned driving scenarios, the higher the recall rate, the higher the safety of driving.

In summary, the use of deep learning technology to achieve target detection in autonomous driving or unmanned driving scenarios is more suitable for the detection of target objects such as vehicles or pedestrians. However, the target detection of obstacles in the road to avoid collisions cannot reach the accuracy required for obstacle detection. The accuracy of obstacle detection is an important part of automatic driving in order to ensure safe driving. For example, if the target detection of obstacles fails to achieve the accuracy of obstacle detection, the safety of autonomous driving or unmanned driving cannot be guaranteed.

Fig. 1 shows a flowchart of a target detection method according to an embodiment of the present disclosure. The method is applied to a target detection device. For example, the device can be deployed in a terminal device or a server or other processing equipment, and can perform processing such as target detection or target classification in automatic driving. Among them, the terminal device may be a user equipment (UE, User Equipment), mobile device, cellular phone, cordless phone, personal digital assistant (PDA, Personal Digital Assistant), handheld device, computing device, in-vehicle device, wearable device, etc. In some possible implementation manners, the method may be implemented by a processor invoking computer-readable instructions stored in the memory. As shown in Figure 1, the process includes:

Step S101: Obtain point cloud information, where the point cloud information includes at least the point cloud information corresponding to the target object and the object to be detected.

In an example, multiple pieces of to-be-processed point cloud information obtained through scanning by at least two sensors may be obtained, and the multiple pieces of to-be-processed point cloud information may be spliced to obtain the point cloud information. In addition, grid processing can be performed according to the point cloud information to obtain grid information.

In an example, the at least two sensors may be sensors with laser emitting and receiving functions in the lidar.

In an example, the target object may refer to a target device scanned by at least two sensors during the target detection process, such as a vehicle in an autonomous driving or unmanned driving scene. The target object in the present disclosure is not limited to the target device, and may also include pedestrians who guide the blind.

In an example, the object to be detected may refer to an object related to the target object in the target detection process. For example, if the target object is a vehicle in an autonomous driving or unmanned driving scene, for safe driving, the object to be detected may be stones, leaves, roadblocks, etc. on the driving route of the vehicle. The object to be detected may also refer to an object in the same observation frame as the target object during the target detection process. For example, the target object is still a vehicle as an example, and the object to be detected may be a roadside billboard, a tree and its canopy in the same observation screen as the vehicle.

Step S102: Obtain grid information according to the point cloud information. Wherein, the grid information includes at least obstacle point information indicating the object to be detected.

In an example, the point cloud information may include the point cloud information corresponding to the target object, such as the point cloud information corresponding to the vehicle in the autonomous driving or unmanned driving scene, and the point cloud information corresponding to the object to be detected, such as pebbles, Leaves, roadblocks, roadside billboards, trees and their canopies, etc. It should be pointed out that in autonomous driving or unmanned driving scenarios, small stones, leaves, and roadblocks in the object to be detected are obstacles to be identified later, while roadside billboards, trees and their canopies are located outside the driving path of the vehicle. , Can not be considered as an obstacle. In this way, not only the amount of calculation can be reduced, but also the detection accuracy of obstacles can be improved.

In an example, the point cloud information can be gridded to obtain a grid map composed of multiple grid regions. Fig. 2 shows a schematic diagram of grid information according to an embodiment of the present disclosure. An implementation manner of the grid information of the present disclosure may be a grid graph or other chart forms, which is not limited. In Figure 2, the grid map contains multiple grid areas 11, and each grid area includes one or more pixels (in Figure 2, each grid area includes multiple pixels as an example) . For each grid area in the grid map, it is necessary to identify whether the grid area has a pixel point corresponding to the object to be detected or even an obstacle (hereinafter referred to as an obstacle point) and identify it with obstacle point information. In this way, the grid graph obtained by the gridding process can be regarded as the initial grid graph, that is, the obstacle point information of each grid area is the first value representing "none", such as "0". The process of identifying obstacles can be regarded as updating the obstacle point information of each grid area in the grid map (it can be specifically, updating the obstacle point information of a certain grid area from a first value to a second value, for example, " 1”), and at least the sensor identification (ring ID) in the point cloud information can be used as the update basis. For example, the obstacle point information can be marked in the grid area according to the ring ID. FIG. 5 shows a schematic diagram of obstacle point information in each grid area according to an embodiment of the present disclosure, taking the number "0" and the number "1" as the obstacle point information as an example. Wherein, marking the grid area as the first value "0" indicates that there are no obstacles in the grid area, and marking the grid area as the second value "1" indicates that there are obstacle points in the grid area. In this way, a grid map containing obstacle point information is obtained, so that the obstacle in the object to be detected can be identified according to the grid map containing obstacle point information.

Step S103: Identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.

In an example, the grid information may be a grid graph containing obstacle point information. According to the grid map containing obstacle point information, obstacles in the object to be detected can be identified. For example, marking the grid area as "1" indicates that there are obstacle points in the grid area. Connecting multiple obstacle points can obtain the connected area corresponding to the multiple obstacle points, and determine the shape of the object to be detected corresponding to the multiple obstacle points, and even the shape of the obstacle.

In the present disclosure, by scanning the target object according to at least two sensors, point cloud information corresponding to the target object and the object to be detected is obtained, and according to the point cloud information, information at least including obstacle points indicating whether the object to be detected exists is obtained According to the grid information of the grid information, the obstacles in the object to be detected can be identified according to the obstacle point information contained in the grid information, which improves the accuracy of target detection for the obstacles.

In an example, in the process of scanning the target object by at least two sensors, the point cloud information can be obtained according to the scanning detection signal sent by the sensor and the return signal received. For example, the sensor transmits a scanning detection signal to the vehicle and its surroundings, and then the sensor receives the return signal reflected from the vehicle and its surrounding objects, and compares the return signal with the transmitted scanning detection signal to obtain the vehicle and its surroundings. Objects such as position information, height information, distance information, speed information, posture information, shape information and other parameters, so that the vehicle and its surrounding objects can be tracked and identified based on these parameters.

It should be pointed out that the point cloud information of the present disclosure is a collection of massive points that express the spatial distribution and surface characteristics of objects in the target area under the same spatial reference system, and the three-dimensional coordinates of each pixel point are recorded in the form of pixels ( Among them, the X/Y two-dimensional coordinates in the three-dimensional coordinates are used to calibrate the position information in the above parameters, and the third dimension Z in the three-dimensional coordinates is used to calibrate the height information in the above parameters), color information (RGB), and laser reflection intensity (Intensity ) A combination of multiple items in information, etc.

In an example, the ring ID information of each pixel can be obtained from the point cloud information, and the ring ID included in the target grid area in the multiple grid areas can be determined to be in the target grid area. Whether there are obstacles. Further, in the case that there are obstacle points in the target grid area, updating the grid information may include the following content:

In a case where the ring IDs corresponding to at least two pixel points in the target grid area are different, it is determined that there are obstacle points in the target grid area. Fig. 3 shows a schematic diagram of different ring IDs of pixel points in a grid area according to an embodiment of the present disclosure, including a sensor 21, a sensor 22, and a sensor 23, an object to be detected 24, and a plurality of pixels (respectively use ①-⑥ to Logo). It should be pointed out that the triangular shape of the object 24 to be detected is merely illustrative, and is not intended to limit the actual shape of the object to be detected. The laser beams emitted by the sensor 21 and the sensor 22 should not originally fall into the target grid area where the object 24 to be detected is located. Because the object 24 to be detected exists in the target grid area, the laser beams emitted by the sensor 21 and the sensor 22 are generated.了Reflected. Wherein, when the sensor 21 is scanned to obtain point cloud information composed of multiple pixels, the laser beam 211 emitted by the sensor 21 meets the object 24 to be detected and is reflected, and the pixel ① falls into the target grid area; When the sensor 22 scans to obtain point cloud information composed of multiple pixels, the laser beam 221 and the laser beam 222 emitted by the sensor 22 meet the object 24 to be detected and reflect, and the pixel points ② and the pixel points ③ fall into the target network. In the grid area; in the case of scanning by the sensor 23 to obtain point cloud information composed of multiple pixels, the laser beam 231, laser beam 232, and laser beam 233 emitted by the sensor 23 do not encounter the object to be detected 24, and the pixel points ④, Pixel ⑤ and pixel ⑥ fall into the target grid area. It can be seen that the ring IDs corresponding to multiple pixels (identified by ①-⑥ respectively) are different identifiers, which means that the multiple pixels are obtained by different sensors. At this time, it can be determined that there are pixel points corresponding to the object to be detected in the target grid area, that is, obstacle points, and the obstacle point information corresponding to the target grid area is updated from the initial first value to the second value to mark the The existence of obstacle points.

It should be pointed out that the multiple sensors (sensor 21, sensor 22, and sensor 23) in Figure 3 are not necessarily arranged separately in actual applications, they can also be arranged next to each other, or even multiple sensors can be arranged together and displayed. Different projection angles. In this example, in order to facilitate the explanation of the recognition of the object to be detected, including obstacles, multiple sensors are dispersedly arranged for more intuitiveness. Multiple sensor placement positions that can be imagined by those skilled in the art without creative work are within the protection scope of the present disclosure.

In a case where the ring IDs corresponding to at least two pixel points in the target grid area are the same, it is determined that there are no obstacle points in the target grid area. FIG. 4 shows a schematic diagram of the source of pixels in the grid area with the same ring ID according to an embodiment of the present disclosure, including a sensor 31 and a plurality of pixels (identified by ⑦-⑩, respectively). In the case of scanning by the sensor 31 to obtain point cloud information composed of multiple pixels, the laser beam 311, laser beam 312, laser beam 313, and laser beam 314 emitted by the sensor 31 have not encountered obstacles, and the pixel point ⑦, pixel Point ⑧, pixel point ⑨ and pixel point ⑩ fall into the target grid area. It can be seen that the ring IDs corresponding to multiple pixels (identified by ⑦-⑩ respectively) are the same identifier, which means that the multiple pixels are obtained by different sensors. At this time, it can be determined that there are no obstacle points in the target grid area, and the obstacle point information corresponding to the target grid area is maintained as the initial first value.

Since the object to be detected included in the point cloud information may be obstacles such as pebbles and leaves, as well as non-obstacles such as tree crowns and signs in autonomous driving or unmanned driving scenarios. Therefore, on the basis of the obstacle point judgment based on the above ring ID, the height information can be further added, and the obstacle points determined by the above ring ID can be verified to avoid possible misjudgments, such as tree crowns and signs. Other non-obstacles are also recognized as obstacles. Because, in the case of autonomous driving or unmanned driving, the target object is a vehicle, and objects in the sky such as tree crowns and signboards should not be obstacles, and are usually much higher than obstacles such as stones and leaves. Therefore, the height information of the pixels in the point cloud information can be added to exclude non-obstacles such as tree crowns and signs from the grid area.

In an example, when the point cloud information further includes height information, and when there are obstacle points in the target grid area, updating the grid information further includes: determining the height information according to the height information. The category of the obstacle point existing in the target grid area; and the obstacle point information corresponding to the target grid area in the grid information is updated according to the category of the obstacle point. For example, in an example where the grid information is a grid graph marked with obstacle point information, when it is determined that the obstacle points existing in the target grid area correspond to non-obstacles such as tree crowns, the target grid area corresponds to The obstruction point information is updated from the second value to the first value, which can effectively reduce the probability of occurrence of the above-mentioned misjudgment. In addition, when it is determined that the obstacle point corresponds to an obstacle such as a roadblock, the obstacle point information corresponding to the target grid area is maintained at a second value. In this way, after the grid information is updated, a more accurate grid map containing only the obstacle point information corresponding to the obstacle can be obtained for subsequent target detection processing.

In an example, determining the category of obstacle points existing in the target grid area according to the height information includes: obtaining ring IDs and height information corresponding to at least two pixels in the target grid area; The at least two pixels are divided according to the ring ID, and the pixels corresponding to the same ring ID are used as a set of data to obtain multiple sets of pixel data. According to the height information, the minimum height value in each group of pixel data is determined; by classifying and counting the minimum height values in the multiple groups of pixel data, one or more minimum height categories are obtained, and each group is determined accordingly. The minimum height category includes the number of height values and the minimum value. Then, the type of obstacle point in the target grid area can be determined according to the number of height values included in each minimum height category corresponding to the target grid area and the minimum value thereof, that is, whether the obstacle point is a corresponding obstacle Of pixels. In an example, the number of height values included in each minimum height class corresponding to the target grid area can be compared with the number threshold (ring_count_th), and the smallest height value included in each minimum height class can be The value is compared with the height threshold (height_th) to determine the category of obstacle points in the target grid area. Wherein, if the one or more minimum height classes have the following target minimum height classes, the number of height values included in the target minimum height class is greater than or equal to the number threshold (ring_count_th) and the minimum value of the included height values If it is less than the height threshold (height_th), it is considered that the obstacle points existing in the target grid area correspond to the obstacle. For example, set ring_count_th=3, and height_th can be the height of the vehicle, for example, 2m. .

After multiple division results are obtained through the above-mentioned division based on a certain ring ID, a connected area analysis can be performed on the grid graph to obtain a connected area, and an obstacle in the object to be detected can be identified according to the connected area. The obstacle can be represented in the form of a polygon such as a concave polygon, a convex polygon, a rectangle, or a triangle, as long as it can be recognized that the obstacle is different from other objects. In an example of the present disclosure, convex polygons are used. On the one hand, in terms of the shape characteristics of convex polygons, compared with rectangles or triangles, the number of sides is more than that of rectangles or triangles, which makes it easier to accurately represent the shape of obstacles; On the one hand, compared with concave polygons, the use of convex polygons does not introduce redundant calculations, and the computational cost is moderate.

In an example, for the above-mentioned connected area analysis, the network with the obstacle point information "1" connected together can be searched based on the obstacle point information marked as "0" or "1" in the grid area as shown in FIG. 5. Grid area, thus forming a "connected area".

Figures 6a-6b show schematic diagrams of a communication manner of a connected area according to an embodiment of the present disclosure. The connected area calculation can be implemented by the Breadth First Search (BFS) algorithm. In an example, the smallest unit in an image is a pixel, and there are 8 adjacent pixels around each pixel, and there are 2 types of adjacent relationships: 4-adjacent (as shown in Figure 6a) and 8-adjacent (as shown in Figure 6b). Among them, 4 is adjacent to a total of 4 points, that is, there are a total of four pixel points up, down, left, and right. The 8 adjacent points also include 4 points on the diagonal position, that is, a total of 8 pixel points. If a certain pixel point A and pixel point B are adjacent and connected, a region is formed. In this way, the set of all connected pixels is called a "connected region". Obstacles can be obtained through connected region operations. FIG. 7 shows a schematic diagram of obstacles in a grid diagram according to an embodiment of the present disclosure. As shown in FIG. 7, the grid diagram contains multiple obstacles represented by convex polygons.

In an example, after identifying the obstacle in the object to be detected based on the above-mentioned connected area, the method further includes: acquiring a plurality of points to be processed on the first line segment of the connected area, and selecting at least two points from the plurality of points to be processed The reference point is connected to the at least two reference points to obtain a second line segment, and the connected area is adjusted according to the second line segment to obtain the first area. For example, the first area may be smaller than the connected area. If the obstacle is a convex polygon, the adjustment process of the connected area can be called convex hull processing. For example, a certain line segment (called the first line segment) that constitutes a connected region has 10 points to be processed, and 6 reference points are selected from the 10 points to be processed, and the 6 reference points are connected to obtain a line segment (called the first line segment). Is the second line segment), the first area can be obtained after adjusting the connected area according to the second line segment, and the first area is smaller than the connected area. That is to say, after the convex hull is processed, the number of convex edges used to represent obstacles is reduced (because there are fewer points, the convex edges are reduced accordingly), and the convex polygon is smaller than its original shape. Convex hull processing can reduce the amount of calculation.

In an example, after the obstacle in the object to be detected is identified according to the connected area, the method further includes: extracting the point cloud information corresponding to the target object from the point cloud information, and according to the target object The coordinates of the pixel points in the corresponding point cloud information are obtained to obtain the target position corresponding to the target object; at least two obstacles identified based on the grid information are obtained; the center point of the target position is used as a reference, according to the prediction Suppose the guide line issued by the angle obtains a fan-shaped area; when the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the second obstacle The obstacle point information of the obstacle is deleted from the grid information. FIG. 8 shows a schematic diagram of deleting obstructed obstacles in a grid diagram according to an embodiment of the present disclosure. As shown in FIG. 8, the grid diagram contains a target object and at least two obstacles. The target object may be a vehicle 41, at least The first obstacle among the two obstacles may be the warning object 42, and the second obstacle among the at least two obstacles may be one or more stones 43. Taking the center point of the current position of the vehicle 41 as a reference, a fan-shaped area is obtained according to the guide line issued by the preset angle α, and the warning object 42 and one or more stones 43 are all covered by the fan-shaped area. However, since one or more stones 43 are blocked by the warning object 42, when observing the obstacle from the current center position of the vehicle 41, one or more stones 43 can not be seen, but only the warning object 42 can be seen. In other words, one or more stones 43, as objects smaller than the warning object 42, can be ignored. Therefore, one or more stones 43 can be deleted from the grid map. It should be pointed out that the second obstacle is not limited to small stones that are blocked, and can also be grass on the side of the road.

In an example, the method includes: sending a message that there is an obstacle on the navigation path to a target object (such as a vehicle), so that the target object performs obstacle avoidance processing and/or replans the navigation path in response to the message.

Application example:

An application example according to the above embodiment includes the following content:

1. Scan the target object according to multiple sensors in the lidar, and obtain the point cloud information corresponding to the target object and the object to be detected. Then, based on the point cloud information, grid information containing at least obstacle point information indicating the object to be detected is obtained. For example, the grid information may be a grid graph marked with obstacle point information.

There can be multiple sensors in one or more lidars, and multiple sensors jointly construct the point cloud information of the entire scene, and the entire scanning area (a group of point clouds scanned by each lidar at the same time or within the same period of time) The area covered) corresponds to the grid map. Since the angle between the direction of each laser transmitter in each lidar and the horizontal plane is different, every time the lidar scans, each sensor will turn and scan the point cloud information at a certain angle and the next circle. .

If there is no object such as an obstacle on a certain grid area in the grid map, the grid area should be a plane that almost matches the height of the ground. The laser light emitted by the adjacent sensor of the grid area corresponds to the sensor. Will not be hindered by the grid area, which makes the laser light emitted to the grid area come from the same sensor. Therefore, it is assumed that all pixels falling in a certain grid area originate from the same sensor, that is, the ring IDs corresponding to all pixels in the grid area are the same, that is, all the pixels falling in the grid area are the same. If the pixel points are scanned by the same sensor, it can be considered that there are no obstacle points that may correspond to obstacles in the grid area. However, if a certain grid area has a raised object such as an obstacle, the laser light emitted by the adjacent sensor of the sensor corresponding to the grid area will be blocked and reflected by the protruding object on the grid area, which makes it hit The laser in the grid area comes from different sensors. Therefore, it is assumed that there are multiple sensors corresponding to pixels that fall in a certain grid area, that is, the ring IDs corresponding to the pixels in the grid area are different, that is, the pixels that fall into the grid area If it is scanned by different sensors, it can be considered that there are obstacle points that may correspond to obstacles in the grid area.

Further, the number of ring IDs of pixels falling in the grid area is used to determine whether there may be an obstacle in the grid area, which can be further optimized. In particular, for a certain grid area that includes objects in the air such as tree crowns, signboards, etc., although lasers belonging to multiple sensors will also be emitted into the grid area, that is, the pixels that fall into the grid area The ring IDs corresponding to the points are different. However, when the target object is a vehicle, objects in the sky such as tree crowns and signboards do not belong to the obstacles that the vehicle pays attention to, and it is necessary to eliminate the situation where the tree canopy and signboards are also identified as obstacles that the vehicle needs to avoid. Therefore, the height information of the pixels can be taken into consideration to check the possible obstacles obtained by the ring ID to filter out objects higher than a certain height, thereby further improving the accuracy of obstacle detection.

It should be pointed out that if the input point cloud information is the fusion of multiple lidar scan results, then an N×M grid can be constructed for each point cloud information scanned by lidar, and each grid can be preset The side length of represents 0.1m in reality, and the coordinates (N/2, M/2) are set as the center of the vehicle. For the case where the input point cloud information is the result of a lidar scan, an N×M grid map is directly constructed. Regardless of whether the point cloud information includes the fusion of multiple lidar scan results or one lidar scan result, the following obstacle identification method is used to judge the obstacles to obtain a grid map with obstacle point information.

In the process of judging whether there is an obstacle in a grid area according to the ring ID and height information, the pixels in the point cloud information scanned by a single lidar can be allocated to the grid according to the position information. For each grid area, count the ring IDs of the pixels allocated to it (the same ring ID is not counted repeatedly). Then, the pixels corresponding to the same ring ID are used as a set of data to obtain multiple sets of pixel data. Then, according to the height information, the minimum height value in each group of pixel data is determined, and the minimum height values of the multiple groups of pixels are classified and counted to obtain at least one minimum height category. For each minimum height class, the class of possible obstacles is determined according to the number of height values included in the minimum height class and the minimum of these height values. In one example, the network can be determined by comparing the number of height values included in each minimum height class with a threshold value, and comparing the minimum value of the height values included in each minimum height class with the height threshold value. The category of obstacle points that exist in the grid area. Wherein, if the following target minimum height class exists in the at least one minimum height class, the number of height values included in the target minimum height class is greater than or equal to the number threshold (ring_count_th), and the minimum value of the included height values If it is less than the height threshold (height_th), it is considered that the obstacle points in the grid area correspond to obstacles that will really affect the target object. The advantage of using classification statistics is to find a continuous segment of obstacles in height, rather than a single point. In the end, a grid map can be obtained for each lidar, and each element of the multiple grid maps is merged by "OR" operation, and the output result is obtained, that is, the grid map with obstacle point information is obtained. An example of the "or" operation is: "1" in the grid graph indicates an obstacle point, and "0" indicates an obstacle-free point. There are two 1x3 grid graphs, respectively [1, 0, 0] And [0, 1, 0], then the two grid graphs will be "or" for the corresponding grid area, if one or two of the grid areas are marked as "1", then the grid will be superimposed. The corresponding position of the grid area is marked with "1", and the result of the "or" operation on these two grid graphs is [1, 1, 0].

In the process of judging that there may be obstacles based on the ring ID, height information is added to verify whether there are obstacles in a certain grid area. Compensation methods can be used to increase the detection distance to ensure the detection quality. When counting the ring IDs of pixels in each grid area, because the point cloud information will become sparse on distant objects, when the distance between the grid area and the center of the vehicle (distance) is farther, it is necessary to use collective statistics The compensation method of the surrounding grid area, the compensation method can be: also need to count the grid area as the center, n×n size range. in,

n=around(1+a×distance),

The around function represents rounding, and a is a small predetermined constant. In this way, when performing classification statistics, all the height values in the minimum height value array can be sorted first, and if the height difference between two consecutive two items in the sorted sequence is greater than a certain threshold (gap_th), it is divided into two categories.

Among them, because the distribution of pixels reflected on distant objects must be more scattered than the distribution nearby, gap_th is used as a correction function, and its value can be corrected according to the distance between the grid area and the center of the vehicle (distance). For example, according to different conditions such as the installation position, angle, and point cloud sparseness of the sensor, different compensation schemes are adopted. In an example,

gap_th=a×distance+b

Among them, the unit of the threshold gap_th is meters, and a and b are relatively small constants. The calculated gap_th is a small value, which can be 0.1m.

As for the value of the number threshold ring_count_th, compensation can be made according to the sparseness of the point cloud information. In an example, a fixed value may be used, for example, 3. As for the value of the height threshold height_th, since the sensor (the lidar that the sensor can be installed on the vehicle) has a certain elevation angle, the height threshold height_th cannot be set to a fixed value. It can be based on the distance between the grid area and the center of the vehicle (distance ) Perform a certain angle correction. For example, in an example, assuming that the tangent of the correction angle is a, then let

height_th=1+a×distance

Among them, the unit of the height threshold height_th is meters. It should be noted that the values of the above-mentioned parameters can be set according to actual conditions, and the specific setting methods are not limited here.

2. Analyze the connected area of the obstacle points in the grid graph to obtain the connected area. According to the connected area, the obstacle represented by the convex polygon can be obtained.

After obtaining the above grid map, the value of each grid area indicates whether there is an obstacle point corresponding to the obstacle in the grid area. Due to the sparseness of the point cloud information, some larger objects are divided into many parts, and the image expansion algorithm can be used to process the grid first to connect multiple parts of the same object. Next, perform connected area analysis (each connected area can represent an object, such as an obstacle). For each connected region, calculate its convex hull, and then use convex hull operations for each convex hull, such as the Ramer–Douglas–Peucker algorithm, which can simplify the number of edges of the convex hull and reduce the amount of computation. Finally, FOV analysis is performed to remove small obstacles that cannot be observed from the center of the vehicle.

An example of convex hull operation includes:

1. For a polyline that needs to be simplified, connect a straight line AB between the two points A and B at the beginning and end of the polyline;

2. Traverse to find the point C farthest from the line AB on the polyline, and calculate the distance between it and the line AB;

3. Compare the distance with a predetermined threshold. If the distance is less than or equal to the threshold, the straight line AB is taken as the approximation of this section of polyline, and the processing of this section of polyline is completed;

4. If the distance is greater than the threshold, use point C to divide the straight line AB into two straight lines AC and BC, and perform the above-mentioned steps 1-4 on the two straight lines AC and BC respectively;

5. When all the polylines have been processed, the polylines formed by connecting each dividing point in turn can be used as an approximation of the initial polylines to obtain the updated convex hull.

An example of FOV analysis includes: for every two convex hulls C1 and C2, it is necessary to detect whether the convex hull C1 can be observed from the position of the vehicle, such as the center point A of the vehicle, under the occlusion of the convex hull C2. Specifically, it can include:

1. For each point P on the convex hull C1, connect the center point A and point P,

2. Detect whether the straight line AP passes through the convex hull C2. In this step of detection, whether all the points on the convex hull C2 are on the same side of the straight line AP through the cross product operation. If they are on the same side, it is considered that the straight line AP does not pass through Convex hull C2.

3. After traversing the points on the convex hull C1, the number n of points on the convex hull C1 that cannot be observed at the position of the vehicle can be obtained. If n is greater than or equal to a certain threshold fov_th, the convex hull C1 is considered invisible, and the convex Package C1 is deleted.

Among them, the value of the threshold fov_th needs to be corrected according to the distance from the obstacle to the vehicle. An example of a correction is:

fov_th=min(1, ceil(convex_point_num×(1–distance/a))),

Among them, convex_point_num is the number of points on the corresponding convex hull; distance is the distance from the convex hull to the vehicle; a is a larger constant, which can be taken as the maximum perceivable distance value; ceil is a rounding up function .

Those skilled in the art can understand that in the above-mentioned methods of the specific implementation, the writing order of the steps does not mean a strict execution order but constitutes any limitation on the implementation process. The specific execution order of each step should be based on its function and possibility. The inner logic is determined.

The foregoing various method embodiments mentioned in the present disclosure can all be combined with each other to form a combined embodiment without violating the principle and logic, and the length is limited, and the details of this disclosure will not be repeated.

In addition, the present disclosure also provides a target detection device, electronic equipment, computer-readable storage medium, and a program, all of which can be used to implement any target detection method provided in the present disclosure. For the corresponding technical solutions and descriptions, refer to the method section. The corresponding records will not be repeated here.

Fig. 9 shows a block diagram of a target detection device according to an embodiment of the present disclosure. As shown in Fig. 9, the device includes: an acquiring unit 51 for acquiring point cloud information. The point cloud information includes at least a target object and a target object to be detected. Point cloud information corresponding to the object, where the target object can move, and the object to be detected is people or things around the target object; the information processing unit 52 is configured to obtain grid information according to the point cloud information, Wherein, the grid information includes at least obstacle point information indicating the object to be detected; the detection unit 53 is configured to identify, according to the grid information, that the object to be detected affects the movement of the target object Obstacles.

In a possible implementation manner, the acquiring unit is configured to: acquire a plurality of to-be-processed point cloud information scanned by at least two sensors; and perform stitching processing on the plurality of to-be-processed point cloud information to obtain Describe point cloud information.

In a possible implementation manner, the point cloud information further includes a sensor identification (ring ID). The information processing unit is configured to: perform grid processing on the point cloud information to obtain a grid graph, the grid graph includes a plurality of grid regions, and each of the grid regions corresponds to the The obstacle point information is the first value; for each grid area, according to the ring ID included in the grid area, determine whether there is an obstacle point corresponding to the object to be detected in the target grid area; If the obstacle point exists in the grid area, the obstacle point information corresponding to the grid area in the grid information is updated to a second value.

In a possible implementation manner, the information processing unit is configured to determine that the obstacle point exists in the grid area when the ring IDs corresponding to at least two pixel points in the grid area are different.

In a possible implementation manner, the point cloud information further includes height information, and the device further includes a category determining unit configured to determine the category of the obstacle point existing in the grid area according to the height information ; According to the category of the obstacle point, update the obstacle point information corresponding to the grid area in the grid information.

In a possible implementation manner, the category determining unit is configured to: obtain ring IDs and height information respectively corresponding to at least two pixels in the grid area; and correspond to the same ring ID in the at least two pixels As a set of data, multiple sets of pixel data are obtained; according to the height information, the minimum height value in each set of pixel data is determined; the minimum height value in the multiple sets of pixel data is determined Categorize statistics to obtain one or more minimum height categories; determine the categories of obstacle points existing in the grid area according to the number of height values included in each minimum height category and the minimum value thereof.

In a possible implementation manner, the detection unit is configured to: perform a connected area analysis according to the obstacle point information in the grid information to obtain a connected area; according to the connected area, identify the object to be detected The obstacles.

In a possible implementation manner, the device further includes a connected area adjustment unit, configured to: obtain a plurality of points to be processed on the first line segment of the connected area; select at least two points from the plurality of points to be processed Reference point; connecting the at least two reference points to obtain a second line segment, and adjusting the connected area according to the second line segment to obtain the first area. In an example, the first area may be smaller than the connected area.

In a possible implementation manner, the device further includes: an occlusion processing unit, configured to: extract the point cloud information corresponding to the target object from the point cloud information, and according to the coordinates of the pixel points in the point cloud information corresponding to the target object , Obtain the target position corresponding to the target object; obtain at least two obstacles identified based on the grid information; use the center point of the target position as a reference, and obtain a fan-shaped area according to a guide line issued at a preset angle; In the case that the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the obstacle point information of the second obstacle is obtained from the network Delete from the grid information.

In a possible implementation manner, the device further includes a sending unit, configured to send a message that there is an obstacle on the navigation path to the target object, so that the target object performs obstacle avoidance processing and/or in response to the message Re-plan the navigation path.

In some embodiments, the functions or modules contained in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments. For specific implementation, refer to the description of the above method embodiments. For brevity, here No longer.

The embodiments of the present disclosure also provide a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above-mentioned method when executed by a processor. The computer-readable storage medium may be a non-volatile computer-readable storage medium.

An embodiment of the present disclosure also provides an electronic device, including: a processor; a memory for storing executable instructions of the processor; wherein the processor is configured as the above-mentioned method.

The electronic device can be provided as a terminal, server or other form of device.

Fig. 10 is a block diagram showing an electronic device 800 according to an exemplary embodiment. For example, the electronic device 800 may be a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and other terminals.

10, the electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, and a sensor component 814 , And communication component 816.

The processing component 802 generally controls the overall operations of the electronic device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing component 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the foregoing method. In addition, the processing component 802 may include one or more modules to facilitate the interaction between the processing component 802 and other components. For example, the processing component 802 may include a multimedia module to facilitate the interaction between the multimedia component 808 and the processing component 802.

The memory 804 is configured to store various types of data to support operations in the electronic device 800. Examples of these data include instructions for any application or method to operate on the electronic device 800, contact data, phone book data, messages, pictures, videos, etc. The memory 804 can be implemented by any type of volatile or non-volatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable and Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic Disk or Optical Disk.

The power supply component 806 provides power for various components of the electronic device 800. The power supply component 806 may include a power management system, one or more power supplies, and other components associated with the generation, management, and distribution of power for the electronic device 800.

The multimedia component 808 includes a screen of an output interface between the electronic device 800 and the user. In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user. The touch panel includes one or more touch sensors to sense touch, sliding, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure related to the touch or slide operation. In some embodiments, the multimedia component 808 includes a front camera and/or a rear camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.

The audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a microphone (MIC), and when the electronic device 800 is in an operation mode, such as a call mode, a recording mode, and a voice recognition mode, the microphone is configured to receive an external audio signal. The received audio signal may be further stored in the memory 804 or transmitted via the communication component 816. In some embodiments, the audio component 810 further includes a speaker for outputting audio signals.

An input/output (I/O) interface 812 provides an interface between the processing component 802 and a peripheral interface module. The peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include, but are not limited to: home button, volume button, start button, and lock button.

The sensor component 814 includes one or more sensors for providing the electronic device 800 with various aspects of state evaluation. For example, the sensor component 814 can detect the on/off state of the electronic device 800 and the relative positioning of the components. For example, the components are the display and keypad of the electronic device 800, the sensor component 814 can also detect the position change of the electronic device 800 or a component of the electronic device 800, the presence or absence of contact between the user and the electronic device 800, the position of the electronic device 800 or Acceleration/deceleration and temperature changes of the electronic device 800. The sensor component 814 may include a proximity sensor configured to detect the presence of nearby objects when there is no physical contact. The sensor component 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor component 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

The communication component 816 is configured to facilitate wired or wireless communication between the electronic device 800 and other devices. The electronic device 800 can access a wireless network based on a communication standard, such as WiFi, 2G, or 3G, or a combination thereof. In an exemplary embodiment, the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 also includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.

In an exemplary embodiment, the electronic device 800 may be implemented by one or more application-specific integrated circuits (ASIC), digital signal processors (DSP), digital signal processing devices (DSPD), programmable logic devices (PLD), field-available A programmable gate array (FPGA), controller, microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.

In an exemplary embodiment, there is also provided a non-volatile computer-readable storage medium, such as the memory 804 including computer program instructions, which can be executed by the processor 820 of the electronic device 800 to complete the foregoing method.

Fig. 11 is a block diagram showing an electronic device 900 according to an exemplary embodiment. For example, the electronic device 900 may be provided as a server. 11, the electronic device 900 includes a processing component 922, which further includes one or more processors, and a memory resource represented by a memory 932, for storing instructions that can be executed by the processing component 922, such as an application program. The application program stored in the memory 932 may include one or more modules each corresponding to a set of instructions. In addition, the processing component 922 is configured to execute instructions to perform the above-described methods.

The electronic device 900 may also include a power component 926 configured to perform power management of the electronic device 900, a wired or wireless network interface 950 configured to connect the electronic device 900 to a network, and an input/output (I/O) interface 958. The electronic device 900 can operate based on an operating system stored in the memory 932, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.

In an exemplary embodiment, a non-volatile computer-readable storage medium is also provided, such as a memory 932 including computer program instructions, which can be executed by the processing component 922 of the electronic device 900 to complete the foregoing method.

The present disclosure may be a system, method and/or computer program product. The computer program product may include a computer-readable storage medium loaded with computer-readable program instructions for enabling a processor to implement various aspects of the present disclosure.

The computer-readable storage medium may be a tangible device that can hold and store instructions used by the instruction execution device. The computer-readable storage medium may include, but is not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing, for example. More specific examples of computer-readable storage media (non-exhaustive list) include: portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM) Or flash memory), static random access memory (SRAM), portable compact disk read-only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanical encoding device, such as a printer with instructions stored thereon The protruding structure in the hole card or the groove, and any suitable combination of the above. The computer-readable storage medium used here is not interpreted as the instantaneous signal itself, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (for example, light pulses through fiber optic cables), or through wires Transmission of electrical signals.

The computer-readable program instructions described herein can be downloaded from a computer-readable storage medium to various computing/processing devices, or downloaded to an external computer or external storage device via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, optical fiber transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers. The network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network, and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device .

The computer program instructions used to perform the operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, state setting data, or in one or more programming languages. Source code or object code written in any combination, the programming language includes object-oriented programming languages such as Smalltalk, C++, etc., and conventional procedural programming languages such as "C" language or similar programming languages. Computer-readable program instructions can be executed entirely on the user's computer, partly on the user's computer, executed as a stand-alone software package, partly on the user's computer and partly executed on a remote computer, or entirely on the remote computer or server implement. In the case of a remote computer, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (for example, using an Internet service provider to access the Internet). connect). In some embodiments, an electronic circuit, such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can be customized by using the status information of the computer-readable program instructions. The computer-readable program instructions are executed to realize various aspects of the present disclosure.

Here, various aspects of the present disclosure are described with reference to flowcharts and/or block diagrams of methods, devices (systems) and computer program products according to embodiments of the present disclosure. It should be understood that each block of the flowcharts and/or block diagrams, and combinations of blocks in the flowcharts and/or block diagrams, can be implemented by computer-readable program instructions.

These computer-readable program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, thereby producing a machine that makes these instructions when executed by the processor of the computer or other programmable data processing device , A device that implements the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams is produced. It is also possible to store these computer-readable program instructions in a computer-readable storage medium. These instructions make computers, programmable data processing apparatuses, and/or other devices work in a specific manner, so that the computer-readable medium storing the instructions includes An article of manufacture, which includes instructions for implementing various aspects of the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.

It is also possible to load computer-readable program instructions onto a computer, other programmable data processing device, or other equipment, so that a series of operation steps are executed on the computer, other programmable data processing device, or other equipment to produce a computer-implemented process , So that the instructions executed on the computer, other programmable data processing apparatus, or other equipment realize the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.

The flowcharts and block diagrams in the accompanying drawings show the possible implementation architecture, functions, and operations of the system, method, and computer program product according to multiple embodiments of the present disclosure. In this regard, each block in the flowchart or block diagram may represent a module, program segment, or part of an instruction, and the module, program segment, or part of an instruction contains one or more components for realizing the specified logical function. Executable instructions. In some alternative implementations, the functions marked in the block may also occur in a different order from the order marked in the drawings. For example, two consecutive blocks can actually be executed substantially in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved. It should also be noted that each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart, can be implemented by a dedicated hardware-based system that performs the specified functions or actions Or it can be realized by a combination of dedicated hardware and computer instructions.

Without violating logic, different embodiments of the present disclosure can be combined with each other, and the description of different embodiments is emphasized. For the part of the description, reference may be made to the records of other embodiments.

The embodiments of the present disclosure have been described above, and the above description is exemplary, not exhaustive, and is not limited to the disclosed embodiments. Without departing from the scope and spirit of the illustrated embodiments, many modifications and changes are obvious to those of ordinary skill in the art. The choice of terms used herein is intended to best explain the principles, practical applications, or technical improvements of the technologies in the market, or to enable other ordinary skilled in the art to understand the embodiments disclosed herein.

Claims

A target detection method includes:

Acquiring point cloud information, where the point cloud information includes at least a target object and point cloud information corresponding to an object to be detected, wherein the object to be detected is a person or thing around the target object;

Obtaining grid information according to the point cloud information, where the grid information includes at least obstacle point information indicating the object to be detected;

According to the grid information, an obstacle in the object to be detected that affects the movement of the target object is identified.
The method according to claim 1, wherein said acquiring point cloud information comprises:

Acquiring multiple to-be-processed point cloud information scanned by at least two sensors;

The multiple point cloud information to be processed are spliced to obtain the point cloud information.
The method according to claim 1 or 2, wherein the point cloud information further includes a sensor identifier, and the obtaining grid information according to the point cloud information includes:

Performing griding processing on the point cloud information to obtain a grid graph, the grid graph including a plurality of grid regions, and the obstacle point information corresponding to each grid region is a first value;

For each grid area,

Determining whether there is an obstacle point corresponding to the object to be detected in the grid area according to the sensor identifiers corresponding to the pixels included in the grid area;

In a case where the obstacle point exists in the grid area, the obstacle point information corresponding to the grid area in the grid information is updated to a second value.
The method according to claim 3, wherein the determining whether there are obstacle points corresponding to the object to be detected in the grid area according to the sensor identifiers corresponding to the pixels included in the grid area comprises :

In a case where the sensor identifiers corresponding to at least two pixel points in the grid area are different, it is determined that the obstacle point exists in the grid area.
The method according to claim 3 or 4, wherein the point cloud information further includes height information, and when the obstacle point exists in the target grid area, the grid The update of the obstacle point information corresponding to the grid area to the second value in the information further includes:

Determine the category of the obstacle point existing in the grid area according to the height information of the pixel points in the grid area;

According to the category of the obstacle point, the obstacle point information corresponding to the grid area in the grid information is updated.
The method according to claim 5, wherein the determining the category of the obstacle point existing in the grid area according to the height information of the pixel points in the grid area comprises:

Acquiring sensor identifiers and height information respectively corresponding to at least two pixels in the grid area;

Taking the pixel points corresponding to the same sensor identifier among the at least two pixel points as a set of data to obtain multiple sets of pixel point data;

Determining the minimum height value in each group of pixel data according to the height information;

Categorizing the minimum height values in the multiple sets of pixel data to obtain one or more minimum height categories;

According to the number of height values included in each of the minimum height classes and the minimum value thereof, the types of obstacle points existing in the grid area are determined.
The method according to claim 6, wherein the determining the category of obstacle points existing in the grid area according to the number of height values included in each of the minimum height classes and the minimum value thereof includes :

In the case where there is a target minimum height class in the one or more minimum height classes, it is determined that the obstacle points existing in the grid area correspond to the obstacle, where

The number of height values included in the target minimum height class is greater than or equal to a preset number threshold,

The minimum value of the height values included in the target minimum height class is less than or equal to a preset height threshold;

In the case that the target minimum height class does not exist in the one or more minimum height classes, it is determined that the obstacle points existing in the grid area correspond to non-obstacles.
The method according to any one of claims 5 to 7, wherein the categories of the obstacle points include obstacles corresponding to the obstacle points and non-obstacles corresponding to the obstacle points. The category of updating the obstacle point information corresponding to the grid area in the grid information includes:

In the case where the category of the obstacle point indicates that the obstacle point corresponds to an obstacle, maintaining the obstacle point information corresponding to the grid area in the grid information as the second value;

When the category of the obstacle point indicates that the obstacle point corresponds to a non-obstacle, the obstacle point information corresponding to the grid area in the grid information is updated to the first value.
The method according to any one of claims 1 to 8, wherein the identifying an obstacle in the object to be detected that affects the movement of the target object according to the grid information comprises :

Perform a connected area analysis according to the obstacle point information in the grid information to obtain a connected area;

According to the connected area, the obstacle in the object to be detected is identified.
The method according to claim 9, further comprising:

Acquiring multiple points to be processed on the first line segment of the connected region;

Selecting at least two reference points from the plurality of points to be processed;

The second line segment is obtained by connecting the at least two reference points, and the first area is obtained after adjusting the connected area according to the second line segment.
The method according to any one of claims 1 to 10, further comprising:

Extracting the point cloud information corresponding to the target object from the point cloud information, and obtaining the target position corresponding to the target object according to the coordinates of the pixel points in the point cloud information corresponding to the target object;

Acquiring at least two obstacles identified based on the grid information;

Using the center point of the target position as a reference, obtain a fan-shaped area according to a guide line issued at a preset angle;

In the case that the fan-shaped area covers the first obstacle and the second obstacle, and the second obstacle is blocked by the first obstacle, the obstacle point information of the second obstacle is obtained from the network Delete from the grid information.
A target detection device includes:

An acquiring unit, configured to acquire point cloud information, the point cloud information includes at least a target object and point cloud information corresponding to an object to be detected, wherein the object to be detected is a person or thing around the target object;

An information processing unit, configured to obtain grid information according to the point cloud information, where the grid information includes at least obstacle point information indicating the object to be detected;

The detection unit is configured to identify obstacles in the object to be detected that affect the movement of the target object according to the grid information.
An electronic device including:

processor;

A memory for storing processor executable instructions;

Wherein, the processor is configured to execute the method of any one of claims 1-11.
A computer-readable storage medium having computer program instructions stored thereon, and when the computer program instructions are executed by a processor, the method according to any one of claims 1 to 11 is implemented.
A computer program, the computer program is stored in a storage medium, and when a processor executes the computer program, the processor is used to execute the target detection method according to any one of claims 1-11.