WO2022193193A1 - Data processing method and device - Google Patents

Data processing method and device Download PDF

Info

Publication number
WO2022193193A1
WO2022193193A1 PCT/CN2021/081387 CN2021081387W WO2022193193A1 WO 2022193193 A1 WO2022193193 A1 WO 2022193193A1 CN 2021081387 W CN2021081387 W CN 2021081387W WO 2022193193 A1 WO2022193193 A1 WO 2022193193A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
map
marked
movable platform
target data
Prior art date
Application number
PCT/CN2021/081387
Other languages
French (fr)
Chinese (zh)
Inventor
江灿森
陈琦
衡量
沈劭劼
Original Assignee
深圳市大疆创新科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市大疆创新科技有限公司 filed Critical 深圳市大疆创新科技有限公司
Priority to PCT/CN2021/081387 priority Critical patent/WO2022193193A1/en
Priority to CN202180079742.6A priority patent/CN116762094A/en
Publication of WO2022193193A1 publication Critical patent/WO2022193193A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis

Definitions

  • the present application relates to the technical field of automatic driving, and in particular, to a data processing method and device.
  • High-precision map data is an important basis for vehicle autonomous driving and is of great significance to the development of intelligent vehicles.
  • HD map data is usually provided by map providers. Map suppliers usually only provide high-precision map data with relatively large usage, but do not provide high-precision map data with relatively small usage.
  • the embodiments of the present application provide a data processing method and device, aiming to provide a solution that can adapt to the personalized map requirements of different mobile platforms.
  • the present application provides a data processing method, comprising:
  • map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information
  • map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
  • control device comprising: a memory for storing instructions and a processor for executing the instructions stored in the memory, where the processor is used to specifically execute:
  • map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information
  • map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
  • the present application provides a movable platform including an image sensor and the data processing method involved in the second aspect.
  • the movable platform collects image data of a spatial scene when moving, generates map data based on the image data, and then uses the collected map data to control its movement, which can meet the requirements of the mobile platform.
  • Personalized map data requirements for mobile platforms To generate map data based on image data, the existing image sensors on the mobile platform can be used, and there is no need to configure high-cost sensors such as lidar to collect point clouds, reducing the cost of map construction.
  • the map data is generated according to the map metadata, so as to ensure the accuracy of the map data generated by the mobile platform.
  • map data is generated based on image data, and the storage of map data is lighter, which is very convenient for real-time update and maintenance of maps.
  • FIG. 1 is a schematic structural diagram of a movable platform according to an embodiment of the present application
  • FIG. 2 is a schematic flowchart of a data processing method provided by another embodiment of the present application.
  • FIG. 3 is a schematic flowchart of a data processing method provided by another embodiment of the present application.
  • FIG. 4 is a schematic flowchart of a data processing method provided by another embodiment of the present application.
  • FIG. 5 is a schematic flowchart of a data processing method provided by another embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a control device provided by another embodiment of the present application.
  • a component when referred to as being "fixed to" another component, it can be directly on the other component or there may also be a centered component.
  • a component When a component is considered to be “connected” to another component, it may be directly connected to the other component or there may be a co-existence of an intervening component.
  • the present application provides a data processing method and device.
  • the technical idea of the present application is: the image data of the space scene is collected by the movable platform while walking in the space scene, and the collected data is used to generate the map data, which can adapt to the personalized needs of the movable platform for map data.
  • map data Before generating map data, it is judged whether the generated map metadata meets the requirements of mapping quality, so as to ensure that the map data generated by the mobile platform can accurately reflect the spatial scene.
  • the map data is generated based on the image data collected by the image sensor, no high-cost image sensor is needed, and the cost of map construction is reduced.
  • an embodiment of the present application provides a movable platform 100 .
  • the movable platform 100 includes an image sensor 101 , a travel sensor (not shown) and a control device (not shown).
  • the image sensor 101 is used to collect the image data of the scene around the movable platform 100
  • the driving sensor is used to collect the driving data of the movable platform
  • the control device is used to execute the data processing method described below. Repeat.
  • This application can be used to solve the automatic parking problem in the automatic driving function, and can be used for map construction in the process of short-distance automatic parking, for example, within 300 meters.
  • the map is mainly used to record various landmarks in the parking lot, including parking spaces, traffic signs, road lane lines, landmark buildings, etc. After the map is constructed, it can assist in the realization of functions such as parking lot location recognition in the process of automatic parking, automatic search for parking spaces in the map area, and locating vehicles at any location in the map area.
  • the application provides a data processing method
  • the execution subject of the method is a control device
  • the method specifically includes the following steps:
  • control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
  • the image sensor on the movable platform is controlled to work, the image sensor collects image data of the space scene, and the image sensor transmits the image data to the control device.
  • the driving sensor collects the position information of the movable platform, transmits the collected position information to the control device, and the control device determines whether to enter a certain space scene according to the position information, and controls the movable platform when it is determined to enter the designated space scene.
  • the upper image sensor works, and the image sensor transmits the collected multi-frame image data to the control device.
  • control device processes the multi-frame image data to obtain map metadata.
  • the map metadata is used to generate map data, and the map metadata includes any one or a combination of three-dimensional feature points, texture data, and semantic information.
  • the three-dimensional feature points are used to reflect the position and shape of objects in the scene space
  • the texture data are used to reflect the surface information of the objects in the scene space
  • the semantic information is used to reflect the categories of objects represented by the texture data and the three-dimensional feature points.
  • map metadata is obtained by extracting two-dimensional feature data from image data, matching two-dimensional feature data in multi-frame image data, and semantic recognition.
  • step S203 enumerates a specific implementation:
  • the control device If the map metadata meets the quality requirements for mapping, the control device generates map data according to the map metadata.
  • the mapping quality requirement is used to determine whether the above-mentioned map metadata is rich enough, that is, whether the quantity of map metadata is sufficient, and whether the data types of the map metadata are sufficient.
  • the quality of the map data constructed using the map metadata will be higher, that is, the map data will more accurately describe the spatial scene. If the data volume of the map metadata is small and the types are single, the quality of the map data constructed by using the map metadata is low, that is, the map data cannot accurately describe the spatial scene.
  • the map metadata is processed to obtain map data, for example, each layer is obtained by image processing of the map metadata.
  • the map data is used to control the movement of the movable platform within the spatial scene.
  • the control device can control the movable platform to move within the space scene at the current moment according to the map data generated at the previous moment.
  • the control device can also control the movable platform to move in the space scene according to the generated map data when entering the space scene again next time.
  • the movable platform collects image data during the walking process, generates map data based on the image data, and then uses the collected map data to control its walking, which can meet the personalized map data requirements of the movable platform, and can directly use
  • the existing image sensors of the mobile platform collect data, and there is no need to configure high-cost sensors, such as lidar.
  • map data is generated according to the map metadata to ensure the accuracy of the generated map data.
  • FIG. 3 another embodiment of the present application provides a data processing method, the execution subject of the method is a control device, and the method specifically includes the following steps:
  • control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
  • a plurality of image sensors are arranged on the movable platform, and the plurality of image sensors are located around the movable platform and are used to collect image data in the space scene where the movable platform is located.
  • the image sensor for the image data.
  • control device processes the multi-frame image data to obtain map metadata.
  • control device uses the identification of the image sensor to mark the source of the map metadata, and obtain the marked map metadata
  • the obtained map metadata is marked with an identifier of an image sensor that collects the image data. That is to mark the data source of the map metadata.
  • the control device If the marked map metadata meets the quality requirements for mapping, the control device generates marked map data according to the marked map metadata.
  • the mapping quality requirement is used to judge whether the marked map data is rich enough.
  • the map metadata includes any one or a combination of three-dimensional feature points, texture data, and semantic information.
  • the mapping quality requirements include at least one of the following: the total number of three-dimensional feature points reaches the first threshold; there are at least two three-dimensional feature points with different components on the three coordinate axes; the total number of texture data reaches the second threshold. the number threshold; the number of types of texture data reaches a third number threshold; the total number of semantic information reaches a fourth number threshold; and the number of types of semantic information reaches a fifth number threshold.
  • the three-dimensional feature points are sufficiently abundant in number.
  • it is determined whether there are at least two three-dimensional feature points with different components on the three coordinate axes it is determined whether the three-dimensional feature points are sufficiently rich in type. If all 3D feature points are located on the same plane, that is, all 3D feature points have the same component on one of the coordinate axes, for example: all 3D feature points have the same component in the z-axis direction, that is, 3D feature points can only represent A plane cannot represent a rich three-dimensional space scene.
  • the second quantity threshold By judging whether the total quantity of texture data reaches the second quantity threshold, it is determined whether the quantity of texture data is sufficiently abundant. By judging whether the number of types of texture data reaches a third quantity threshold, it is determined whether the texture data is rich enough in type.
  • map metadata it is determined whether the map metadata meets the quality requirements of map construction in combination with its richness in quantity and type.
  • the map data generated based on the map metadata can accurately reflect the spatial scene.
  • map data is generated according to the marked map metadata.
  • the process of generating map data specifically includes at least one of the following:
  • a marked feature data layer is generated according to the marked three-dimensional feature points and marked texture data; and a marked semantic information layer is generated according to the marked semantic information.
  • image processing is performed on the marked three-dimensional features and the marked texture data to obtain a marked feature data layer.
  • the marked semantic information is imaged to obtain the marked semantic information layer.
  • the parking space layer can be generated according to the marked semantic information, which specifically includes: extracting semantic information representing the parking space from the marked semantic information, and performing image processing to generate the parking space according to the semantic information of the represented parking space.
  • the marked parking space layer can be generated according to the marked semantic information, which specifically includes: extracting semantic information representing the parking space from the marked semantic information, and performing image processing to generate the parking space according to the semantic information of the represented parking space.
  • the marked map data includes marking information used to indicate the source of the data.
  • the map data can be filtered according to the marker information and moving direction of the map data, and then the filtered map data can be used to control the movement of the movable platform, so as to reduce the amount of data processing in the process of using the map data, so that the movable platform can Generate control instructions from map data more quickly.
  • map metadata it is judged whether the map metadata meets the requirements of mapping quality according to the richness of each map metadata in quantity and type, and map data that accurately reflects the spatial scene can be obtained according to the map metadata.
  • map data that accurately reflects the spatial scene can be obtained according to the map metadata.
  • the source tagging process is performed on the map metadata, so that the obtained map data can also reflect the data source.
  • the data can be filtered according to the data source to reduce the data processing amount, and then the control instructions can be quickly generated according to the map data. To control the precise movement of the movable platform.
  • FIG. 4 another embodiment of the present application provides a data processing method, the execution subject of the method is a control device, and the method specifically includes the following steps:
  • control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
  • control device processes the multi-frame image data to obtain map metadata.
  • control device uses the identification of the image sensor to mark the source of the map metadata, and obtains the marked map metadata
  • the control device If the marked map metadata meets the quality requirements for mapping, the control device generates marked map data according to the marked map metadata.
  • control device acquires the moving direction of the movable platform and the real-time data collected by the image sensor.
  • control device can control the movable platform to move in the space scene at the current moment according to the map data generated at the previous moment. It is also possible to control the movable platform to move in the space scene according to the generated map data when entering the space scene again next time.
  • the moving direction of the movable platform is collected by the driving sensor, and the real-time data of the space scene is collected by the image sensor.
  • the control device controls the movable platform to move in the space scene in real time based on the moving direction, real-time data and map data of the movable platform.
  • control device acquires target data matching the moving direction from the marked map data according to the marking information.
  • the marker information of the map data is used to reflect the data source of the map data, that is, the image sensor that collects the image data corresponding to the map data can also be determined.
  • the installation position of the image sensor on the movable platform is fixed, and then the position information of the image sensor that collects the image data corresponding to the map data can be determined according to the marker information.
  • the target data is selected from the map data in combination with the moving direction of the movable platform and the above-mentioned position information.
  • the map data derived from the image sensor installed in front of the movable platform is obtained from the map data as the target data. If the moving direction of the movable vehicle is to walk backward, the map data derived from the image sensor installed behind the movable platform is obtained from the map data as the target data.
  • control device generates a control instruction according to the real-time data and the target data.
  • the real-time data collected by the image sensor is also image data
  • the control device performs feature extraction on the image data, and determines the location information of the movable platform according to the processed real-time data and map data, and then generates a mobile platform based on the location information and map data.
  • the processed real-time data and the map data are matched to obtain a matching result, and the location information of the movable platform is determined according to the successfully matched map data.
  • the reliability value of the target data is set according to the matching result.
  • the reliability value of the target data is set to the first reliability value
  • the reliability value of the target data is set to the second reliability value.
  • the first reliability value is greater than the second reliability value.
  • the control device After obtaining the reliable value of the target data, the control device counts the reliable value of the target data to obtain the reliability statistical result. When the reliability statistical result meets the low reliability condition, the target data is deleted to realize the optimization of the map data.
  • the reliability statistics result is the average value of the reliability value
  • the low reliability condition is that the average value of the reliability value is smaller than the preset average value.
  • the control device after the control device generates the map data with the source mark, selects target data from the map data according to the mark information, and controls the movable platform to move according to the target data and the real-time data collected by the image sensor, and obtains by screening Target data, reducing the amount of data processing during the use of map data, the control device can generate control instructions more quickly, so that the movable device can move in the space scene reliably.
  • the control device matches the real-time data with the target data, the target data is marked with the matching result, so as to realize the optimization of the map data.
  • the data processing method provided by the present application is described below by taking the movable platform as an intelligent car as an example.
  • the execution subject of the method is a control device in the intelligent car, such as a trip computer, and the method specifically includes the following steps:
  • control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
  • the smart car is equipped with a monocular camera, such as a driving recorder, and a fisheye camera installed around the smart car.
  • the above cameras are used to collect image data in a certain spatial scene, such as an underground parking lot.
  • Smart cars are also equipped with driving sensors, such as low-precision inertial navigation units, odometers, GPS, etc.
  • the data processing method provided by the present application does not require the smart car to add new sensors, and the above sensors can be used to generate map data and control the driving of the smart vehicle.
  • control device processes the multi-frame image data to obtain map metadata.
  • the control device processes the multi-frame image data to obtain map metadata. Specifically include the following steps:
  • the VIO and VO algorithms can be used to process the image data and the data collected by the driving sensor to estimate the frame-to-frame pose between two frames of image data.
  • the frame-to-frame pose serves as the basis for image data processing. It is also possible to estimate the frame-to-frame pose using driving sensors, such as integrating the data collected by the odometer and the inertial measurement unit to obtain the frame-to-frame pose.
  • feature extraction is performed on the image data collected by the monocular camera and the image data collected by the fisheye camera.
  • geometric features are extracted from the image data, such as: object edges, corners, planes, salient points, special textures, and the like. It also extracts texture data, gradient data, pixel color and other data in the image data.
  • These characteristic information have the characteristics of time stability, angle stability, scale stability, etc., and can be observed stably and consistently at different angles, distances, and time periods.
  • the extracted texture data, gradient data, pixel color, etc. are also used to encode the two-dimensional feature data, so as to perform feature matching and build a map dictionary.
  • the image data collected by the fisheye camera for example: convert the image data under the fisheye camera model to the image data under the pinhole camera model, and convert the two-dimensional feature data in the pinhole image data to the image data under the pinhole camera model.
  • the two-dimensional feature data in the resulting fisheye image data can be fused, which is conducive to feature matching processing.
  • Timing correlation methods include algorithms such as inter-frame correlation, window correlation, and loopback correlation.
  • the inter-frame correlation is mainly for two adjacent images, for example: an image with an acquisition time interval of 50 milliseconds, or an image with a displayed position interval of 20 cm. Usually, there will be more feature correlations in the inter-frame matching.
  • Window correlation mainly refers to correlating all features within a period of time or distance. Through the quantitative statistics of feature correlation, performance indicators such as feature stability and consistency can be measured.
  • the two-dimensional feature data when a two-dimensional feature data can be associated with a large number of images within a window, such as 30 frames of images, the two-dimensional feature data is high-quality two-dimensional feature data, and has better robustness to temporal and spatial changes sex.
  • the calculation of three-dimensional feature points is based on the three-point coplanarity assumption, using two images at different positions to observe the same object, and calculate the three-dimensional coordinates of the object.
  • the image feature data is triangulated by using the inter-frame poses and matching results of two frames of image data to obtain three-dimensional feature points.
  • semantic information is mainly to extract the information of objects with clear categories in the spatial scene, such as: ground lane lines, parking spaces, indicating arrows, etc., air collision bars, hanging signs, large walls, pillars, etc.
  • Semantic information is usually a relatively stable element. Usually, only when the environment changes, such as parking lot maintenance and reconstruction, will the semantic information fail, which can accurately reflect the spatial scene.
  • image data will be collected by using cameras located in different directions, for example, using monocular cameras and fisheye cameras to collect image data.
  • marking is performed according to the characteristics of image data collected by different cameras.
  • map data filter different map metadata for matching according to the driving direction of the intelligent vehicle.
  • the control device If the marked map metadata meets the quality requirements for mapping, the control device generates marked map data according to the marked map metadata.
  • map metadata meets the requirements of mapping quality through information such as the richness of spatial three-dimensional feature points, the richness of semantic information, and the richness of texture data. If it does not meet the requirements, the control device will send out warning information to warn you to choose a more suitable space scene and build a map in a more suitable time period.
  • the marked map data is generated. More specifically, image processing is performed on the marked three-dimensional features and the marked texture data to obtain a marked feature data layer.
  • the marked semantic information is imaged to obtain the marked semantic information layer.
  • the corresponding layers can be generated based on individual requirements for the autonomous driving of smart cars. For example: building a dictionary of keyframes.
  • the process of generating map data is described below by taking the generation of map data of the parking lot as an example: when the intelligent vehicle drives along the path in the figure, the fisheye camera and the front-view camera collect the image data in the parking lot, and stitch the image data collected by the fisheye camera. A look-around top view is formed, and then the images collected by the look-up top view and the monocular camera are spliced into a ground image, and then the deep learning method is used to realize the identification and extraction of parking spaces, mainly including lane lines, parking spaces, ground indicating arrows and other information, as semantics important part of the map.
  • the identification and storage of valid parking spaces, invalid parking spaces, exclusive parking spaces, parking space numbers and other information in the map can be realized, which can be used for interactive selection of customers during automatic parking.
  • the map data can be optimized. Specifically, it includes: the position of the 3D point, the pose of the camera, and the quality of the semantic map.
  • control device acquires the moving direction of the movable platform and the real-time data collected by the image sensor.
  • control device obtains the driving direction of the smart car, and obtains the image data collected by the fisheye camera and the monocular camera.
  • the monocular camera when the intelligent vehicle is driving forward, the monocular camera will be used to collect the map data corresponding to the image data for positioning, and when the intelligent vehicle is driving backward, the map data corresponding to the image data collected by the fisheye camera will be used for positioning.
  • the map data corresponding to the image data collected by the fisheye camera In order to make intelligent vehicles have faster and more powerful positioning capabilities.
  • the control device generates a control instruction according to the real-time data and the target data.
  • the control device matches the real-time data with the map data to obtain a matching result, determines the location information of the intelligent vehicle according to the successfully matched map data, and then controls the intelligent vehicle to drive according to the location information of the intelligent vehicle.
  • the control device after the control device generates the map data with the source mark, the target data is selected from the map data according to the mark information, and the driving of the vehicle is controlled according to the target data and the real-time data collected by the image sensor, so as to reduce the process of using the map data.
  • the driving of the smart car In order to realize the rapid positioning of the smart car, and then control the driving of the smart car more reliably.
  • FIG. 5 another embodiment of the present application provides a data processing method.
  • the execution body of the method is a control device in a smart car, such as a trip computer.
  • the method specifically includes the following steps:
  • the movable platform is a smart car
  • a driving recorder is installed in front of the smart car
  • a pinhole camera in the driving recorder is used as an image sensor.
  • a fisheye camera is installed around the smart car, which also acts as an image sensor. Pinhole cameras and fisheye cameras are used to collect image data of spatial scenes.
  • the smart car is also equipped with an odometer, an Inertial Measurement Unit (IMU) and GPS.
  • the inertial measurement unit, GPS and odometer are used to collect the driving data of the smart car, such as acceleration, speed, mileage, data such as driving location.
  • the inter-frame pose is estimated based on the multi-frame image data, the odometer and the driving data collected by the IMU.
  • VINS monocular visual inertial system
  • VIO visual inertial odometer
  • VO visual odometer
  • this step is mainly to perform feature extraction on the image data collected by the pinhole camera and the image data collected by the fisheye camera to obtain feature data.
  • the feature data is two-dimensional feature data.
  • Two-dimensional feature data includes geometric feature data, such as: object edges, corners, planes, salient points, special textures and other features. These characteristic information have the characteristics of time stability, angle stability, scale stability, etc., and are relatively stable at different angles, distances, and time periods, and maintain consistent observability.
  • the two-dimensional feature data is effectively expressed, using texture data, gradient data, pixel color data, etc. to encode the two-dimensional feature data, and the encoded two-dimensional feature data is used for feature matching and dictionary data.
  • the image data collected by the fisheye camera is corrected, so that the image data under the fisheye camera model is converted to the pinhole camera model.
  • the two-dimensional features of the image data collected by the pinhole camera The 2D feature data of the data and the image data collected by the fisheye camera can be fused to achieve feature matching with higher stability and consistency.
  • the semantic information processing part is mainly to process the objects with clear meaning in the image data of the spatial scene, such as the ground objects such as lane lines, parking spaces, indicating arrows, anti-collision bars, hanging signs, large walls, pillars and other spaces in the space.
  • Class recognition of objects is performed to obtain semantic information.
  • Semantic information is usually a relatively stable element, and it is usually only in the case of large-scale environmental changes, such as parking lot maintenance and reconstruction, that semantic information is unavailable.
  • the semantic information extraction of the look-up top view mainly includes the extraction of semantic information such as lane lines, parking spaces, ground indicating arrows, etc., which can be used as an important part of the semantic information layer.
  • the identification and storage of valid parking spaces, invalid parking spaces, exclusive parking spaces, parking space numbers and other information in the map can be realized, which can be used for interactive selection of customers during automatic parking.
  • the matching of two-dimensional and feature data is mainly to perform time series association on the extracted two-dimensional feature data.
  • Algorithms such as inter-frame association, window association, and loopback association can be used for matching.
  • the inter-frame correlation is mainly to correlate two adjacent images, for example, the interval is 50 milliseconds, or the interval is 20 centimeters. Usually, there will be more feature correlations in the inter-frame matching.
  • Window correlation is mainly to correlate all features within a fixed or non-fixed time or within a fixed or non-fixed distance range, and obtain performance indicators such as the stability and consistency of the feature by counting the number of feature correlations. For example, when a two-dimensional feature data can be associated with a large number of images within a window, such as 30 frames of images, it indicates that the two-dimensional feature data is of very high quality and has good robustness to temporal and spatial changes.
  • Loopback matching means that data may be collected multiple times in the same spatial scene.
  • loopback matching means that data may be collected multiple times in the same spatial scene.
  • it can not only identify whether a map has been constructed at the current location, but also effectively fuse and update the image data of the spatial scene observed multiple times.
  • S606 construct a dictionary of key frames in the map data based on the two-dimensional feature data extracted in S602.
  • constructing the key frame dictionary refers to clustering the features of the image data of the spatial scene, and expressing the current scene by using a combination of multiple two-dimensional feature data.
  • the construction of the key frame dictionary can use not only two-dimensional feature data, but also semantic information, deep learning descriptors, and so on.
  • the functions of the expression of the dictionary include: First, the expression of the richness of the scene. If the keyframe dictionary in the map data of a spatial scene is rich, the map data representing the spatial scene has rich texture data, geometric features, semantic information, etc. When the richness of the key frame dictionary in the map data of the spatial scene is low, it means that the quality of the map data of the spatial scene is poor, and the user cannot use the map to control the intelligent vehicle, such as automatic parking, which promotes the user's expected management. Second, the keyframe dictionary can be used for position recognition during parking relocation. During relocation initialization, the vehicle needs to find its current location on the map. Through the matching of the key frame dictionary, the approximate current position of the vehicle in the map can be quickly found, and then accurate position estimation can be achieved through the matching of semantic information and two-dimensional feature data.
  • the splicing of the image data collected by looking around the fisheye and the image data collected by the pinhole camera can reflect the complete ground image at a long distance.
  • the deep learning method is used to realize the identification and extraction of parking spaces, and the deep learning method will not be repeated here.
  • obtaining three-dimensional feature points according to the feature matching result is to realize triangulation of feature data, that is, to convert two-dimensional feature data into three-dimensional feature data.
  • the triangulation step is mainly based on the three-point coplanarity assumption, and the three-dimensional coordinates of the object are calculated by using the images obtained by observing the same object at two different positions.
  • the traditional SFM technology needs to estimate the pose between the cameras and the three-dimensional point cloud at the same time.
  • the two-dimensional feature data is matched based on the inter-frame pose, and then the feature data is triangulated quickly based on the matching result.
  • the map metadata refers to the three-dimensional feature points, semantic information and texture data obtained in the above steps. Through the number of 3D feature points, the richness of 3D feature points in space, the richness of semantic features, the richness of scene texture, the environmental illumination and other information, comprehensively judge the construction quality of map data constructed based on the currently obtained map metadata. Whether it meets the quality requirements of the drawing.
  • the specific mapping quality requirements have been described in detail in the foregoing embodiments, and will not be repeated this time.
  • control device will issue a warning message to warn the user to change the spatial scene, better weather, and better time to build the map.
  • the map data constructed based on the map metadata can meet the needs of automatic parking when it is used for automatic parking control, and the quality of the map data will be optimized uniformly.
  • the optimization contents include: the position of 3D feature points, the pose of the camera, and the quality of the semantic layer. For example: optimization of the position, angle, size, category, etc. of semantic elements.
  • the map data is stored in different layers, such as: feature data layer, semantic information layer, navigation data layer, key frame dictionary layer, etc. Different layers can be used to provide data for the automatic parking location function.
  • the map metadata of the image data collected by different cameras will be encoded, and the map metadata of the image data collected by different cameras will be used for matching in different positioning stages. For example, when the vehicle is moving forward, the map metadata of the image data collected by the front-view camera is used for matching and positioning, and when the vehicle is moving backward, the map metadata of the image data collected by the rear-view fisheye camera is used for matching and positioning to achieve more Fast and powerful map building capabilities.
  • the control device When using map data for automatic parking, the control device will increase the weight of the matched map metadata, while the unmatched map metadata will reduce the weight, and add a certain amount of high-quality map metadata to the map to realize the map.
  • the timely update of map metadata ensures the long-term timeliness and high quality of the map.
  • pure visual lightweight map data is constructed, and complex and manual intervention of high-precision map construction is avoided.
  • the constructed map data has extremely low hardware cost requirements, and can quickly realize high-quality map data construction. .
  • the constructed map data can be effectively used for positioning problems in the process of automatic parking. Through a lot of learning, it can realize functions such as exclusive parking space selection, selecting any parking space for parking, parking lot cruise parking and other functions, which can help people very well. It also provides map navigation and real-time path planning for functions such as valet parking and memory parking.
  • control device 700 specifically includes: a memory 701 for storing instructions and a processor 702 for executing the instructions stored in the memory, and the processor 702 is used for specific execution :
  • map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information
  • map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
  • mapping quality requirements include at least one of the following:
  • the total number of three-dimensional feature points reaches the first number threshold
  • the total amount of texture data reaches the second amount threshold
  • the number of types of texture data reaches the third number threshold
  • the total quantity of semantic information reaches the fourth quantity threshold
  • the number of kinds of semantic information reaches the fifth number threshold.
  • processor 702 is configured to specifically execute:
  • map metadata including:
  • the marked map data includes marking information used to indicate the source of the data.
  • the processor 702 is configured to specifically execute at least one of the following:
  • the marked feature data layer is generated
  • processor 702 is configured to specifically execute:
  • control instructions Generate control instructions according to real-time data and target data; wherein, the control instructions are used to control the movement of the movable platform.
  • processor 702 is configured to specifically execute:
  • the map data from the image sensor installed in front of the movable platform is obtained from the map data as the target data;
  • the map data derived from the image sensor installed behind the movable platform is obtained from the map data as the target data.
  • processor 702 is configured to specifically execute:
  • Control commands are generated according to the position information of the movable platform.
  • processor 702 is configured to specifically execute:
  • processor 702 is configured to specifically execute:
  • the first reliability value is greater than the second reliability value.
  • processor 702 is configured to specifically execute:
  • the image sensor includes a monocular camera and/or a fisheye camera.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

A data processing method and a device. The method comprises: when a movable platform moves in a space scene, controlling an image sensor, which is located on the movable platform, to capture multi-frame image data of the space scene; processing the multi-frame image data to obtain map metadata, wherein the map metadata comprises any one or a combination of a three-dimensional feature point, texture data, and semantic information; determining whether the map metadata meets a mapping quality requirement; and if the map metadata meets the mapping quality requirement, generating map data according to the map metadata, wherein the map data is used for controlling the movable platform to move in the space scene. By means of this solution, a customization requirement of a mobile platform for map data can be met, and the accuracy of the generated map data can also be ensured.

Description

数据处理方法和设备Data processing method and device 技术领域technical field
本申请涉及自动驾驶技术领域,尤其涉及一种数据处理方法和设备。The present application relates to the technical field of automatic driving, and in particular, to a data processing method and device.
背景技术Background technique
高精地图数据是车辆自动驾驶的重要基础,对智能汽车发展意义重大。High-precision map data is an important basis for vehicle autonomous driving and is of great significance to the development of intelligent vehicles.
高精地图数据通常是由地图供应商提供。地图供应商通常只能提供使用量比较大的高精地图数据,而不会提供使用量比较小的高精地图数据。HD map data is usually provided by map providers. Map suppliers usually only provide high-precision map data with relatively large usage, but do not provide high-precision map data with relatively small usage.
然而,智能汽车对高精地图的需求是不同的,现有的高精地图供给方式无法适应智能汽车的个性化的地图需求。However, smart cars have different requirements for high-precision maps, and the existing high-precision map supply methods cannot adapt to the personalized map requirements of smart cars.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供一种数据处理方法和设备,旨在提供一种可适应不同的可移动平台的个性化地图需求的方案。The embodiments of the present application provide a data processing method and device, aiming to provide a solution that can adapt to the personalized map requirements of different mobile platforms.
第一方面,本申请提供一种数据处理方法,包括:In a first aspect, the present application provides a data processing method, comprising:
当可移动平台在空间场景内移动时,控制位于可移动平台上的图像传感器采集空间场景的多帧图像数据;When the movable platform moves in the space scene, control the image sensor located on the movable platform to collect multiple frames of image data of the space scene;
对多帧图像数据进行处理获得地图元数据;其中,地图元数据包括三维特征点、纹理数据以及语义信息中任意一种或多种组合;Process the multi-frame image data to obtain map metadata; wherein, the map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information;
判断所述地图元数据是否满足建图质量要求;Judging whether the map metadata meets the quality requirements for mapping;
若地图元数据满足建图质量要求,根据地图元数据生成地图数据;其中,地图数据用于控制可移动平台在空间场景内移动。If the map metadata meets the mapping quality requirements, map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
第二方面,本申请提供一种控制设备,包括:用于存储指令的存储器和用于执行存储在存储器中指令的处理器,处理器用于具体执行:In a second aspect, the present application provides a control device, comprising: a memory for storing instructions and a processor for executing the instructions stored in the memory, where the processor is used to specifically execute:
当可移动平台在空间场景内移动时,控制位于可移动平台上的图像传感器采集空间场景的多帧图像数据;When the movable platform moves in the space scene, control the image sensor located on the movable platform to collect multiple frames of image data of the space scene;
对多帧图像数据进行处理获得地图元数据;其中,地图元数据包括三维特征点、纹理数据以及语义信息中任意一种或多种组合;Process the multi-frame image data to obtain map metadata; wherein, the map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information;
判断所述地图元数据是否满足建图质量要求;Judging whether the map metadata meets the quality requirements for mapping;
若地图元数据满足所述建图质量要求,根据地图元数据生成地图数据;其中,地图数据用于控制可移动平台在空间场景内移动。If the map metadata meets the mapping quality requirements, map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
第三方面,本申请提供一种可移动平台,包括图像传感器以及第二方面所涉及的数据处理方法。In a third aspect, the present application provides a movable platform including an image sensor and the data processing method involved in the second aspect.
综上所述,本申请实施例提供数据处理方法和设备,可移动平台在移动时采集空间场景的图像数据,并基于图像数据生成地图数据,再使用所采集地图数据控制其运动,可满足可移动平台个性化地图数据需求。基于图像数据生成地图数据,可使用可移动平台上已有图象传感器,无需再配置激光雷达等高成本传感器采集点云,降低地图建构成本。并且,在判断地图元数据满足建图质量要求后再根据地图元数据生成地图数据,保证可移动平台所生成地图数据的准确度。此外,本申请中基于图像数据生成地图数据,地图数据存储更轻量,对于地图的实时更新、维护极具便利。To sum up, the embodiments of the present application provide a data processing method and device. The movable platform collects image data of a spatial scene when moving, generates map data based on the image data, and then uses the collected map data to control its movement, which can meet the requirements of the mobile platform. Personalized map data requirements for mobile platforms. To generate map data based on image data, the existing image sensors on the mobile platform can be used, and there is no need to configure high-cost sensors such as lidar to collect point clouds, reducing the cost of map construction. Moreover, after judging that the map metadata meets the quality requirements for mapping, the map data is generated according to the map metadata, so as to ensure the accuracy of the map data generated by the mobile platform. In addition, in this application, map data is generated based on image data, and the storage of map data is lighter, which is very convenient for real-time update and maintenance of maps.
附图说明Description of drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are For some embodiments of the present application, for those of ordinary skill in the art, other drawings can also be obtained according to these drawings without any creative effort.
图1为本申请一实施例提供的可移动平台的结构示意图;FIG. 1 is a schematic structural diagram of a movable platform according to an embodiment of the present application;
图2为本申请另一实施例提供的数据处理方法的流程示意图;2 is a schematic flowchart of a data processing method provided by another embodiment of the present application;
图3为本申请另一实施例提供的数据处理方法的流程示意图;3 is a schematic flowchart of a data processing method provided by another embodiment of the present application;
图4为本申请另一实施例提供的数据处理方法的流程示意图;4 is a schematic flowchart of a data processing method provided by another embodiment of the present application;
图5为本申请另一实施例提供的数据处理方法的流程示意图;5 is a schematic flowchart of a data processing method provided by another embodiment of the present application;
图6为本申请另一实施例提供的控制设备的结构示意图。FIG. 6 is a schematic structural diagram of a control device provided by another embodiment of the present application.
具体实施方式Detailed ways
为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于 本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be described clearly and completely below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of this application.
需要说明的是,当组件被称为“固定于”另一个组件,它可以直接在另一个组件上或者也可以存在居中的组件。当一个组件被认为是“连接”另一个组件,它可以是直接连接到另一个组件或者可能同时存在居中组件。It should be noted that when a component is referred to as being "fixed to" another component, it can be directly on the other component or there may also be a centered component. When a component is considered to be "connected" to another component, it may be directly connected to the other component or there may be a co-existence of an intervening component.
除非另有定义,本文所使用的所有的技术和科学术语与属于本申请的技术领域的技术人员通常理解的含义相同。本文中在本申请的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本申请。本文所使用的术语“及/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field to which this application belongs. The terms used herein in the specification of the application are for the purpose of describing specific embodiments only, and are not intended to limit the application. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
下面结合附图,对本申请的一些实施方式作详细说明。在不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。Some embodiments of the present application will be described in detail below with reference to the accompanying drawings. The embodiments described below and features in the embodiments may be combined with each other without conflict.
现有的高精地图供给方式无法适应智能汽车的个性化的地图需求。为解决上述技术问题,本申请提供一种数据处理方法和设备。本申请的技术构思为:由可移动平台在空间场景内行走时采集空间场景的图像数据,并使用采集的数据生成该地图数据,可以适应可移动平台的对地图数据的个性化需求。在生成地图数据前判断用于生成的地图元数据是否满足建图质量要求,以保证可移动平台生成的地图数据可以准确反应空间场景。且基于图像传感器采集的图像数据生成地图数据,无需高成本的图像传感器,降低地图构建成本。The existing high-precision map supply methods cannot meet the personalized map requirements of smart cars. To solve the above technical problems, the present application provides a data processing method and device. The technical idea of the present application is: the image data of the space scene is collected by the movable platform while walking in the space scene, and the collected data is used to generate the map data, which can adapt to the personalized needs of the movable platform for map data. Before generating map data, it is judged whether the generated map metadata meets the requirements of mapping quality, so as to ensure that the map data generated by the mobile platform can accurately reflect the spatial scene. And the map data is generated based on the image data collected by the image sensor, no high-cost image sensor is needed, and the cost of map construction is reduced.
如图1所示,本申请一实施例提供一种可移动平台100,可移动平台100包括图像传感器101、行驶传感器(图未示)以及控制设备(图未示)。图像传感器101用于采集可移动平台100周围场景的图像数据,行驶传感器用于采集可移动平台的行驶数据,控制设备用于执行如下所描述的数据处理方法,详细信息参考如下描述,此处不再赘述。As shown in FIG. 1 , an embodiment of the present application provides a movable platform 100 . The movable platform 100 includes an image sensor 101 , a travel sensor (not shown) and a control device (not shown). The image sensor 101 is used to collect the image data of the scene around the movable platform 100, the driving sensor is used to collect the driving data of the movable platform, and the control device is used to execute the data processing method described below. Repeat.
本申请可以用于解决自动驾驶功能中的自动泊车问题,可以用于短距离自动泊车过程中的地图构建,例如:300米以内。该地图主要用于记录停车场内各种标志物,包括停车位、交通标志、道路车道线、地标建筑物等。在地图构建之后,可辅助实现自动泊车过程中的停车场位置识别、地图区域内停车位自动搜索、定位地图区域内任意位置车辆等功能的实现。This application can be used to solve the automatic parking problem in the automatic driving function, and can be used for map construction in the process of short-distance automatic parking, for example, within 300 meters. The map is mainly used to record various landmarks in the parking lot, including parking spaces, traffic signs, road lane lines, landmark buildings, etc. After the map is constructed, it can assist in the realization of functions such as parking lot location recognition in the process of automatic parking, automatic search for parking spaces in the map area, and locating vehicles at any location in the map area.
如图2所示,本申请提供一种数据处理方法,该方法的执行主体为控制 设备,该方法具体包括如下步骤:As shown in Figure 2, the application provides a data processing method, the execution subject of the method is a control device, and the method specifically includes the following steps:
S201、当可移动平台在空间场景内移动时,控制设备控制位于可移动平台上的图像传感器采集空间场景的多帧图像数据。S201. When the movable platform moves in the space scene, the control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
其中,当可移动平台进入某一空间场景时,控制可移动平台上图像传感器工作,图像传感器采集空间场景的图像数据,图像传感器将图像数据传输至控制设备。Wherein, when the movable platform enters a certain space scene, the image sensor on the movable platform is controlled to work, the image sensor collects image data of the space scene, and the image sensor transmits the image data to the control device.
优选地,行驶传感器采集可移动平台的位置信息,将采集到的位置信息传输至控制设备,控制设备根据位置信息判断是否进入某一空间场景内,在确定进入指定空间场景时,控制可移动平台上图像传感器工作,图像传感器将采集到的多帧图像数据传输至控制设备。Preferably, the driving sensor collects the position information of the movable platform, transmits the collected position information to the control device, and the control device determines whether to enter a certain space scene according to the position information, and controls the movable platform when it is determined to enter the designated space scene. The upper image sensor works, and the image sensor transmits the collected multi-frame image data to the control device.
S202、控制设备对多帧图像数据进行处理获得地图元数据。S202, the control device processes the multi-frame image data to obtain map metadata.
其中,地图元数据用于生成地图数据,地图元数据包括三维特征点、纹理数据以及语义信息中任意一种或多种组合。The map metadata is used to generate map data, and the map metadata includes any one or a combination of three-dimensional feature points, texture data, and semantic information.
三维特征点用于反映场景空间内物体的位置和形状,纹理数据用于反映场景空间内物体的表面信息,语义信息用于反映纹理数据和三维特征点所表示物体的类别。The three-dimensional feature points are used to reflect the position and shape of objects in the scene space, the texture data are used to reflect the surface information of the objects in the scene space, and the semantic information is used to reflect the categories of objects represented by the texture data and the three-dimensional feature points.
通过对图像数据进行提取二维特征数据、多帧图像数据中二维特征数据匹配、以及语义识别等处理,获得上述地图元数据。The above-mentioned map metadata is obtained by extracting two-dimensional feature data from image data, matching two-dimensional feature data in multi-frame image data, and semantic recognition.
进一步的,判断所述地图元数据是否满足建图质量要求,步骤S203列举了一种具体的实施方式:Further, to determine whether the map metadata meets the quality requirements for mapping, step S203 enumerates a specific implementation:
S203、若地图元数据满足建图质量要求,控制设备根据地图元数据生成地图数据。S203. If the map metadata meets the quality requirements for mapping, the control device generates map data according to the map metadata.
其中,建图质量要求用于确定上述地图元数据是否足够丰富,也就是地图元数据的数量是否足够多,以及地图元数据的数据种类是否足够多。The mapping quality requirement is used to determine whether the above-mentioned map metadata is rich enough, that is, whether the quantity of map metadata is sufficient, and whether the data types of the map metadata are sufficient.
若地图元数据足够丰富,使用该地图元数据构建的地图数据的质量越高,也就是地图数据更加准确描述该空间场景。若地图元数据的数据量小且种类单一,使用该地图元数据构建的地图数据的质量较低,也就是地图数据无法准确描述该空间场景。If the map metadata is rich enough, the quality of the map data constructed using the map metadata will be higher, that is, the map data will more accurately describe the spatial scene. If the data volume of the map metadata is small and the types are single, the quality of the map data constructed by using the map metadata is low, that is, the map data cannot accurately describe the spatial scene.
当地图元数据满足建图质量要求时,对地图元数据进行处理获得地图数据,例如:对地图元数据进行图像化处理获得各个图层。When the map metadata meets the mapping quality requirements, the map metadata is processed to obtain map data, for example, each layer is obtained by image processing of the map metadata.
地图数据用于控制可移动平台在空间场景内移动。控制设备可以根据上一时刻生成的地图数据控制可移动平台在当前时刻在空间场景内移动。控制设备也可以在下一次再次进入空间场景时,根据所生成的地图数据控制可移动平台在空间场景内移动。The map data is used to control the movement of the movable platform within the spatial scene. The control device can control the movable platform to move within the space scene at the current moment according to the map data generated at the previous moment. The control device can also control the movable platform to move in the space scene according to the generated map data when entering the space scene again next time.
在上述技术方案中,由可移动平台采集行走过程中图像数据,并根据图像数据生成的地图数据,再使用所采集地图数据控制其行走,可满足可移动平台个性化地图数据需求,且直接利用可移动平台已有的图像传感器采集数据,无需配置高成本的传感器,例如:激光雷达等。另外,在判断地图元数据满足建图质量要求后,再根据地图元数据生成地图数据,保证所生成地图数据的准确度。In the above technical solution, the movable platform collects image data during the walking process, generates map data based on the image data, and then uses the collected map data to control its walking, which can meet the personalized map data requirements of the movable platform, and can directly use The existing image sensors of the mobile platform collect data, and there is no need to configure high-cost sensors, such as lidar. In addition, after judging that the map metadata meets the quality requirements for mapping, map data is generated according to the map metadata to ensure the accuracy of the generated map data.
如图3所示,本申请另一实施例提供一种数据处理方法,该方法的执行主体为控制设备,该方法具体包括如下步骤:As shown in FIG. 3 , another embodiment of the present application provides a data processing method, the execution subject of the method is a control device, and the method specifically includes the following steps:
S301、当可移动平台在空间场景内移动时,控制设备控制位于可移动平台上的图像传感器采集空间场景的多帧图像数据。S301. When the movable platform moves in the space scene, the control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
其中,可移动平台上布置有多个图像传感器,多个图像传感器位于可移动平台的四周,用于采集可移动平台所处空间场景内的图像数据,控制设备在获取图像数据时,同时获取采集该图像数据的图像传感器。Among them, a plurality of image sensors are arranged on the movable platform, and the plurality of image sensors are located around the movable platform and are used to collect image data in the space scene where the movable platform is located. The image sensor for the image data.
S302、控制设备对多帧图像数据进行处理获得地图元数据。S302, the control device processes the multi-frame image data to obtain map metadata.
其中,该步骤已经在上述实施例中详细说明,此处不再赘述。Wherein, this step has been described in detail in the above embodiments, and will not be repeated here.
S303、控制设备使用图像传感器的标识对地图元数据进行来源标记,获得标记后的地图元数据;S303, the control device uses the identification of the image sensor to mark the source of the map metadata, and obtain the marked map metadata;
其中,在对多帧图像数据进行处理获得地图元数据后,使用采集该图像数据的图像传感器的标识对获得的地图元数据进行标记。也就是对地图元数据进行数据来源标记。Wherein, after multiple frames of image data are processed to obtain map metadata, the obtained map metadata is marked with an identifier of an image sensor that collects the image data. That is to mark the data source of the map metadata.
S304、若标记后的地图元数据满足建图质量要求,控制设备根据标记后的地图元数据生成标记后的地图数据。S304. If the marked map metadata meets the quality requirements for mapping, the control device generates marked map data according to the marked map metadata.
其中,建图质量要求用于判断标记后的地图数据是否足够丰富。地图元数据包括三维特征点、纹理数据以及语义信息中任意一种或多种组合。Among them, the mapping quality requirement is used to judge whether the marked map data is rich enough. The map metadata includes any one or a combination of three-dimensional feature points, texture data, and semantic information.
建图质量要求包括以下至少一项:三维特征点的总数量达到第一数量阈 值;至少存在两个在三个坐标轴上的分量均不相同的三维特征点;纹理数据的总数量达到第二数量阈值;纹理数据的种类数量达到第三数量阈值;语义信息的总数量达到第四数量阈值;以及语义信息的种类数量达到第五数量阈值。The mapping quality requirements include at least one of the following: the total number of three-dimensional feature points reaches the first threshold; there are at least two three-dimensional feature points with different components on the three coordinate axes; the total number of texture data reaches the second threshold. the number threshold; the number of types of texture data reaches a third number threshold; the total number of semantic information reaches a fourth number threshold; and the number of types of semantic information reaches a fifth number threshold.
通过判断三维特征点的总数量是否达到第一数量阈值,确定三维特征点在数量上是否足够丰富。通过判断是否至少存在两个在三个坐标轴上的分量均不相同的三维特征点,确定三维特征点在类型上是否足够丰富。若所有三维特征点都位于同一个平面,也就是所有三维特征点在其中一个坐标轴上的分量相同,例如:所有三维特征点的z轴方向上的分量相同,也就是三维特征点仅能表示一个平面,而无法表示丰富的三维空间场景。By judging whether the total number of three-dimensional feature points reaches the first number threshold, it is determined whether the three-dimensional feature points are sufficiently abundant in number. By judging whether there are at least two three-dimensional feature points with different components on the three coordinate axes, it is determined whether the three-dimensional feature points are sufficiently rich in type. If all 3D feature points are located on the same plane, that is, all 3D feature points have the same component on one of the coordinate axes, for example: all 3D feature points have the same component in the z-axis direction, that is, 3D feature points can only represent A plane cannot represent a rich three-dimensional space scene.
通过判断纹理数据的总数量是否达到第二数量阈值,确定纹理数据在数量上是否足够丰富。通过判断纹理数据的种类数量是否达到第三数量阈值,确定纹理数据在类型上是否足够丰富。By judging whether the total quantity of texture data reaches the second quantity threshold, it is determined whether the quantity of texture data is sufficiently abundant. By judging whether the number of types of texture data reaches a third quantity threshold, it is determined whether the texture data is rich enough in type.
通过判断语义信息的总数量是否达到第四数量阈值,确定纹理数据在数量上是否足够丰富。通过判断语义信息的种类数量是否达到第五数量阈值,确定语义信息在类型上是否足够丰富。By judging whether the total quantity of semantic information reaches the fourth quantity threshold, it is determined whether the texture data is sufficiently abundant in quantity. By judging whether the number of types of semantic information reaches the fifth quantity threshold, it is determined whether the types of semantic information are rich enough.
针对上述地图元数据,结合其在数量和类型上的丰富程度,确定地图元数据是否满足建图质量要求,基于该地图元数据生成的地图数据可以准确反应空间场景。For the above-mentioned map metadata, it is determined whether the map metadata meets the quality requirements of map construction in combination with its richness in quantity and type. The map data generated based on the map metadata can accurately reflect the spatial scene.
在得到满足建图质量要求的地图元数据后,根据标记后的地图元数据生成地图数据。生成地图数据的过程具体包括如下至少一项:After obtaining map metadata that meets the requirements of mapping quality, map data is generated according to the marked map metadata. The process of generating map data specifically includes at least one of the following:
根据标记后的三维特征点和标记后的纹理数据生成标记后的特征数据图层;以及根据标记后的语义信息生成标记后的语义信息图层。A marked feature data layer is generated according to the marked three-dimensional feature points and marked texture data; and a marked semantic information layer is generated according to the marked semantic information.
优选地,对标记后的三维特点和标记后的纹理数据进行图像化处理,获得标记后的特征数据图层。对标记后的语义信息进行图像化处理,获得标记后的语义信息图层。Preferably, image processing is performed on the marked three-dimensional features and the marked texture data to obtain a marked feature data layer. The marked semantic information is imaged to obtain the marked semantic information layer.
在其他实施例中,可根据标记后的语义信息生成停车位图层,具体包括:从标记后的语义信息中提取表示停车位的语义信息,根据所表示停车位的语义信息进行图像化处理生成标记后的停车位图层。In other embodiments, the parking space layer can be generated according to the marked semantic information, which specifically includes: extracting semantic information representing the parking space from the marked semantic information, and performing image processing to generate the parking space according to the semantic information of the represented parking space. The marked parking space layer.
其中,标记后的地图数据包括用于指示数据来源的标记信息。在使用地 图数据时,可以根据地图数据的标记信息和移动方向筛选地图数据,再使用筛选得到的地图数据控制可移动平台移动,实现减少地图数据使用过程的数据处理量,以使可移动平台可以更快速地根据地图数据生成控制指令。Wherein, the marked map data includes marking information used to indicate the source of the data. When using map data, the map data can be filtered according to the marker information and moving direction of the map data, and then the filtered map data can be used to control the movement of the movable platform, so as to reduce the amount of data processing in the process of using the map data, so that the movable platform can Generate control instructions from map data more quickly.
在上述技术方案中,结合各个地图元数据在数量和类型上的丰富程度判断地图元数据是否满足建图质量要求,可根据该地图元数据获得准确反应空间场景的地图数据。此外,对地图元数据进行来源标记处理,使得所获得的地图数据也可以体现数据来源,在使用地图数据时可以根据数据来源筛选数据,减少数据处理量,进而可根据地图数据快速生成控制指令,以控制可移动平台精准移动。In the above technical solution, it is judged whether the map metadata meets the requirements of mapping quality according to the richness of each map metadata in quantity and type, and map data that accurately reflects the spatial scene can be obtained according to the map metadata. In addition, the source tagging process is performed on the map metadata, so that the obtained map data can also reflect the data source. When using the map data, the data can be filtered according to the data source to reduce the data processing amount, and then the control instructions can be quickly generated according to the map data. To control the precise movement of the movable platform.
如图4所示,本申请另一实施例提供一种数据处理方法,该方法的执行主体为控制设备,该方法具体包括如下步骤:As shown in FIG. 4 , another embodiment of the present application provides a data processing method, the execution subject of the method is a control device, and the method specifically includes the following steps:
S401、当可移动平台在空间场景内移动时,控制设备控制位于可移动平台上的图像传感器采集空间场景的多帧图像数据。S401. When the movable platform moves in the space scene, the control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
S402、控制设备对多帧图像数据进行处理获得地图元数据。S402, the control device processes the multi-frame image data to obtain map metadata.
S403、控制设备使用图像传感器的标识对地图元数据进行来源标记,获得标记后的地图元数据、S403, the control device uses the identification of the image sensor to mark the source of the map metadata, and obtains the marked map metadata,
S404、若标记后的地图元数据满足建图质量要求,控制设备根据标记后的地图元数据生成标记后的地图数据。S404. If the marked map metadata meets the quality requirements for mapping, the control device generates marked map data according to the marked map metadata.
其中,S401至S404已经在上述实施例中详细说明,此处不再赘述。Among them, S401 to S404 have been described in detail in the above embodiments, and will not be repeated here.
S405、控制设备获取可移动平台的移动方向和图像传感器采集的实时数据。S405, the control device acquires the moving direction of the movable platform and the real-time data collected by the image sensor.
其中,控制设备在生成标记后的地图数据后,控制设备可以根据上一时刻生成的地图数据控制可移动平台在当前时刻在空间场景内移动。也可以在下一次再次进入空间场景时,根据所生成的地图数据控制可移动平台在空间场景内移动。Wherein, after the control device generates the marked map data, the control device can control the movable platform to move in the space scene at the current moment according to the map data generated at the previous moment. It is also possible to control the movable platform to move in the space scene according to the generated map data when entering the space scene again next time.
可移动平台在该空间场景内移动时,由行驶传感器采集可移动平台的移动方向,由图像传感器采集空间场景的实时数据。控制设备基于可移动平台的移动方向、实时数据以及地图数据实时控制可移动平台在空间场景内移动。When the movable platform moves in the space scene, the moving direction of the movable platform is collected by the driving sensor, and the real-time data of the space scene is collected by the image sensor. The control device controls the movable platform to move in the space scene in real time based on the moving direction, real-time data and map data of the movable platform.
S406、控制设备根据标记信息从标记后的地图数据中获取与移动方向匹 配的目标数据。S406, the control device acquires target data matching the moving direction from the marked map data according to the marking information.
其中,地图数据的标记信息用于反映地图数据的数据来源,也就是也可以确定采集地图数据所对应的图像数据的图像传感器。而图像传感器在可移动平台上的安装位置是固定的,进而可以根据标记信息确定采集该地图数据所对应的图像数据的图像传感器的位置信息。再结合可移动平台的移动方向和上述位置信息从地图数据中选择目标数据。The marker information of the map data is used to reflect the data source of the map data, that is, the image sensor that collects the image data corresponding to the map data can also be determined. The installation position of the image sensor on the movable platform is fixed, and then the position information of the image sensor that collects the image data corresponding to the map data can be determined according to the marker information. The target data is selected from the map data in combination with the moving direction of the movable platform and the above-mentioned position information.
更具体地,若可移动车辆移动方向为向前行走,从地图数据中获取来源于安装在可移动平台前方的图像传感器的地图数据作为目标数据。若可移动车辆的移动方向为向后行走,从地图数据中获取来源于安装在可移动平台后方的图像传感器的地图数据作为目标数据。More specifically, if the moving direction of the movable vehicle is forward walking, the map data derived from the image sensor installed in front of the movable platform is obtained from the map data as the target data. If the moving direction of the movable vehicle is to walk backward, the map data derived from the image sensor installed behind the movable platform is obtained from the map data as the target data.
S407、控制设备根据实时数据和目标数据生成控制指令。S407, the control device generates a control instruction according to the real-time data and the target data.
其中,图像传感器采集的实时数据也为图像数据,控制设备对图像数据进行特征提取等处理,并根据处理后的实时数据与地图数据确定可移动平台的位置信息,再根据位置信息和地图数据生成用于控制可移动平台移动的控制指令,以使可移动平台在控制指令的控制下在空间场景内移动。Among them, the real-time data collected by the image sensor is also image data, and the control device performs feature extraction on the image data, and determines the location information of the movable platform according to the processed real-time data and map data, and then generates a mobile platform based on the location information and map data. Control instructions for controlling the movement of the movable platform, so that the movable platform moves within the spatial scene under the control of the control instructions.
在根据处理后的实时数据与地图数据确定可移动平台的位置信息时,将处理后的实时数据与地图数据进行匹配获得匹配结果,并根据匹配成功的地图数据确定可移动平台的位置信息。When the location information of the movable platform is determined according to the processed real-time data and the map data, the processed real-time data and the map data are matched to obtain a matching result, and the location information of the movable platform is determined according to the successfully matched map data.
在另一实施例中,控制设备将实时数据与目标数据进行匹配获得匹配结果后,根据匹配结果设置目标数据的可靠值。In another embodiment, after the control device matches the real-time data with the target data to obtain a matching result, the reliability value of the target data is set according to the matching result.
更具体地,若匹配结果为匹配成功,设置目标数据的可靠值为第一可靠值,若匹配结果为匹配失败,设置目标数据的可靠值为第二可靠值。其中,第一可靠值大于第二可靠值。More specifically, if the matching result is that the matching is successful, the reliability value of the target data is set to the first reliability value, and if the matching result is that the matching fails, the reliability value of the target data is set to the second reliability value. Wherein, the first reliability value is greater than the second reliability value.
在获得目标数据的可靠值后,控制设备统计目标数据的可靠值获得可靠性统计结果,当可靠性统计结果满足低可靠性条件时,删除目标数据,以实现对地图数据的优化。After obtaining the reliable value of the target data, the control device counts the reliable value of the target data to obtain the reliability statistical result. When the reliability statistical result meets the low reliability condition, the target data is deleted to realize the optimization of the map data.
在对目标数据的可靠值进行统计时,若可靠性统计结果为可靠值均值,低可靠性条件为可靠值均值小于预设均值。When the reliability value of the target data is counted, if the reliability statistics result is the average value of the reliability value, the low reliability condition is that the average value of the reliability value is smaller than the preset average value.
在上述技术方案中,在控制设备生成带来源标记的地图数据后,根据标记信息从地图数据中选择目标数据,并根据目标数据和图像传感器采集到的 实时数据控制可移动平台移动,通过筛选获得目标数据,减少地图数据使用过程中的数据处理量,控制设备可以更快速生成控制指令,使可移动设备可靠地在空间场景内移动。另外,在控制设备将实时数据和目标数据进行匹配时,用匹配结果标记目标数据,实现对地图数据的优化。In the above technical solution, after the control device generates the map data with the source mark, selects target data from the map data according to the mark information, and controls the movable platform to move according to the target data and the real-time data collected by the image sensor, and obtains by screening Target data, reducing the amount of data processing during the use of map data, the control device can generate control instructions more quickly, so that the movable device can move in the space scene reliably. In addition, when the control device matches the real-time data with the target data, the target data is marked with the matching result, so as to realize the optimization of the map data.
下面以可移动平台为智能汽车为例说明本申请提供的数据处理方法,该方法的执行主体为智能汽车内的控制设备,例如:行车电脑,该方法具体包括如下步骤:The data processing method provided by the present application is described below by taking the movable platform as an intelligent car as an example. The execution subject of the method is a control device in the intelligent car, such as a trip computer, and the method specifically includes the following steps:
S501、当可移动平台在空间场景内移动时,控制设备控制位于可移动平台上的图像传感器采集空间场景的多帧图像数据。S501. When the movable platform moves in the space scene, the control device controls the image sensor located on the movable platform to collect multiple frames of image data of the space scene.
其中,智能汽车上设有单目相机,例如:如行车记录仪,以及安装在智能汽车车内四周的鱼眼相机,上述相机用于采集某一空间场景内的图像数据,例如:地下停车场。智能汽车上还安装有行驶传感器,例如:低精度惯性导航单元、里程计、GPS等。Among them, the smart car is equipped with a monocular camera, such as a driving recorder, and a fisheye camera installed around the smart car. The above cameras are used to collect image data in a certain spatial scene, such as an underground parking lot. . Smart cars are also equipped with driving sensors, such as low-precision inertial navigation units, odometers, GPS, etc.
本申请提供的数据处理方法无需智能汽车在新增传感器,使用上述传感器即可生成地图数据以及控制智能车辆行驶。The data processing method provided by the present application does not require the smart car to add new sensors, and the above sensors can be used to generate map data and control the driving of the smart vehicle.
S502、控制设备对多帧图像数据进行处理获得地图元数据。S502, the control device processes the multi-frame image data to obtain map metadata.
其中,控制设备在获取上述相机采集的多帧图像数据,以及上述行驶传感器采集的行驶数据后,对多帧图像数据进行处理获得地图元数据。具体包括如下步骤:Wherein, after acquiring the multi-frame image data collected by the camera and the driving data collected by the driving sensor, the control device processes the multi-frame image data to obtain map metadata. Specifically include the following steps:
S5001、利用上述行驶数据计算两帧图像数据之间帧间位姿。S5001 , using the above driving data to calculate an inter-frame pose between two frames of image data.
可以使用VIO、VO算法对图像数据和行驶传感器采集数据进行处理,估算出两帧图像数据之间帧间位姿。帧间位姿作为图像数据处理的基础。也可以使用行驶传感器估算帧间位姿,例如:对里程计和惯性测量单元采集数据进行积分获得帧间位姿。The VIO and VO algorithms can be used to process the image data and the data collected by the driving sensor to estimate the frame-to-frame pose between two frames of image data. The frame-to-frame pose serves as the basis for image data processing. It is also possible to estimate the frame-to-frame pose using driving sensors, such as integrating the data collected by the odometer and the inertial measurement unit to obtain the frame-to-frame pose.
S5002、提取多帧图像数据中的二维特征数据。S5002 , extracting two-dimensional feature data in the multi-frame image data.
其中,对单目相机采集的图像数据和鱼眼相机采集的图像数据进行特征提取。优选地,从图像数据中提取得到几何特征,例如:物体边缘、角点、平面、显著点、特殊纹理等。还提取图像数据中纹理数据、梯度数据、像素色彩等数据。这些特征信息具有时间稳定、角度稳定、尺度稳定等特性,可 以在不同的角度、不同距离、不同时间段都能稳定、一致的观测性。Among them, feature extraction is performed on the image data collected by the monocular camera and the image data collected by the fisheye camera. Preferably, geometric features are extracted from the image data, such as: object edges, corners, planes, salient points, special textures, and the like. It also extracts texture data, gradient data, pixel color and other data in the image data. These characteristic information have the characteristics of time stability, angle stability, scale stability, etc., and can be observed stably and consistently at different angles, distances, and time periods.
在对图像数据进行特征提取获得二维特征数据时,也利用提取到纹理数据、梯度数据、像素色彩等对二维特征数据进行编码,以便进行特征匹配以及构建地图词典。When the feature extraction is performed on the image data to obtain the two-dimensional feature data, the extracted texture data, gradient data, pixel color, etc. are also used to encode the two-dimensional feature data, so as to perform feature matching and build a map dictionary.
在特征提取的时候,需要对鱼眼相机采集的图像数据进行校正处理,例如:将鱼眼相机模型下图像数据转换为针孔相机模型下图像数据,针孔图像数据中二维特征数据和转换后的鱼眼图像数据中的二维特征数据可以融合,有利于进行特征匹配处理。During feature extraction, it is necessary to correct the image data collected by the fisheye camera, for example: convert the image data under the fisheye camera model to the image data under the pinhole camera model, and convert the two-dimensional feature data in the pinhole image data to the image data under the pinhole camera model. The two-dimensional feature data in the resulting fisheye image data can be fused, which is conducive to feature matching processing.
S5003、对帧间位姿和二维特征数据进行特征匹配。S5003. Perform feature matching on the pose between frames and the two-dimensional feature data.
其中,主要是对所提取的特征进行时序关联。常见的时序关联方法包括帧间关联、窗口关联、回环关联等算法。Among them, it is mainly to perform time series correlation on the extracted features. Common timing correlation methods include algorithms such as inter-frame correlation, window correlation, and loopback correlation.
帧间关联主要为对相邻两个图像,例如:采集时间间隔50毫秒的图像,或,所显示位置间隔20厘米的图像,通常帧间匹配会有较多的特征关联。The inter-frame correlation is mainly for two adjacent images, for example: an image with an acquisition time interval of 50 milliseconds, or an image with a displayed position interval of 20 cm. Usually, there will be more feature correlations in the inter-frame matching.
窗口关联主要是指对一段时间或距离范围内的所有特征进行关联,通过特征关联的数量统计,可以得到特征的稳定性、一致性等性能指标的测量。Window correlation mainly refers to correlating all features within a period of time or distance. Through the quantitative statistics of feature correlation, performance indicators such as feature stability and consistency can be measured.
例如,当一个二维特征数据在一个窗口内能关联到大量图像,例如:如30帧图像,该二维特征数据是高质量的二维特征数据,对时间和空间变化有较好的鲁棒性。For example, when a two-dimensional feature data can be associated with a large number of images within a window, such as 30 frames of images, the two-dimensional feature data is high-quality two-dimensional feature data, and has better robustness to temporal and spatial changes sex.
S5004、根据特征匹配结果计算出三维特征点。S5004. Calculate three-dimensional feature points according to the feature matching result.
其中,计算三维特征点是基于三点共面假设,利用两个不同位置的图像对相同物体的观测,并计算出物体的三维坐标。在本申请中,使用两帧图像数据的帧间位姿和匹配结果实现图像特征数据的三角化,得到三维特征点。Among them, the calculation of three-dimensional feature points is based on the three-point coplanarity assumption, using two images at different positions to observe the same object, and calculate the three-dimensional coordinates of the object. In the present application, the image feature data is triangulated by using the inter-frame poses and matching results of two frames of image data to obtain three-dimensional feature points.
S5005、提取多帧图像数据的语义信息。S5005. Extract the semantic information of the multi-frame image data.
其中,提取语义信息主要是将提取空间场景中具有明确类别的物体信息,例如:地面车道线、停车位、指示箭头等,空中防撞条、悬挂指示牌、大块墙壁、柱子等。语义信息通常是比较稳定的元素,通常只有在环境变化时,例如:停车场维修重建等,才会出现语义信息失效,可以准确反应空间场景。Among them, the extraction of semantic information is mainly to extract the information of objects with clear categories in the spatial scene, such as: ground lane lines, parking spaces, indicating arrows, etc., air collision bars, hanging signs, large walls, pillars, etc. Semantic information is usually a relatively stable element. Usually, only when the environment changes, such as parking lot maintenance and reconstruction, will the semantic information fail, which can accurately reflect the spatial scene.
S503、使用图像传感器的标识对地图元数据进行来源标记,获得标记后的地图元数据。S503. Use the identifier of the image sensor to mark the source of the map metadata, and obtain the marked map metadata.
其中,为了实现一次性构建地图数据,会使用位于不同方位的相机采集 图像数据,例如:使用单目相机和鱼眼相机采集图像数据。在采集图像数据时,根据对不同相机所采集图像数据的特征进行标记。在使用地图数据时,根据智能车辆的行驶方向筛选不同的地图元数据进行匹配。Among them, in order to realize the one-time construction of map data, image data will be collected by using cameras located in different directions, for example, using monocular cameras and fisheye cameras to collect image data. When collecting image data, marking is performed according to the characteristics of image data collected by different cameras. When using map data, filter different map metadata for matching according to the driving direction of the intelligent vehicle.
S504、若标记后的地图元数据满足建图质量要求,控制设备根据标记后的地图元数据生成标记后的地图数据。S504. If the marked map metadata meets the quality requirements for mapping, the control device generates marked map data according to the marked map metadata.
其中,通过对空间三维特征点的丰富性、语义信息的丰富程度以及纹理数据的丰富程度等信息,判断所获得的地图元数据是否符合建图质量要求。如果不符合要求,控制设备发出警示信息,以警示选择更合适的空间场景,更合适的时间段进行地图构建等。Among them, it is judged whether the obtained map metadata meets the requirements of mapping quality through information such as the richness of spatial three-dimensional feature points, the richness of semantic information, and the richness of texture data. If it does not meet the requirements, the control device will send out warning information to warn you to choose a more suitable space scene and build a map in a more suitable time period.
若所获得的地图元数据符合建图质量要求,则生成标记后的地图数据。更具体地,对标记后的三维特点和标记后的纹理数据进行图像化处理,获得标记后的特征数据图层。对标记后的语义信息进行图像化处理,获得标记后的语义信息图层。If the obtained map metadata meets the requirements of mapping quality, the marked map data is generated. More specifically, image processing is performed on the marked three-dimensional features and the marked texture data to obtain a marked feature data layer. The marked semantic information is imaged to obtain the marked semantic information layer.
在生成特征数据图层和语义信息图层后,可以基于个性化需求生成相应图层,以供智能汽车自动驾驶。例如:构建关键帧词典。After the feature data layer and the semantic information layer are generated, the corresponding layers can be generated based on individual requirements for the autonomous driving of smart cars. For example: building a dictionary of keyframes.
下面以生成停车场的地图数据为例说明生成地图数据的过程:智能车辆沿着图中路径行驶时,鱼眼相机和前视相机采集停车场内图像数据,将鱼眼相机采集的图像数据拼接成环视俯视图,再将环视俯视图和单目相机采集的图像拼接成地面图像,然后利用深度学习的方法实现停车位的识别和提取,主要包括车道线、停车位、地面指示箭头等信息,作为语义地图的重要组成部分。The process of generating map data is described below by taking the generation of map data of the parking lot as an example: when the intelligent vehicle drives along the path in the figure, the fisheye camera and the front-view camera collect the image data in the parking lot, and stitch the image data collected by the fisheye camera. A look-around top view is formed, and then the images collected by the look-up top view and the monocular camera are spliced into a ground image, and then the deep learning method is used to realize the identification and extraction of parking spaces, mainly including lane lines, parking spaces, ground indicating arrows and other information, as semantics important part of the map.
通过停车位的识别,可以实现地图中的有效车位、无效车位、专属车位、车位号码等信息的识别与存储,用于客户进行自动泊车时的交互选择。Through the identification of parking spaces, the identification and storage of valid parking spaces, invalid parking spaces, exclusive parking spaces, parking space numbers and other information in the map can be realized, which can be used for interactive selection of customers during automatic parking.
在停车位的检测结果中,可能会出现噪声、错误检测等,需要对多次观测的停车位进行位置、类型、尺寸等信息的融合滤波获得语义图层,再结合停车场的特征数据图层,最后形成停车场的地图数据。In the detection results of parking spaces, noise, false detection, etc. may appear. It is necessary to fuse and filter the location, type, size and other information of the parking spaces observed for multiple times to obtain a semantic layer, and then combine the feature data layer of the parking lot. , and finally form the map data of the parking lot.
在生成地图数据后,可以对地图数据进行优化。具体包括:三维点的位置、相机的位姿以及语义地图质量。After the map data is generated, the map data can be optimized. Specifically, it includes: the position of the 3D point, the pose of the camera, and the quality of the semantic map.
S505、控制设备获取可移动平台的移动方向和图像传感器采集的实时数据。S505, the control device acquires the moving direction of the movable platform and the real-time data collected by the image sensor.
其中,控制设备获取智能汽车的行驶方向,并获取鱼眼相机和单目相机采集的图像数据。Among them, the control device obtains the driving direction of the smart car, and obtains the image data collected by the fisheye camera and the monocular camera.
S506、根据所述标记信息从所述标记后的地图数据中获取与所述移动方向匹配的目标数据。S506. Acquire target data matching the moving direction from the marked map data according to the marked information.
其中,在智能车辆行驶方向向前行驶时,会利用单目相机采集图像数据对应的地图数据进行定位,在智能车辆向后行驶,会采用利用鱼眼相机采集图像数据对应的地图数据进行定位,以使智能车辆拥有更加快速和更加强大的定位能力。Among them, when the intelligent vehicle is driving forward, the monocular camera will be used to collect the map data corresponding to the image data for positioning, and when the intelligent vehicle is driving backward, the map data corresponding to the image data collected by the fisheye camera will be used for positioning. In order to make intelligent vehicles have faster and more powerful positioning capabilities.
S507、控制设备根据所述实时数据和所述目标数据生成控制指令。S507. The control device generates a control instruction according to the real-time data and the target data.
其中,控制设备将实时数据与地图数据进行匹配获得匹配结果,并根据匹配成功的地图数据确定智能车辆的位置信息,再根据智能车辆的位置信息控制智能车辆行驶。The control device matches the real-time data with the map data to obtain a matching result, determines the location information of the intelligent vehicle according to the successfully matched map data, and then controls the intelligent vehicle to drive according to the location information of the intelligent vehicle.
在上述技术方案中,在控制设备生成带来源标记的地图数据后,根据标记信息从地图数据中选择目标数据,并根据目标数据和图像传感器采集到的实时数据控制车辆行驶,减少地图数据使用过程中的数据处理量,以实现智能汽车的快速定位,进而更可靠地控制智能汽车行驶。In the above technical solution, after the control device generates the map data with the source mark, the target data is selected from the map data according to the mark information, and the driving of the vehicle is controlled according to the target data and the real-time data collected by the image sensor, so as to reduce the process of using the map data. In order to realize the rapid positioning of the smart car, and then control the driving of the smart car more reliably.
如图5所示,本申请另一实施例提供一种数据处理方法,该方法的执行主体为智能汽车内的控制设备,例如:行车电脑,该方法具体包括如下步骤:As shown in FIG. 5 , another embodiment of the present application provides a data processing method. The execution body of the method is a control device in a smart car, such as a trip computer. The method specifically includes the following steps:
S601、根据可移动平台上传感器采集的多帧数据获得帧间位姿。S601. Obtain an inter-frame pose according to multi-frame data collected by a sensor on a movable platform.
其中,可移动平台为智能汽车,在智能汽车前方安装有行车记录仪,该行车记录仪中针孔相机作为图像传感器。在智能汽车的四周安装有鱼眼相机,该鱼眼相机也作为图像传感器。针孔相机和鱼眼相机用于采集空间场景的图像数据。Among them, the movable platform is a smart car, a driving recorder is installed in front of the smart car, and a pinhole camera in the driving recorder is used as an image sensor. A fisheye camera is installed around the smart car, which also acts as an image sensor. Pinhole cameras and fisheye cameras are used to collect image data of spatial scenes.
智能汽车上还安装有里程计、惯性测量单元(Inertial Measurement Unit,简称:IMU)以及GPS,惯性测量单元、GPS以及里程计用于采集智能汽车的行驶数据,例如:加速度、速度、行驶里程、行驶位置等数据。The smart car is also equipped with an odometer, an Inertial Measurement Unit (IMU) and GPS. The inertial measurement unit, GPS and odometer are used to collect the driving data of the smart car, such as acceleration, speed, mileage, data such as driving location.
在使用针孔相机采集到多帧图像数据和使用里程计和IMU采集到智能汽车的行驶数据后,根据多帧图像数据、里程计以及IMU采集的行驶数据估算帧间位姿。After using the pinhole camera to collect multi-frame image data and using the odometer and IMU to collect the driving data of the smart car, the inter-frame pose is estimated based on the multi-frame image data, the odometer and the driving data collected by the IMU.
在估算帧间位姿时,可使用单目视觉惯性系统(visual inertial system简称:VINS)算法,也可以使用视觉惯性里程计(visual inertial odometer,简称:VIO)算法或者惯性里程计(visual odometer,简称:VO)算法,也可以基于轮速和IMU输出行驶数据的积分估算帧间位姿。When estimating the pose between frames, the monocular visual inertial system (VINS) algorithm, the visual inertial odometer (VIO) algorithm or the visual odometer (visual odometer, Abbreviation: VO) algorithm, which can also estimate the pose between frames based on the integration of wheel speed and IMU output driving data.
估算帧间位姿作为整个数据处理方法的重要的系统基础,其质量会直接影响后续地图数据优化步骤的计算耗时。Estimating the frame-to-frame pose is an important system basis for the entire data processing method, and its quality will directly affect the computational time-consuming of subsequent map data optimization steps.
S602、提取多帧图像数据中的二维特征数据。S602. Extract two-dimensional feature data in the multi-frame image data.
其中,该步骤主要是对针孔相机采集的图像数据以及鱼眼相机采集的图像数据进行特征提取获得特征数据。该特征数据为二维特征数据。Among them, this step is mainly to perform feature extraction on the image data collected by the pinhole camera and the image data collected by the fisheye camera to obtain feature data. The feature data is two-dimensional feature data.
二维特征数据包括几何特征数据,例如:物体边缘、角点、平面、显著点、特殊纹理等特征。这些特征信息具有时间稳定、角度稳定、尺度稳定等特性,在不同的角度、不同距离、不同时间段比较稳定,且保持一致的观测性。同时,对二维特征数据进行有效的表达,利用纹理数据、梯度数据、像素色彩数据等对二维特征数据进行编码,编码后的二维特征数据用于特征匹配和词典数据。Two-dimensional feature data includes geometric feature data, such as: object edges, corners, planes, salient points, special textures and other features. These characteristic information have the characteristics of time stability, angle stability, scale stability, etc., and are relatively stable at different angles, distances, and time periods, and maintain consistent observability. At the same time, the two-dimensional feature data is effectively expressed, using texture data, gradient data, pixel color data, etc. to encode the two-dimensional feature data, and the encoded two-dimensional feature data is used for feature matching and dictionary data.
在进行特征提取时,对鱼眼相机采集的图像数据进行校正处理,实现鱼眼相机模型下的图像数据被转换到针孔相机模型下,转换后,针孔相机采集的图像数据的二维特征数据和鱼眼相机采集的图像数据的二维特征数据可以融合,以实现稳定性和一致性更高的特征匹配。During feature extraction, the image data collected by the fisheye camera is corrected, so that the image data under the fisheye camera model is converted to the pinhole camera model. After conversion, the two-dimensional features of the image data collected by the pinhole camera The 2D feature data of the data and the image data collected by the fisheye camera can be fused to achieve feature matching with higher stability and consistency.
S603、对针孔相机采集的图像数据进行语义信息的提取。S603, extracting semantic information on the image data collected by the pinhole camera.
其中,语义信息处理部分主要是将空间场景的图像数据中具有明确含义的物体,如车道线、停车位、指示箭头等地面物体,防撞条、悬挂指示牌、大块墙壁、柱子等空间中的物体进行类别识别,获得语义信息。语义信息通常是比较稳定的元素,通常只有在大范围的环境变化,如停车场维修重建等的情况才会出现语义信息不可用的情况。Among them, the semantic information processing part is mainly to process the objects with clear meaning in the image data of the spatial scene, such as the ground objects such as lane lines, parking spaces, indicating arrows, anti-collision bars, hanging signs, large walls, pillars and other spaces in the space. Class recognition of objects is performed to obtain semantic information. Semantic information is usually a relatively stable element, and it is usually only in the case of large-scale environmental changes, such as parking lot maintenance and reconstruction, that semantic information is unavailable.
S604、将鱼眼相机采集图像拼接成环视俯视图。S604 , stitching the images collected by the fisheye camera into a look-around top view.
其中,具体拼接方式采用现有技术,此次不再赘述。在获得环视俯视图后,通过对环视俯视图进行语义信息提取,主要包括车道线、停车位、地面指示箭头等语义信息提取,可以作为语义信息图层的重要组成部分。The specific splicing method adopts the prior art, which will not be repeated here. After obtaining the look-around top view, the semantic information extraction of the look-up top view mainly includes the extraction of semantic information such as lane lines, parking spaces, ground indicating arrows, etc., which can be used as an important part of the semantic information layer.
通过对环视俯视图中停车位的识别,可以实现地图中的有效车位、无效 车位、专属车位、车位号码等信息的识别与存储,用于客户进行自动泊车时的交互选择。Through the identification of parking spaces in the top view, the identification and storage of valid parking spaces, invalid parking spaces, exclusive parking spaces, parking space numbers and other information in the map can be realized, which can be used for interactive selection of customers during automatic parking.
S605、基于S601中估算的帧间位姿对S602中提取的二维特征数据进行匹配。S605. Match the two-dimensional feature data extracted in S602 based on the poses between frames estimated in S601.
其中二维,特征数据的匹配主要是对所提取的二维特征数据进行时序关联。可以使用帧间关联、窗口关联、回环关联等算法进行匹配。Among them, the matching of two-dimensional and feature data is mainly to perform time series association on the extracted two-dimensional feature data. Algorithms such as inter-frame association, window association, and loopback association can be used for matching.
帧间关联主要为对相邻两个图像进行关联,例如,间隔50毫秒,或间隔20厘米,通常帧间匹配会有较多的特征关联。The inter-frame correlation is mainly to correlate two adjacent images, for example, the interval is 50 milliseconds, or the interval is 20 centimeters. Usually, there will be more feature correlations in the inter-frame matching.
窗口关联主要是对在一个固定或非固定的时间或固定或非固定距离范围内的所有特征进行关联,通过对特征关联的数量统计得到该特征的稳定性、一致性等性能指标值。例如,当一个二维特征数据在一个窗口内能关联到大量图像,如30帧图像,那表明该二维特征数据质量非常高,对时间和空间变化有较好的鲁棒性。Window correlation is mainly to correlate all features within a fixed or non-fixed time or within a fixed or non-fixed distance range, and obtain performance indicators such as the stability and consistency of the feature by counting the number of feature correlations. For example, when a two-dimensional feature data can be associated with a large number of images within a window, such as 30 frames of images, it indicates that the two-dimensional feature data is of very high quality and has good robustness to temporal and spatial changes.
回环匹配是指在同一个空间场景可能会采集多次数据。这种情况下,通过把二维特征数据关联到不同时段的图像数据,既能识别出当前位置是否已经构建地图,也可以对多次观测的空间场景的图像数据进行有效的融合和更新。Loopback matching means that data may be collected multiple times in the same spatial scene. In this case, by correlating the two-dimensional feature data with the image data of different time periods, it can not only identify whether a map has been constructed at the current location, but also effectively fuse and update the image data of the spatial scene observed multiple times.
S606、基于S602中提取二维的特征数据构建地图数据中的关键帧词典。S606, construct a dictionary of key frames in the map data based on the two-dimensional feature data extracted in S602.
其中,构建关键帧词典是指对空间场景的图像数据的特征进行聚类,利用多个二维特征数据的组合来表达当前场景。关键帧词典的构造可以采用二维特征数据之外,也可以使用语义信息、深度学习描述子等等。Wherein, constructing the key frame dictionary refers to clustering the features of the image data of the spatial scene, and expressing the current scene by using a combination of multiple two-dimensional feature data. The construction of the key frame dictionary can use not only two-dimensional feature data, but also semantic information, deep learning descriptors, and so on.
词典的表达的作用包括:第一,对场景的丰富程度的表达。如果一个空间场景的地图数据中关键帧词典比较丰富,表示这个空间场景的地图数据具有丰富的纹理数据、几何特征、语义信息等。当空间场景的地图数据中关键帧词典丰富程度较低,表示该空间场景的地图数据质量差,用户无法使用该地图对智能车辆控制,例如:自动泊车,对用户的预期管理有促进作用。第二,关键帧词典可以用于泊车重定位时的位置识别。在重定位初始化时,车辆需要找到当前在地图中的位置。通过关键帧词典的匹配,快速找到车辆当前在地图中的大概位置,再通过语义信息和二维特征数据的匹配,就可以实现精确的位置估计。The functions of the expression of the dictionary include: First, the expression of the richness of the scene. If the keyframe dictionary in the map data of a spatial scene is rich, the map data representing the spatial scene has rich texture data, geometric features, semantic information, etc. When the richness of the key frame dictionary in the map data of the spatial scene is low, it means that the quality of the map data of the spatial scene is poor, and the user cannot use the map to control the intelligent vehicle, such as automatic parking, which promotes the user's expected management. Second, the keyframe dictionary can be used for position recognition during parking relocation. During relocation initialization, the vehicle needs to find its current location on the map. Through the matching of the key frame dictionary, the approximate current position of the vehicle in the map can be quickly found, and then accurate position estimation can be achieved through the matching of semantic information and two-dimensional feature data.
S607、提取环视俯视图和针孔相机采集的图像数据中停车位信息。S607 , extracting parking space information from the image data collected by the look-ahead top view and the pinhole camera.
其中,环视鱼眼采集的图像数据的拼接而成的环视俯视图和针孔相机采集的图像数据可以反映远距离的完整地面图像。使用深度学习的方法实现停车位的识别和提取,深度学习方法此处不再赘述。Among them, the splicing of the image data collected by looking around the fisheye and the image data collected by the pinhole camera can reflect the complete ground image at a long distance. The deep learning method is used to realize the identification and extraction of parking spaces, and the deep learning method will not be repeated here.
在停车位的识别结果中可能会出现噪声、错误识别等问题,需要对多次采集的图像数据进行停车位的位置、类型、尺寸等信息的融合滤波,得出多个连续稳定高质量的停车位图层。In the recognition results of parking spaces, problems such as noise and misrecognition may occur. It is necessary to fuse and filter the location, type, size and other information of parking spaces on the image data collected multiple times to obtain multiple continuous, stable and high-quality parking spaces. bit layer.
S608、根据特征匹配结果获得三维特征点。S608. Obtain three-dimensional feature points according to the feature matching result.
其中,根据特征匹配结果获得三维特征点是实现特征数据的三角化,也就是将二维特征数据转换为三维特征数据。Wherein, obtaining three-dimensional feature points according to the feature matching result is to realize triangulation of feature data, that is, to convert two-dimensional feature data into three-dimensional feature data.
三角化步骤主要是基于三点共面假设,利用对相同物体在两个不同位置观测获得的图像计算出物体的三维坐标,传统的SFM技术需要同时估计相机之间的位姿和点云的三维坐标,本申请中在估算出帧间位姿后,基于帧间位姿对二维特征数据进行匹配,再基于匹配结果快速的实现特征数据的三角化。The triangulation step is mainly based on the three-point coplanarity assumption, and the three-dimensional coordinates of the object are calculated by using the images obtained by observing the same object at two different positions. The traditional SFM technology needs to estimate the pose between the cameras and the three-dimensional point cloud at the same time. In this application, after the inter-frame pose is estimated, the two-dimensional feature data is matched based on the inter-frame pose, and then the feature data is triangulated quickly based on the matching result.
S609、判断地图元数据是否满足建图质量要求。S609. Determine whether the map metadata meets the quality requirements for mapping.
其中,地图元数据是指在上述步骤中获得的三维特征点、语义信息以及纹理数据。通过对三维特征点的数量、三维特征点在空间的丰富性、语义特征的丰富程度、场景纹理的丰富程度、环境照度等信息,综合判断基于当前所获得地图元数据构建的地图数据的构建质量是否符合建图质量要求。具体建图质量要求已经在上述实施例中详细说明,此次不再赘述。The map metadata refers to the three-dimensional feature points, semantic information and texture data obtained in the above steps. Through the number of 3D feature points, the richness of 3D feature points in space, the richness of semantic features, the richness of scene texture, the environmental illumination and other information, comprehensively judge the construction quality of map data constructed based on the currently obtained map metadata. Whether it meets the quality requirements of the drawing. The specific mapping quality requirements have been described in detail in the foregoing embodiments, and will not be repeated this time.
如果不符合建图质量要求,控制设备会发出警示信息,以警示用户更换空间场景、更好的天气、更好时间点进行地图构建。If it does not meet the quality requirements of the map, the control device will issue a warning message to warn the user to change the spatial scene, better weather, and better time to build the map.
S610、根据地图元数据生成地图数据,并对地图数据进行优化。S610. Generate map data according to the map metadata, and optimize the map data.
其中,在满足建图质量要求时,基于地图元数据构建的地图数据用于自动泊车控制时可以满足自动泊车需求,则会对地图数据进行统一的质量优化。优化内容包括:三维特征点的位置、相机的位姿、以及语义图层的质量。例如:语义元素的位置、角度、大小、类别等等优化。Among them, when the quality requirements for mapping are met, the map data constructed based on the map metadata can meet the needs of automatic parking when it is used for automatic parking control, and the quality of the map data will be optimized uniformly. The optimization contents include: the position of 3D feature points, the pose of the camera, and the quality of the semantic layer. For example: optimization of the position, angle, size, category, etc. of semantic elements.
S611、将地图数据按照不同图层进行存储。S611. Store the map data according to different layers.
其中,当地图数据构建完成后,将地图数据进行不同图层的存储,例如:特征数据图层、语义信息图层、导航数据图层、关键帧词典图层等。不同图 层可以用于为自动泊车定位功能提供数据。Among them, after the map data is constructed, the map data is stored in different layers, such as: feature data layer, semantic information layer, navigation data layer, key frame dictionary layer, etc. Different layers can be used to provide data for the automatic parking location function.
此外,为了实现一次性构建地图数据,会对不同相机采集的图像数据的地图元数据进行编码,在不同定位阶段会采用不同相机采集的图像数据的地图元数据进行匹配。例如,在车辆前进时,利用前视相机采集的图像数据的地图元数据做匹配和定位,在车辆后退时,采用后视鱼眼相机采集的图像数据的地图元数据进行匹配和定位,实现更加快速和强大的地图构建能力。In addition, in order to realize the one-time construction of map data, the map metadata of the image data collected by different cameras will be encoded, and the map metadata of the image data collected by different cameras will be used for matching in different positioning stages. For example, when the vehicle is moving forward, the map metadata of the image data collected by the front-view camera is used for matching and positioning, and when the vehicle is moving backward, the map metadata of the image data collected by the rear-view fisheye camera is used for matching and positioning to achieve more Fast and powerful map building capabilities.
在使用地图数据进行自动泊车时,控制设备将匹配上的地图元数据增加权重,而没有匹配上的地图元数据减少权重,并将一定数量的高质量地图元数据添加到地图中,实现地图地图元数据的及时更新,保证地图的长久时效性和高质量。When using map data for automatic parking, the control device will increase the weight of the matched map metadata, while the unmatched map metadata will reduce the weight, and add a certain amount of high-quality map metadata to the map to realize the map. The timely update of map metadata ensures the long-term timeliness and high quality of the map.
在上述技术方案中,构建纯视觉的轻量级地图数据,避免复杂且需人工干预的高精地图构建,所构建地图数据具有极低的硬件成本要求,且能快速实现高质量的地图数据构建。经大量测试,所构建地图数据能有效用于自动泊车过程中的定位问题,通过大量学习实现专属车位选择、选定任意车位泊车、停车场巡航泊车等功能,能很好的辅助人机交互,并为代客泊车、记忆泊车等功能提供地图导航、实时路径规划等提供帮助。In the above technical solution, pure visual lightweight map data is constructed, and complex and manual intervention of high-precision map construction is avoided. The constructed map data has extremely low hardware cost requirements, and can quickly realize high-quality map data construction. . After a lot of tests, the constructed map data can be effectively used for positioning problems in the process of automatic parking. Through a lot of learning, it can realize functions such as exclusive parking space selection, selecting any parking space for parking, parking lot cruise parking and other functions, which can help people very well. It also provides map navigation and real-time path planning for functions such as valet parking and memory parking.
如图6所示,本申请提供一种控制设备700,该控制设备700具体包括:用于存储指令的存储器701和用于执行存储在存储器中指令的处理器702,处理器702用于具体执行:As shown in FIG. 6, the present application provides a control device 700, the control device 700 specifically includes: a memory 701 for storing instructions and a processor 702 for executing the instructions stored in the memory, and the processor 702 is used for specific execution :
当可移动平台在空间场景内移动时,控制位于可移动平台上的图像传感器采集空间场景的多帧图像数据;When the movable platform moves in the space scene, control the image sensor located on the movable platform to collect multiple frames of image data of the space scene;
对多帧图像数据进行处理获得地图元数据;其中,地图元数据包括三维特征点、纹理数据以及语义信息中任意一种或多种组合;Process the multi-frame image data to obtain map metadata; wherein, the map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information;
判断所述地图元数据是否满足建图质量要求;Judging whether the map metadata meets the quality requirements for mapping;
若地图元数据满足建图质量要求,根据地图元数据生成地图数据;其中,地图数据用于控制可移动平台在空间场景内移动。If the map metadata meets the mapping quality requirements, map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
可选地,建图质量要求包括以下至少一项:Optionally, the mapping quality requirements include at least one of the following:
三维特征点的总数量达到第一数量阈值;The total number of three-dimensional feature points reaches the first number threshold;
至少存在两个在三个坐标轴上的分量均不相同的三维特征点;There are at least two three-dimensional feature points with different components on the three coordinate axes;
纹理数据的总数量达到第二数量阈值;The total amount of texture data reaches the second amount threshold;
纹理数据的种类数量达到第三数量阈值;The number of types of texture data reaches the third number threshold;
语义信息的总数量达到第四数量阈值;The total quantity of semantic information reaches the fourth quantity threshold;
语义信息的种类数量达到第五数量阈值。The number of kinds of semantic information reaches the fifth number threshold.
可选地,处理器702用于具体执行:Optionally, the processor 702 is configured to specifically execute:
使用图像传感器的标识对地图元数据进行来源标记,获得标记后的地图元数据;Use the identity of the image sensor to mark the source of the map metadata, and obtain the marked map metadata;
根据地图元数据生成地图数据,具体包括:Generate map data based on map metadata, including:
根据标记后的地图元数据生成标记后的地图数据;Generate marked map data according to the marked map metadata;
其中,标记后的地图数据包括用于指示数据来源的标记信息。Wherein, the marked map data includes marking information used to indicate the source of the data.
可选地,处理器702用于具体执行以下至少一项:Optionally, the processor 702 is configured to specifically execute at least one of the following:
根据标记后的三维特征点和标记后的纹理数据,生成标记后的特征数据图层;According to the marked 3D feature points and the marked texture data, the marked feature data layer is generated;
根据标记后的语义信息生成标记后的语义信息图层。Generate a marked semantic information layer according to the marked semantic information.
可选地,处理器702用于具体执行:Optionally, the processor 702 is configured to specifically execute:
获取可移动平台的移动方向和图像传感器采集的实时数据;Obtain the moving direction of the movable platform and real-time data collected by the image sensor;
根据标记信息从标记后的地图数据中获取与移动方向匹配的目标数据;Obtain target data matching the moving direction from the marked map data according to the marked information;
根据实时数据和目标数据生成控制指令;其中,控制指令用于控制可移动平台移动。Generate control instructions according to real-time data and target data; wherein, the control instructions are used to control the movement of the movable platform.
可选地,处理器702用于具体执行:Optionally, the processor 702 is configured to specifically execute:
若移动方向为向前移动,从地图数据中获取来源于安装在可移动平台前方的图像传感器的地图数据作为目标数据;If the movement direction is forward movement, the map data from the image sensor installed in front of the movable platform is obtained from the map data as the target data;
若移动方向为向后移动,从地图数据中获取来源于安装在可移动平台后方的图像传感器的地图数据作为目标数据。If the moving direction is backward movement, the map data derived from the image sensor installed behind the movable platform is obtained from the map data as the target data.
可选地,处理器702用于具体执行:Optionally, the processor 702 is configured to specifically execute:
将实时数据和目标数据进行匹配获得匹配结果;Match real-time data and target data to obtain matching results;
根据匹配成功的目标数据获得可移动平台的位置信息;Obtain the position information of the movable platform according to the successfully matched target data;
根据可移动平台的位置信息生成控制指令。Control commands are generated according to the position information of the movable platform.
可选地,处理器702用于具体执行:Optionally, the processor 702 is configured to specifically execute:
根据匹配结果设置目标数据的可靠值。Set the reliable value of the target data according to the matching result.
可选地,处理器702用于具体执行:Optionally, the processor 702 is configured to specifically execute:
若匹配结果为匹配成功,设置目标数据的可靠值为第一可靠值;If the matching result is that the matching is successful, set the reliable value of the target data as the first reliable value;
若匹配结果为匹配失败,设置目标数据的可靠值为第二可靠值;If the matching result is that the matching fails, set the reliable value of the target data to the second reliable value;
其中,第一可靠值大于第二可靠值。Wherein, the first reliability value is greater than the second reliability value.
可选地,处理器702用于具体执行:Optionally, the processor 702 is configured to specifically execute:
统计目标数据的可靠值,获得可靠性统计结果;Count the reliable values of the target data and obtain the reliability statistical results;
当可靠性统计结果满足低可靠性条件时,删除目标数据。When the reliability statistics result meets the low reliability condition, delete the target data.
可选地,图像传感器包括单目相机和/或鱼眼相机。Optionally, the image sensor includes a monocular camera and/or a fisheye camera.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:只读内存(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps of implementing the above method embodiments can be completed by program instructions related to hardware, the aforementioned program can be stored in a computer-readable storage medium, and when the program is executed, execute Including the steps of the above-mentioned method embodiments; and the aforementioned storage medium includes: read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disks or optical disks and other various programs that can store program codes medium.
最后应说明的是:以上各实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述各实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application, but not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: The technical solutions described in the foregoing embodiments can still be modified, or some or all of the technical features thereof can be equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the embodiments of the present application. scope.

Claims (23)

  1. 一种数据处理方法,其特征在于,包括:A data processing method, comprising:
    当可移动平台在空间场景内移动时,控制位于所述可移动平台上的图像传感器采集所述空间场景的多帧图像数据;When the movable platform moves in the space scene, controlling the image sensor located on the movable platform to collect multiple frames of image data of the space scene;
    对所述多帧图像数据进行处理获得地图元数据;其中,所述地图元数据包括三维特征点、纹理数据以及语义信息中任意一种或多种组合;Process the multi-frame image data to obtain map metadata; wherein, the map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information;
    判断所述地图元数据是否满足建图质量要求;Judging whether the map metadata meets the quality requirements for mapping;
    若所述地图元数据满足所述建图质量要求,根据所述地图元数据生成地图数据;其中,所述地图数据用于控制所述可移动平台在所述空间场景内移动。If the map metadata meets the mapping quality requirement, map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
  2. 根据权利要求1所述的方法,其特征在于,所述建图质量要求包括以下至少一项:The method according to claim 1, wherein the mapping quality requirements include at least one of the following:
    所述三维特征点的总数量达到第一数量阈值;The total number of the three-dimensional feature points reaches a first number threshold;
    至少存在两个在三个坐标轴上的分量均不相同的三维特征点;There are at least two three-dimensional feature points with different components on the three coordinate axes;
    所述纹理数据的总数量达到第二数量阈值;The total quantity of the texture data reaches a second quantity threshold;
    所述纹理数据的种类数量达到第三数量阈值;The number of types of the texture data reaches a third number threshold;
    所述语义信息的总数量达到第四数量阈值;The total quantity of the semantic information reaches a fourth quantity threshold;
    所述语义信息的种类数量达到第五数量阈值。The number of kinds of the semantic information reaches a fifth number threshold.
  3. 根据权利要求1或2所述的方法,其特征在于,在对所述多帧图像数据进行处理获得地图元数据之后,所述方法还包括:The method according to claim 1 or 2, wherein after the multi-frame image data is processed to obtain map metadata, the method further comprises:
    使用所述图像传感器的标识对所述地图元数据进行来源标记,获得所述标记后的地图元数据;Using the identifier of the image sensor to mark the source of the map metadata to obtain the marked map metadata;
    根据所述地图元数据生成地图数据,具体包括:Generate map data according to the map metadata, which specifically includes:
    根据所述标记后的地图元数据生成标记后的地图数据;generating marked map data according to the marked map metadata;
    其中,所述标记后的地图数据包括用于指示数据来源的标记信息。Wherein, the marked map data includes marked information for indicating the source of the data.
  4. 根据权利要求3所述的方法,其特征在于,根据所述标记后的地图元数据生成所述地图数据,包括以下至少一项:The method according to claim 3, wherein generating the map data according to the marked map metadata includes at least one of the following:
    根据标记后的三维特征点和标记后的纹理数据,生成标记后的特征数据图层;According to the marked 3D feature points and the marked texture data, the marked feature data layer is generated;
    根据标记后的语义信息生成标记后的语义信息图层。Generate a marked semantic information layer according to the marked semantic information.
  5. 根据权利要求3或4所述的方法,其特征在于,在根据所述标记后的地图元数据生成标记后的地图数据之后,所述方法还包括:The method according to claim 3 or 4, wherein after generating the marked map data according to the marked map metadata, the method further comprises:
    获取所述可移动平台的移动方向和所述图像传感器采集的实时数据;acquiring the moving direction of the movable platform and real-time data collected by the image sensor;
    根据所述标记信息从所述标记后的地图数据中获取与所述移动方向匹配的目标数据;Obtain target data matching the moving direction from the marked map data according to the marked information;
    根据所述实时数据和所述目标数据生成控制指令;其中,所述控制指令用于控制所述可移动平台移动。A control instruction is generated according to the real-time data and the target data; wherein, the control instruction is used to control the movement of the movable platform.
  6. 根据权利要求5所述的方法,其特征在于,根据所述标记信息从所述标记后的地图数据中获取与所述移动方向匹配的目标数据,具体包括:The method according to claim 5, wherein acquiring target data matching the moving direction from the marked map data according to the marked information specifically includes:
    若所述移动方向为向前移动,从所述地图数据中获取来源于安装在所述可移动平台前方的图像传感器的地图数据作为所述目标数据;If the moving direction is forward movement, obtain map data from an image sensor installed in front of the movable platform as the target data from the map data;
    若所述移动方向为向后移动,从所述地图数据中获取来源于安装在所述可移动平台后方的图像传感器的地图数据作为所述目标数据。If the moving direction is backward movement, map data derived from an image sensor installed behind the movable platform is obtained from the map data as the target data.
  7. 根据权利要求5所述的方法,其特征在于,根据所述实时数据和所述目标数据生成控制指令,具体包括:The method according to claim 5, wherein generating a control instruction according to the real-time data and the target data specifically includes:
    将所述实时数据和所述目标数据进行匹配获得匹配结果;Matching the real-time data and the target data to obtain a matching result;
    根据匹配成功的目标数据获得所述可移动平台的位置信息;Obtain the position information of the movable platform according to the successfully matched target data;
    根据所述可移动平台的位置信息生成所述控制指令。The control instruction is generated according to the position information of the movable platform.
  8. 根据权利要求7所述的方法,其特征在于,在将所述实时数据和所述目标数据进行匹配获得匹配结果之后,所述方法还包括:The method according to claim 7, wherein after matching the real-time data and the target data to obtain a matching result, the method further comprises:
    根据所述匹配结果设置所述目标数据的可靠值。The reliability value of the target data is set according to the matching result.
  9. 根据权利要求8所述的方法,其特征在于,根据所述匹配结果设置所述目标数据的可靠值,具体包括:The method according to claim 8, wherein setting the reliability value of the target data according to the matching result specifically includes:
    若所述匹配结果为匹配成功,设置所述目标数据的可靠值为第一可靠值;If the matching result is that the matching is successful, set the reliable value of the target data as the first reliable value;
    若所述匹配结果为匹配失败,设置所述目标数据的可靠值为第二可靠值;If the matching result is a matching failure, setting the reliable value of the target data to a second reliable value;
    其中,所述第一可靠值大于所述第二可靠值。Wherein, the first reliability value is greater than the second reliability value.
  10. 根据权利要求8所述的方法,其特征在于,在根据所述匹配结果设置所述目标数据的可靠值之后,所述方法还包括:The method according to claim 8, wherein after setting the reliability value of the target data according to the matching result, the method further comprises:
    统计所述目标数据的可靠值,获得可靠性统计结果;Count the reliability values of the target data to obtain reliability statistics results;
    当所述可靠性统计结果满足低可靠性条件时,删除所述目标数据。When the reliability statistics result satisfies the low reliability condition, the target data is deleted.
  11. 根据权利要求1至10中任意一项所述的方法,其特征在于,所述图像传感器包括单目相机和/或鱼眼相机。The method according to any one of claims 1 to 10, wherein the image sensor comprises a monocular camera and/or a fisheye camera.
  12. 一种控制设备,其特征在于,包括:用于存储指令的存储器和用于执行存储在存储器中指令的处理器,所述处理器用于具体执行:A control device, characterized in that it comprises: a memory for storing instructions and a processor for executing instructions stored in the memory, wherein the processor is used to specifically execute:
    当可移动平台在空间场景内移动时,控制位于所述可移动平台上的图像传感器采集所述空间场景的多帧图像数据;When the movable platform moves in the space scene, controlling the image sensor located on the movable platform to collect multiple frames of image data of the space scene;
    对所述多帧图像数据进行处理获得地图元数据;其中,所述地图元数据包括三维特征点、纹理数据以及语义信息中任意一种或多种组合;Process the multi-frame image data to obtain map metadata; wherein, the map metadata includes any one or a combination of three-dimensional feature points, texture data and semantic information;
    判断所述地图元数据是否满足建图质量要求;Judging whether the map metadata meets the quality requirements for mapping;
    若所述地图元数据满足所述建图质量要求,根据所述地图元数据生成地图数据;其中,所述地图数据用于控制所述可移动平台在所述空间场景内移动。If the map metadata meets the mapping quality requirement, map data is generated according to the map metadata; wherein the map data is used to control the movable platform to move within the spatial scene.
  13. 根据权利要求12所述的控制设备,其特征在于,所述建图质量要求包括以下至少一项:The control device according to claim 12, wherein the mapping quality requirements include at least one of the following:
    所述三维特征点的总数量达到第一数量阈值;The total number of the three-dimensional feature points reaches a first number threshold;
    至少存在两个在三个坐标轴上的分量均不相同的三维特征点;There are at least two three-dimensional feature points with different components on the three coordinate axes;
    所述纹理数据的总数量达到第二数量阈值;The total quantity of the texture data reaches a second quantity threshold;
    所述纹理数据的种类数量达到第三数量阈值;The number of types of the texture data reaches a third number threshold;
    所述语义信息的总数量达到第四数量阈值;The total quantity of the semantic information reaches a fourth quantity threshold;
    所述语义信息的种类数量达到第五数量阈值。The number of kinds of the semantic information reaches a fifth number threshold.
  14. 根据权利要求12或13所述的控制设备,其特征在于,所述处理器用于具体执行:The control device according to claim 12 or 13, wherein the processor is configured to specifically execute:
    使用所述图像传感器的标识对所述地图元数据进行来源标记,获得所述标记后的地图元数据;Using the identifier of the image sensor to mark the source of the map metadata to obtain the marked map metadata;
    根据所述地图元数据生成地图数据,具体包括:Generate map data according to the map metadata, which specifically includes:
    根据所述标记后的地图元数据生成标记后的地图数据;generating marked map data according to the marked map metadata;
    其中,所述标记后的地图数据包括用于指示数据来源的标记信息。Wherein, the marked map data includes marked information for indicating the source of the data.
  15. 根据权利要求14所述的控制设备,其特征在于,所述处理器用于具体执行以下至少一项:The control device according to claim 14, wherein the processor is configured to specifically execute at least one of the following:
    根据标记后的三维特征点和标记后的纹理数据,生成标记后的特征数据 图层;According to the marked three-dimensional feature points and the marked texture data, the marked feature data layer is generated;
    根据标记后的语义信息生成标记后的语义信息图层。Generate a marked semantic information layer according to the marked semantic information.
  16. 根据权利要求14或15所述的控制设备,其特征在于,所述处理器用于具体执行:The control device according to claim 14 or 15, wherein the processor is configured to specifically execute:
    获取所述可移动平台的移动方向和所述图像传感器采集的实时数据;acquiring the moving direction of the movable platform and real-time data collected by the image sensor;
    根据所述标记信息从所述标记后的地图数据中获取与所述移动方向匹配的目标数据;Obtain target data matching the moving direction from the marked map data according to the marked information;
    根据所述实时数据和所述目标数据生成控制指令;其中,所述控制指令用于控制所述可移动平台移动。A control instruction is generated according to the real-time data and the target data; wherein, the control instruction is used to control the movement of the movable platform.
  17. 根据权利要求16所述的控制设备,其特征在于,所述处理器用于具体执行:The control device according to claim 16, wherein the processor is configured to specifically execute:
    若所述移动方向为向前移动,从所述地图数据中获取来源于安装在所述可移动平台前方的图像传感器的地图数据作为所述目标数据;If the moving direction is forward movement, obtain map data from an image sensor installed in front of the movable platform as the target data from the map data;
    若所述移动方向为向后移动,从所述地图数据中获取来源于安装在所述可移动平台后方的图像传感器的地图数据作为所述目标数据。If the moving direction is backward movement, map data derived from an image sensor installed behind the movable platform is obtained from the map data as the target data.
  18. 根据权利要求16所述的控制设备,其特征在于,所述处理器用于具体执行:The control device according to claim 16, wherein the processor is configured to specifically execute:
    将所述实时数据和所述目标数据进行匹配获得匹配结果;Matching the real-time data and the target data to obtain a matching result;
    根据匹配成功的目标数据获得所述可移动平台的位置信息;Obtain the position information of the movable platform according to the successfully matched target data;
    根据所述可移动平台的位置信息生成控制指令。Control commands are generated according to the position information of the movable platform.
  19. 根据权利要求18所述的控制设备,其特征在于,所述处理器用于具体执行:The control device according to claim 18, wherein the processor is configured to specifically execute:
    根据所述匹配结果设置所述目标数据的可靠值。The reliability value of the target data is set according to the matching result.
  20. 根据权利要求19所述的控制设备,其特征在于,所述处理器用于具体执行:The control device according to claim 19, wherein the processor is configured to specifically execute:
    若所述匹配结果为匹配成功,设置所述目标数据的可靠值为第一可靠值;If the matching result is that the matching is successful, set the reliable value of the target data as the first reliable value;
    若所述匹配结果为匹配失败,设置所述目标数据的可靠值为第二可靠值;If the matching result is a matching failure, setting the reliable value of the target data to a second reliable value;
    其中,所述第一可靠值大于所述第二可靠值。Wherein, the first reliability value is greater than the second reliability value.
  21. 根据权利要求19所述的控制设备,其特征在于,所述处理器用于具体执行:The control device according to claim 19, wherein the processor is configured to specifically execute:
    统计所述目标数据的可靠值,获得可靠性统计结果;Count the reliability values of the target data to obtain reliability statistics results;
    当所述可靠性统计结果满足低可靠性条件时,删除所述目标数据。When the reliability statistics result satisfies the low reliability condition, the target data is deleted.
  22. 根据权利要求12至21中任意一项所述的控制设备,其特征在于,所述图像传感器包括单目相机和/或鱼眼相机。The control device according to any one of claims 12 to 21, wherein the image sensor comprises a monocular camera and/or a fisheye camera.
  23. 一种可移动平台,其特征在于,包括图像传感器以及如权利要求12至22中任意一项所述的数据处理方法。A movable platform is characterized by comprising an image sensor and the data processing method according to any one of claims 12 to 22.
PCT/CN2021/081387 2021-03-17 2021-03-17 Data processing method and device WO2022193193A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2021/081387 WO2022193193A1 (en) 2021-03-17 2021-03-17 Data processing method and device
CN202180079742.6A CN116762094A (en) 2021-03-17 2021-03-17 Data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/081387 WO2022193193A1 (en) 2021-03-17 2021-03-17 Data processing method and device

Publications (1)

Publication Number Publication Date
WO2022193193A1 true WO2022193193A1 (en) 2022-09-22

Family

ID=83321621

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/081387 WO2022193193A1 (en) 2021-03-17 2021-03-17 Data processing method and device

Country Status (2)

Country Link
CN (1) CN116762094A (en)
WO (1) WO2022193193A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117194704A (en) * 2023-11-07 2023-12-08 航天宏图信息技术股份有限公司 Method, device and equipment for inquiring attribute of component-level live-action three-dimensional model

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111325843A (en) * 2020-03-09 2020-06-23 北京航空航天大学 Real-time semantic map construction method based on semantic inverse depth filtering
CN111368759A (en) * 2020-03-09 2020-07-03 河海大学常州校区 Monocular vision-based semantic map construction system for mobile robot
US10794710B1 (en) * 2017-09-08 2020-10-06 Perceptin Shenzhen Limited High-precision multi-layer visual and semantic map by autonomous units
WO2021018690A1 (en) * 2019-07-31 2021-02-04 Continental Automotive Gmbh Method for determining an environmental model of a scene
CN112348921A (en) * 2020-11-05 2021-02-09 上海汽车集团股份有限公司 Mapping method and system based on visual semantic point cloud

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10794710B1 (en) * 2017-09-08 2020-10-06 Perceptin Shenzhen Limited High-precision multi-layer visual and semantic map by autonomous units
WO2021018690A1 (en) * 2019-07-31 2021-02-04 Continental Automotive Gmbh Method for determining an environmental model of a scene
CN111325843A (en) * 2020-03-09 2020-06-23 北京航空航天大学 Real-time semantic map construction method based on semantic inverse depth filtering
CN111368759A (en) * 2020-03-09 2020-07-03 河海大学常州校区 Monocular vision-based semantic map construction system for mobile robot
CN112348921A (en) * 2020-11-05 2021-02-09 上海汽车集团股份有限公司 Mapping method and system based on visual semantic point cloud

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117194704A (en) * 2023-11-07 2023-12-08 航天宏图信息技术股份有限公司 Method, device and equipment for inquiring attribute of component-level live-action three-dimensional model
CN117194704B (en) * 2023-11-07 2024-02-06 航天宏图信息技术股份有限公司 Method, device and equipment for inquiring attribute of component-level live-action three-dimensional model

Also Published As

Publication number Publication date
CN116762094A (en) 2023-09-15

Similar Documents

Publication Publication Date Title
KR102125958B1 (en) Method and apparatus for fusing point cloud data
CN107709928B (en) Method and device for real-time mapping and positioning
CN107025662B (en) Method, server, terminal and system for realizing augmented reality
WO2022110049A1 (en) Navigation method, apparatus, and system
CN110617821B (en) Positioning method, positioning device and storage medium
JP5802279B2 (en) Autonomous mobile system
CN109887053A (en) A kind of SLAM map joining method and system
CN108406731A (en) A kind of positioning device, method and robot based on deep vision
KR20180079428A (en) Apparatus and method for automatic localization
CN105973236A (en) Indoor positioning or navigation method and device, and map database generation method
CN111242994B (en) Semantic map construction method, semantic map construction device, robot and storage medium
WO2022193508A1 (en) Method and apparatus for posture optimization, electronic device, computer-readable storage medium, computer program, and program product
JP2015148601A (en) System and method for mapping, localization and pose correction
CN111986506A (en) Mechanical parking space parking method based on multi-vision system
CN103941746A (en) System and method for processing unmanned aerial vehicle polling image
CN110073362A (en) System and method for lane markings detection
WO2005088971A1 (en) Image generation device, image generation method, and image generation program
CN103680291A (en) Method for realizing simultaneous locating and mapping based on ceiling vision
CN113916243A (en) Vehicle positioning method, device, equipment and storage medium for target scene area
CN208323361U (en) A kind of positioning device and robot based on deep vision
WO2022062480A1 (en) Positioning method and positioning apparatus of mobile device
CN113903011A (en) Semantic map construction and positioning method suitable for indoor parking lot
JPWO2020039937A1 (en) Position coordinate estimation device, position coordinate estimation method and program
WO2021017211A1 (en) Vehicle positioning method and device employing visual sensing, and vehicle-mounted terminal
WO2022193193A1 (en) Data processing method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21930786

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 202180079742.6

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21930786

Country of ref document: EP

Kind code of ref document: A1