WO2023016082A1 - Three-dimensional reconstruction method and apparatus, and electronic device and storage medium - Google Patents

Three-dimensional reconstruction method and apparatus, and electronic device and storage medium Download PDF

Info

Publication number
WO2023016082A1
WO2023016082A1 PCT/CN2022/098993 CN2022098993W WO2023016082A1 WO 2023016082 A1 WO2023016082 A1 WO 2023016082A1 CN 2022098993 W CN2022098993 W CN 2022098993W WO 2023016082 A1 WO2023016082 A1 WO 2023016082A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
vehicle
model
target
group
Prior art date
Application number
PCT/CN2022/098993
Other languages
French (fr)
Chinese (zh)
Inventor
张保成
Original Assignee
北京迈格威科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京迈格威科技有限公司 filed Critical 北京迈格威科技有限公司
Publication of WO2023016082A1 publication Critical patent/WO2023016082A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • G06T3/06
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/80Analysis of captured images to determine intrinsic or extrinsic camera parameters, i.e. camera calibration

Definitions

  • the present disclosure relates to the technical field of image processing, and in particular, to a three-dimensional reconstruction method, device, electronic equipment, and storage medium.
  • the purpose of the embodiments of the present disclosure is to provide a three-dimensional reconstruction method, device, electronic equipment and storage medium to solve the above problems.
  • an embodiment of the present disclosure provides a three-dimensional reconstruction method, the method comprising: acquiring a plurality of monitoring image data; each of the plurality of monitoring image data includes image data of a vehicle; according to the plurality of monitoring image data, Determine the target image group of the target model vehicle; the target image group includes: images of the target model vehicle at different viewing angles; obtain the calibration results of the cameras corresponding to each image in the target image group; according to the The target image group and the calibration results of the cameras corresponding to the images in the target image group obtain the three-dimensional model of the vehicle of the target model.
  • the images of the vehicle of the target model under different viewing angles are determined, and then the three-dimensional model of the vehicle of the target model is obtained according to the calibration results of each image and the corresponding camera.
  • Manual hand-held 3D scanning equipment scans the vehicle, and the construction of the vehicle 3D model can be realized based on the monitoring image data, which is more efficient; secondly, since there is no need for 3D scanning equipment and physical vehicles, the cost is lower and the implementation is easier.
  • the determining the target image group of the vehicle of the target model according to the plurality of monitoring image data includes: detecting the vehicle in the plurality of monitoring image data , to obtain a plurality of vehicle images; group the plurality of vehicle images according to the model of the vehicle to obtain at least one image group corresponding to at least one model of the vehicle, each image group includes the corresponding model of the vehicle under different viewing angles a plurality of vehicle images; determining an image group from the at least one image group as the target image group.
  • multiple vehicle images are obtained by detecting vehicles in multiple monitoring image data, and then multiple vehicle images are grouped according to vehicle models to prevent subsequent use of A-type vehicle images from Model B vehicles perform 3D reconstruction to improve the accuracy of 3D reconstruction; secondly, when grouping, because only the images of each vehicle need to be processed, there is no need to process the images of other objects in the surveillance video, and avoid the distortion of other objects.
  • the image interferes with the grouping, thereby reducing the complexity of the grouping, and improving the grouping efficiency and grouping accuracy.
  • the method before determining an image group from the at least one image group as the target image group, the method further includes: for each of the at least one image group An image group, and the vehicle images belonging to the same viewing angle in the image group are deduplicated.
  • the vehicle images belonging to the same viewing angle in the image group are deduplicated, so as to reduce the complexity of 3D reconstruction using the image group and improve the efficiency of 3D reconstruction.
  • obtaining the 3D model of the vehicle of the target model includes: obtaining the The key point information group corresponding to each image in the target image group; the key point information group includes: the position of a plurality of two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; according to the target image group The key point information group corresponding to each image in the image and the calibration result of the camera that captured each image are used to obtain the 3D model of the vehicle of the target model.
  • the key point information group includes: the position of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; and then according to each of the target image group
  • the key point information group corresponding to the image and the calibration result of the camera that took each image can obtain the 3D model of the vehicle of the target model, without using the position information of all points in the target image group in the image, and then improve the efficiency of 3D reconstruction.
  • the 3D model of the vehicle of the target model is obtained according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image , comprising: determining the initial three-dimensional model of the vehicle of the target model, the initial three-dimensional model comprising: each three-dimensional key point constituting the three-dimensional model, and the initial coordinates of each three-dimensional key point in the model coordinate system; for the target image For each image in the group, determine the initial pose of the vehicle in the image in the world coordinate system when the image is taken; according to the key point information group corresponding to each image in the target image group and capture the respective images Based on the calibration result of the camera, the initial pose of the vehicle in each image and the initial coordinates of the three-dimensional key points of the initial three-dimensional model are optimized by using the bundle adjustment method to obtain the three-dimensional model of the vehicle of the target model.
  • the camera calibration results of each image optimize the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model to reduce the impact of noise on the 3D reconstruction results and improve the accuracy of the 3D reconstruction.
  • the camera calibration result corresponding to the image, and the initial pose determine the position of the initial projection point corresponding to the image in the image coordinate system, and then according to the initial projection point and the position difference between the corresponding two-dimensional key points, determine the first loss value corresponding to the image, and according to the first loss value of each image, the initial coordinates of the three-dimensional key points of the initial three-dimensional model and corresponding to each image.
  • the initial pose is optimized until the new loss value determined by using the optimized 3D model and the optimized pose meets the preset conditions, then the optimization is stopped, and then the accuracy of the final 3D model can be guaranteed.
  • obtaining the calibration result of the camera corresponding to each image in the target image group includes: determining each image in the target image group according to the plurality of surveillance image data The calibration results of the corresponding cameras.
  • the monitoring image data is used to determine the calibration result of the camera, so as to ensure that the subsequent three-dimensional reconstruction of the vehicle of the target model can be performed according to the calibration result of the camera.
  • an embodiment of the present disclosure provides a three-dimensional reconstruction device, the device comprising: an acquisition unit configured to acquire a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle; An image group determining unit, configured to determine a target image group of a target model vehicle according to the plurality of monitoring image data; the target image group includes: images of the target model vehicle under different viewing angles; calibration result acquisition A unit configured to obtain the calibration result of the camera corresponding to each image in the target image group; a three-dimensional model obtaining unit configured to obtain the calibration result of the camera corresponding to each image in the target image group and the target image group A 3D model of the vehicle of the target model.
  • the image group determination unit includes: a detection unit configured to detect vehicles in the plurality of monitoring image data to obtain a plurality of vehicle images
  • the grouping unit may be configured to group the plurality of vehicle images according to the model of the vehicle to obtain at least one image group corresponding to at least one model of the vehicle, each image group including the vehicle of the corresponding model in A plurality of vehicle images under different viewing angles;
  • the selecting unit may be configured to determine one image group from the at least one image group as the target image group.
  • the device further includes: a deduplication unit configured to, for each image group in the at least one image group, The image of the vehicle under the perspective is deduplicated.
  • the 3D model obtaining unit includes: an information group obtaining unit configured to obtain the key point information group corresponding to each image in the target image group;
  • the key point information group includes: the position of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image;
  • the three-dimensional model obtaining subunit can be configured to The corresponding key point information group and the calibration results of the cameras that capture the images are used to obtain the three-dimensional model of the vehicle of the target model.
  • the 3D model obtaining subunit includes: an initial model determining unit configured to determine the initial 3D model of the vehicle of the target model, the initial 3D model
  • the model includes: each 3D key point constituting the 3D model, and the initial coordinates of each 3D key point in the model coordinate system;
  • the initial pose determination unit may be configured to, for each image in the target image group, Determining the initial pose of the vehicle in the image in the world coordinate system when the image is captured;
  • the optimization unit may be configured to set and capture the key point information corresponding to each image in the target image group
  • the camera calibration results of each image are optimized for the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model to obtain the 3D model of the vehicle of the target model.
  • the optimization unit includes: a projection unit configured to, for each image in the target image group, according to the initial three-dimensional model, the image Corresponding to the camera calibration result and the initial pose, determine the position of the initial projection point corresponding to the image in the image coordinate system; the initial projection point includes the three-dimensional corresponding to the two-dimensional key point of the image in the initial three-dimensional model The key point is projected to a point in the image coordinate system under the initial pose corresponding to the image; the loss determination unit may be configured to be based on the relationship between each initial projected point corresponding to the image and the corresponding two-dimensional key point Determine the first loss value corresponding to the image according to the position difference between them; the optimization subunit may be configured to calculate the initial coordinates and Optimizing the initial poses corresponding to each image until a new loss value determined using the optimized three-dimensional model and the optimized pose meets the preset condition; the optimized three-dimensional model is the vehicle of the target model 3D model of .
  • the calibration result acquisition unit may be configured to determine the calibration of the camera corresponding to each image in the target image group according to the plurality of surveillance image data result.
  • an embodiment of the present disclosure provides an electronic device, including a processor and a memory connected to the processor, where a computer program is stored in the memory, and when the computer program is executed by the processor, the The electronic device executes the method described in the first aspect.
  • an embodiment of the present disclosure provides a storage medium, where a computer program is stored in the storage medium, and when the computer program runs on a computer, the computer executes the method described in the first aspect.
  • FIG. 1 is a schematic flowchart of a three-dimensional reconstruction method provided by an embodiment of the present disclosure.
  • FIG. 2 is a schematic structural diagram of a three-dimensional reconstruction device provided by an embodiment of the present disclosure.
  • FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
  • Icons 200-three-dimensional reconstruction device; 210-acquisition unit; 220-image group determination unit; 230-calibration result acquisition unit; 240-three-dimensional model acquisition unit; 300-electronic equipment; 301-processor; 302-memory; 303- Communication Interface.
  • FIG. 1 is a flow chart of a three-dimensional reconstruction method provided by an embodiment of the present disclosure. The process shown in FIG. 1 will be described in detail below, and the method includes steps: S11-S14.
  • S11 Acquire a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle.
  • S12 Determine a target image group of the target model vehicle according to the plurality of monitoring image data; the target image group includes: images of the target model vehicle under different viewing angles.
  • S11 Acquire a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle.
  • S11 may be implemented in the following manner, acquiring a plurality of monitoring image data captured by at least one camera from a third party, wherein the plurality of monitoring image data all include vehicle image data.
  • multiple monitoring image data can be acquired without directly communicating with at least one camera, and the complexity of communication connection is low, especially when the number of at least one camera is large, the effect is more obvious.
  • S11 can be implemented in the following manner, acquiring multiple monitoring image data captured by at least one camera within a specified time period after the current moment, so as to ensure that the acquired multiple monitoring image data contains the latest vehicle model Image.
  • the latest vehicle model may be a model of a vehicle produced within one year before the current moment, and in other embodiments, the latest vehicle model may also be a model of a vehicle produced within half a year before the current moment.
  • S11 may be implemented in the following manner, acquiring a plurality of monitoring image data sent by at least one camera.
  • the image may be a picture or a video.
  • each camera in the at least two cameras can be set at different geographical locations or different angles respectively, so as to ensure that images from different angles can be taken; in the above-mentioned
  • the camera may be a camera that is installed on roads inside the park, parking lots inside the company, etc., and can capture images of vehicles.
  • S12 Determine a target image group of the target model vehicle according to the plurality of monitoring image data; the target image group includes: images of the target model vehicle under different viewing angles.
  • the target model is the model of the vehicle whose three-dimensional model needs to be established.
  • S12 can be implemented in the following manner, based on a predetermined target model, using a pre-trained vehicle model recognition model to identify a plurality of images of vehicles belonging to the target model from a plurality of monitoring image data, And divide the multiple images of the vehicle of the target model into the target image group.
  • the target image group needs to contain images of the vehicle under multiple viewing angles.
  • the target image group needs to include images of the front surface of the target model vehicle, including the target model vehicle
  • the target image group may not include images under the viewing angles corresponding to this part of the viewing angles. For example, if the target image group already includes the image of the left side of the vehicle, the image of the right side of the vehicle may not be included.
  • the specific implementation manner of identifying which surface image of the vehicle the image belongs to may be: according to the image of the vehicle, determine the information of a plurality of two-dimensional key points that characterize the outline of the vehicle in the image, and then According to the information of the two-dimensional key points, it can be determined which surface image of the vehicle the image belongs to. For example, according to the image of the vehicle, it is determined that a plurality of two-dimensional key points representing the outline of the vehicle in the image include the key points of the left window, the key points of the left front wheel, the key points of the left rear wheel, etc., then it can be determined that the image is Left view.
  • the origin of the image coordinate system corresponding to the image can be the center of the image or the vertex of the image, one of the u axis and the v axis of the image coordinate system is parallel to the upper edge of the image, and the image coordinate The v-axis of the system, the other of the v-axis is parallel to the lower edge of the image.
  • S12 can be implemented in the following manner. For the image of each vehicle in a plurality of surveillance video data, input the image of the vehicle into the pre-trained vehicle model recognition model to obtain the vehicle model, and use the same The images captured by the vehicle model under different viewing angles are divided into an image group, and then from at least one of the divided image groups, one image group is sequentially or randomly determined as the target image group; or, considering that in order to construct the target model For the three-dimensional model of the vehicle, it is necessary to ensure that the target image group includes images of each surface of the target model vehicle.
  • the number of images meets a certain requirement (for example, greater than or equal to 200) or
  • the image group required by the angle distribution (eg, distribution at certain specific angles) is the target image group. It can be understood that, because the larger the number of images in the target image group, the more ensured that the images of the various surfaces of the vehicle of the target model can be determined from the target image group.
  • S12 includes steps: A1-A3.
  • A1 Detecting vehicles in the multiple monitoring image data to obtain multiple vehicle images.
  • A1 can be implemented in the following manner. For each image in multiple monitoring image data, use a pre-trained vehicle detection model to detect the vehicle in the image to obtain the position of the vehicle detection frame. The vehicle image can be cropped according to the position of the vehicle detection frame.
  • step A2 is performed.
  • A2 Group the vehicle images of each vehicle according to the vehicle model to obtain at least one more image group corresponding to at least one model of vehicle, and each image group includes images of the corresponding model of the vehicle under different viewing angles.
  • A2 can be implemented in the following manner. For each vehicle image, use the pre-trained vehicle brand recognition model to identify the vehicle brand in the vehicle image, wherein the vehicle image can include : The vehicle logo or logo of the vehicle; after determining the brand of the vehicle in the vehicle image, use the pre-trained vehicle model detection model corresponding to the brand of the vehicle to group the images of vehicles belonging to the same brand , dividing vehicle images of vehicles of the same model under various viewing angles into an image group to obtain at least one image group corresponding to at least one type of vehicle one-to-one.
  • the number of images in the image group needs to meet a certain Requirements (for example, the number of images in the group is greater than or equal to 200, or 3000), to ensure that the group of images includes vehicle images of vehicles of the corresponding model at various angles of view, and then ensure that the three-dimensionality of the vehicle can be accurately determined in the future Model.
  • a certain Requirements for example, the number of images in the group is greater than or equal to 200, or 3000
  • A2 can be implemented in the following manner. For each vehicle image, input the vehicle image into a pre-trained vehicle model recognition model to obtain the vehicle model, and then divide the vehicle images of the same model into As an image group, at least one image group corresponding to at least one model of vehicle is obtained.
  • step A3 may be performed.
  • A3 Determine an image group from the at least one image group as the target image group.
  • A3 may be implemented in the following manner, one image group is sequentially determined from the at least one image group as the target image group.
  • A3 may be implemented in the following manner, randomly determining an image group from the at least one image group as the target image group.
  • A3 may be implemented in the following manner, using an image group whose number of images satisfies the requirement as a target image group.
  • A3 may be implemented in the following manner. First, the target model is determined, and then the image group corresponding to the target model is used as the target image group.
  • each image group in the remaining image groups can be sequentially As the target image group, S13-S14 is then executed using the image group as the target image group, so as to determine the three-dimensional models of vehicles of multiple models.
  • step S13 is executed.
  • the camera calibration result may include the internal reference matrix K of the camera, the external reference matrix P of the camera, or the matrix S obtained by multiplying the internal reference matrix K of the camera and the external reference matrix P of the camera.
  • the specific calibration method may use any camera calibration method, which is not limited here.
  • the third category is based on the target calibration captured by the camera.
  • a specific method of camera calibration based on the target captured by the camera is as follows:
  • the 3D model of the known target (the 3D model of the known target can be reconstructed by methods such as manual scanning, and the 3D model includes multiple 3D key points); obtain the pose of the known target in the world coordinate system (so it can be determined The position of multiple 3D key points in the world coordinate system and the attitude of the 3D model in the world coordinate system); use the camera to be calibrated to shoot the known target to obtain the calibration image; key the known target in the calibration image Point detection, to obtain each two-dimensional key point of the known target and the position (u, v) of the two-dimensional key point in the image coordinate system; according to the corresponding relationship between the two-dimensional key point and the three-dimensional key point in the three-dimensional model (for For any 2D key point of the target in any calibration image, it is bound to be able to find the 3D key point corresponding to the position of the 2D key point from the 3D model of the target.
  • any 3D key point in the 3D model of the target key point there may not be a two-dimensional key point corresponding to the position of the three-dimensional key point in the calibration image;
  • the 3D key point corresponding to the 2D key point in the upper left corner of the front door is found, and the corresponding 3D key point is the point in the upper corner of the front door in the 3D model of the vehicle), and the known target can be determined
  • the position (x m , y m , z m ) of the 3D key point corresponding to the 2D key point in the 3D model of the The position (u, v) of the 2D key point in the image coordinate system, the position (x m , y m , z m ) of the 3D key point corresponding to the 2D key point in the 3D model in the world coordinate system, and the position of the 3D key point in the world coordinate system
  • the pose of the model in the world coordinate system when the calibration image is taken (the position is represented by (X
  • is a constant
  • the xoy plane of the world coordinate system may overlap with the road where the vehicle is located.
  • the origin of the world coordinate system may be a projection point corresponding to the center of the camera on the road where the vehicle is located.
  • S13 includes: determining a calibration result of a camera corresponding to each image in the target image group according to the plurality of surveillance image data.
  • the multiple surveillance image data include multiple models of vehicles, some of which have unknown 3D models of vehicles, and some of which have known 3D models of vehicles. Therefore, automatic calibration of the camera can be performed based on the vehicle whose 3D model is known.
  • S13 can be implemented in the following manner: use the cameras corresponding to each image in the target image group as the camera to be calibrated sequentially; find out the monitoring image data captured by the camera to be calibrated from a plurality of monitoring image data, From the monitoring image data captured by the camera to be calibrated, it is determined that there are multiple images of a target with a known 3D model (for example, if the 3D model of a certain type of vehicle is known, then the target of the known 3D model is the vehicle of this type) , determine the coordinates of the 3D key points representing the target according to the known 3D model, and detect the coordinates of the 2D key points representing the target in the multiple images according to the multiple images captured by the camera, and then according to the target’s
  • the three-dimensional coordinates of each three-dimensional key point and the two-dimensional coordinates of the corresponding two-dimensional key point determine the calibration result of the camera to be calibrated.
  • the corresponding relationship between the camera identification and the camera calibration result is stored.
  • S13 can be implemented in the following manner. Obtain the identifier of the camera corresponding to each image in the target image group, and then, for the identifier of the camera corresponding to each image, obtain the corresponding relationship between the camera identifier and the camera calibration result stored in advance. In , find out the camera calibration result corresponding to the identity of the camera corresponding to the image.
  • the three-dimensional model of the vehicle of the target model includes: three-dimensional key points representing the profile of the vehicle of the target model, and the relative positional relationship of each three-dimensional key point (for example, a certain three-dimensional key point can be the origin, parallel to the plane where the chassis of the vehicle is located)
  • the plane is the coordinate plane to establish a model coordinate system, and the coordinate values of each three-dimensional key point in the model coordinate system can represent the relative positional relationship between each three-dimensional key point); the three-dimensional key points contained in the three-dimensional model of each type of vehicle
  • the type and quantity of the model can be consistent, and the 3D key points involved in each 3D model can include: each car light, each window, each door, each wheel, the front surface of the car, the rear surface of the car, the roof surface, etc. point and so on.
  • the model coordinate system can be a three-dimensional coordinate system, and the model coordinate system and the world coordinate system can only have a translation relationship without a rotation or scaling relationship;
  • the origin of the model coordinate system can be a vehicle in an image in the target image group
  • the z-axis of the model coordinate system can be perpendicular to the road where the vehicle is located.
  • the x-axis of the model coordinate system can be parallel to the central axis of the vehicle. axis is perpendicular to the z-axis.
  • the images of the vehicle of the target model under different viewing angles are determined, and then the three-dimensional model of the vehicle of the target model is obtained according to the calibration results of each image and the corresponding camera.
  • Manual hand-held 3D scanning equipment scans the vehicle, and the construction of the vehicle 3D model can be realized based on the monitoring image data, which is more efficient; secondly, since there is no need for 3D scanning equipment and physical vehicles, the cost is lower and the implementation is easier.
  • S14 includes steps: B1-B2.
  • the key point information group includes: the positions of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image.
  • B1 can directly acquire multiple two-dimensional key points that characterize the outline of the vehicle in the image determined in the aforementioned step S12 for each image in the target image in the following manner: position; wherein, the positions of multiple two-dimensional key points in the image constitute the key point information group of the image. It can be understood that in some vehicle detection or vehicle model detection models, the vehicle key point detection is performed while the vehicle detection or vehicle model detection is performed, so that in step S12, multiple two-dimensional key points have been determined in the image position in .
  • B1 can be implemented in the following manner. For each image in the target image group, use the pre-trained vehicle key point extraction model to extract the key points of the image, so as to determine the representative vehicle from the image The positions of multiple 2D keypoints of the contour of the image in the image. It can be understood that in some vehicle detection or vehicle model detection models, vehicle key point detection is performed without vehicle detection or vehicle model detection, so additional steps are required to determine the positions of multiple two-dimensional key points in the image.
  • B2 Obtain the 3D model of the vehicle of the target model according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image.
  • the key point information group corresponding to each image in the target image group includes: the positions of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; then according to each of the target image group
  • the key point information group corresponding to the image and the calibration result of the camera that captures each image can obtain the 3D model of the vehicle of the target model.
  • the three-dimensional reconstruction of the car models contained in the images can be automatically performed through a large number of multi-angle images, and then the efficiency and accuracy of the three-dimensional reconstruction can be improved.
  • B2 includes steps B21-B23.
  • B21 Determine the initial three-dimensional model of the vehicle of the target model, the initial three-dimensional model includes: each three-dimensional key point constituting the three-dimensional model, and the initial coordinates of each three-dimensional key point in the model coordinate system.
  • the model coordinate system is a three-dimensional coordinate system, which is established according to user requirements without limitation.
  • 3D key points contained in the initial 3D models of vehicles of various models may be consistent, and the 3D key points involved in each 3D model may include: each lamp, each door, each window, each wheel, vehicle Points on the periphery of the front surface, rear surface, roof surface, etc.
  • a 3D model of a vehicle with a known model can be used as the initial 3D model, or a unified initial 3D model can be specified for a certain type of vehicle.
  • the initial 3D models of vehicles belonging to the same class can be the same by default, for example, cars belong to the same class, trucks belong to the same class, and non-motor vehicles belong to the same class.
  • B22 For each image in the target image group, determine the initial pose of the vehicle in the image in the world coordinate system when the image is captured.
  • the initial pose can be determined according to the two-dimensional key points detected in each image combined with camera calibration information, or can be determined according to the pose of the same vehicle in the image captured at the associated time, and can also be a default value determined based on experience.
  • the two-dimensional key points detected in the first image are key points on the left rear wheel, left window, left front glass, left rear glass, etc.
  • the camera can be determined according to the camera calibration results of the camera that captured the first image. is erected parallel to the road, an initial pose can be estimated.
  • the second image also includes vehicle A
  • the second image is captured by the same camera as the first image, and the shooting time interval is less than a certain length of time (for example, 3s)
  • the first The pose of vehicle A in the image is estimated for the pose of the vehicle in the second image, for example, the pose of vehicle A in the first image can be used as the initial pose of vehicle A in the second image.
  • the initial pose of the vehicle in the world coordinate system when the image was taken in each image can be set to the same default value.
  • the method for optimizing the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model may be bundle adjustment.
  • B23 includes steps: B231-B233.
  • the initial projection point includes a point where a 3D key point corresponding to a 2D key point of the image in the initial 3D model is projected into the image coordinate system at an initial pose corresponding to the image.
  • the initial pose of the vehicle is the initial pose of the 3D model corresponding to the vehicle in the world coordinate system.
  • B231 can be implemented in the following manner. For each image, according to the corresponding two-dimensional key point information group of the image, from the initial three-dimensional model, determine the corresponding two-dimensional key points in the image The initial coordinates of the three-dimensional key points, for each determined three-dimensional key point, the initial coordinates (x, y, z) of the three-dimensional key point in the model, the camera calibration result S and the initial pose T corresponding to the image, Input to projection expression In , the position (u', v') of the initial projection point corresponding to the 3D key point in the image coordinate system corresponding to the image is obtained.
  • B232 Determine the first loss value corresponding to the image for the position difference between each initial projection point corresponding to the image and the corresponding dimensional key point.
  • B232 can be implemented in the following manner, for each initial projection point corresponding to the image, determine the position (u', v') of the initial projection point in the image coordinate system corresponding to the image and the The distance between the positions (u, v) of the two-dimensional key points corresponding to the three-dimensional key points corresponding to the initial projection point is determined as the first loss value as the sum of the respective distances corresponding to the image.
  • the optimized three-dimensional model is the three-dimensional model of the vehicle of the target model.
  • the initial coordinates of the 3D key points of the initial 3D model and the initial pose corresponding to each image are optimized, which can be based on the sum of the first loss values corresponding to each image, for The initial coordinates of the 3D key points of the initial 3D model and the initial poses corresponding to each image are optimized.
  • the preset condition can be one of the following: the sum of the loss values corresponding to each image converges; the sum of the loss values corresponding to each image is the minimum value in previous iterations; the loss value corresponding to each image is less than the target loss value; the corresponding loss value of each image The sum of the loss values of is less than the preset value; the number of iterations reaches the preset number.
  • B233 can be implemented in the following manner.
  • the initial coordinates of the 3D key points of the initial 3D model and the initial poses corresponding to each image The parameters are optimized to obtain the optimized 3D model and the optimized pose; for each image in the target image group, according to the corresponding 2D key point information group of the image, from the optimized 3D model, determine Get the coordinates of the three-dimensional key points corresponding to the two-dimensional key points in the image, and for each three-dimensional key point, the coordinates (x, y, z) of the three-dimensional key points, the camera calibration result S corresponding to the image and
  • the optimized pose T is input to the projection expression , the position (u', v') of the optimized projection point corresponding to the 3D key point in the image coordinate system corresponding to the image is obtained, according to each optimized projection point corresponding to the image and the corresponding 2D
  • the camera calibration result corresponding to the image, and the initial pose determine the position of the initial projection point corresponding to the image in the image coordinate system, and then according to the initial projection point and the position difference between the corresponding two-dimensional key points, determine the first loss value corresponding to the image, and according to the first loss value of each image, the initial coordinates and initial pose of the three-dimensional key points of the initial three-dimensional model Optimization, until the new loss value determined by using the optimized 3D model and the optimized pose meets the preset condition, the optimization is stopped, and then the accuracy of the final 3D model can be guaranteed.
  • the method further includes: for each image group in the at least one image group, deduplicating the vehicle images belonging to the same viewing angle in the image group.
  • the preset threshold may be any value in 79%-90%.
  • the vehicle images belonging to the same viewing angle in the image group are deduplicated, so as to reduce the complexity of 3D reconstruction using the image group and improve the efficiency of 3D reconstruction.
  • FIG. 2 is a structural block diagram of a three-dimensional reconstruction apparatus 200 provided by an embodiment of the present disclosure.
  • the structural block diagram shown in Figure 2 will be described below, and the shown devices include:
  • the obtaining unit 210 may be configured to obtain a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle.
  • the image group determining unit 220 may be configured to determine the target image group of the target model vehicle according to the plurality of surveillance image data; the target image group includes: the target model vehicle under different viewing angles image.
  • the calibration result obtaining unit 230 may be configured to obtain a calibration result of the camera corresponding to each image in the target image group.
  • the 3D model obtaining unit 240 may be configured to obtain the 3D model of the vehicle of the target model according to the target image group and the calibration results of the cameras corresponding to the images in the target image group.
  • the image group determination unit 220 includes: a detection unit configured to detect vehicles in the plurality of monitoring image data to obtain a plurality of vehicle images; a grouping unit may be It is configured to group the plurality of vehicle images according to the model of the vehicle to obtain at least one image group corresponding to at least one model of the vehicle, and each image group includes a plurality of images of the corresponding model of the vehicle under different viewing angles.
  • the vehicle image; selecting unit may be configured to determine one image group from the at least one image group as the target image group.
  • the device further includes: a deduplication unit, which may be configured to, for each image group in the at least one image group, deduplicate the vehicle images belonging to the same viewing angle in the image group. Heavy.
  • a deduplication unit which may be configured to, for each image group in the at least one image group, deduplicate the vehicle images belonging to the same viewing angle in the image group. Heavy.
  • the 3D model obtaining unit 240 includes: an information group obtaining unit configured to obtain a key point information group corresponding to each image in the target image group; the key point information group includes : The position of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; the three-dimensional model obtaining subunit can be configured to set the key point information corresponding to each image in the target image group and the calibration results of the cameras that capture the respective images to obtain a three-dimensional model of the vehicle of the target model.
  • the 3D model obtaining subunit includes: an initial model determining unit configured to determine the initial 3D model of the vehicle of the target model, and the initial 3D model includes: Each three-dimensional key point, and the initial coordinates of each three-dimensional key point in the model coordinate system;
  • the initial pose determination unit may be configured to determine the position of the vehicle in the image for each image in the target image group The initial pose of the image in the world coordinate system when the image is taken;
  • the optimization unit may be configured to be based on the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image , optimizing the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model to obtain the 3D model of the vehicle of the target model.
  • the optimization unit includes: a projection unit, configured to, for each image in the target image group, according to the initial 3D model, the camera calibration result corresponding to the image, and the initial Pose, determine the position of the initial projection point corresponding to the image in the image coordinate system;
  • the initial projection point includes the three-dimensional key point corresponding to the two-dimensional key point of the image in the initial three-dimensional model in the corresponding position of the image Points projected into the image coordinate system under the initial pose;
  • the loss determination unit may be configured to determine the corresponding The first loss value of the image;
  • the optimization subunit is used to optimize the initial coordinates of the three-dimensional key points of the initial three-dimensional model and the initial pose corresponding to each image according to the first loss value corresponding to each image, Until the new loss value determined by using the optimized three-dimensional model and the optimized pose satisfies the preset condition;
  • the optimized three-dimensional model is the three-dimensional model of the vehicle of the target model.
  • the calibration result acquisition unit 230 may be configured to determine, according to the plurality of monitoring image data, the calibration results of the cameras corresponding to the images in the target image group.
  • FIG. 3 is a schematic structural diagram of an electronic device 300 provided by an embodiment of the present disclosure.
  • the electronic device 300 may be a personal computer, a tablet computer, a smart phone, a personal digital assistant (personal digital assistant, PDA) and the like.
  • PDA personal digital assistant
  • the electronic device 300 may include: a memory 302, a processor 301, a communication interface 303, and a communication bus, and the communication bus is used to implement connection and communication of these components.
  • the memory 302 is used to store various data such as calculation program instructions corresponding to the three-dimensional reconstruction method and device provided by the embodiments of the present disclosure, wherein the memory 302 may be, but not limited to, random access memory, read only memory (Read Only Memory, ROM), Programmable Read-Only Memory (Programmable Read-Only Memory, PROM), Erasable Programmable Read-Only Memory (EPROM), Electric Erasable Programmable Read-Only Memory (Electric Erasable Programmable Read-Only Memory, Only Memory, EEPROM), etc.
  • the processor 301 is used to read and run the computer program instructions corresponding to the three-dimensional reconstruction method and device stored in the memory to obtain a plurality of monitoring image data; the plurality of monitoring image data includes image data of vehicles; according to the A plurality of monitoring image data, determine the target image group of the target model vehicle; the target image group includes: the images of the target model vehicle at different angles of view; obtain the corresponding camera of each image in the target image group The calibration result of the target image group and the calibration results of the cameras corresponding to the images in the target image group to obtain a three-dimensional model of the vehicle of the target model.
  • processor 301 may be an integrated circuit chip, which has a signal processing capability.
  • processor 301 can be general purpose processor, comprises CPU, network processor (Network Processor, NP) etc.; Can also be digital signal processor (DSP), application specific integrated circuit (ASIC), field programmable gate array (FPGA) ) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA field programmable gate array
  • a general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like.
  • the communication interface 303 is used for receiving or sending data.
  • an embodiment of the present disclosure also provides a storage medium, in which a computer program is stored, and when the computer program is run on a computer, the computer is made to execute the method provided by any one of the embodiments of the present disclosure. method.
  • the 3D reconstruction method, device, electronic device, and storage medium proposed by the various embodiments of the present disclosure determine the images of the target model vehicle under different viewing angles based on a plurality of monitoring image data, and then, according to each image and The 3D model of the vehicle of the target model is obtained from the calibration result of the corresponding camera.
  • This method does not need to manually scan the vehicle with a 3D scanning device, and the construction of the 3D model of the vehicle can be realized based on the monitoring image data, which is more efficient. Since there is no need for 3D scanning equipment and physical vehicles, the cost is lower and the implementation is easier.
  • each block in a flowchart or block diagram may represent a module, program segment, or part of code that includes one or more Executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented by a dedicated hardware-based device that performs the specified function or action , or may be implemented by a combination of dedicated hardware and computer instructions.
  • each functional module in each embodiment of the present disclosure may be integrated together to form an independent part, each module may exist independently, or two or more modules may be integrated to form an independent part.
  • the present disclosure provides a three-dimensional reconstruction method, device, electronic equipment, and storage medium.
  • the method includes: acquiring a plurality of monitoring image data; the plurality of monitoring image data includes image data of a vehicle; The target image group of the target model vehicle; the target image group includes the images of the target model vehicle under different viewing angles; obtain the calibration results of the cameras corresponding to each image in the target image group; according to the target image group and each image in the target image group The calibration result of the camera corresponding to the image is used to obtain the 3D model of the vehicle of the target model.
  • This method does not need to scan the vehicle with a hand-held 3D scanning device, and can realize the construction of a 3D model of the vehicle based on the monitoring image data, which is more efficient; easy.
  • the three-dimensional reconstruction method, device, electronic device and storage medium of the present disclosure are reproducible and can be used in various industrial applications.
  • the three-dimensional reconstruction method, device, electronic equipment, and storage medium of the present disclosure can be used in the technical field of image processing.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)
  • Length Measuring Devices By Optical Means (AREA)

Abstract

Provided in the present disclosure are a three-dimensional reconstruction method and apparatus, and an electronic device and a storage medium. The method comprises: acquiring a plurality of pieces of monitoring image data, wherein the plurality of pieces of monitoring image data each comprise image data of a vehicle; determining a target image group of a vehicle of a target model according to the plurality of pieces of monitoring image data, wherein the target image group comprises images of the vehicle of the target model at different viewing angles; acquiring a calibration result of a camera corresponding to each image in the target image group; and obtaining a three-dimensional model of the vehicle of the target model according to the target image group and the calibration result of the camera corresponding to each image in the target image group. By means of the method, there is no need to manually hold a three-dimensional scanning device to scan a vehicle, and a three-dimensional vehicle model can be constructed on the basis of monitoring image data, such that the efficiency is higher; moreover, there is no need to use the three-dimensional scanning device and a physical vehicle, such that the cost is lower, and the implementation is easier.

Description

三维重建方法、装置、电子设备及存储介质Three-dimensional reconstruction method, device, electronic equipment and storage medium
相关申请的交叉引用Cross References to Related Applications
本公开要求于2021年08月13日提交中国国家知识产权局的申请号为202110931999.1、名称为“三维重建方法、装置、电子设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。This disclosure claims the priority of the Chinese patent application with application number 202110931999.1 and titled "Three-dimensional reconstruction method, device, electronic equipment and storage medium" filed with the State Intellectual Property Office of China on August 13, 2021, the entire contents of which are incorporated by reference incorporated in this disclosure.
技术领域technical field
本公开涉及图像处理技术领域,具体而言,涉及一种三维重建方法、装置、电子设备及存储介质。The present disclosure relates to the technical field of image processing, and in particular, to a three-dimensional reconstruction method, device, electronic equipment, and storage medium.
背景技术Background technique
在智能交通领域中,为了准确地确定出真实车辆的位姿,需要对真实的车辆进行三维重建,以得到车辆的三维模型。In the field of intelligent transportation, in order to accurately determine the pose of a real vehicle, it is necessary to perform 3D reconstruction of the real vehicle to obtain a 3D model of the vehicle.
在相关领域的车辆三维重建技术中,针对每种型号的车辆,需要将该型号的真实车辆放置在一确定位置上,并依靠人工手持三维扫描设备对该车辆进行扫描,才能得到该种型号的车辆的三维模型。In the vehicle 3D reconstruction technology in the related field, for each type of vehicle, it is necessary to place the real vehicle of this type at a certain position, and rely on manual hand-held 3D scanning equipment to scan the vehicle, in order to obtain the model of the vehicle. 3D model of the vehicle.
由于上述方法需要借助人工和三维扫描设备,因此,上述方法效率较低,且三维扫描设备的使用一定程度上增加了车辆三维重建的成本;另外,由于上述方法需要基于实体车辆进行,在车辆型号众多的情况下,该方法的实现难度较大。Since the above method requires manual labor and 3D scanning equipment, the efficiency of the above method is low, and the use of 3D scanning equipment increases the cost of 3D reconstruction of the vehicle to a certain extent; In many cases, the implementation of this method is difficult.
发明内容Contents of the invention
鉴于此,本公开实施例的目的在于提供一种三维重建方法、装置、电子设备及存储介质,以解决上述问题。In view of this, the purpose of the embodiments of the present disclosure is to provide a three-dimensional reconstruction method, device, electronic equipment and storage medium to solve the above problems.
第一方面,本公开实施例提供一种三维重建方法,所述方法包括:获取多个监控图像数据;所述多个监控图像数据均包括车辆的图像数据;根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像;获取所述目标图像组中各个图像对应的相机的标定结果;根据所述目标图像组和所述目标图像组中各个图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。In a first aspect, an embodiment of the present disclosure provides a three-dimensional reconstruction method, the method comprising: acquiring a plurality of monitoring image data; each of the plurality of monitoring image data includes image data of a vehicle; according to the plurality of monitoring image data, Determine the target image group of the target model vehicle; the target image group includes: images of the target model vehicle at different viewing angles; obtain the calibration results of the cameras corresponding to each image in the target image group; according to the The target image group and the calibration results of the cameras corresponding to the images in the target image group obtain the three-dimensional model of the vehicle of the target model.
在上述实现过程中,根据多个监控图像数据,确定出目标型号的车辆在不同视角下 的图像,继而根据各个图像和对应的相机的标定结果,得到目标型号的车辆的三维模型,该方法无需人工手持三维扫描设备对车辆进行扫描,通过基于监控图像数据便可实现车辆三维模型的构建,效率更高;其次,由于无需借助三维扫描设备及实体车辆,成本更低,且实现更容易。In the above implementation process, according to multiple monitoring image data, the images of the vehicle of the target model under different viewing angles are determined, and then the three-dimensional model of the vehicle of the target model is obtained according to the calibration results of each image and the corresponding camera. Manual hand-held 3D scanning equipment scans the vehicle, and the construction of the vehicle 3D model can be realized based on the monitoring image data, which is more efficient; secondly, since there is no need for 3D scanning equipment and physical vehicles, the cost is lower and the implementation is easier.
基于第一方面,在一种可能的设计中,所述根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组,包括:对所述多个监控图像数据中的车辆进行检测,得到多个车辆图像;按照车辆的型号对所述多个车辆图像进行分组,得到与至少一个型号的车辆一一对应的至少一个图像组,每个图像组包括对应型号的车辆在不同视角下的多个车辆图像;从所述至少一个图像组中确定一个图像组作为所述目标图像组。Based on the first aspect, in a possible design, the determining the target image group of the vehicle of the target model according to the plurality of monitoring image data includes: detecting the vehicle in the plurality of monitoring image data , to obtain a plurality of vehicle images; group the plurality of vehicle images according to the model of the vehicle to obtain at least one image group corresponding to at least one model of the vehicle, each image group includes the corresponding model of the vehicle under different viewing angles a plurality of vehicle images; determining an image group from the at least one image group as the target image group.
在上述实现过程中,通过对多个监控图像数据中的车辆进行检测,以得到多个车辆图像,继而按照车辆的型号对多个车辆图像进分组,以防止后续利用A型号的车辆的图像对B型号的车辆进行三维重建,提高三维重建的准确度;其次,在分组时,由于只需要对各个车辆图像进行处理,无需对监控视频中的其他物体的图像进行处理,且避免了其他物体的图像对分组的干扰,继而降低分组复杂度,提高分组效率和分组准确度。In the above-mentioned implementation process, multiple vehicle images are obtained by detecting vehicles in multiple monitoring image data, and then multiple vehicle images are grouped according to vehicle models to prevent subsequent use of A-type vehicle images from Model B vehicles perform 3D reconstruction to improve the accuracy of 3D reconstruction; secondly, when grouping, because only the images of each vehicle need to be processed, there is no need to process the images of other objects in the surveillance video, and avoid the distortion of other objects. The image interferes with the grouping, thereby reducing the complexity of the grouping, and improving the grouping efficiency and grouping accuracy.
基于第一方面,在一种可能的设计中,在从所述至少一个图像组中确定一个图像组作为所述目标图像组之前,所述方法还包括:针对所述至少一个图像组中的每个图像组,将该图像组中属于同一视角下的车辆图像进行去重。Based on the first aspect, in a possible design, before determining an image group from the at least one image group as the target image group, the method further includes: for each of the at least one image group An image group, and the vehicle images belonging to the same viewing angle in the image group are deduplicated.
在上述实现过程中,针对每个图像组,将该图像组中属于同一视角下的车辆图像进行去重,降低利用图像组进行三维重建的复杂度,提高三维重建效率。In the above implementation process, for each image group, the vehicle images belonging to the same viewing angle in the image group are deduplicated, so as to reduce the complexity of 3D reconstruction using the image group and improve the efficiency of 3D reconstruction.
基于第一方面,在一种可能的设计中,根据所述目标图像组和所述目标图像组中各个图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型,包括:获取所述目标图像组中各个图像对应的关键点信息组;所述关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置;根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型。Based on the first aspect, in a possible design, according to the target image group and the calibration results of the cameras corresponding to the images in the target image group, obtaining the 3D model of the vehicle of the target model includes: obtaining the The key point information group corresponding to each image in the target image group; the key point information group includes: the position of a plurality of two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; according to the target image group The key point information group corresponding to each image in the image and the calibration result of the camera that captured each image are used to obtain the 3D model of the vehicle of the target model.
在上述实现过程中,考虑到车辆图像中表征车辆轮廓的多个二维关键点在所处图像中的位置、图像对应的相机标定结果,以及车辆的三维模型这三者存在映射关系,因此,通过获取目标图像组中各个图像对应的关键点信息组;关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置;继而根据目标图像组中各个图像对应的关键点信息组和拍摄各个图像的相机的标定结果,得到目标型号的车辆的三维模型,无需利用目标图像组中的全部点在所处图像中的位置信息,继而提高三维重建效 率。In the above implementation process, considering the position of multiple two-dimensional key points in the vehicle image that characterize the vehicle outline in the image, the camera calibration results corresponding to the image, and the three-dimensional model of the vehicle, there is a mapping relationship. Therefore, By obtaining the key point information group corresponding to each image in the target image group; the key point information group includes: the position of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; and then according to each of the target image group The key point information group corresponding to the image and the calibration result of the camera that took each image can obtain the 3D model of the vehicle of the target model, without using the position information of all points in the target image group in the image, and then improve the efficiency of 3D reconstruction.
基于第一方面,在一种可能的设计中,根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型,包括:确定所述目标型号的车辆的初始三维模型,所述初始三维模型包括:构成三维模型的各个三维关键点,以及各个三维关键点在模型坐标系中的初始坐标;针对所述目标图像组中的每个图像,确定该图像中的车辆在该图像被拍摄时在世界坐标系中的初始位姿;根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,利用光束平差法对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维模型。Based on the first aspect, in a possible design, the 3D model of the vehicle of the target model is obtained according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image , comprising: determining the initial three-dimensional model of the vehicle of the target model, the initial three-dimensional model comprising: each three-dimensional key point constituting the three-dimensional model, and the initial coordinates of each three-dimensional key point in the model coordinate system; for the target image For each image in the group, determine the initial pose of the vehicle in the image in the world coordinate system when the image is taken; according to the key point information group corresponding to each image in the target image group and capture the respective images Based on the calibration result of the camera, the initial pose of the vehicle in each image and the initial coordinates of the three-dimensional key points of the initial three-dimensional model are optimized by using the bundle adjustment method to obtain the three-dimensional model of the vehicle of the target model.
在上述实现过程中,在确定出目标型号的车辆的初始三维模型,以及各个图像中的车辆在世界坐标系中的初始位姿态之后,根据目标图像组中各个图像对应的关键点信息组和拍摄各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及初始三维模型的三维关键点的初始坐标进行优化,以降低噪声对三维重建结果的影响,提高三维重建的准确度。In the above implementation process, after determining the initial 3D model of the vehicle of the target model and the initial position and posture of the vehicle in the world coordinate system in each image, according to the key point information group and the shooting data corresponding to each image in the target image group The camera calibration results of each image optimize the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model to reduce the impact of noise on the 3D reconstruction results and improve the accuracy of the 3D reconstruction.
基于第一方面,在一种可能的设计中,根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维模型,包括:针对所述目标图像组中的每个图像,根据所述初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置;所述初始投影点包括所述初始三维模型中与该图像的二维关键点对应的三维关键点在该图像对应的初始位姿下投影至所述图像坐标系中的点;根据对应于该图像的各初始投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第一损失值;根据各个图像对应的第一损失值,对所述初始三维模型的三维关键点的初始坐标和各个初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件;所述优化后的三维模型为所述目标型号的车辆的三维模型。Based on the first aspect, in a possible design, according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captured each image, the initial pose of the vehicle in each image , and the initial coordinates of the three-dimensional key points of the initial three-dimensional model are optimized to obtain the three-dimensional model of the vehicle of the target model, including: for each image in the target image group, according to the initial three-dimensional model, the The camera calibration result and initial pose corresponding to the image, determine the position of the initial projection point corresponding to the image in the image coordinate system; the initial projection point includes the initial three-dimensional model corresponding to the two-dimensional key point of the image The three-dimensional key point is projected to the point in the image coordinate system under the initial pose corresponding to the image; according to the position difference between each initial projection point corresponding to the image and the corresponding two-dimensional key point, determine the corresponding The first loss value of the image; according to the first loss value corresponding to each image, the initial coordinates and each initial pose of the 3D key point of the initial 3D model are optimized until the optimized 3D model and the optimized position are used The new loss value determined by the attitude satisfies the preset condition; the optimized three-dimensional model is the three-dimensional model of the vehicle of the target model.
在上述实现过程中,针对每个图像,根据初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置,接着根据初始投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第一损失值,并根据各个图像的第一损失值,对初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件时,才停止优化,继而可以保证最终得到的三维模型的准确性。In the above implementation process, for each image, according to the initial three-dimensional model, the camera calibration result corresponding to the image, and the initial pose, determine the position of the initial projection point corresponding to the image in the image coordinate system, and then according to the initial projection point and the position difference between the corresponding two-dimensional key points, determine the first loss value corresponding to the image, and according to the first loss value of each image, the initial coordinates of the three-dimensional key points of the initial three-dimensional model and corresponding to each image The initial pose is optimized until the new loss value determined by using the optimized 3D model and the optimized pose meets the preset conditions, then the optimization is stopped, and then the accuracy of the final 3D model can be guaranteed.
基于第一方面,在一种可能的设计中,获取所述目标图像组中各个图像对应的相机的标定结果,包括:根据所述多个监控图像数据,确定出所述目标图像组中各个图像对应的相机的标定结果。Based on the first aspect, in a possible design, obtaining the calibration result of the camera corresponding to each image in the target image group includes: determining each image in the target image group according to the plurality of surveillance image data The calibration results of the corresponding cameras.
在上述实现过程中,利用监控图像数据来确定相机的标定结果,保证后续能够根据相机的标定结果对目标型号的车辆进行三维重建。In the above implementation process, the monitoring image data is used to determine the calibration result of the camera, so as to ensure that the subsequent three-dimensional reconstruction of the vehicle of the target model can be performed according to the calibration result of the camera.
第二方面,本公开实施例提供一种三维重建装置,所述装置包括:获取单元,可以被配置成用于获取多个监控图像数据;所述多个监控图像数据均包括车辆的图像数据;图像组确定单元,用于根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像;标定结果获取单元,用于获取所述目标图像组中各个图像对应的相机的标定结果;三维模型获得单元,用于根据所述目标图像组和所述目标图像组中各个图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。In a second aspect, an embodiment of the present disclosure provides a three-dimensional reconstruction device, the device comprising: an acquisition unit configured to acquire a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle; An image group determining unit, configured to determine a target image group of a target model vehicle according to the plurality of monitoring image data; the target image group includes: images of the target model vehicle under different viewing angles; calibration result acquisition A unit configured to obtain the calibration result of the camera corresponding to each image in the target image group; a three-dimensional model obtaining unit configured to obtain the calibration result of the camera corresponding to each image in the target image group and the target image group A 3D model of the vehicle of the target model.
基于第二方面,在一种可能的设计中,所述图像组确定单元,包括:检测单元,可以被配置成用于对所述多个监控图像数据中的车辆进行检测,得到多个车辆图像;分组单元,可以被配置成用于按照车辆的型号对所述多个车辆图像进行分组,得到与至少一个型号的车辆一一对应的至少一个图像组,每个图像组包括对应型号的车辆在不同视角下的多个车辆图像;选取单元,可以被配置成用于从所述至少一个图像组中确定一个图像组作为所述目标图像组。Based on the second aspect, in a possible design, the image group determination unit includes: a detection unit configured to detect vehicles in the plurality of monitoring image data to obtain a plurality of vehicle images The grouping unit may be configured to group the plurality of vehicle images according to the model of the vehicle to obtain at least one image group corresponding to at least one model of the vehicle, each image group including the vehicle of the corresponding model in A plurality of vehicle images under different viewing angles; the selecting unit may be configured to determine one image group from the at least one image group as the target image group.
基于第二方面,在一种可能的设计中,所述装置还包括:去重单元,可以被配置成用于针对所述至少一个图像组中的每个图像组,将该图像组中属于同一视角下的车辆图像进行去重。Based on the second aspect, in a possible design, the device further includes: a deduplication unit configured to, for each image group in the at least one image group, The image of the vehicle under the perspective is deduplicated.
基于第二方面,在一种可能的设计中,所述三维模型获得单元,包括:信息组获取单元,可以被配置成用于获取所述目标图像组中各个图像对应的关键点信息组;所述关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置;三维模型获得子单元,可以被配置成用于根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型。Based on the second aspect, in a possible design, the 3D model obtaining unit includes: an information group obtaining unit configured to obtain the key point information group corresponding to each image in the target image group; The key point information group includes: the position of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; the three-dimensional model obtaining subunit can be configured to The corresponding key point information group and the calibration results of the cameras that capture the images are used to obtain the three-dimensional model of the vehicle of the target model.
基于第二方面,在一种可能的设计中,所述三维模型获得子单元,包括:初始模型确定单元,可以被配置成用于确定所述目标型号的车辆的初始三维模型,所述初始三维模型包括:构成三维模型的各个三维关键点,以及各个三维关键点在模型坐标系中的初始坐标;初始位姿确定单元,可以被配置成用于针对所述目标图像组中的每个图像,确定该图像中的车辆在该图像被拍摄时在世界坐标系中的初始位姿;优化单元,可以被配 置成用于根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维模型。Based on the second aspect, in a possible design, the 3D model obtaining subunit includes: an initial model determining unit configured to determine the initial 3D model of the vehicle of the target model, the initial 3D model The model includes: each 3D key point constituting the 3D model, and the initial coordinates of each 3D key point in the model coordinate system; the initial pose determination unit may be configured to, for each image in the target image group, Determining the initial pose of the vehicle in the image in the world coordinate system when the image is captured; the optimization unit may be configured to set and capture the key point information corresponding to each image in the target image group The camera calibration results of each image are optimized for the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model to obtain the 3D model of the vehicle of the target model.
基于第二方面,在一种可能的设计中,所述优化单元,包括:投影单元,可以被配置成用于针对所述目标图像组中的每个图像,根据所述初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置;所述初始投影点包括所述初始三维模型中与该图像的二维关键点对应的三维关键点在该图像对应的初始位姿下投影至所述图像坐标系中的点;损失确定单元,可以被配置成用于根据对应于该图像的各初始投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第一损失值;优化子单元,可以被配置成用于根据各个图像对应的第一损失值,对所述初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件;所述优化后的三维模型为所述目标型号的车辆的三维模型。Based on the second aspect, in a possible design, the optimization unit includes: a projection unit configured to, for each image in the target image group, according to the initial three-dimensional model, the image Corresponding to the camera calibration result and the initial pose, determine the position of the initial projection point corresponding to the image in the image coordinate system; the initial projection point includes the three-dimensional corresponding to the two-dimensional key point of the image in the initial three-dimensional model The key point is projected to a point in the image coordinate system under the initial pose corresponding to the image; the loss determination unit may be configured to be based on the relationship between each initial projected point corresponding to the image and the corresponding two-dimensional key point Determine the first loss value corresponding to the image according to the position difference between them; the optimization subunit may be configured to calculate the initial coordinates and Optimizing the initial poses corresponding to each image until a new loss value determined using the optimized three-dimensional model and the optimized pose meets the preset condition; the optimized three-dimensional model is the vehicle of the target model 3D model of .
基于第二方面,在一种可能的设计中,所述标定结果获取单元,可以被配置成用于根据所述多个监控图像数据,确定出所述目标图像组中各个图像对应的相机的标定结果。Based on the second aspect, in a possible design, the calibration result acquisition unit may be configured to determine the calibration of the camera corresponding to each image in the target image group according to the plurality of surveillance image data result.
第三方面,本公开实施例提供一种电子设备,包括处理器以及与所述处理器连接的存储器,所述存储器内存储计算机程序,当所述计算机程序被所述处理器执行时,使得所述电子设备执行第一方面所述的方法。In a third aspect, an embodiment of the present disclosure provides an electronic device, including a processor and a memory connected to the processor, where a computer program is stored in the memory, and when the computer program is executed by the processor, the The electronic device executes the method described in the first aspect.
第四方面,本公开实施例提供一种存储介质,所述存储介质中存储有计算机程序,当所述计算机程序在计算机上运行时,使得所述计算机执行第一方面所述的方法。In a fourth aspect, an embodiment of the present disclosure provides a storage medium, where a computer program is stored in the storage medium, and when the computer program runs on a computer, the computer executes the method described in the first aspect.
本公开的其他特征和优点将在随后的说明书阐述,并且,部分地从说明书中变得显而易见,或者通过实施本公开实施例了解。Additional features and advantages of the disclosure will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the disclosed embodiments.
附图说明Description of drawings
为了更清楚地说明本公开实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本公开的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present disclosure, the following will briefly introduce the accompanying drawings used in the embodiments. It should be understood that the following drawings only show some embodiments of the present disclosure, and therefore are not It should be regarded as a limitation on the scope, and those skilled in the art can also obtain other related drawings based on these drawings without creative work.
图1为本公开实施例提供的三维重建方法的流程示意图。FIG. 1 is a schematic flowchart of a three-dimensional reconstruction method provided by an embodiment of the present disclosure.
图2为本公开实施例提供的三维重建装置的结构示意图。FIG. 2 is a schematic structural diagram of a three-dimensional reconstruction device provided by an embodiment of the present disclosure.
图3为本公开实施例提供的电子设备的结构示意图。FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
图标:200-三维重建装置;210-获取单元;220-图像组确定单元;230-标定结果获取单元;240-三维模型获得单元;300-电子设备;301-处理器;302-存储器;303-通信接口。Icons: 200-three-dimensional reconstruction device; 210-acquisition unit; 220-image group determination unit; 230-calibration result acquisition unit; 240-three-dimensional model acquisition unit; 300-electronic equipment; 301-processor; 302-memory; 303- Communication Interface.
具体实施方式Detailed ways
下面将结合本公开实施例中附图,对本公开实施例中的技术方案进行描述。The technical solutions in the embodiments of the present disclosure will be described below with reference to the drawings in the embodiments of the present disclosure.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步定义和解释。同时,在本公开的描述中,术语“第一”、“第二”等仅用于区分描述,而不能理解为指示或暗示相对重要性。It should be noted that like numerals and letters denote similar items in the following figures, therefore, once an item is defined in one figure, it does not require further definition and explanation in subsequent figures. Meanwhile, in the description of the present disclosure, the terms "first", "second", etc. are only used to distinguish descriptions, and cannot be understood as indicating or implying relative importance.
请参照图1,图1为本公开实施例提供的一种三维重建方法的流程图,下面将对图1所示的流程进行详细阐述,所述方法包括步骤:S11-S14。Please refer to FIG. 1 . FIG. 1 is a flow chart of a three-dimensional reconstruction method provided by an embodiment of the present disclosure. The process shown in FIG. 1 will be described in detail below, and the method includes steps: S11-S14.
S11:获取多个监控图像数据;所述多个监控图像数据均包括车辆的图像数据。S11: Acquire a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle.
S12:根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像。S12: Determine a target image group of the target model vehicle according to the plurality of monitoring image data; the target image group includes: images of the target model vehicle under different viewing angles.
S13:获取所述目标图像组中各个图像对应的相机的标定结果。S13: Obtain a calibration result of the camera corresponding to each image in the target image group.
S14:根据所述目标图像组和所述目标图像组中各个图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。S14: According to the target image group and the calibration results of the cameras corresponding to the images in the target image group, obtain a three-dimensional model of the vehicle of the target model.
下面对上述方法进行详细介绍。The above method will be described in detail below.
S11:获取多个监控图像数据;所述多个监控图像数据均包括车辆的图像数据。S11: Acquire a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle.
在实际实施过程中,S11可以按照如下方式实施,从第三方获取至少一个相机拍摄到的多个监控图像数据,其中,所述多个监控图像数据均包括车辆的图像数据。In an actual implementation process, S11 may be implemented in the following manner, acquiring a plurality of monitoring image data captured by at least one camera from a third party, wherein the plurality of monitoring image data all include vehicle image data.
在上述实现过程中,由于无需直接与至少一个相机进行通信连接,就能获取到多个监控图像数据,通信连接复杂度低,特别地,在至少一个相机的数量较多时,效果更为明显。In the above implementation process, multiple monitoring image data can be acquired without directly communicating with at least one camera, and the complexity of communication connection is low, especially when the number of at least one camera is large, the effect is more obvious.
作为一种实施方式,S11可以按照如下方式实施,获取至少一个相机从当前时刻开始之后的指定时长内拍摄到的多个监控图像数据,以保证获取到的多个监控图像数据中包含最新车辆型号的图像。As an implementation, S11 can be implemented in the following manner, acquiring multiple monitoring image data captured by at least one camera within a specified time period after the current moment, so as to ensure that the acquired multiple monitoring image data contains the latest vehicle model Image.
其中,在本实施例中,最新车辆型号可以为在当前时刻之前的一年内生产的车辆的型号,在其他实施例中,最新型号也可以为在当前时刻之前的半年内生产的车辆的型号。Wherein, in this embodiment, the latest vehicle model may be a model of a vehicle produced within one year before the current moment, and in other embodiments, the latest vehicle model may also be a model of a vehicle produced within half a year before the current moment.
作为一种实施方式,S11可以按照如下方式实施,获取至少一个相机发送的多个监控图像数据。As an implementation manner, S11 may be implemented in the following manner, acquiring a plurality of monitoring image data sent by at least one camera.
其中,图像可以为图片、视频。在上述各个实施例中,若相机的数量为至少两个,则至少两个相机中的各个相机可以分别设置于不同的地理位置或不同角度,以确保能够拍摄到不同多角度的图像;在上述各个实施例中,相机可以为设置于园区内部道路、公司内部停车场等可以拍摄到车辆的图像的相机。Wherein, the image may be a picture or a video. In each of the above-mentioned embodiments, if the number of cameras is at least two, each camera in the at least two cameras can be set at different geographical locations or different angles respectively, so as to ensure that images from different angles can be taken; in the above-mentioned In various embodiments, the camera may be a camera that is installed on roads inside the park, parking lots inside the company, etc., and can capture images of vehicles.
S12:根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像。S12: Determine a target image group of the target model vehicle according to the plurality of monitoring image data; the target image group includes: images of the target model vehicle under different viewing angles.
其中,目标型号是需要被建立三维模型的车辆的型号。Wherein, the target model is the model of the vehicle whose three-dimensional model needs to be established.
在实际实施过程中,S12可以按照如下方式实施,基于预先确定的目标型号,利用预先训练好的车辆型号识别模型,从多个监控图像数据中识别出属于该目标型号的车辆的多个图像,并将目标型号的车辆的多个图像划分为该目标图像组。In the actual implementation process, S12 can be implemented in the following manner, based on a predetermined target model, using a pre-trained vehicle model recognition model to identify a plurality of images of vehicles belonging to the target model from a plurality of monitoring image data, And divide the multiple images of the vehicle of the target model into the target image group.
其中,为了实现对车辆的三维重建,目标图像组中需要包含车辆在多个视角下的图像,具体地,例如目标图像组中需要包括目标型号的车辆的前表面的图像,包含目标型号的车辆的左侧面的图像、包含目标型号的车辆的右侧面的图像、包含目标型号的车辆的后表面的图像、包含目标型号的车辆的车顶的图像。可以理解的是,考虑到车辆的对称性,在目标图像组中已包含部分视角下的图像的情况下,目标图像组可不包含与这部分视角对应的视角下的图像。例如,如果目标图像组中已包含车辆的左侧面的图像,可以不包含车辆的右侧面的图像。Among them, in order to realize the three-dimensional reconstruction of the vehicle, the target image group needs to contain images of the vehicle under multiple viewing angles. Specifically, for example, the target image group needs to include images of the front surface of the target model vehicle, including the target model vehicle The image of the left side of the vehicle, the image of the right side of the vehicle including the target model, the image of the rear surface of the vehicle including the target model, and the image of the roof of the vehicle including the target model. It can be understood that, considering the symmetry of the vehicle, in the case that the target image group already includes images under some viewing angles, the target image group may not include images under the viewing angles corresponding to this part of the viewing angles. For example, if the target image group already includes the image of the left side of the vehicle, the image of the right side of the vehicle may not be included.
具体地,针对车辆的图像,识别该图像属于车辆的哪个表面的图像的具体实施方式可以为:根据车辆的图像,确定出表征图像中的车辆的轮廓的多个二维关键点的信息,继而可以根据二维关键点的信息,确定出该图像具体属于车辆的哪个表面的图像。例如,根据车辆的图像,确定出表征图像中的车辆的轮廓的多个二维关键点包括左车窗关键点、左前轮关键点、左后轮关键点等,则可确定出该图像为左视图。Specifically, for an image of a vehicle, the specific implementation manner of identifying which surface image of the vehicle the image belongs to may be: according to the image of the vehicle, determine the information of a plurality of two-dimensional key points that characterize the outline of the vehicle in the image, and then According to the information of the two-dimensional key points, it can be determined which surface image of the vehicle the image belongs to. For example, according to the image of the vehicle, it is determined that a plurality of two-dimensional key points representing the outline of the vehicle in the image include the key points of the left window, the key points of the left front wheel, the key points of the left rear wheel, etc., then it can be determined that the image is Left view.
其中,针对每个图像,与该图像对应的图像坐标系的原点可以为该图像的中心或者该图像的顶点,图像坐标系的u轴、v轴之一与该图像的上边沿平行,图像坐标系的v轴、v轴中的另一个与该图像的下边沿平行。Wherein, for each image, the origin of the image coordinate system corresponding to the image can be the center of the image or the vertex of the image, one of the u axis and the v axis of the image coordinate system is parallel to the upper edge of the image, and the image coordinate The v-axis of the system, the other of the v-axis is parallel to the lower edge of the image.
作为一种实施方式,S12可以按照如下方式实施,针对多个监控视频数据中的每个车辆的图像,将该车辆的图像输入预先训练好的车辆型号识别模型中,得到车辆型号,并将同一型号的车辆在不同视角下的拍摄到的图像划分为一个图像组,继而从划分得到的至少一个图像组中,依次或者随机地确定一个图像组为目标图像组;或者,考虑到为了构建目标型号的车辆的三维模型,需要保证目标图像组中包括目标型号的车辆的各个表面的图像,因此,从划分得到的至少一个图像组中,将图像数量满足一定要求(例如, 大于等于200张)或角度分布要求(例如在某些特定角度有分布)的图像组为目标图像组。可以理解的是,因为目标图像组中的图像的数量越多,越能保证目标型号的车辆的各个表面的图像能从目标图像组中确定出。As an implementation, S12 can be implemented in the following manner. For the image of each vehicle in a plurality of surveillance video data, input the image of the vehicle into the pre-trained vehicle model recognition model to obtain the vehicle model, and use the same The images captured by the vehicle model under different viewing angles are divided into an image group, and then from at least one of the divided image groups, one image group is sequentially or randomly determined as the target image group; or, considering that in order to construct the target model For the three-dimensional model of the vehicle, it is necessary to ensure that the target image group includes images of each surface of the target model vehicle. Therefore, from at least one image group obtained by dividing, the number of images meets a certain requirement (for example, greater than or equal to 200) or The image group required by the angle distribution (eg, distribution at certain specific angles) is the target image group. It can be understood that, because the larger the number of images in the target image group, the more ensured that the images of the various surfaces of the vehicle of the target model can be determined from the target image group.
作为一种实施方式,S12包括步骤:A1-A3。As an implementation manner, S12 includes steps: A1-A3.
A1:对所述多个监控图像数据中的车辆进行检测,得到多个车辆图像。A1: Detecting vehicles in the multiple monitoring image data to obtain multiple vehicle images.
在实际实施过程中,A1可以按照如下方式实施,针对多个监控图像数据中的每个图像,利用预先训练好的车辆检测模型对该图像中的车辆进行检测,得到车辆检测框的位置。根据车辆检测框的位置即可裁剪出车辆图像。In the actual implementation process, A1 can be implemented in the following manner. For each image in multiple monitoring image data, use a pre-trained vehicle detection model to detect the vehicle in the image to obtain the position of the vehicle detection frame. The vehicle image can be cropped according to the position of the vehicle detection frame.
在获取到各个车辆的车辆图像之后,执行步骤A2。After the vehicle images of each vehicle are acquired, step A2 is performed.
A2:按照车辆的型号对各个车辆的车辆图像进行分组,得到与至少一个型号的车辆一一对应的多至少一个图像组,每个图像组包括对应型号的车辆在不同视角下的图像。A2: Group the vehicle images of each vehicle according to the vehicle model to obtain at least one more image group corresponding to at least one model of vehicle, and each image group includes images of the corresponding model of the vehicle under different viewing angles.
在实际实施过程中,A2可以按照如下方式实施,针对每张车辆图像,利用预先训练好的车辆品牌识别模型,对该张车辆图像的中车辆品牌进行识别,其中,该张车辆图像中可以包括:车辆的车标或者logo;在确定出该张车辆图像中的车辆的品牌之后,利用预先训练好的与该车辆的品牌对应的车辆型号检测模型,对属于同一品牌的车辆的各张图像分组,将同一型号的车辆在各个视角下的车辆图像划分为一个图像组,得到与至少一个型号的车辆一一对应的至少一个图像组。In the actual implementation process, A2 can be implemented in the following manner. For each vehicle image, use the pre-trained vehicle brand recognition model to identify the vehicle brand in the vehicle image, wherein the vehicle image can include : The vehicle logo or logo of the vehicle; after determining the brand of the vehicle in the vehicle image, use the pre-trained vehicle model detection model corresponding to the brand of the vehicle to group the images of vehicles belonging to the same brand , dividing vehicle images of vehicles of the same model under various viewing angles into an image group to obtain at least one image group corresponding to at least one type of vehicle one-to-one.
值得一提的是,针对每个图像组,由于该图像组中的图像的数量越多,越能保证该图像组中涉及的视角越多,因此,该图像组中的图像数量需要满足一定的要求(比如,该图像组中的数量大于等于200张,或者3000张),以保证该图像组中包括对应型号的车辆在各个视角下的车辆图像,继而保证后续能够准确地确定出车辆的三维模型。It is worth mentioning that for each image group, since the larger the number of images in the image group, the more perspectives involved in the image group can be ensured. Therefore, the number of images in the image group needs to meet a certain Requirements (for example, the number of images in the group is greater than or equal to 200, or 3000), to ensure that the group of images includes vehicle images of vehicles of the corresponding model at various angles of view, and then ensure that the three-dimensionality of the vehicle can be accurately determined in the future Model.
作为一种实施方式,A2可以按照如下方式实施,针对每张车辆图像,将该张车辆图像输入到预先训练好的车辆型号识别模型中,得到车辆型号,继而将相同型号的车辆的车辆图像划分为一个图像组,得到与至少一个型号的车辆一一对应的至少一个图像组。As an implementation, A2 can be implemented in the following manner. For each vehicle image, input the vehicle image into a pre-trained vehicle model recognition model to obtain the vehicle model, and then divide the vehicle images of the same model into As an image group, at least one image group corresponding to at least one model of vehicle is obtained.
其中,车辆品牌模型和车辆型号的检测模型的训练方法为本领域熟知技术,因此,在此不再赘述。Wherein, the training methods of the vehicle brand model and the vehicle model detection model are well-known techniques in the art, so details will not be repeated here.
在得到至少一个图像组之后,可以执行步骤A3。After obtaining at least one image group, step A3 may be performed.
A3:从所述至少一个图像组中确定一个图像组作为所述目标图像组。A3: Determine an image group from the at least one image group as the target image group.
在实际实施过程中,A3可以按照如下方式实施,依次从所述至少一个图像组中确定一个图像组作为所述目标图像组。In an actual implementation process, A3 may be implemented in the following manner, one image group is sequentially determined from the at least one image group as the target image group.
作为一种实施方式,A3可以按照如下方式实施,随机地从所述至少一个图像组中 确定一个图像组作为所述目标图像组。As an implementation manner, A3 may be implemented in the following manner, randomly determining an image group from the at least one image group as the target image group.
作为一种实施方式,A3可以按照如下方式实施,将图像数量满足要求的图像组作为目标图像组。As an implementation manner, A3 may be implemented in the following manner, using an image group whose number of images satisfies the requirement as a target image group.
作为一种实施方式,A3可以按照如下方式实施,先确定目标型号,再将目标型号对应的图像组作为所述目标图像组。As an implementation manner, A3 may be implemented in the following manner. First, the target model is determined, and then the image group corresponding to the target model is used as the target image group.
值得一提的是,在从至少一个图像组中确定出一个图像组作为目标图像组,并利用该目标图像组执行完步骤S13-S14之后,可以分别将剩余的图像组中的各个图像组依次作为目标图像组,继而利用被作为目标图像组的图像组执行S13-S14,从而确定多个型号的车辆的三维模型。It is worth mentioning that after determining an image group from at least one image group as the target image group, and using the target image group to perform steps S13-S14, each image group in the remaining image groups can be sequentially As the target image group, S13-S14 is then executed using the image group as the target image group, so as to determine the three-dimensional models of vehicles of multiple models.
在确定出目标图像组之后,执行步骤S13。After the target image group is determined, step S13 is executed.
S13:获取所述目标图像组中各个图像对应的相机的标定结果。S13: Obtain a calibration result of the camera corresponding to each image in the target image group.
其中,相机标定结果可以包括机的内参矩阵K,相机的外参矩阵P,也可以包括由相机的内参矩阵K,相机的外参矩阵P相乘所得到的矩阵S。Wherein, the camera calibration result may include the internal reference matrix K of the camera, the external reference matrix P of the camera, or the matrix S obtained by multiplying the internal reference matrix K of the camera and the external reference matrix P of the camera.
具体的标定方法可采用任何相机标定方法,在此不做限定。相机标定方法主要有三类:第一类相机标定方法需要依靠放置的标定物;第二类相机标定方法主要利用相机的运动信息来对相机进行标定,该方法不需要依靠标定物,但需要控制相机做某些特殊运动,并且该方法不适用于运动信息未知或无法控制相机移动的场景(例如,安防监控场景)。第三类是根据相机拍摄的目标标定。The specific calibration method may use any camera calibration method, which is not limited here. There are three main types of camera calibration methods: the first type of camera calibration method needs to rely on the placed calibration object; the second type of camera calibration method mainly uses the camera’s motion information to calibrate the camera, this method does not need to rely on the calibration object, but needs to control the camera Do some special sports, and this method is not suitable for scenes where the motion information is unknown or the camera movement cannot be controlled (for example, security monitoring scenes). The third category is based on the target calibration captured by the camera.
一个具体的根据相机拍摄的目标进行相机标定的方法如下:A specific method of camera calibration based on the target captured by the camera is as follows:
获取已知目标的三维模型(已知目标的三维模型可以通过例如人工扫描的方法进行重建,三维模型包括多个三维关键点);获取已知目标在世界坐标系中的位姿(因此能够确定多个三维关键点在世界坐标系中的位置以及三维模型在世界坐标系中的姿态);使用待标定的相机对已知目标进行拍摄,得到标定图像;在标定图像中对已知目标进行关键点检测,得到已知目标的每个二维关键点及该二维关键点在图像坐标系中的位置(u,v);根据二维关键点与三维模型中三维关键点的对应关系(针对任一标定图像中目标的任意一个二维关键点,势必能从该目标的三维模型中查找出与该二维关键点的位置对应的三维关键点,针对该目标的三维模型中的任意一个三维关键点,该标定图像中不一定存在与该三维关键点的位置对应的二维关键点;(例如,针对标定图像中的车辆的前车门左上角的二维关键点,势必能从该车辆的三维模型中查找出与前车门左上角的二维关键点对应的三维关键点,该对应的三维关键点为该车辆的三维模型中前车门上角的点)),也就能够确定已知目标的三维模型中与该二维关键点对应的三维关键点在世界坐 标系中的位置(x m,y m,z m);由于相机的标定结果、已知目标的二维关键点及检测出的二维关键点在图像坐标系中的位置(u,v)、三维模型中与二维关键点对应的三维关键点在世界坐标系中的位置(x m,y m,z m)、三维模型在标定图像被拍摄时在世界坐标系中的位姿(位置用(X,Y)表示,姿态可用航向角θ表示)满足如下关系表达式: Obtain the 3D model of the known target (the 3D model of the known target can be reconstructed by methods such as manual scanning, and the 3D model includes multiple 3D key points); obtain the pose of the known target in the world coordinate system (so it can be determined The position of multiple 3D key points in the world coordinate system and the attitude of the 3D model in the world coordinate system); use the camera to be calibrated to shoot the known target to obtain the calibration image; key the known target in the calibration image Point detection, to obtain each two-dimensional key point of the known target and the position (u, v) of the two-dimensional key point in the image coordinate system; according to the corresponding relationship between the two-dimensional key point and the three-dimensional key point in the three-dimensional model (for For any 2D key point of the target in any calibration image, it is bound to be able to find the 3D key point corresponding to the position of the 2D key point from the 3D model of the target. For any 3D key point in the 3D model of the target key point, there may not be a two-dimensional key point corresponding to the position of the three-dimensional key point in the calibration image; In the 3D model, the 3D key point corresponding to the 2D key point in the upper left corner of the front door is found, and the corresponding 3D key point is the point in the upper corner of the front door in the 3D model of the vehicle), and the known target can be determined The position (x m , y m , z m ) of the 3D key point corresponding to the 2D key point in the 3D model of the The position (u, v) of the 2D key point in the image coordinate system, the position (x m , y m , z m ) of the 3D key point corresponding to the 2D key point in the 3D model in the world coordinate system, and the position of the 3D key point in the world coordinate system The pose of the model in the world coordinate system when the calibration image is taken (the position is represented by (X, Y), and the attitude can be represented by the heading angle θ) satisfies the following relational expression:
Figure PCTCN2022098993-appb-000001
其中,λ为常数,
Figure PCTCN2022098993-appb-000002
表征已知目标在世界坐标系中的位姿。
Figure PCTCN2022098993-appb-000001
Among them, λ is a constant,
Figure PCTCN2022098993-appb-000002
Characterize the pose of a known object in the world coordinate system.
可通过上述表达式确定出K和P或二者的乘积。K and P or the product of the two can be determined by the above expressions.
其中,当已知三维模型为车辆的三维模型时,世界坐标系的xoy平面可与车辆所处路面重叠。世界坐标系的原点可以为对应相机的中心在车辆所处路面上的投影点。Wherein, when the known 3D model is the 3D model of the vehicle, the xoy plane of the world coordinate system may overlap with the road where the vehicle is located. The origin of the world coordinate system may be a projection point corresponding to the center of the camera on the road where the vehicle is located.
如此,可通过人工扫描建立少量三维模型后得到相机标定结果,以便后续通过相机标定结果自动化的重建大量三维模型。In this way, a small amount of 3D models can be established through manual scanning to obtain camera calibration results, so that a large number of 3D models can be automatically reconstructed through the camera calibration results.
作为一种实施方式,S13包括:根据所述多个监控图像数据,确定出所述目标图像组中各个图像对应的相机的标定结果。As an implementation manner, S13 includes: determining a calibration result of a camera corresponding to each image in the target image group according to the plurality of surveillance image data.
可以理解的是,多个监控图像数据包含多个型号的车辆,其中有些型号的车辆三维模型未知,有些型号的车辆三维模型是已知的。因此,可根据三维模型已知的车辆对相机进行自动标定。It can be understood that the multiple surveillance image data include multiple models of vehicles, some of which have unknown 3D models of vehicles, and some of which have known 3D models of vehicles. Therefore, automatic calibration of the camera can be performed based on the vehicle whose 3D model is known.
在实际实施过程中,S13可以按照如下方式实施:将目标图像组中各个图像对应的相机依次作为待标定相机;从多个监控图像数据中,查找出待标定相机所拍摄到的监控图像数据,从待标定相机所拍摄到的监控图像数据中确定出存在已知三维模型的目标(例如某型号的车辆三维模型已知,则已知三维模型的目标即为该型号的车辆)的多张图像,根据已知三维模型确定表征该目标的三维关键点的坐标,根据该相机所拍摄到的多张图像,检测出多张图像中表征该目标的二维关键点的坐标,继而根据该目标的每个三维关键点的三维坐标和对应的二维关键点的二维坐标,确定出待标定相机的标定结果。In the actual implementation process, S13 can be implemented in the following manner: use the cameras corresponding to each image in the target image group as the camera to be calibrated sequentially; find out the monitoring image data captured by the camera to be calibrated from a plurality of monitoring image data, From the monitoring image data captured by the camera to be calibrated, it is determined that there are multiple images of a target with a known 3D model (for example, if the 3D model of a certain type of vehicle is known, then the target of the known 3D model is the vehicle of this type) , determine the coordinates of the 3D key points representing the target according to the known 3D model, and detect the coordinates of the 2D key points representing the target in the multiple images according to the multiple images captured by the camera, and then according to the target’s The three-dimensional coordinates of each three-dimensional key point and the two-dimensional coordinates of the corresponding two-dimensional key point determine the calibration result of the camera to be calibrated.
如此,在通过人工扫描等方式确定少量三维模型后,即可实现相机的自动标定以及大量三维模型的自动重建。In this way, after a small number of 3D models are determined by means of manual scanning, automatic camera calibration and automatic reconstruction of a large number of 3D models can be realized.
在对多个相机进行了标定后,则将相机标识与相机标定结果的对应关系进行存储。After the multiple cameras are calibrated, the corresponding relationship between the camera identification and the camera calibration result is stored.
作为一种实施方式,S13可以按照如下方式实施,获取目标图像组中各个图像对应的相机的标识,继而针对每个图像对应的相机的标识,从预先存储的相机标识与相机标定结果的对应关系中,查找出与该图像对应的相机的标识对应的相机标定结果。As an implementation manner, S13 can be implemented in the following manner. Obtain the identifier of the camera corresponding to each image in the target image group, and then, for the identifier of the camera corresponding to each image, obtain the corresponding relationship between the camera identifier and the camera calibration result stored in advance. In , find out the camera calibration result corresponding to the identity of the camera corresponding to the image.
S14:根据所述目标图像组和所述目标图像组中各个图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。S14: According to the target image group and the calibration results of the cameras corresponding to the images in the target image group, obtain a three-dimensional model of the vehicle of the target model.
其中,目标型号的车辆的三维模型包括:表征目标型号的车辆的轮廓的三维关键点,以及各三维关键点的相对位置关系(例如可以某个三维关键点为原点,与车辆的底盘所在平面平行的平面为坐标平面建立模型坐标系,各三维关键点在模型坐标系中的坐标值即可表示各三维关键点之间的相对位置关系);各个型号的车辆的三维模型所包含的三维关键点的类型和数量可以是一致的,各个三维模型所涉及的三维关键点可以包括:各个车灯、各个车窗、各个车门、各个车轮、车前表面、车后表面、车顶面等的四周上的点等。Wherein, the three-dimensional model of the vehicle of the target model includes: three-dimensional key points representing the profile of the vehicle of the target model, and the relative positional relationship of each three-dimensional key point (for example, a certain three-dimensional key point can be the origin, parallel to the plane where the chassis of the vehicle is located) The plane is the coordinate plane to establish a model coordinate system, and the coordinate values of each three-dimensional key point in the model coordinate system can represent the relative positional relationship between each three-dimensional key point); the three-dimensional key points contained in the three-dimensional model of each type of vehicle The type and quantity of the model can be consistent, and the 3D key points involved in each 3D model can include: each car light, each window, each door, each wheel, the front surface of the car, the rear surface of the car, the roof surface, etc. point and so on.
可以理解的是,模型坐标系可以为三维坐标系,模型坐标系与世界坐标系可以只有平移关系而没有旋转、缩放关系;模型坐标系的原点可以为目标图像组中的图像中的一车辆的车头中心点在所处路面上的投影点,模型坐标系的z轴可以与该车辆所处路面垂直,模型坐标系的x轴可与车辆的中轴线平行,模型坐标系的y轴分别与x轴和z轴垂直。It can be understood that the model coordinate system can be a three-dimensional coordinate system, and the model coordinate system and the world coordinate system can only have a translation relationship without a rotation or scaling relationship; the origin of the model coordinate system can be a vehicle in an image in the target image group The projection point of the center point of the front of the vehicle on the road where the vehicle is located. The z-axis of the model coordinate system can be perpendicular to the road where the vehicle is located. The x-axis of the model coordinate system can be parallel to the central axis of the vehicle. axis is perpendicular to the z-axis.
在上述实现过程中,根据多个监控图像数据,确定出目标型号的车辆在不同视角下的图像,继而根据各个图像和对应的相机的标定结果,得到目标型号的车辆的三维模型,该方法无需人工手持三维扫描设备对车辆进行扫描,通过基于监控图像数据便可实现车辆三维模型的构建,效率更高;其次,由于无需借助三维扫描设备及实体车辆,成本更低,且实现更容易。In the above implementation process, according to multiple monitoring image data, the images of the vehicle of the target model under different viewing angles are determined, and then the three-dimensional model of the vehicle of the target model is obtained according to the calibration results of each image and the corresponding camera. Manual hand-held 3D scanning equipment scans the vehicle, and the construction of the vehicle 3D model can be realized based on the monitoring image data, which is more efficient; secondly, since there is no need for 3D scanning equipment and physical vehicles, the cost is lower and the implementation is easier.
作为一种实施方式,S14包括步骤:B1-B2。As an implementation manner, S14 includes steps: B1-B2.
B1:获取所述目标图像组中各个图像对应的关键点信息组;所述关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置。B1: Obtain the key point information group corresponding to each image in the target image group; the key point information group includes: the positions of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image.
在实际实施过程中,B1可以按照如下方式,针对所述目标图像中的各个图像,直接获取前述步骤S12中确定出的表征该图像中的车辆的轮廓的多个二维关键点在该图像中的位置;其中,该图像中的多个二维关键点的位置构成了该图像的关键点信息组。可以理解的是,某些车辆检测或车辆型号检测模型中,进行车辆检测或车辆型号检测的同时进行车辆关键点检测,如此在步骤S12中就已经确定出了多个二维关键点在该图像中的位置。In the actual implementation process, B1 can directly acquire multiple two-dimensional key points that characterize the outline of the vehicle in the image determined in the aforementioned step S12 for each image in the target image in the following manner: position; wherein, the positions of multiple two-dimensional key points in the image constitute the key point information group of the image. It can be understood that in some vehicle detection or vehicle model detection models, the vehicle key point detection is performed while the vehicle detection or vehicle model detection is performed, so that in step S12, multiple two-dimensional key points have been determined in the image position in .
作为一种实施方式,B1可以按照如下方式实施,针对目标图像组中的各个图像,利用预先训练好的车辆关键点提取模型,对该图像进行关键点提取,以从该图像中确定出表征车辆的轮廓的多个二维关键点在该图像中的位置。可以理解的是,某些车辆检测 或车辆型号检测模型中,未进行车辆检测或车辆型号检测的同时进行车辆关键点检测,因此需要额外步骤确定多个二维关键点在该图像中的位置。As an implementation, B1 can be implemented in the following manner. For each image in the target image group, use the pre-trained vehicle key point extraction model to extract the key points of the image, so as to determine the representative vehicle from the image The positions of multiple 2D keypoints of the contour of the image in the image. It can be understood that in some vehicle detection or vehicle model detection models, vehicle key point detection is performed without vehicle detection or vehicle model detection, so additional steps are required to determine the positions of multiple two-dimensional key points in the image.
B2:根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型。B2: Obtain the 3D model of the vehicle of the target model according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image.
在上述实现过程中,考虑到车辆图像中表征车辆轮廓的多个二维关键点在所处图像中的位置、图像对应的相机标定结果,以及车辆的三维模型这三者存在映射关系,因此,先获取目标图像组中各个图像对应的关键点信息组;关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置;继而根据目标图像组中各个图像对应的关键点信息组和拍摄各个图像的相机的标定结果,即可得到目标型号的车辆的三维模型。In the above implementation process, considering the position of multiple two-dimensional key points in the vehicle image that characterize the vehicle outline in the image, the camera calibration results corresponding to the image, and the three-dimensional model of the vehicle, there is a mapping relationship. Therefore, First obtain the key point information group corresponding to each image in the target image group; the key point information group includes: the positions of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; then according to each of the target image group The key point information group corresponding to the image and the calibration result of the camera that captures each image can obtain the 3D model of the vehicle of the target model.
如此,可通过多角度的海量图像对图像中所包含的车型自动化的进行三维重建,继而提高三维重建效率和准确率。In this way, the three-dimensional reconstruction of the car models contained in the images can be automatically performed through a large number of multi-angle images, and then the efficiency and accuracy of the three-dimensional reconstruction can be improved.
作为一种实施方式,B2包括步骤B21-B23。As an implementation manner, B2 includes steps B21-B23.
B21:确定所述目标型号的车辆的初始三维模型,所述初始三维模型包括:构成三维模型的各个三维关键点,以及各个三维关键点在模型坐标系中的初始坐标。B21: Determine the initial three-dimensional model of the vehicle of the target model, the initial three-dimensional model includes: each three-dimensional key point constituting the three-dimensional model, and the initial coordinates of each three-dimensional key point in the model coordinate system.
其中,模型坐标系为三维坐标系,根据用户需求建立,不做限制。Among them, the model coordinate system is a three-dimensional coordinate system, which is established according to user requirements without limitation.
各个型号的车辆的初始三维模型所包含的三维关键点的类型和数量可以是一致的,各个三维模型所涉及的三维关键点可以包括:各个车灯、各个车门、各个车窗、各个车轮、车前表面、车后表面、车顶面等的四周上的点。The types and quantities of 3D key points contained in the initial 3D models of vehicles of various models may be consistent, and the 3D key points involved in each 3D model may include: each lamp, each door, each window, each wheel, vehicle Points on the periphery of the front surface, rear surface, roof surface, etc.
可以将一个已知模型的车型的三维模型作为初始三维模型,也可以对某类型的车辆指定统一的初始三维模型。例如,属于同一类的车辆的初始三维模型默认可以相同,例如,轿车属于同一类,货车属于一类,非机动车属于一类。A 3D model of a vehicle with a known model can be used as the initial 3D model, or a unified initial 3D model can be specified for a certain type of vehicle. For example, the initial 3D models of vehicles belonging to the same class can be the same by default, for example, cars belong to the same class, trucks belong to the same class, and non-motor vehicles belong to the same class.
B22:针对所述目标图像组中的每个图像,确定该图像中的车辆在该图像被拍摄时在世界坐标系中的初始位姿。B22: For each image in the target image group, determine the initial pose of the vehicle in the image in the world coordinate system when the image is captured.
其中,初始位姿可以根据各个图像中检测出的二维关键点结合相机标定信息确定,也可根据关联时刻拍摄到的图像中相同车辆的位姿确定,还可以为根据经验确定的默认值。Among them, the initial pose can be determined according to the two-dimensional key points detected in each image combined with camera calibration information, or can be determined according to the pose of the same vehicle in the image captured at the associated time, and can also be a default value determined based on experience.
例如,第一图像中检测到的二维关键点为左后轮、左侧车窗、左前玻璃、左后玻璃等上的关键点,且根据拍摄第一图像的相机的相机标定结果可确定相机是平行于道路架设,则可估计一初始位姿。For example, the two-dimensional key points detected in the first image are key points on the left rear wheel, left window, left front glass, left rear glass, etc., and the camera can be determined according to the camera calibration results of the camera that captured the first image. is erected parallel to the road, an initial pose can be estimated.
例如,已知第一图像中车辆A的位姿,第二图像同样包括车辆A,且第二图像与第 一图像拍摄相机相同、拍摄时间间隔小于一定时长(例如3s),则可根据第一图像中车辆A的位姿估计第二图像中车辆位姿,例如可以将第一图像中车辆A的位姿作为第二图像中车辆A的初始位姿。For example, if the pose of vehicle A in the first image is known, the second image also includes vehicle A, and the second image is captured by the same camera as the first image, and the shooting time interval is less than a certain length of time (for example, 3s), then the first The pose of vehicle A in the image is estimated for the pose of the vehicle in the second image, for example, the pose of vehicle A in the first image can be used as the initial pose of vehicle A in the second image.
例如,可将各个图像中的车辆在该图像被拍摄时在世界坐标系中的初始位姿设置为相同的默认值。For example, the initial pose of the vehicle in the world coordinate system when the image was taken in each image can be set to the same default value.
B23:根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维模型。B23: According to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captured each image, the initial pose of the vehicle in each image, and the 3D key points of the initial 3D model The initial coordinates are optimized to obtain the 3D model of the vehicle of the target model.
对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化的方法可以为光束平差法。The method for optimizing the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model may be bundle adjustment.
在确定出目标型号的车辆的初始三维模型,以及各个图像中的车辆在世界坐标系中的初始位姿态之后,需要对各个图像中的车辆的初始位姿,以及初始三维模型的三维关键点的初始坐标进行优化,可以对初始位姿和初始模型进行修正,从而得到精确的三维模型。After determining the initial 3D model of the vehicle of the target model and the initial pose of the vehicle in the world coordinate system in each image, it is necessary to determine the initial pose of the vehicle in each image and the 3D key points of the initial 3D model. The initial coordinates are optimized, and the initial pose and initial model can be corrected to obtain an accurate 3D model.
作为一种实施方式,B23包括步骤:B231-B233。As an implementation manner, B23 includes steps: B231-B233.
B231:针对所述目标图像组中的每个图像,根据所述初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置;所述初始投影点包括所述初始三维模型中与该图像的二维关键点对应的三维关键点在该图像对应的初始位姿下投影至所述图像坐标系中的点。B231: For each image in the target image group, according to the initial three-dimensional model, the camera calibration result corresponding to the image, and the initial pose, determine the position of the initial projection point corresponding to the image in the image coordinate system; The initial projection point includes a point where a 3D key point corresponding to a 2D key point of the image in the initial 3D model is projected into the image coordinate system at an initial pose corresponding to the image.
可以理解的是,车辆的初始位姿,也就是车辆所对应的三维模型在世界坐标系下的初始位姿。It can be understood that the initial pose of the vehicle is the initial pose of the 3D model corresponding to the vehicle in the world coordinate system.
实际实施过程中,B231可以按照如下方式实施,针对每个图像,根据该图像所对应的二维关键点信息组,从初始三维模型中,确定出与该图像中的二维关键点所对应的三维关键点的初始坐标,针对确定出的每个三维关键点,将该三维关键点在模型中的初始坐标(x,y,z)、该图像对应的相机标定结果S和初始位姿T,输入至投影表达式
Figure PCTCN2022098993-appb-000003
中,得到该三维关键点对应的初始投影点在该图像对应的图像坐标系中的位置(u',v')。
In the actual implementation process, B231 can be implemented in the following manner. For each image, according to the corresponding two-dimensional key point information group of the image, from the initial three-dimensional model, determine the corresponding two-dimensional key points in the image The initial coordinates of the three-dimensional key points, for each determined three-dimensional key point, the initial coordinates (x, y, z) of the three-dimensional key point in the model, the camera calibration result S and the initial pose T corresponding to the image, Input to projection expression
Figure PCTCN2022098993-appb-000003
In , the position (u', v') of the initial projection point corresponding to the 3D key point in the image coordinate system corresponding to the image is obtained.
B232:针对对应于该图像的各初始投影点和对应的维关键点之间的位置差异,确定对应于该图像的第一损失值。B232: Determine the first loss value corresponding to the image for the position difference between each initial projection point corresponding to the image and the corresponding dimensional key point.
在实际实施过程中,B232可以按照如下方式实施,针对对应于该图像的每个初始 投影点,确定该初始投影点在该图像对应的图像坐标系中的位置(u',v')和该初始投影点对应的三维关键点所对应的二维关键点的位置(u,v)之间的距离,将对应于该图像的各个距离之和确定为第一损失值。In the actual implementation process, B232 can be implemented in the following manner, for each initial projection point corresponding to the image, determine the position (u', v') of the initial projection point in the image coordinate system corresponding to the image and the The distance between the positions (u, v) of the two-dimensional key points corresponding to the three-dimensional key points corresponding to the initial projection point is determined as the first loss value as the sum of the respective distances corresponding to the image.
B233:根据各个图像对应的第一损失值,对所述初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件;所述优化后的三维模型为所述目标型号的车辆的三维模型。B233: According to the first loss value corresponding to each image, optimize the initial coordinates of the 3D key points of the initial 3D model and the initial pose corresponding to each image, until the optimized 3D model and the optimized pose are used The determined new loss value satisfies a preset condition; the optimized three-dimensional model is the three-dimensional model of the vehicle of the target model.
根据各个图像对应的第一损失值,对所述初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化,可以为根据各个图像对应的第一损失值之和,对所述初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化。According to the first loss value corresponding to each image, the initial coordinates of the 3D key points of the initial 3D model and the initial pose corresponding to each image are optimized, which can be based on the sum of the first loss values corresponding to each image, for The initial coordinates of the 3D key points of the initial 3D model and the initial poses corresponding to each image are optimized.
预设条件可以为以下之一:各图像对应的损失值之和呈收敛;各图像对应的损失值之和是历次迭代中的最小值;各图像对应的损失值小于目标损失值;各图像对应的损失值之和小于预设值;迭代次数达到预设次数。The preset condition can be one of the following: the sum of the loss values corresponding to each image converges; the sum of the loss values corresponding to each image is the minimum value in previous iterations; the loss value corresponding to each image is less than the target loss value; the corresponding loss value of each image The sum of the loss values of is less than the preset value; the number of iterations reaches the preset number.
在实际实施过程中,B233可以按照如下方式实施,根据目标图像组中的各个图像对应的第一损失值,对初始三维模型的三维关键点的初始坐标和各个对应于各个图像的初始位姿中的参数进行优化,得到优化后的三维模型和优化后的位姿;针对目标图像组中的每个图像,根据该图像所对应的二维关键点信息组,从优化后的三维模型中,确定出与该图像中的二维关键点所对应的三维关键点的坐标,针对每个三维关键点,将该三维关键点的坐标(x,y,z)、该图像对应的相机标定结果S和优化后的位姿T,输入至投影表达式
Figure PCTCN2022098993-appb-000004
中,得到该三维关键点对应的优化后的投影点在该图像对应的图像坐标系中的位置(u',v'),根据对应于该图像的各优化后的投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第二损失值,在确定各个图像对应的第二损失值之和小于等于预设值时,停止优化,其中,优化后的三维模型为目标型号的车辆的三维模型。
In the actual implementation process, B233 can be implemented in the following manner. According to the first loss value corresponding to each image in the target image group, the initial coordinates of the 3D key points of the initial 3D model and the initial poses corresponding to each image The parameters are optimized to obtain the optimized 3D model and the optimized pose; for each image in the target image group, according to the corresponding 2D key point information group of the image, from the optimized 3D model, determine Get the coordinates of the three-dimensional key points corresponding to the two-dimensional key points in the image, and for each three-dimensional key point, the coordinates (x, y, z) of the three-dimensional key points, the camera calibration result S corresponding to the image and The optimized pose T is input to the projection expression
Figure PCTCN2022098993-appb-000004
, the position (u', v') of the optimized projection point corresponding to the 3D key point in the image coordinate system corresponding to the image is obtained, according to each optimized projection point corresponding to the image and the corresponding 2D The position difference between the key points, determine the second loss value corresponding to the image, and stop the optimization when the sum of the second loss values corresponding to each image is determined to be less than or equal to the preset value, wherein the optimized three-dimensional model is the target A 3D model of the vehicle.
在上述实现过程中,针对每个图像,根据初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置,接着根据初始投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第一损失值,并根据各个图像的第一损失值,对初始三维模型的三维关键点的初始坐标和初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件时,才停止优化,继而可以保证最终得到的三维模型的准确性。In the above implementation process, for each image, according to the initial three-dimensional model, the camera calibration result corresponding to the image, and the initial pose, determine the position of the initial projection point corresponding to the image in the image coordinate system, and then according to the initial projection point and the position difference between the corresponding two-dimensional key points, determine the first loss value corresponding to the image, and according to the first loss value of each image, the initial coordinates and initial pose of the three-dimensional key points of the initial three-dimensional model Optimization, until the new loss value determined by using the optimized 3D model and the optimized pose meets the preset condition, the optimization is stopped, and then the accuracy of the final 3D model can be guaranteed.
作为一种实施方式,在步骤A3之前,所述方法还包括:针对所述至少一个图像组中的每个图像组,将该图像组中属于同一视角下的车辆图像进行去重。As an implementation manner, before step A3, the method further includes: for each image group in the at least one image group, deduplicating the vehicle images belonging to the same viewing angle in the image group.
具体地,针对每个图像组,计算出该图像组中的任意两张车辆图像中的车辆轮廓的相似度;在确定相似度大于预设阈值时,确定这两张车辆图像属于同一个视角下的车辆图像;否则,确定这两张图像属于不同视角下的车辆图像;继而将图像组中属于同一视角下的车辆图像进行去重。其中;在本实施例中,所述预设阈值可以为79%-90%中的任意值。Specifically, for each image group, calculate the similarity of the vehicle outline in any two vehicle images in the image group; when it is determined that the similarity is greater than the preset threshold, it is determined that the two vehicle images belong to the same viewing angle Otherwise, it is determined that the two images belong to vehicle images under different viewing angles; and then the vehicle images belonging to the same viewing angle in the image group are deduplicated. Wherein; in this embodiment, the preset threshold may be any value in 79%-90%.
在上述实现过程中,针对每个图像组,将该图像组中属于同一视角下的车辆图像进行去重,降低利用图像组进行三维重建的复杂度,提高三维重建效率。In the above implementation process, for each image group, the vehicle images belonging to the same viewing angle in the image group are deduplicated, so as to reduce the complexity of 3D reconstruction using the image group and improve the efficiency of 3D reconstruction.
请参照图2,图2是本公开实施例提供的一种三维重建装置200的结构框图。下面将对图2所示的结构框图进行阐述,所示装置包括:Please refer to FIG. 2 . FIG. 2 is a structural block diagram of a three-dimensional reconstruction apparatus 200 provided by an embodiment of the present disclosure. The structural block diagram shown in Figure 2 will be described below, and the shown devices include:
获取单元210,可以被配置成用于获取多个监控图像数据;所述多个监控图像数据均包括车辆的图像数据。The obtaining unit 210 may be configured to obtain a plurality of monitoring image data; the plurality of monitoring image data all include image data of a vehicle.
图像组确定单元220,可以被配置成用于根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像。The image group determining unit 220 may be configured to determine the target image group of the target model vehicle according to the plurality of surveillance image data; the target image group includes: the target model vehicle under different viewing angles image.
标定结果获取单元230,可以被配置成用于获取所述目标图像组中各张图像对应的相机的标定结果。The calibration result obtaining unit 230 may be configured to obtain a calibration result of the camera corresponding to each image in the target image group.
三维模型获得单元240,可以被配置成用于根据所述目标图像组和所述目标图像组中各张图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。The 3D model obtaining unit 240 may be configured to obtain the 3D model of the vehicle of the target model according to the target image group and the calibration results of the cameras corresponding to the images in the target image group.
作为一种实施方式,所述图像组确定单元220,包括:检测单元,可以被配置成用于对所述多个监控图像数据中的车辆进行检测,得到多个车辆图像;分组单元,可以被配置成用于按照车辆的型号对所述多个车辆图像进行分组,得到与至少一个型号的车辆一一对应的至少一个图像组,每个图像组包括对应型号的车辆在不同视角下的多个车辆图像;选取单元,可以被配置成用于从所述至少一个图像组中确定一个图像组作为所述目标图像组。As an implementation manner, the image group determination unit 220 includes: a detection unit configured to detect vehicles in the plurality of monitoring image data to obtain a plurality of vehicle images; a grouping unit may be It is configured to group the plurality of vehicle images according to the model of the vehicle to obtain at least one image group corresponding to at least one model of the vehicle, and each image group includes a plurality of images of the corresponding model of the vehicle under different viewing angles. The vehicle image; selecting unit may be configured to determine one image group from the at least one image group as the target image group.
作为一种实施方式,所述装置还包括:去重单元,可以被配置成用于针对所述至少一个图像组中的每个图像组,将该图像组中属于同一视角下的车辆图像进行去重。As an implementation manner, the device further includes: a deduplication unit, which may be configured to, for each image group in the at least one image group, deduplicate the vehicle images belonging to the same viewing angle in the image group. Heavy.
作为一种实施方式,所述三维模型获得单元240,包括:信息组获取单元,可以被配置成用于获取所述目标图像组中各个图像对应的关键点信息组;所述关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置;三维模型获得 子单元,可以被配置成用于根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型。As an implementation manner, the 3D model obtaining unit 240 includes: an information group obtaining unit configured to obtain a key point information group corresponding to each image in the target image group; the key point information group includes : The position of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image; the three-dimensional model obtaining subunit can be configured to set the key point information corresponding to each image in the target image group and the calibration results of the cameras that capture the respective images to obtain a three-dimensional model of the vehicle of the target model.
作为一种实施方式,所述三维模型获得子单元,包括:初始模型确定单元,可以被配置成用于确定所述目标型号的车辆的初始三维模型,所述初始三维模型包括:构成三维模型的各个三维关键点,以及各个三维关键点在模型坐标系中的初始坐标;初始位姿确定单元,可以被配置成用于针对所述目标图像组中的每个图像,确定该图像中的车辆在该图像被拍摄时在世界坐标系中的初始位姿;优化单元,可以被配置成用于根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维模型。As an implementation manner, the 3D model obtaining subunit includes: an initial model determining unit configured to determine the initial 3D model of the vehicle of the target model, and the initial 3D model includes: Each three-dimensional key point, and the initial coordinates of each three-dimensional key point in the model coordinate system; the initial pose determination unit may be configured to determine the position of the vehicle in the image for each image in the target image group The initial pose of the image in the world coordinate system when the image is taken; the optimization unit may be configured to be based on the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image , optimizing the initial pose of the vehicle in each image and the initial coordinates of the 3D key points of the initial 3D model to obtain the 3D model of the vehicle of the target model.
作为一种实施方式,所述优化单元,包括:投影单元,可以被配置成用于针对所述目标图像组中的每个图像,根据所述初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置;所述初始投影点包括所述初始三维模型中与该图像的二维关键点对应的三维关键点在该图像对应的初始位姿下投影至所述图像坐标系中的点;损失确定单元,可以被配置成用于根据对应于该图像的各初始投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第一损失值;优化子单元,用于根据各个图像对应的第一损失值,对所述初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件;所述优化后的三维模型为所述目标型号的车辆的三维模型。As an implementation manner, the optimization unit includes: a projection unit, configured to, for each image in the target image group, according to the initial 3D model, the camera calibration result corresponding to the image, and the initial Pose, determine the position of the initial projection point corresponding to the image in the image coordinate system; the initial projection point includes the three-dimensional key point corresponding to the two-dimensional key point of the image in the initial three-dimensional model in the corresponding position of the image Points projected into the image coordinate system under the initial pose; the loss determination unit may be configured to determine the corresponding The first loss value of the image; the optimization subunit is used to optimize the initial coordinates of the three-dimensional key points of the initial three-dimensional model and the initial pose corresponding to each image according to the first loss value corresponding to each image, Until the new loss value determined by using the optimized three-dimensional model and the optimized pose satisfies the preset condition; the optimized three-dimensional model is the three-dimensional model of the vehicle of the target model.
作为一种实施方式,所述标定结果获取单元230,可以被配置成用于根据所述多个监控图像数据,确定出所述目标图像组中各个图像对应的相机的标定结果。As an implementation manner, the calibration result acquisition unit 230 may be configured to determine, according to the plurality of monitoring image data, the calibration results of the cameras corresponding to the images in the target image group.
本实施例对的各功能单元实现各自功能的过程,请参见上述图1所示实施例中描述的内容,此处不再赘述。Refer to the content described in the embodiment shown in FIG. 1 above for the process of realizing the respective functions of each functional unit in this embodiment, and details are not repeated here.
请参照图3,图3为本公开实施例提供的一种电子设备300的结构示意图,电子设备300可以是个人电脑、平板电脑、智能手机、个人数字助理(personal digital assistant,PDA)等。Please refer to FIG. 3. FIG. 3 is a schematic structural diagram of an electronic device 300 provided by an embodiment of the present disclosure. The electronic device 300 may be a personal computer, a tablet computer, a smart phone, a personal digital assistant (personal digital assistant, PDA) and the like.
电子设备300可以包括:存储器302、处理器301、通信接口303和通信总线,通信总线用于实现这些组件的连接通信。The electronic device 300 may include: a memory 302, a processor 301, a communication interface 303, and a communication bus, and the communication bus is used to implement connection and communication of these components.
所述存储器302用于存储本公开实施例提供的三维重建方法和装置对应的计算程序指令等各种数据,其中,存储器302可以是,但不限于,随机存取存储器,只读存储器 (Read Only Memory,ROM),可编程只读存储器(Programmable Read-Only Memory,PROM),可擦除只读存储器(Erasable Programmable Read-Only Memory,EPROM),电可擦除只读存储器(Electric Erasable Programmable Read-Only Memory,EEPROM)等。The memory 302 is used to store various data such as calculation program instructions corresponding to the three-dimensional reconstruction method and device provided by the embodiments of the present disclosure, wherein the memory 302 may be, but not limited to, random access memory, read only memory (Read Only Memory, ROM), Programmable Read-Only Memory (Programmable Read-Only Memory, PROM), Erasable Programmable Read-Only Memory (EPROM), Electric Erasable Programmable Read-Only Memory (Electric Erasable Programmable Read-Only Memory, Only Memory, EEPROM), etc.
处理器301用于读取并运行存储于存储器中的三维重建方法和装置对应的计算机程序指令,以获取多个监控图像数据;所述多个监控图像数据均包括车辆的图像数据;根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像;获取所述目标图像组中各张图像对应的相机的标定结果;根据所述目标图像组和所述目标图像组中各张图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。The processor 301 is used to read and run the computer program instructions corresponding to the three-dimensional reconstruction method and device stored in the memory to obtain a plurality of monitoring image data; the plurality of monitoring image data includes image data of vehicles; according to the A plurality of monitoring image data, determine the target image group of the target model vehicle; the target image group includes: the images of the target model vehicle at different angles of view; obtain the corresponding camera of each image in the target image group The calibration result of the target image group and the calibration results of the cameras corresponding to the images in the target image group to obtain a three-dimensional model of the vehicle of the target model.
其中,处理器301可能是一种集成电路芯片,具有信号的处理能力。上述的处理器301可以是通用处理器,包括CPU、网络处理器(Network Processor,NP)等;还可以是数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本公开实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。Wherein, the processor 301 may be an integrated circuit chip, which has a signal processing capability. Above-mentioned processor 301 can be general purpose processor, comprises CPU, network processor (Network Processor, NP) etc.; Can also be digital signal processor (DSP), application specific integrated circuit (ASIC), field programmable gate array (FPGA) ) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. Various methods, steps and logic block diagrams disclosed in the embodiments of the present disclosure may be implemented or executed. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like.
通信接口303,用于接收或者发送数据。The communication interface 303 is used for receiving or sending data.
此外,本公开实施例还提供了一种存储介质,在该存储介质中存储有计算机程序,当所述计算机程序在计算机上运行时,使得所述计算机执行本公开任一项实施方式所提供的方法。In addition, an embodiment of the present disclosure also provides a storage medium, in which a computer program is stored, and when the computer program is run on a computer, the computer is made to execute the method provided by any one of the embodiments of the present disclosure. method.
综上所述,本公开各实施例提出的三维重建方法、装置、电子设备及存储介质,根据多个监控图像数据,确定出目标型号的车辆在不同视角下的图像,继而根据各张图像和对应的相机的标定结果,得到目标型号的车辆的三维模型,该方法无需人工手持三维扫描设备对车辆进行扫描,通过基于监控图像数据便可实现车辆三维模型的构建,效率更高。由于无需借助三维扫描设备及实体车辆,成本更低,且实现更容易。To sum up, the 3D reconstruction method, device, electronic device, and storage medium proposed by the various embodiments of the present disclosure determine the images of the target model vehicle under different viewing angles based on a plurality of monitoring image data, and then, according to each image and The 3D model of the vehicle of the target model is obtained from the calibration result of the corresponding camera. This method does not need to manually scan the vehicle with a 3D scanning device, and the construction of the 3D model of the vehicle can be realized based on the monitoring image data, which is more efficient. Since there is no need for 3D scanning equipment and physical vehicles, the cost is lower and the implementation is easier.
在本公开所提供的实施例中,应该理解到,所揭露的装置和方法,也可以通过其它的方式实现。以上所描述的装置实施例仅仅是示意性的,例如,附图中的流程图和框图显示了根据本公开的多个实施例的装置、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现方式中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执 行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的装置来实现,或者可以用专用硬件与计算机指令的组合来实现。In the embodiments provided in the present disclosure, it should be understood that the disclosed devices and methods may also be implemented in other ways. The device embodiments described above are only illustrative. For example, the flowcharts and block diagrams in the accompanying drawings show the architecture, functions and possible implementations of devices, methods and computer program products according to multiple embodiments of the present disclosure. operate. In this regard, each block in a flowchart or block diagram may represent a module, program segment, or part of code that includes one or more Executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. It should also be noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by a dedicated hardware-based device that performs the specified function or action , or may be implemented by a combination of dedicated hardware and computer instructions.
另外,在本公开各个实施例中的各功能模块可以集成在一起形成一个独立的部分,也可以是各个模块单独存在,也可以两个或两个以上模块集成形成一个独立的部分。In addition, each functional module in each embodiment of the present disclosure may be integrated together to form an independent part, each module may exist independently, or two or more modules may be integrated to form an independent part.
工业实用性Industrial Applicability
本公开提供了一种三维重建方法、装置、电子设备及存储介质,该方法包括:获取多个监控图像数据;多个监控图像数据均包括车辆的图像数据;根据多个监控图像数据,确定出目标型号的车辆的目标图像组;目标图像组包括目标型号的车辆在不同视角下的图像;获取目标图像组中各张图像对应的相机的标定结果;根据目标图像组和目标图像组中各张图像对应的相机的标定结果,得到目标型号的车辆的三维模型。该方法无需人工手持三维扫描设备对车辆进行扫描,通过基于监控图像数据便可实现车辆三维模型的构建,效率更高;其次,由于无需借助三维扫描设备及实体车辆,成本更低,且实现更容易。The present disclosure provides a three-dimensional reconstruction method, device, electronic equipment, and storage medium. The method includes: acquiring a plurality of monitoring image data; the plurality of monitoring image data includes image data of a vehicle; The target image group of the target model vehicle; the target image group includes the images of the target model vehicle under different viewing angles; obtain the calibration results of the cameras corresponding to each image in the target image group; according to the target image group and each image in the target image group The calibration result of the camera corresponding to the image is used to obtain the 3D model of the vehicle of the target model. This method does not need to scan the vehicle with a hand-held 3D scanning device, and can realize the construction of a 3D model of the vehicle based on the monitoring image data, which is more efficient; easy.
此外,可以理解的是,本公开的三维重建方法、装置、电子设备及存储介质是可以重现的,并且可以用在多种工业应用中。例如,本公开的三维重建方法、装置、电子设备及存储介质可以用于图像处理技术领域。In addition, it can be understood that the three-dimensional reconstruction method, device, electronic device and storage medium of the present disclosure are reproducible and can be used in various industrial applications. For example, the three-dimensional reconstruction method, device, electronic equipment, and storage medium of the present disclosure can be used in the technical field of image processing.

Claims (16)

  1. 一种三维重建方法,其特征在于,所述方法包括:A three-dimensional reconstruction method, characterized in that the method comprises:
    获取多个监控图像数据;所述多个监控图像数据均包括车辆的图像数据;Obtaining a plurality of monitoring image data; the plurality of monitoring image data all include image data of the vehicle;
    根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像;According to the plurality of monitoring image data, a target image group of the target model vehicle is determined; the target image group includes: images of the target model vehicle under different viewing angles;
    获取所述目标图像组中各个图像对应的相机的标定结果;Acquiring calibration results of cameras corresponding to each image in the target image group;
    根据所述目标图像组和所述目标图像组中各个图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。According to the target image group and the calibration results of the cameras corresponding to the images in the target image group, a three-dimensional model of the vehicle of the target model is obtained.
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组,包括:The method according to claim 1, wherein the determining the target image group of the vehicle of the target model according to the plurality of monitoring image data includes:
    对所述多个监控图像数据中的车辆进行检测,得到多个车辆图像;Detecting vehicles in the plurality of monitoring image data to obtain a plurality of vehicle images;
    按照车辆的型号对所述多个车辆图像进行分组,得到与至少一个型号的车辆一一对应的至少一个图像组,每个图像组包括对应型号的车辆在不同视角下的多个车辆图像;Grouping the plurality of vehicle images according to the vehicle type to obtain at least one image group corresponding to at least one type of vehicle, each image group including a plurality of vehicle images of the corresponding type of vehicle under different viewing angles;
    从所述至少一个图像组中确定一个图像组作为所述目标图像组。A group of images is determined from the at least one group of images as the target group of images.
  3. 根据权利要求2所述的方法,其特征在于,在从所述至少一个图像组中确定一个图像组作为所述目标图像组之前,所述方法还包括:The method according to claim 2, wherein before determining an image group from the at least one image group as the target image group, the method further comprises:
    针对所述至少一个图像组中的每个图像组,将该图像组中属于同一视角下的车辆图像进行去重。For each image group in the at least one image group, the vehicle images belonging to the same viewing angle in the image group are deduplicated.
  4. 根据权利要求1-3中任一权项所述的方法,其特征在于,根据所述目标图像组和所述目标图像组中各个图像对应的相机的标定结果,得到所述目标型号的车辆的三维模型,包括:The method according to any one of claims 1-3, characterized in that, according to the target image group and the calibration results of the cameras corresponding to each image in the target image group, the vehicle of the target model is obtained. 3D models, including:
    获取所述目标图像组中各个图像对应的关键点信息组;所述关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置;Obtain the key point information group corresponding to each image in the target image group; the key point information group includes: the positions of multiple two-dimensional key points representing the outline of the vehicle in the corresponding image in the image;
    根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型。According to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image, a three-dimensional model of the vehicle of the target model is obtained.
  5. 根据权利要求4所述的方法,其特征在于,根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型,包括:The method according to claim 4, wherein the three-dimensional model of the vehicle of the target model is obtained according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image ,include:
    确定所述目标型号的车辆的初始三维模型,所述初始三维模型包括:构成三维模型的各个三维关键点,以及各个三维关键点在模型坐标系中的初始坐标;determining the initial three-dimensional model of the vehicle of the target model, the initial three-dimensional model including: each three-dimensional key point constituting the three-dimensional model, and the initial coordinates of each three-dimensional key point in the model coordinate system;
    针对所述目标图像组中的每个图像,确定该图像中的车辆在该图像被拍摄时在世界坐标系中的初始位姿;For each image in the target image group, determine the initial pose of the vehicle in the image in the world coordinate system when the image is taken;
    根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维模型。According to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captured each image, the initial pose of the vehicle in each image and the initial three-dimensional key point of the initial three-dimensional model The coordinates are optimized to obtain a three-dimensional model of the vehicle of the target model.
  6. 根据权利要求5所述的方法,其特征在于,根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维模型,包括:The method according to claim 5, characterized in that, according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that took the each image, the initial pose of the vehicle in each image , and the initial coordinates of the three-dimensional key points of the initial three-dimensional model are optimized to obtain the three-dimensional model of the vehicle of the target model, including:
    针对所述目标图像组中的每个图像,根据所述初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置;所述初始投影点包括所述初始三维模型中与该图像的二维关键点对应的三维关键点在该图像对应的初始位姿下投影至所述图像坐标系中的点;For each image in the target image group, according to the initial three-dimensional model, the camera calibration result corresponding to the image, and the initial pose, determine the position of the initial projection point corresponding to the image in the image coordinate system; The initial projected point includes a point in the initial three-dimensional model where the three-dimensional key point corresponding to the two-dimensional key point of the image is projected into the image coordinate system at the initial pose corresponding to the image;
    根据对应于该图像的各初始投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第一损失值;Determining a first loss value corresponding to the image according to the position difference between each initial projection point corresponding to the image and the corresponding two-dimensional key point;
    根据各个图像对应的第一损失值,对所述初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件;所述优化后的三维模型为所述目标型号的车辆的三维模型。According to the first loss value corresponding to each image, the initial coordinates of the 3D key points of the initial 3D model and the initial pose corresponding to each image are optimized until the optimized 3D model and the optimized pose are used to determine The new loss value satisfies the preset condition; the optimized three-dimensional model is the three-dimensional model of the vehicle of the target model.
  7. 根据权利要求1至6中任一权项所述的方法,其特征在于,获取所述目标图像组中各个图像对应的相机的标定结果,包括:The method according to any one of claims 1 to 6, wherein obtaining the calibration result of the camera corresponding to each image in the target image group includes:
    根据所述多个监控图像数据,确定出所述目标图像组中各个图像对应的相机的标定结果。According to the plurality of monitoring image data, the calibration result of the camera corresponding to each image in the target image group is determined.
  8. 一种三维重建装置,其特征在于,所述装置包括:A three-dimensional reconstruction device, characterized in that the device comprises:
    获取单元,被配置成用于获取多个监控图像数据;所述多个监控图像数据包括车辆的图像数据;an acquisition unit configured to acquire a plurality of monitoring image data; the plurality of monitoring image data includes image data of a vehicle;
    图像组确定单元,被配置成用于根据所述多个监控图像数据,确定出目标型号的车辆的目标图像组;所述目标图像组包括:所述目标型号的车辆在不同视角下的图像;The image group determining unit is configured to determine a target image group of a target model vehicle according to the plurality of monitoring image data; the target image group includes: images of the target model vehicle under different viewing angles;
    标定结果获取单元,被配置成用于获取所述目标图像组中各个图像对应的相机的标定结果;A calibration result acquisition unit configured to acquire a calibration result of the camera corresponding to each image in the target image group;
    三维模型获得单元,被配置成用于根据所述目标图像组和所述目标图像组中各个图 像对应的相机的标定结果,得到所述目标型号的车辆的三维模型。The three-dimensional model obtaining unit is configured to obtain the three-dimensional model of the vehicle of the target model according to the target image group and the calibration result of the camera corresponding to each image in the target image group.
  9. 根据权利要求8所述的三维重建装置,其特征在于,所述图像组确定单元包括:检测单元、分组单元和选取单元,其中,The three-dimensional reconstruction device according to claim 8, wherein the image group determination unit comprises: a detection unit, a grouping unit, and a selection unit, wherein,
    所述检测单元被配置成用于对所述多个监控图像数据中的车辆进行检测,得到多个车辆图像;The detection unit is configured to detect vehicles in the plurality of monitoring image data to obtain a plurality of vehicle images;
    所述分组单元被配置成用于按照车辆的型号对所述多个车辆图像进行分组,得到与至少一个型号的车辆一一对应的至少一个图像组,每个图像组包括对应型号的车辆在不同视角下的多个车辆图像;The grouping unit is configured to group the plurality of vehicle images according to the model of the vehicle to obtain at least one image group corresponding to at least one model of the vehicle, and each image group includes the corresponding model of the vehicle in different Multiple vehicle images from perspective;
    所述选取单元被配置成用于从所述至少一个图像组中确定一个图像组作为所述目标图像组。The selecting unit is configured to determine one image group from the at least one image group as the target image group.
  10. 根据权利要求9所述的三维重建装置,其特征在于,所述三维重建装置还包括:去重单元,所述去重单元被配置成用于:针对所述至少一个图像组中的每个图像组,将该图像组中属于同一视角下的车辆图像进行去重。The three-dimensional reconstruction device according to claim 9, characterized in that the three-dimensional reconstruction device further comprises: a deduplication unit configured to: for each image in the at least one image group group, deduplicating the vehicle images belonging to the same viewing angle in the image group.
  11. 根据权利要求8至10中任一权项所述的三维重建装置,其特征在于,所述三维模型获得单元包括信息组获取单元和三维模型获得子单元,The 3D reconstruction device according to any one of claims 8 to 10, wherein the 3D model obtaining unit includes an information group obtaining unit and a 3D model obtaining subunit,
    其中,所述信息组获取单元被配置成用于:获取所述目标图像组中各个图像对应的关键点信息组;所述关键点信息组包括:表征对应图像中车辆的轮廓的多个二维关键点在所处图像中的位置;Wherein, the information group acquisition unit is configured to: acquire the key point information group corresponding to each image in the target image group; the key point information group includes: a plurality of two-dimensional information representing the outline of the vehicle in the corresponding image The position of the key point in the image;
    其中,所述三维模型获得子单元被配置成用于:根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,得到所述目标型号的车辆的三维模型。Wherein, the three-dimensional model obtaining subunit is configured to: obtain the vehicle of the target model according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image. 3D model.
  12. 根据权利要求11所述的三维重建装置,其特征在于,所述三维模型获得子单元,包括:初始模型确定单元、初始位姿确定单元和优化单元,The 3D reconstruction device according to claim 11, wherein the 3D model obtaining subunit comprises: an initial model determination unit, an initial pose determination unit and an optimization unit,
    其中,所述初始模型确定单元被配置成用于:确定所述目标型号的车辆的初始三维模型,所述初始三维模型包括:构成三维模型的各个三维关键点,以及各个三维关键点在模型坐标系中的初始坐标;Wherein, the initial model determination unit is configured to: determine the initial three-dimensional model of the vehicle of the target model, the initial three-dimensional model includes: each three-dimensional key point constituting the three-dimensional model, and each three-dimensional key point in the model coordinates initial coordinates in the system;
    其中,所述初始位姿确定单元被配置成用于:针对所述目标图像组中的每个图像,确定该图像中的车辆在该图像被拍摄时在世界坐标系中的初始位姿;Wherein, the initial pose determining unit is configured to: for each image in the target image group, determine the initial pose of the vehicle in the image in the world coordinate system when the image is captured;
    其中,所述优化单元被配置成用于:根据所述目标图像组中各个图像对应的关键点信息组和拍摄所述各个图像的相机的标定结果,对各个图像中的车辆的初始位姿,以及所述初始三维模型的三维关键点的初始坐标进行优化,得到所述目标型号的车辆的三维 模型。Wherein, the optimization unit is configured to: according to the key point information group corresponding to each image in the target image group and the calibration result of the camera that captures each image, for the initial pose of the vehicle in each image, And the initial coordinates of the 3D key points of the initial 3D model are optimized to obtain the 3D model of the vehicle of the target model.
  13. 根据权利要求12所述的三维重建装置,其特征在于,所述优化单元包括:投影单元、损失确定单元和优化子单元,The three-dimensional reconstruction device according to claim 12, wherein the optimization unit comprises: a projection unit, a loss determination unit, and an optimization subunit,
    其中,所述投影单元被配置成用于:针对所述目标图像组中的每个图像,根据所述初始三维模型、该图像对应的相机标定结果和初始位姿,确定与该图像对应的初始投影点在图像坐标系中的位置;所述初始投影点包括所述初始三维模型中与该图像的二维关键点对应的三维关键点在该图像对应的初始位姿下投影至所述图像坐标系中的点;Wherein, the projection unit is configured to: for each image in the target image group, according to the initial three-dimensional model, the camera calibration result corresponding to the image and the initial pose, determine the initial The position of the projection point in the image coordinate system; the initial projection point includes the projection of the three-dimensional key points corresponding to the two-dimensional key points of the image in the initial three-dimensional model to the image coordinates under the initial pose corresponding to the image points in the system;
    其中,所述损失确定单元被配置成用于:根据对应于该图像的各初始投影点和对应的二维关键点之间的位置差异,确定对应于该图像的第一损失值;Wherein, the loss determination unit is configured to: determine the first loss value corresponding to the image according to the position difference between each initial projection point corresponding to the image and the corresponding two-dimensional key point;
    其中,所述优化子单元被配置成用于:根据各个图像对应的第一损失值,对所述初始三维模型的三维关键点的初始坐标和对应于各个图像的初始位姿进行优化,直至利用优化后的三维模型和优化后的位姿确定出的新的损失值满足预设条件;所述优化后的三维模型为所述目标型号的车辆的三维模型。Wherein, the optimization subunit is configured to: optimize the initial coordinates of the 3D key points of the initial 3D model and the initial poses corresponding to each image according to the first loss value corresponding to each image, until using The optimized three-dimensional model and the new loss value determined by the optimized pose satisfy a preset condition; the optimized three-dimensional model is the three-dimensional model of the vehicle of the target model.
  14. 根据权利要求8至13中任一权项所述的三维重建装置,其特征在于,所述标定结果获取单元被配置成用于:根据所述多个监控图像数据,确定出所述目标图像组中各个图像对应的相机的标定结果。The three-dimensional reconstruction device according to any one of claims 8 to 13, wherein the calibration result acquisition unit is configured to: determine the target image group according to the plurality of monitoring image data The calibration results of the cameras corresponding to each image in .
  15. 一种电子设备,其特征在于,包括存储器以及处理器,所述存储器中存储有计算机程序指令,所述计算机程序指令被所述处理器读取并运行时,执行根据权利要求1至7中任一项所述的方法。An electronic device, characterized by comprising a memory and a processor, wherein computer program instructions are stored in the memory, and when the computer program instructions are read and executed by the processor, any one of the methods described.
  16. 一种存储介质,其特征在于,所述存储介质上存储有计算机程序指令,所述计算机程序指令被计算机读取并运行时,执行根据权利要求1至7中任一项所述的方法。A storage medium, wherein computer program instructions are stored on the storage medium, and when the computer program instructions are read and executed by a computer, the method according to any one of claims 1 to 7 is executed.
PCT/CN2022/098993 2021-08-13 2022-06-15 Three-dimensional reconstruction method and apparatus, and electronic device and storage medium WO2023016082A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110931999.1 2021-08-13
CN202110931999.1A CN113793413A (en) 2021-08-13 2021-08-13 Three-dimensional reconstruction method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2023016082A1 true WO2023016082A1 (en) 2023-02-16

Family

ID=79181817

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/098993 WO2023016082A1 (en) 2021-08-13 2022-06-15 Three-dimensional reconstruction method and apparatus, and electronic device and storage medium

Country Status (2)

Country Link
CN (1) CN113793413A (en)
WO (1) WO2023016082A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113793413A (en) * 2021-08-13 2021-12-14 北京迈格威科技有限公司 Three-dimensional reconstruction method and device, electronic equipment and storage medium
CN115620094B (en) * 2022-12-19 2023-03-21 南昌虚拟现实研究院股份有限公司 Key point marking method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190141310A1 (en) * 2018-12-28 2019-05-09 Intel Corporation Real-time, three-dimensional vehicle display
CN109816704A (en) * 2019-01-28 2019-05-28 北京百度网讯科技有限公司 The 3 D information obtaining method and device of object
CN112652056A (en) * 2020-12-25 2021-04-13 北京奇艺世纪科技有限公司 3D information display method and device
CN112902874A (en) * 2021-01-19 2021-06-04 中国汽车工程研究院股份有限公司 Image acquisition device and method, image processing method and device and image processing system
CN113793413A (en) * 2021-08-13 2021-12-14 北京迈格威科技有限公司 Three-dimensional reconstruction method and device, electronic equipment and storage medium

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103996220A (en) * 2014-05-26 2014-08-20 江苏大学 Three-dimensional reconstruction method and system in intelligent transportation
CN104346833A (en) * 2014-10-28 2015-02-11 燕山大学 Vehicle restructing algorithm based on monocular vision
CN110517349A (en) * 2019-07-26 2019-11-29 电子科技大学 A kind of 3D vehicle target detection method based on monocular vision and geometrical constraint
CN111145238B (en) * 2019-12-12 2023-09-22 中国科学院深圳先进技术研究院 Three-dimensional reconstruction method and device for monocular endoscopic image and terminal equipment
CN111476798B (en) * 2020-03-20 2023-05-16 上海遨遥人工智能科技有限公司 Vehicle space morphology recognition method and system based on contour constraint
CN111462249B (en) * 2020-04-02 2023-04-18 北京迈格威科技有限公司 Traffic camera calibration method and device
CN112489126B (en) * 2020-12-10 2023-09-19 浙江商汤科技开发有限公司 Vehicle key point information detection method, vehicle control method and device and vehicle
CN112541460B (en) * 2020-12-21 2022-05-13 山东师范大学 Vehicle re-identification method and system
CN112750203B (en) * 2021-01-21 2023-10-31 脸萌有限公司 Model reconstruction method, device, equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190141310A1 (en) * 2018-12-28 2019-05-09 Intel Corporation Real-time, three-dimensional vehicle display
CN109816704A (en) * 2019-01-28 2019-05-28 北京百度网讯科技有限公司 The 3 D information obtaining method and device of object
CN112652056A (en) * 2020-12-25 2021-04-13 北京奇艺世纪科技有限公司 3D information display method and device
CN112902874A (en) * 2021-01-19 2021-06-04 中国汽车工程研究院股份有限公司 Image acquisition device and method, image processing method and device and image processing system
CN113793413A (en) * 2021-08-13 2021-12-14 北京迈格威科技有限公司 Three-dimensional reconstruction method and device, electronic equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TANG XINYAO, SONG HUANSHENG, WANG WEI, ZHANG CHAOYANG, CUI HUA: "3D Vehicle Information Recognition Algorithm of Monocular Camera Based onSelf-Calibration in Traffic Scene", JOURNAL OF COMPUTER-AIDED DESIGN & COMPUTER GRAPHICS, vol. 32, no. 8, 1 August 2020 (2020-08-01), CN , pages 1305 - 1314, XP093034238, ISSN: 1003-9775, DOI: 10.3724/SP.J.1089.2020.18041 *

Also Published As

Publication number Publication date
CN113793413A (en) 2021-12-14

Similar Documents

Publication Publication Date Title
WO2023016082A1 (en) Three-dimensional reconstruction method and apparatus, and electronic device and storage medium
Taneja et al. Image based detection of geometric changes in urban environments
US9483703B2 (en) Online coupled camera pose estimation and dense reconstruction from video
EP3502621B1 (en) Visual localisation
CN110176032B (en) Three-dimensional reconstruction method and device
US9091553B2 (en) Systems and methods for matching scenes using mutual relations between features
WO2018120027A1 (en) Method and apparatus for detecting obstacles
CN110879994A (en) Three-dimensional visual inspection detection method, system and device based on shape attention mechanism
WO2021143935A1 (en) Detection method, device, electronic apparatus, and storage medium
US10872246B2 (en) Vehicle lane detection system
CN111837158A (en) Image processing method and device, shooting device and movable platform
CN111928842B (en) Monocular vision based SLAM positioning method and related device
WO2023284358A1 (en) Camera calibration method and apparatus, electronic device, and storage medium
WO2023016182A1 (en) Pose determination method and apparatus, electronic device, and readable storage medium
CN110673607A (en) Feature point extraction method and device in dynamic scene and terminal equipment
Kume et al. Bundle adjustment using aerial images with two-stage geometric verification
CN116823966A (en) Internal reference calibration method and device for camera, computer equipment and storage medium
CN116051736A (en) Three-dimensional reconstruction method, device, edge equipment and storage medium
CN114898321A (en) Method, device, equipment, medium and system for detecting road travelable area
CN112598736A (en) Map construction based visual positioning method and device
CN116721109B (en) Half global matching method for binocular vision images
CN113240734B (en) Vehicle cross-position judging method, device, equipment and medium based on aerial view
Ding et al. Stereovision based generic obstacle detection and motion estimation using v-stxiel algorithm
CN115147803A (en) Object detection method and device
Fathi et al. Machine vision-based infrastructure as-built documentation using edge points

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22855067

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE