CN106910253B

CN106910253B - Stereo image cloning method based on different camera distances

Info

Publication number: CN106910253B
Application number: CN201710097015.8A
Authority: CN
Inventors: 侯春萍; 刘琦; 阎维青; 陈华; 王致芃
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2017-02-22
Filing date: 2017-02-22
Publication date: 2020-02-18
Anticipated expiration: 2037-02-22
Also published as: CN106910253A

Abstract

The invention belongs to the technical field of image processing, and provides a stereo image cloning algorithm for inconsistent camera distances. The technical scheme adopted by the invention is that the method for cloning the three-dimensional image based on different camera distances comprises the steps of firstly carrying out image preprocessing, then processing the outline drawn by a user so as to carry out subsequent grid deformation, then adjusting the shape and size of a cloning region of the three-dimensional image by adjusting parallax, and finally fusing the adjusted cloning region of the three-dimensional image into a target image. The invention is mainly applied to the image processing occasion.

Description

Stereo image cloning method based on different camera distances

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a stereo image cloning algorithm based on different camera distances.

Background

With the increase of 3D media such as 3D movies and televisions, 3D contents have received wide attention from users. Therefore, once a user has convenient access to a 3D camera and a 3D display device, it is desirable to be able to edit a 3D image as if it were editing a 2D image, however, applying the method of 2D image editing directly to a 3D image often does not produce a comfortable and pleasant visual experience.

Currently, there are many mature algorithms for 2D image cloning, however, 3D image cloning faces many new challenges: 1) the 3D image increases depth constraint compared to the 2D image, and therefore, in order to ensure depth consistency in the stereoscopic image cloning process, the size and shape of the cloning region need to be adjusted. 2) For a comfortable visual experience, it is necessary to maintain the correspondence between the left view image and the right view image.

At present, some researchers have conducted studies on stereo image cloning. Lo et al propose a 3D copy and paste method that requires accurate segmentation of the object to be cloned in the source image and pasting it into the destination image. Luo et al propose a seamless stereoscopic image cloning algorithm based on perspective perception deformation, which does not require accurate segmentation of a region to be cloned in a source image and can process cloned objects with fuzzy boundaries and difficult segmentation. However, in the process of stereoscopic image cloning, the above studies assume that the camera pitches of the stereoscopic cameras that capture the source image and the target image are identical. However, in practice, the camera pitches of the stereo cameras for capturing the source image and the target image often do not coincide, and therefore the problem of cloning the stereo images when the camera pitches do not coincide is urgently solved.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention aims to provide a stereo image cloning algorithm aiming at the inconsistent camera spacing. The technical scheme adopted by the invention is that the method for cloning the three-dimensional image based on different camera distances comprises the steps of firstly carrying out image preprocessing, then processing the outline drawn by a user so as to carry out subsequent grid deformation, then adjusting the shape and size of a cloning region of the three-dimensional image by adjusting parallax, and finally fusing the adjusted cloning region of the three-dimensional image into a target image.

The image preprocessing comprises the specific steps of calculating disparity maps of a source image and a target image, and constructing quadrilateral grids for the source image and the target image in the following construction mode: first using a uniform grid M^l＝(V^l,E^l,F^l) A left viewpoint image representing a source image, wherein V^lMesh vertices representing left viewpoint image, E^lMesh edge representing left viewpoint image, F^lThe grid quadrangle of the left viewpoint image is represented, and then the grid M of the right viewpoint image of the source image is constructed according to the disparity map of the source image^r＝(V^r,E^r,F^r) Wherein V is^rMesh vertices representing right viewpoint image, E^rMesh edge, F, representing right view image^rA grid quadrangle representing the right view image.

The contour processing method comprises the specific steps of adjusting any contour drawn by a user by combining an initial uniform grid of a left viewpoint image of a source image to enable a new contour to be a minimum rectangular contour containing the contour drawn by the user, wherein four corner points of the rectangular contour are grid vertexes in the initial uniform grid of the left viewpoint image of the source image, after the new contour of the left viewpoint image of the source image is obtained, the new contour of a right viewpoint image of the source image is obtained according to a source image parallax image, and a contour surrounding area is a source image cloning area.

Iteratively adjusting the shape and size of the stereoscopic image clone region by adjusting the parallax, namely iteratively optimizing deformation,

1) parallax adjustment

The adjustment mode is as follows: adopting the parallax map gradient field of the source image clone region as a guide field G for solving the following formula₀Meanwhile, the parallax of the edge of the adjusted source image cloning region is forced to be the same as the parallax of the edge of the pasting region of the target stereo image, and the parallax image of the adjusted source image cloning region is obtained by solving the minimum value of the weighted Poisson equation

Wherein

Is a weight function formulated for the source image clone region for penalizing a smoother region, D_TA disparity map of the target image is represented,

represents the outline of the source image clone region,

a gradient field of a disparity map representing the adjusted source image clone region;

2) optimized deformation based on quadrilateral grids

2.1 relationship between scene object size and disparity

Assuming that the cameras of the stereo camera are arranged in parallel, and deducing the relation between the depth z of the scene object and the distance b between the cameras according to the geometric relation and the similar triangle theorem as follows:

where b denotes a stereo camera pitch, f denotes a camera focal length, z denotes a depth of a scene object, and d ═ x_L-x_RRepresenting image parallax, x_LAnd x_RCorresponding abscissas of the same scene object in the left viewpoint image and the right viewpoint image are represented;

according to the geometric relation and the similar triangle theorem, the relation between the size of the scene object on the stereo image plane and the depth of the scene object is deduced as

Wherein L represents the size of the scene object, x represents the size of the scene object on the image plane, and the relationship between the size x of the scene object on the stereoscopic image plane and the image parallax d is derived as follows:

as can be seen from the above formula, the size x of the scene object L on the stereo image is related to the image parallax d and the stereo camera distance b, but is not related to the stereo camera focal length f;

for the same scene object L₁With a size x on the source image and on the target image, respectively₁And x₂Parallax is d₁And d₂The distance between the binocular stereo cameras is b₁And b₂The following formula is obtained:

according to the two formulas, when the distances between the cameras are inconsistent, the scaling relationship of the same scene object on the source image and the target image is as follows:

thus, the size of a scene object on a stereo image is related not only to image parallax, but also to the camera pitch of the stereo camera;

2.2 optimization of the deformation

Guiding quadrilateral mesh deformation by adopting weight combination of perspective scaling item, straight line constraint item, parallax consistent item, vertical alignment item and position fixing item, wherein V is^lAnd V^rRepresenting the left viewpoint image mesh vertices and the right viewpoint image mesh vertices before the optimization deformation,

andrepresenting the mesh vertex of the left viewpoint image and the mesh vertex of the right viewpoint image after the optimization deformation;

(1) the perspective zoom term defines a perspective zoom factor of

Wherein the content of the first and second substances,

and

respectively representing the adjusted parallax and the parallax before the adjustment of the clone region of the source image, b_SAnd b_TRespectively representing the space between the stereo cameras for shooting the source images and the space between the stereo cameras for shooting the target images;

for any grid edge of left viewpoint image of source image clone regionWherein, V_i ^lAnd

respectively representing the ith and jth vertex coordinates of the left viewpoint image mesh before the optimization deformation, and the scaling factor thereof is defined as

Wherein the content of the first and second substances,

and

respectively representing perspective scaling factors of the ith and jth vertexes of the left viewpoint image grid before the optimization deformation;

perspective zoom term is defined as

Wherein E is^lAnd E^rRespectively representing the sets of left viewpoint image grid edges and right viewpoint image grid edges of a source image clone area; v_i ^lAnd

respectively representing the ith and jth vertex coordinates, V, of the left viewpoint image mesh before the optimization deformation_i ^rAnd

respectively representing the ith and jth vertex coordinates of the right viewpoint image grid before the optimization deformation;

and

respectively represent the ith and jth vertex coordinates of the left viewpoint image mesh after the optimized transformation,

and

respectively representing the ith and jth vertex coordinates of the optimized and deformed right viewpoint image grid; s_ijA scaling factor representing a grid edge;

(2) linear constraint term

Defining the length ratio of the deformed grid edge to the deformed grid edge of the left viewpoint image of the source image as

Wherein, V_i ^lAnd

respectively represent the ith and jth vertex coordinates of the left viewpoint image mesh before the optimization transformation,

and

respectively representing the ith and jth vertex coordinates of the optimized and deformed left viewpoint image grid;

defining the length ratio of the right viewpoint image of the source image by the same method

The linear constraint term is defined as

Wherein E is^lAnd E^rRespectively representing the sets of left viewpoint image grid edges and right viewpoint image grid edges of a source image clone area, V_i ^lAnd

andrespectively represent the ith and jth vertex coordinates of the left viewpoint image mesh after the optimized transformation,

and

respectively representing the ith and jth vertex coordinates of the optimized and deformed right viewpoint image grid;

(3) disparity coincidence term

For each set of horizontal coordinate components to be calculated

The parallax should be consistent with the adjusted parallax so as to make

Indicating the adjusted disparity. The disparity-consistent term is defined as

Wherein the content of the first and second substances,

represents the horizontal coordinates of the left viewpoint image to be calculated,

representing a horizontal coordinate to be calculated of the right viewpoint image;

(4) vertical alignment item

For each set of vertical coordinate components to be calculatedShould be adjusted to approach 0 to eliminate the effect of vertical parallax, the vertical alignment term is defined as

Wherein the content of the first and second substances,

indicates the vertical coordinate of the left view image to be calculated,

representing a vertical coordinate to be calculated of the right viewpoint image;

(5) position fixing item

To prevent the central position of the source image cloning region from changing, a position fixing item is introduced to fix the left of the source image cloning regionThe center position of the gridIndicates the center position of the initial left cell, where V^lRepresenting the coordinates of the initial grid vertices of the left viewpoint image, V_i ^lThe ith vertex coordinate representing the initial left viewpoint image mesh. The position fixed item is defined as

Wherein the content of the first and second substances,

representing the coordinates of the mesh vertexes of the left viewpoint image after the optimization deformation,

representing the ith vertex coordinate of the optimized and deformed left viewpoint image grid;

the total energy of the optimization deformation is defined as the weighted sum of the above five energy terms:

Φ＝ω_sΦ_s+ω_lΦ_l+ω_dΦ_d+ω_vΦ_v+ω_dΦ_d

wherein phi_s、Φ_l、Φ_d、Φ_vAnd phi_dRespectively representing perspective zoom energy, straight line constraint energy, parallax coincidence energy, vertical alignment energy and position fixing energy. Omega_s、ω_l、ω_d、ω_vAnd ω_dThe weight coefficient corresponding to the energy;

an optimized deformation grid is obtained by solving the minimum of the energy function.

The image fusion method specifically comprises the steps of obtaining an optimized deformed grid of a source image clone region, obtaining the deformed source image clone region through bilinear interpolation, and then respectively fusing a left viewpoint image and a right viewpoint image of the deformed source image clone region to a left viewpoint image and a right viewpoint image of a target image by adopting a plane Poisson fusion method to obtain a final stereo image fusion result.

The invention has the characteristics and beneficial effects that:

the stereo image cloning algorithm based on different camera pitches provided by the invention realizes the cloning of stereo images under the condition that the stereo camera pitch for shooting a source image is different from the stereo camera pitch for shooting a target image, and compared with the algorithm provided by the predecessor, the algorithm provided by the invention realizes the cloning of more accurate stereo images.

Description of the drawings:

figure 1 gives a schematic view of the profile modification.

In the figure, a source image left view image outline and b source image right view image outline.

Fig. 2 shows the relationship between the size of the scene object and the parallax.

In the figure, a is a top view, b is a right side view

Fig. 3 comparison of the present invention and a case where the camera pitch is not considered.

a, a source image left viewpoint image;

b, a left viewpoint image of the target image;

c, obtaining a left viewpoint image of the stereo clone image without considering the camera distance;

d, taking the camera distance into consideration to obtain a left viewpoint image of the stereo clone image;

e, a source image right viewpoint image;

f, a right viewpoint image of the target image;

g, a right viewpoint image of the stereo clone image obtained without considering the camera distance;

h a right viewpoint image of the stereoscopic clone image obtained while considering the camera pitch.

Detailed Description

At present, a plurality of stereo image cloning algorithms exist, the existing algorithms all assume that the stereo camera distances of a shooting source image and a shooting target image are consistent, and actually, the camera distances have great influence on the cloning of the stereo images.

The invention provides a stereo image cloning algorithm based on different camera distances. Firstly, image preprocessing is carried out, then, the contour drawn by a user is processed so as to carry out subsequent grid deformation, then, the shape and the size of a stereoscopic image clone area are adjusted by adjusting parallax, and finally, the adjusted stereoscopic image clone area is fused into a target image. The specific technical scheme of the invention comprises the following steps:

1. image pre-processing

In the image preprocessing stage, disparity maps of a source image and a target image need to be calculated, and quadrilateral grids need to be constructed for the source image and the target image, wherein the construction mode is as follows: first using a uniform grid M^l＝(V^l,E^l,F^l) A left view point diagram representing a source image, wherein V^lMesh vertices representing left viewpoint image, E^lMesh edge representing left viewpoint image, F^lThe grid quadrangle of the left viewpoint image is represented, and then the grid M of the source image right viewpoint image is constructed according to the disparity map of the source image^r＝(V^r,E^r,F^r) In which V is^rMesh vertices representing right viewpoint image, E^rMesh edge, F, representing right view image^rA grid quadrangle representing the right view image.

2. Contour processing

In order to facilitate the operation of the user, the user is allowed to draw an arbitrary outline for the left viewpoint image of the source image. Then, in order to perform optimization iterative adjustment, any contour drawn by the user is adjusted by combining the initial uniform grid of the left viewpoint image of the source image. And enabling the new contour to be a minimum rectangular contour containing the user drawing contour, wherein four corner points of the rectangular contour are all grid vertexes in an initial uniform grid of the left viewpoint image of the source image. And after obtaining the new contour of the left viewpoint image of the source image, obtaining the new contour of the right viewpoint image of the source image according to the source image disparity map. The outline modification schematic is shown in fig. 1. And the contour surrounding area is the source image clone area.

3. Iterative optimization of deformation

3.1 parallax adjustment

In stereo image cloning, source image and target imageThe images are usually shot by different stereo cameras, so that the parallax of a source image clone area and the parallax of a target image are usually different to a certain extent, and the phenomenon of parallax mutation of the cloned stereo image is avoided. The parallax of the source image clone area needs to be adjusted in the following manner: adopting the parallax map gradient field of the source image clone region as a guide field G for solving the following formula₀Meanwhile, the parallax of the edge of the adjusted source image cloning region is forced to be the same as the parallax of the edge of the pasting region of the target stereo image, and the parallax image of the adjusted source image cloning region is obtained by solving the minimum value of the weighted Poisson equation

WhereinIs a weight function formulated for the source image clone region for penalizing a smoother region, D_TA disparity map of the target image is represented,

represents the outline of the source image clone region,

a gradient field of the disparity map representing the adjusted region of the source image clone.

3.2 optimized deformation based on quadrilateral meshes

3.2.1 relationship between scene object size and disparity

Generally, as the distance from the object to the viewer increases, the object becomes gradually smaller. The stereo image has depth information, the size of objects in the stereo image is affected by the perceived depth, and the perceived depth of the stereo image is affected by image parallax. Therefore, the size of an object in a stereoscopic image depends on the parallax of the object. Fig. 2 gives a schematic diagram of the relationship between object size and parallax, assuming here that the stereo camera cameras are arranged in parallel, which is a common arrangement of stereo cameras.

According to the geometric relationship and the theorem of similar triangles in fig. 1(a), it can be derived that the relationship between the depth z of the scene object and the camera distance b is:

where b denotes a stereo camera pitch, f denotes a camera focal length, z denotes a depth of a scene object, and d ═ x_L-x_RRepresenting image parallax, x_LAnd x_RAnd corresponding abscissas of objects of the same scene in the left view image and the right view image are represented.

According to the geometric relationship and the theorem of similar triangles in FIG. 1(b), it can be deduced that the relationship between the size of the scene object on the stereoscopic image plane and the depth of the scene object is

Where L represents the size of the scene object and x represents the size of the scene object on the image plane.

It can thus be derived that the relationship between the size x of the scene object in the stereoscopic image plane and the image disparity d is

As can be seen from the above formula, the size x of the scene object L on the stereo image is related to the image parallax d and the stereo camera pitch b, and is not related to the stereo camera focal length f.

it can be seen that the size of the scene object on the stereoscopic image is related to not only the image parallax but also the camera pitch of the stereoscopic camera.

3.2.2 optimization of deformation

In order to realize the cloning of stereo images under different stereo camera pitches, the invention adopts the weight combination of a perspective zooming item, a straight line constraint item, a parallax consistency item, a vertical alignment item and a position fixing item to guide the deformation of a quadrilateral mesh, wherein V is^lAnd V^rRepresenting the left viewpoint image mesh vertices and the right viewpoint image mesh vertices before the optimization deformation,

and

and representing the vertices of the left viewpoint image mesh and the vertices of the right viewpoint image mesh after the optimization deformation.

(1) Perspective zoom item

According to the introduction 3.1.1, the size of scene objects on a stereo image is dependent on image parallax and stereo camera distance. The invention defines a perspective zoom factor as

Wherein the content of the first and second substances,

andrespectively representing the adjusted parallax and the parallax before the adjustment of the clone region of the source image, b_SAnd b_TRepresenting respectively the stereo camera pitch of the source image and the stereo camera pitch of the target image, V_i ^lThe ith vertex coordinate of the left viewpoint image mesh before the optimization deformation is represented.

For any grid edge of left viewpoint image of source image clone region

Wherein, V_i ^lAndrespectively representing the ith and jth vertex coordinates of the left viewpoint image mesh before the optimization deformation, and the scaling factor thereof is defined as

Wherein the content of the first and second substances,

and

respectively representing perspective scaling factors of ith and jth vertexes of the left viewpoint image mesh before the optimization deformation.

Perspective zoom term is defined as

andrespectively represent the ith and jth vertex coordinates of the left viewpoint image mesh after the optimized transformation,and

respectively representing the ith and jth vertex coordinates of the optimized and deformed right viewpoint image grid; s_ijRepresenting the scaling factor of the grid edge.

(2) Straight line zoom item

In the process of optimizing deformation, in order to avoid structural distortion of a cloned object of a source image and force the grid edge to zoom along the original direction, the length ratio of the deformed grid edge to the deformed grid edge of a left viewpoint image of the source image is defined as

Wherein, V_i ^lAnd

respectively representing left viewpoint image net before optimized deformationThe ith and jth vertex coordinates of the lattice,

and

respectively representing the ith and jth vertex coordinates of the left viewpoint image mesh after the optimized deformation.

The linear constraint term is defined as

respectively representing the ith and jth vertex coordinates of the right viewpoint image grid before the optimization deformation;and

and

respectively representing the ith and jth vertex coordinates of the optimally deformed right viewpoint image mesh.

(3) Disparity coincidence term

For each set of horizontal coordinate components to be calculated

The parallax should be consistent with the adjusted parallax so as to make

Indicating the adjusted disparity. The disparity-consistent term is defined as

Wherein the content of the first and second substances,

indicating the horizontal coordinates of the right viewpoint image to be calculated.

(4) Vertical alignment item

For each set of vertical coordinate components to be calculated

Should be adjusted to approach 0 to eliminate the effect of vertical parallax, the vertical alignment term is defined as

Wherein the content of the first and second substances,

indicates the vertical coordinate of the left view image to be calculated,

indicating the vertical coordinate of the right viewpoint image to be calculated.

(5) Position fixing item

In order to prevent the central position of the source image cloning region from changing, a position fixing item is introduced to fix the central position of a left grid of the source image cloning region, and the central position of the left grid is adjusted

Denotes the center position of the initial left grid, where V^lRepresenting the coordinates of the initial grid vertices of the left viewpoint image, V_i ^lThe ith vertex coordinate representing the initial left viewpoint image mesh. The position fixed item is defined as

Wherein the content of the first and second substances,

and (4) representing the ith vertex coordinate of the left viewpoint image grid after the optimized deformation.

Φ＝ω_sΦ_s+ω_lΦ_l+ω_dΦ_d+ω_vΦ_v+ω_dΦ_d

wherein phi_s、Φ_l、Φ_d、Φ_vAnd phi_dRespectively representing perspective zoom energy, straight line constraint energy, parallax coincidence energy, vertical alignment energy and position fixing energy. Omega_s、ω_l、ω_d、ω_vAnd ω_dThe weight coefficient is corresponding to the energy.

4. Image fusion

After the optimized deformation grid of the source image cloning region is obtained, the deformed source image cloning region is obtained through bilinear interpolation, and then a plane Poisson fusion method is adopted to fuse the left viewpoint image and the right viewpoint image of the deformed source image cloning region to the left viewpoint image and the right viewpoint image of the target image respectively, so that the final three-dimensional image fusion result is obtained.

In order to make the objects, technical solutions and advantages of the present invention more apparent, specific implementation processes of the present invention are described below.

The invention carries out simulation based on MATLAB platform, and the specific technical scheme is divided into the following steps:

1. image pre-processing

In the image preprocessing stage, disparity maps of a source image and a target image need to be calculated, and quadrilateral grids need to be constructed for the source image and the target image, wherein the construction mode is as follows: first using a uniform grid M^l＝(V^l,E^l,F^l) A left viewpoint image representing a source image, wherein V^lMesh vertices representing left viewpoint image, E^lMesh edge representing left viewpoint image, F^lThe grid quadrangle of the left viewpoint image is represented, and then the grid M of the right viewpoint image of the source image is constructed according to the disparity map of the source image^r＝(V^r,E^r,F^r) Wherein V is^rMesh vertices representing right viewpoint image, E^rMesh edge, F, representing right view image^rA grid quadrangle representing the right view image.

2. Contour processing

In order to facilitate the operation of the user, the user is allowed to draw an arbitrary outline for the left viewpoint image of the source image. Then, in order to perform optimization iterative adjustment, any contour drawn by the user is adjusted by combining the initial uniform grid of the left viewpoint image of the source image. And enabling the new contour to be a minimum rectangular contour containing any contour drawn by a user, wherein four corner points of the rectangular contour are all grid vertexes in an initial uniform grid of the left viewpoint image of the source image. And after obtaining the new contour of the left view diagram of the source image, obtaining the new contour of the right view diagram of the source image according to the source image disparity map. The outline modification schematic is shown in fig. 1. And the contour surrounding area is the source image clone area.

3. Iterative optimization of deformation

3.1 parallax adjustment

When the stereo images are cloned, a source image and a target image are often shot by different stereo cameras, so that the parallax of a source image cloning region and the parallax of the target image often have certain difference, and the phenomenon of parallax mutation of the cloned stereo images is avoided. The parallax of the source image clone area needs to be adjusted in the following manner: adopting the parallax map gradient field of the source image clone region as a guide field G for solving the following formula₀Meanwhile, the parallax of the edge of the adjusted source image cloning region is forced to be the same as the parallax of the edge of the pasting region of the target stereo image, and the parallax image of the adjusted source image cloning region is obtained by solving the minimum value of the weighted Poisson equation

Wherein

represents the outline of the source image clone region,

representing disparity maps of adjusted source image clone regionsA gradient field.

3.2 optimized deformation based on quadrilateral meshes

3.2.1 relationship between scene object size and disparity

3.2.2 optimization of deformation

and

representing the sum of the vertices of the left viewpoint image mesh after the optimization deformationAnd (4) right viewpoint image grid vertex.

(1) Perspective zoom item

According to the introduction 3.1.1, the size of the scene object on the stereo image is dependent on the image disparity and the camera pitch of the stereo camera. The invention defines a perspective zoom factor as

Wherein the content of the first and second substances,and

respectively representing the adjusted parallax and the parallax before the adjustment of the clone region of the source image, b_SAnd b_TRepresenting respectively the stereo camera pitch of the source image and the stereo camera pitch of the target image, V_i ^lThe ith vertex coordinate of the left viewpoint image mesh before the optimization deformation is represented.

For any grid edge of left viewpoint image of source image clone region

Wherein, V_i ^lAnd

Wherein the content of the first and second substances,

and

respectively represent the left side before the optimized deformationPerspective scaling factors for the ith and jth vertices of the viewpoint image grid.

Perspective zoom term is defined as

and

(2) Linear constraint term

Wherein, V_i ^lAnd

and

The linear constraint term is defined as

and

andrespectively representing the ith and jth vertex coordinates of the optimally deformed right viewpoint image mesh.

(3) Disparity coincidence term

For each set of horizontal coordinate components to be calculated

The parallax should be consistent with the adjusted parallax, so that

Indicating the adjusted disparity. The disparity-consistent term is defined as

Wherein the content of the first and second substances,represents the horizontal coordinates of the left viewpoint image to be calculated,

(4) Vertical alignment item

For each set of vertical coordinate components to be calculated

Wherein the content of the first and second substances,

indicates the vertical coordinate of the left view image to be calculated,

(5) Position fixing item

In order to prevent the central position of the source image cloning region from changing, a position fixing item is introduced to fix the central position of a left grid of the source image cloning region, and the central position of the left grid is adjustedIndicates the center position of the initial left cell, where V^lRepresenting the coordinates of the initial grid vertices of the left viewpoint image, V_i ^lThe ith vertex coordinate representing the initial left viewpoint image mesh. The position fixed item is defined as

Wherein the content of the first and second substances,

Φ＝ω_sΦ_s+ω_lΦ_l+ω_dΦ_d+ω_vΦ_v+ω_dΦ_d

An optimized deformation grid is obtained by solving the minimum of the energy function. The weighting factors employed herein are each ω_s＝50，ω_l＝50，ω_d＝50，ω_v100 and ω_p1. The total energy Φ is a quadratic optimization function and is programmed to the final solution based on Matlab R2013 b.

4. Image fusion

5. Experiment and evaluation of Effect

In order to test the performance of the stereo image cloning algorithm provided by the invention, the invention displays the result of the stereo image cloning. The input source stereo image is shot by a JVC Erio 3D high-definition video camera with the camera interval of 35mm, and the target stereo image is shot by a FUJIFILM REAL 3D W1 camera with the camera interval of 77 mm.

Fig. 3 shows the comparison of the results of the stereo image cloning method with and without taking the camera pitch into account. In fig. 3, (a) and (e) are a left viewpoint image and a right viewpoint image of a source image, respectively; (b) and (f) a left viewpoint image and a right viewpoint image representing the target image; (c) and (g) a left viewpoint image and a right viewpoint image representing a stereoscopic clone image obtained without considering a camera pitch, that is, assuming that camera pitches of a captured source image and a target image are identical; (d) and (g) a left viewpoint image and a right viewpoint image representing a stereoscopic clone image obtained while considering a camera pitch. The existing algorithms assume that the stereo camera distances of the shooting source image and the shooting target image are consistent, however, in the case, the obtained stereo image cloning result is not accurate. When the camera pitch is not considered, i.e., only the influence of parallax on the shape and size of the source image clone area is considered, the source image clone area is enlarged as shown in (c) and (g). However, the source image clone area is reduced when the camera pitch is considered, as in figures (d) and (g). Therefore, the algorithm provided by the invention can realize more accurate scaling.

Claims

1. A stereoscopic image cloning method based on different camera distances is characterized in that image preprocessing is firstly carried out, then a contour drawn by a user is processed so as to carry out subsequent grid deformation, then the shape and the size of a stereoscopic image cloning region are adjusted by adjusting parallax, finally the adjusted stereoscopic image cloning region is fused into a target image, wherein the shape and the size of the stereoscopic image cloning region are iteratively adjusted by adjusting the parallax, namely iterative optimization deformation is carried out, the specific steps are,

1) parallax adjustment

WhereinIs a clone region of a source imageA domain-specific weighting function for penalizing smooth regions, D_TA disparity map of the target image is represented,

represents the outline of the source image clone region,

2) optimized deformation based on quadrilateral grids

2.1 relationship between scene object size and disparity

2.2 optimization of the deformation

and

representing the mesh vertex of the left viewpoint image and the mesh vertex of the right viewpoint image after the optimization deformation;

(1) the perspective zoom term defines a perspective zoom factor of

Wherein the content of the first and second substances,

and

respectively representing the adjusted parallax and the parallax before the adjustment of the clone region of the source image, b_SAnd b_TRepresenting respectively the stereo camera pitch of the source image and the stereo camera pitch of the target image, V_i ^lRepresenting the ith vertex coordinate of the left viewpoint image mesh before the optimization deformation;

for any grid edge of left viewpoint image of source image clone region

Wherein the content of the first and second substances,

and

perspective zoom term is defined as

and

and

(2) linear constraint term

Wherein, V_i ^lAnd

and

The linear constraint term is defined as

respectively represent the ith and jth vertex coordinates of the left viewpoint image mesh after the optimized transformation,andrespectively representing the ith and jth vertex coordinates of the optimized and deformed right viewpoint image grid;

(3) disparity coincidence term

For each set of horizontal coordinate components to be calculated

The parallax should be consistent with the adjusted parallax so as to make

Indicating the adjusted parallax, and defining the parallax consistent item as

Wherein the content of the first and second substances,

(4) vertical alignment item

For each set of vertical coordinate components to be calculated

Wherein the content of the first and second substances,

indicates the vertical coordinate of the left view image to be calculated,

(5) position fixing item

Indicates the center position of the initial left cell, where V^lRepresenting the coordinates of the initial grid vertices of the left viewpoint image, V_i ^lThe ith vertex coordinate representing the initial left viewpoint image mesh, and the position fixing term is defined as

Wherein the content of the first and second substances,

the total energy of the optimized deformation is defined as the weighted sum of five energy terms, wherein the five energy terms are a perspective scaling term, a straight line constraint term, a parallax consistency term, a vertical alignment term and a position fixing term, and the weighted sum is expressed as:

Φ＝ω_sΦ_s+ω_lΦ_l+ω_dΦ_d+ω_vΦ_v+ω_dΦ_d

wherein phi_s、Φ_l、Φ_d、Φ_vAnd phi_dRespectively representing perspective zoom energy, straight line constraint energy, parallax consistent energy, vertical alignment energy and position fixing energy, omega_s、ω_l、ω_d、ω_vAnd ω_dThe weight coefficient corresponding to the energy; and obtaining the optimized deformation grid by solving the minimum value of the total energy function.

2. The method for cloning stereo images based on different camera pitches as claimed in claim 1, wherein the image preprocessing comprises the specific steps of calculating disparity maps of the source image and the target image, and constructing quadrilateral meshes for the source image and the target image simultaneously in the following manner: first using a uniform grid M^l＝(V^l,E^l,F^l) A left viewpoint image representing a source image, wherein V^lMesh vertices representing left viewpoint image, E^lMesh edge representing left viewpoint image, F^lA grid quadrangle representing the left viewpoint image is constructed, and then a grid Mr (V) of the right viewpoint image of the source image is constructed according to the disparity map of the source image^r,E^r,F^r) Wherein V is^rMesh vertices representing right viewpoint image, E^rMesh edge, F, representing right view image^rA grid quadrangle representing the right view image.

3. The stereoscopic image cloning method based on different camera pitches as claimed in claim 1, wherein the contour processing specific steps are that any contour drawn by a user is adjusted by combining with an initial uniform grid of a left viewpoint image of a source image, so that a new contour is a minimum rectangular contour containing the contour drawn by the user, wherein four corner points of the rectangular contour are grid vertices in the initial uniform grid of the left viewpoint image of the source image, after the new contour of the left viewpoint image of the source image is obtained, the new contour of the right viewpoint image of the source image is obtained according to a source image disparity map, and a contour surrounding area is a source image cloning area.

4. The method for cloning stereo images based on different camera distances as claimed in claim 1, wherein the image fusion specific steps are, after obtaining the optimized deformed grid of the source image cloning region, obtaining the deformed source image cloning region by bilinear interpolation, and then adopting a planar Poisson fusion method to fuse the left viewpoint image and the right viewpoint image of the deformed source image cloning region to the left viewpoint image and the right viewpoint image of the target image respectively to obtain the final stereo image fusion result.