CN114463179A

CN114463179A - Closed space multi-camera image splicing method for sound source positioning judgment

Info

Publication number: CN114463179A
Application number: CN202210125158.6A
Authority: CN
Inventors: 毛宏宇; 牛锋; 齐共金; 何龙标; 王栋杰; 赵玉莹
Original assignee: Beijing Aeronautical Engineering Technology Research Center; National Institute of Metrology
Current assignee: Beijing Aeronautical Engineering Technology Research Center; National Institute of Metrology
Priority date: 2022-02-10
Filing date: 2022-02-10
Publication date: 2022-05-10

Abstract

The invention relates to the technical field of acoustic sound source positioning, in particular to a method for splicing images of multiple cameras in a closed space for sound source positioning judgment, which acquires noise space information through a microphone array, calibrates the space information between a microphone and the cameras, acquires the images on the upper, lower, left and right sides of the cameras, deduces pixel coordinates of the noise sound source position on the images according to the noise sound source space position information, a rotation matrix and a translation vector between a microphone coordinate system and a camera coordinate system by combining a calibration result, and displays the pixel coordinates on the image splicing result of the multiple cameras. And transmitting the position to the nearest camera, and displaying the noise position in the fused image.

Description

Closed space multi-camera image splicing method for sound source positioning judgment

Technical Field

The invention relates to the technical field of acoustic sound source positioning, in particular to a method for splicing images of multiple cameras in a closed space for sound source positioning judgment.

Background

The noise not only damages the hearing of human ears, but also causes adverse reaction to the physiological function of human body, and the long-term exposure in the strong noise environment can reduce the health level of the human body and induce various chronic diseases.

In the prior art, a method for judging the position of a noise source in a closed space is insufficient, and the noise in the closed space cannot be accurately judged and visually checked, so that certain trouble is caused to the noise investigation.

Therefore, a method for splicing multiple cameras in a closed space for sound source positioning judgment needs to be designed, a coordinate transformation relation between a sound source position and an image is calculated, and the sound source position is mapped to a spliced multiple-camera image plane, so that a basis can be provided for the noise sound source investigation in the closed space, and the method has an important actual reference value.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a method for splicing images of multiple cameras in a closed space for sound source positioning judgment.

In order to achieve the above object, the present invention provides a method for stitching closed-space multi-camera images for sound source localization determination, comprising the following steps:

s1: acquiring noise sound source space information of a closed space through a microphone array;

the sound source space information is obtained by calculating the distance d between two adjacent microphones and the time difference delta t of signals received by the two adjacent microphones;

calculating L-v-delta t according to the propagation velocity v of sound in the air; l is the sound path difference;

thus, the angle theta of the microphone receiving the noise signal is calculated to be arcsin (L/d), and the position of the noise source is determined;

s2: calibrating spatial information between the microphone array and the camera system;

the world coordinate system coordinate of a certain point P in the closed space is P_wThe coordinate of P in the camera coordinate system is P_cAnd the coordinate of P in the coordinate system of the microphone array is P_m(ii) a Then P is_w、P_c、P_mThe coordinate transformation relation of (2) is as follows:

P_c＝R₁P_w+t₁

P_m＝R₂P_w+t₂

(R₁，t₁) As rotation matrix and translation vector between world coordinate system and camera coordinate system, (R)₂，t₂) A rotation matrix and a translation vector between a world coordinate system and a microphone array coordinate system are obtained; according to the above two formulas, the transformation matrix relation of a certain point P in the closed space between the camera coordinate system and the microphone array coordinate system is calculated as follows:

the rotation matrix and the translation vector between the microphone array coordinate system and the camera coordinate system are obtained;

s3: an image splicing method based on multiple cameras in a closed space;

the images acquired by the camera are acquired at the upper, lower, left and right sides of the position of the camera;

forming a display interface of the spliced image at the periphery between the camera and the image;

the upper, lower, left and right points of the camera and the 4 connecting lines corresponding to the upper, lower, left and right points of the display interface are the joints of the image splicing area;

a triangular area formed by the display interface, the 4 connecting lines and the camera is represented by a scene from a long scene to a short scene under different scale factors, and the long scene or the short-field scene from the camera is transformed into a unified coordinate system by applying different scale factors;

s4: and deducing pixel coordinates of the noise sound source position on the image according to the spatial position information of the noise sound source acquired in S1 and S2, the rotation matrix and the translation vector between the microphone array coordinate system and the camera coordinate system and the calibration result of the camera, and displaying the pixel coordinates on the image splicing result of the multi-camera image according to S3.

Compared with the prior art, the invention takes images in a closed space through a plurality of cameras and carries out image fusion, and obtains the spatial position information of noise according to a sound source positioning method and the information on the images obtained by the corresponding cameras to calculate a corresponding transformation matrix, thereby providing a corresponding image mark for the determined sound source positioning, transmitting the position to the nearest camera and displaying the noise position in the fused image.

Drawings

FIG. 1 is a schematic diagram of the system of the present invention.

FIG. 2 is a schematic diagram of the system of the present invention for locating a sound source.

FIG. 3 is a schematic diagram of image stitching with multiple cameras according to the present invention.

Detailed Description

The invention will now be further described with reference to the accompanying drawings.

Referring to fig. 1 to 3, the invention provides a method for splicing images of multiple cameras in a closed space for sound source localization judgment, which comprises the following steps:

P_c＝R₁P_w+t₁

P_m＝R₂P_w+t₂

s3: an image splicing method based on multiple cameras in a closed space;

Example (b):

step 1: as shown in fig. 2, noise source space information of a closed space is acquired by a microphone array, wherein the noise source space information includes a distance d between two adjacent microphones; the time difference Δ t of the signals received by two adjacent microphones is calculated according to the propagation speed v of sound in the air, so that L is v · Δ t; l is the difference of the sound path, so that the angle θ of the noise signal received by the microphone is calculated as arcsin (L/d), thereby determining the position of the noise source.

Step 2: calibrating spatial information between the microphone array and the camera system, if the coordinate of the world coordinate system of a certain point P in the closed space is P_wIts coordinate in the camera coordinate system is P_cAnd the coordinate of the point in the coordinate system of the microphone array is P_m. Then P is_w、P_c、P_mThe coordinate transformation relation of (2) is as follows:

P_c＝R₁P_w+t₁

P_m＝R₂P_w+t₂

wherein (R)₁，t₁) As rotation matrix and translation vector between world coordinate system and camera coordinate system, (R)₂，t₂) For a rotation matrix and a translation between a world coordinate system and a microphone array coordinate systemAmount of the compound (A). According to the above two formulas, the transformation matrix relation of a certain point P in the closed space between the camera coordinate system and the microphone array coordinate system can be estimated as follows:

wherein

Representing the rotation matrix and translation vector between the microphone array coordinate system and the camera coordinate system.

And step 3: as shown in fig. 3, in the image stitching method based on multiple cameras in an enclosed space, a black square marked as 0123 in the figure represents a device for installing the cameras in the enclosed space, and four ellipses represent images acquired by the cameras and respectively correspond to images acquired by front, rear, left and right cameras of the device. And region 4567 represents the display interface of the entire stitched image, and the four diagonal lines 04, 15, 26, 37 represent the image stitching region junctions. In the four areas 0154, 1265, 2376 and 0374, the corresponding images are represented by scenes from a far scene to a near scene under different scale factors, and the aim is to apply different scale factors to transform the far scene or the near scene of the distance camera into a unified coordinate system.

And 4, step 4: and (3) according to the spatial position information of the noise sound source obtained in the steps (1) and (2), the rotation matrix and the translation vector between the microphone array coordinate system and the camera coordinate system, and the calibration result of the camera, the pixel coordinates of the noise sound source position on the image can be deduced, and then according to the step (3), the pixel coordinates are displayed on the multi-camera image splicing result.

The above are only preferred embodiments of the present invention, and are only used to help understanding the method and the core idea of the present application, the scope of the present invention is not limited to the above embodiments, and all technical solutions belonging to the idea of the present invention belong to the scope of the present invention. It should be noted that modifications and embellishments within the scope of the invention may occur to those skilled in the art without departing from the principle of the invention, and are considered to be within the scope of the invention.

The invention integrally solves the technical problems that the method for judging the position of the noise sound source in the closed space in the prior art is insufficient, the noise in the closed space cannot be accurately judged and visually checked, and certain trouble is caused to the noise investigation.

Claims

1. A closed space multi-camera image splicing method for sound source positioning judgment is characterized by comprising the following steps:

the world coordinate system coordinate of a certain point P in the closed space is P_wThe coordinate of the P in the camera coordinate system is P_cThe coordinate of the P in the coordinate system of the microphone array is P_m(ii) a Then P is_w，P_c、P_mThe coordinate transformation relation of (2) is as follows:

P_c＝R₁P_w+t₁

P_m＝R₂P_w+t₂

said (R)₁，t₁) Is a rotation matrix and a translation vector between a world coordinate system and a camera coordinate system, the (R)₂，t₂) A rotation matrix and a translation vector between a world coordinate system and a microphone array coordinate system are obtained; according to the above two formulas, the relation of a transformation matrix of a certain point P in the closed space between a camera coordinate system and a microphone array coordinate system is calculated as follows:

the above-mentioned

s3: an image splicing method based on multiple cameras in a closed space;

forming a display interface of a spliced image at the periphery between the camera and the image;

a triangular area formed by the display interface, the 4 connecting lines and the camera is represented by a scene from a long scene to a short scene under different scale factors of the image, and the long scene or the short-field scene from the camera is transformed into a unified coordinate system by applying different scale factors;

s4: and deducing pixel coordinates of the noise sound source position on the image according to the spatial position information of the noise sound source acquired in the steps S1 and S2, and the rotation matrix and the translation vector between the microphone array coordinate system and the camera coordinate system and by combining a camera calibration result, and displaying the pixel coordinates on a multi-camera image splicing result according to the step S3.