CN114554154A - Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium - Google Patents

Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium Download PDF

Info

Publication number
CN114554154A
CN114554154A CN202210174618.4A CN202210174618A CN114554154A CN 114554154 A CN114554154 A CN 114554154A CN 202210174618 A CN202210174618 A CN 202210174618A CN 114554154 A CN114554154 A CN 114554154A
Authority
CN
China
Prior art keywords
audio
camera
video
pickup
picture
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210174618.4A
Other languages
Chinese (zh)
Inventor
仇波
余求是
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shibang Communication Co ltd
Original Assignee
Shibang Communication Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shibang Communication Co ltd filed Critical Shibang Communication Co ltd
Priority to CN202210174618.4A priority Critical patent/CN114554154A/en
Publication of CN114554154A publication Critical patent/CN114554154A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/326Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only for microphones

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Studio Devices (AREA)

Abstract

The invention discloses a method and a system for selecting the position of an audio and video pickup, an audio and video acquisition terminal and a storage medium, wherein the method comprises the steps of acquiring an initial audio and video picture; establishing a three-dimensional coordinate system, and acquiring coordinates corresponding to each pixel point in an initial audio and video picture in the three-dimensional coordinate system and coordinates corresponding to the position of each sound pickup in the three-dimensional coordinate system; defining a virtual camera, and acquiring the position coordinate of the virtual camera in a three-dimensional coordinate system after the camera is zoomed; and acquiring the position coordinate of the sound pickup matched with the shot picture after the camera zooms. According to the invention, when the camera zooms, the audio signal of the sound pickup in the adaptive three-dimensional space can be selected, and the audio signal of the sound pickup can be better selected to be matched with the audio and video, so that the audio playing effect of the audio and video is improved.

Description

Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium
Technical Field
The invention relates to the technical field of audio and video acquisition, in particular to a method and a system for selecting the position of an audio and video pickup, an audio and video acquisition terminal and a storage medium.
Background
When the audio and video are actually collected, the site environment represented by the audio and video picture is planar, and the position of the sound pickup is a two-dimensional coordinate taking the audio and video picture as a reference. When the video is zoomed, the two-dimensional distance between each sound pick-up and the central point of the audio and video picture is calculated to select the sound collected by the corresponding sound pick-up as the audio content matched with the audio and video picture.
However, the sound pick-up devices of the actual video monitoring field environment are arranged in all directions of the space where the camera is located, so that if a plurality of sound pick-up devices are distributed along the normal direction of the audio and video picture shot by the camera, a proper sound pick-up device cannot be obtained by the method.
Therefore, a method for selecting the position of the audio/video pickup is needed to solve the above-mentioned problems.
Disclosure of Invention
Based on this, it is necessary to provide a method and a system for selecting a position of an audio/video pickup, an audio/video capture terminal, and a storage medium, which can better select an audio signal of a suitable pickup to match with audio/video after zooming a camera, so as to improve an audio playing effect of audio/video, in view of the defects of the prior art.
In order to solve the technical problems, the technical scheme adopted by the invention is as follows:
in a first aspect, a method for selecting a position of an audio/video pickup is provided, which includes the following steps:
acquiring an initial audio and video picture; the content of the picture presented in the initial audio/video picture is a scene between a camera shot by the camera at an initial focal length and a plane where a preset monitoring boundary is located, and the plane where the preset monitoring boundary is located is vertical to the axial direction of a lens of the camera;
establishing a three-dimensional coordinate system, and acquiring coordinates corresponding to each pixel point in an initial audio and video picture in the three-dimensional coordinate system and coordinates corresponding to the position of each sound pickup in the three-dimensional coordinate system;
defining a virtual camera, and acquiring the position coordinate of the virtual camera in a three-dimensional coordinate system after the camera is zoomed;
and acquiring a pickup position coordinate matched with a shot picture after the camera is zoomed.
In a second aspect, there is provided an audio-video pick-up location selection system comprising:
the audio/video picture acquisition module is used for acquiring an initial audio/video picture;
the three-dimensional coordinate system establishing module is used for establishing a three-dimensional coordinate system and acquiring coordinates corresponding to each pixel point in an initial audio and video picture in the three-dimensional coordinate system and coordinates corresponding to the position of each sound pickup in the three-dimensional coordinate system;
the virtual camera module is used for defining a virtual camera and acquiring the position coordinate of the virtual camera in a three-dimensional coordinate system after the camera is zoomed;
and the pickup position acquisition module is used for acquiring pickup position coordinates matched with the shot picture after the camera zooms.
In a third aspect, an audio/video collecting terminal is provided, which includes a memory and a processor, where the memory stores a computer program, and the processor implements the method for selecting the position of the audio/video pickup when executing the computer program.
In a fourth aspect, there is provided a storage medium storing a computer program comprising program instructions which, when executed, implement the audio/video pick-up position selection method described above.
In summary, the audio and video pickup position selection method and system, the audio and video acquisition terminal and the storage medium of the invention can select the audio signal of the pickup in the adaptive three-dimensional space when the camera zooms, and can better select the appropriate audio signal of the pickup to match with the audio and video, so as to improve the audio playing effect of the audio and video.
Drawings
Fig. 1 is a schematic flowchart of a first method for selecting a position of an audio/video pickup according to an embodiment of the present invention;
fig. 2 is a block diagram of a structure of an audio/video pickup position selection system according to an embodiment of the present invention;
fig. 3 is a block diagram of an internal structure of an audio/video acquisition terminal according to an embodiment of the present invention;
fig. 4 is a schematic diagram illustrating a relative position relationship between a camera and a sound pickup in the y-axis direction according to an embodiment of the present invention;
fig. 5 is a schematic diagram illustrating a relative position relationship between a camera and a sound pickup in an x-axis direction according to an embodiment of the present invention;
fig. 6 is a schematic diagram of the position of a sound pickup in an initial audio/video picture according to an embodiment of the present invention;
fig. 7 is a schematic diagram illustrating a relative position relationship between a virtual camera and a camera entity in a z-axis direction according to an embodiment of the present invention;
fig. 8 is a schematic diagram illustrating a principle of a similar triangle formed by a virtual camera and a camera entity when the camera performs k-times zooming according to an embodiment of the present invention.
Detailed Description
For further understanding of the features and technical means of the present invention, as well as the specific objects and functions attained by the present invention, the present invention will be described in further detail with reference to the accompanying drawings and detailed description.
The method and the system for selecting the position of the audio and video pickup, the audio and video acquisition terminal and the storage medium provided by the embodiment of the invention can be particularly applied to the audio and video acquisition terminal, and can better select a proper pickup to be matched with an audio and video picture in the normal direction of a picture shot by a camera, so that the signal of the pickup is obtained, and an optimal audio and video matching signal is formed. The following describes in detail a method and a system for selecting a position of an audio/video pickup, an audio/video capture terminal, and a storage medium according to the present embodiment.
Fig. 1 is a schematic flow chart of a first method for selecting a position of an audio/video pickup according to an embodiment of the present invention, and as shown in fig. 1, the method for selecting a position of an audio/video pickup includes steps S110 to S140, which are specifically as follows:
step S110, obtaining an initial audio and video picture; the content of the image presented in the initial audio/video image is the image between the camera shot by the camera with the initial focal length and the plane where the preset monitoring boundary is located, and the plane where the preset monitoring boundary is located is perpendicular to the axial direction of the lens of the camera.
Step S120, establishing a three-dimensional coordinate system and obtaining three dimensionsCoordinates (x) corresponding to each pixel point in initial audio/video picture under coordinate system0,y0,d0) And coordinates (x) corresponding to the position of each sound pickup in the three-dimensional coordinate system1,y1,z1),(x2,y2,z2),……(xN,yN,zN) (ii) a Wherein d is0The distance between a monitoring boundary and the camera is preset in the axial direction of a lens of the camera, the normal direction of an initial audio and video picture is parallel to the z axis of a three-dimensional coordinate system, the edge line of the initial audio and video picture is parallel to the x axis and the y axis of the three-dimensional coordinate system respectively, the position of the camera is set as the origin of the three-dimensional coordinate system, namely the initial position coordinate of the camera is (0, 0, 0), and the relative position relationship between the camera and the sound pick-up can refer to fig. 4-6.
Step S130, defining a virtual camera, and acquiring the position coordinate (x) of the virtual camera in a three-dimensional coordinate system after the camera is zoomed0,y0,z0);
The method of step S130 specifically includes:
by the formula
Figure BDA0003518581500000041
Acquiring the position coordinate (x) of the virtual camera in the three-dimensional coordinate system after the camera zooms0,y0,z0) (ii) a The virtual camera position is obtained after the camera entity position moves along the z-axis direction, in this embodiment, the virtual camera position is on the z-axis of the three-dimensional coordinate system, and dxExpressed as the distance between the virtual camera and the preset monitoring boundary in the axial direction of the lens, k is the zoom multiple of the lens of the camera, x0、y0Respectively carrying out zoom operation on the camera and then shooting the coordinate values of the center point of the picture in the x axis and the y axis of the initial audio and video picture; d0To preset values by adjusting the parameter d0The size of the camera is adjusted to correct the audio and video picture effect after the camera zooms; in this example, d0The range of (1) is 3 to 50 m.
Specifically, when the camera is zoomed, the zoom is takenIn this embodiment, when the camera performs k times of zooming, the area of the zoomed photographed image is enlarged by k times of the area of the initial audio/video image2The effect of the image is similar to the effect of a shot image after the camera is pushed forwards along the axial distance of the lens by the initial focal length; setting a virtual camera, wherein the distance between the virtual camera and the preset monitoring boundary is closer, and the distance between the virtual camera and the preset monitoring boundary is set as dxThen z is0=d0-dxThat is, the coordinate of the virtual camera in the three-dimensional coordinate system is (x)0,y0,z0)=(x0,y0,d0-dx)。
As shown in fig. 7 and 8, the size of the initial audio/video picture shot by the camera is 1, and when the initial audio/video picture is enlarged by k2When the time is doubled, the scene display range in the zoomed audio and video picture is actually 1/k of the scene display range in the initial audio and video picture2(ii) a Assuming that the visual angle of the camera is unchanged, k is amplified when the initial audio/video picture is enlarged2During doubling, the distance d between the virtual camera and the preset monitoring boundary is calculated according to the similar triangle by taking the preset monitoring boundary as reference, which is equivalent to the audio and video picture shot after the camera is pushed forwardsxSatisfy the formula
Figure BDA0003518581500000051
Step S140, acquiring a pickup position coordinate matched with a shooting picture after the camera zooms; after the camera is zoomed, the audio signal of the sound pickup can be better selected to be matched with the shot picture, so that the audio playing effect of audio and video is improved.
The method of step S140 specifically includes:
by the formula
Figure BDA0003518581500000052
Selection of diThe position coordinate of the pickup corresponding to the minimum value, namely the pickup of the position coordinate is the zoom of the cameraThe pickup adapted to the photographed picture (x)i,yi,zi) Expressed as the position coordinate of the ith pickup, i is more than or equal to 1 and less than or equal to N.
In other embodiments, the method of step S140 specifically includes:
by the formula
Figure BDA0003518581500000053
Selection of diA sound pickup position coordinate corresponding to the minimum value, that is, a sound pickup adapted to a shot image after the zoom of the camera, (x)i,yi,zi) Expressed as the position coordinates of the ith sound pickup, i is more than or equal to 1 and less than or equal to N, wherein,
Figure BDA0003518581500000054
Figure BDA0003518581500000055
in particular behind the virtual camera (i.e. z)i<z0) The audio signal of a sound pick-up is less important than the audio signal located in front of the virtual camera (i.e. z)i>z0) Importance of an audio signal of a microphone, and therefore, when calculating a distance between the microphone and the virtual camera in the z-axis direction, a distance weight of the microphone located in the z-axis direction of the virtual camera can be reduced, where a weighting factor γ is introduced for adjusting the distance weight of the microphone located in front of or behind the virtual camera; in this embodiment, the direction of the virtual camera facing the preset monitoring boundary is located in front of the virtual camera, and the direction of the virtual camera facing away from the preset monitoring boundary is located behind the virtual camera.
According to the audio and video pickup position selection method, when the camera zooms, the audio signal of the pickup in the adaptive three-dimensional space can be selected, the defects that the existing audio and video acquisition terminal selects the adaptive pickup through two-dimensional coordinates and cannot select the pickup in the axial direction of the lens of the camera are overcome, and therefore the audio signal of the appropriate pickup can be better selected to be matched with the audio and video, and the audio playing effect of the audio and video is improved.
Fig. 2 shows a structural block diagram of an audio/video pickup position selection system according to the present invention, and as shown in fig. 2, the present invention further provides an audio/video pickup position selection system corresponding to the audio/video pickup position selection method, where the audio/video pickup position selection system includes a module for executing the audio/video pickup position selection method, and the audio/video pickup position selection system may be configured on an audio/video acquisition terminal.
Specifically, as shown in fig. 2, the audio/video pickup position selection system includes an audio/video picture acquisition module 110, a three-dimensional coordinate system establishment module 120, a virtual camera module 130, and a pickup position acquisition module 140.
An audio/video picture acquisition module 110, configured to acquire an initial audio/video picture; in this embodiment, the audio/video image acquisition module 110 is a camera;
a three-dimensional coordinate system establishing module 120, configured to establish a three-dimensional coordinate system, and obtain coordinates corresponding to each pixel point in an initial audio/video picture in the three-dimensional coordinate system and coordinates corresponding to a position of each sound pickup in the three-dimensional coordinate system;
the virtual camera module 130 is used for defining a virtual camera and acquiring the position coordinates of the virtual camera in a three-dimensional coordinate system after the camera is zoomed;
and a pickup position acquiring module 140 for acquiring pickup position coordinates adapted to the zoom-photographed image of the camera.
In an embodiment, the virtual camera module 130 defines a virtual camera, acquires position coordinates of the virtual camera in a three-dimensional coordinate system after the camera is zoomed, and is specifically configured to perform the following steps:
by the formula
Figure BDA0003518581500000071
Acquiring the position coordinate (x) of the virtual camera in the three-dimensional coordinate system after the camera zooms0,y0,z0) (ii) a Wherein d isxExpressed as the distance between the virtual camera and the preset monitoring boundary in the axial direction of the lens, k is the zoom multiple of the lens of the camera, x0、y0Respectively carrying out zoom operation on the camera and then shooting the coordinate values of the center point of the picture in the x axis and the y axis of the initial audio and video picture, and the coordinate value of the center point of the picture in the initial audio and video picture in the z axis0=d0-dx
In one embodiment, the pickup position obtaining module 140 obtains pickup position coordinates adapted to a zoom-shot picture of a camera, and is specifically configured to perform the following steps:
by the formula
Figure BDA0003518581500000072
Selection of diThe position coordinate of the sound pick-up corresponding to the minimum value is the sound pick-up matched with the shot picture after the camera is zoomed, (x)i,yi,zi) Expressed as the position coordinates of the ith pickup, i is more than or equal to 1 and less than or equal to N.
In another embodiment, the pickup position obtaining module 140 obtains pickup position coordinates adapted to a zoom-shot picture of the camera, and is specifically configured to perform the following steps:
by the formula
Figure BDA0003518581500000073
Selection of diThe position coordinate of the sound pick-up corresponding to the minimum value is the sound pick-up matched with the shot picture after the camera is zoomed, (x)i,yi,xi) Expressed as the position coordinates of the ith microphone, i is more than or equal to 1 and less than or equal to N, wherein,
Figure BDA0003518581500000074
Figure BDA0003518581500000075
it should be noted that, as can be clearly understood by those skilled in the art, the specific implementation process of the audio/video sound pickup position selection system and each module may refer to the corresponding description in the foregoing method embodiment, and for convenience and conciseness of description, no further description is provided herein.
Fig. 3 is a block diagram of an internal structure of an audio/video acquisition terminal according to an embodiment of the present invention, and as shown in fig. 3, the audio/video acquisition terminal according to the present invention includes a memory and a processor connected by a system bus, and connected by the system bus; the storage is stored with computer programs, the processor is used for providing calculation and control capacity to support the operation of the whole terminal, the processor executes the computer programs, the method for selecting the position of the audio and video pickup is realized, when the camera zooms, the audio signals of the pickup in the adaptive three-dimensional space can be selected, the defects that the adaptive pickup is selected through two-dimensional coordinates and the pickup cannot be selected in the axial direction of the lens of the camera in the existing audio and video acquisition terminal are overcome, and therefore the audio signals of the appropriate pickup can be better selected to be matched with the audio and video, and the audio playing effect of the audio and video is improved.
The memory may include a non-volatile storage medium having an operating system stored thereon and a computer program stored thereon, which, when executed by the processor, causes the processor to implement the audio/video pickup position selection method.
The internal memory may also have a computer program stored therein, which when executed by the processor, causes the processor to perform the audio/video pickup position selection method. Those skilled in the art will appreciate that the configuration shown in fig. 3 is a block diagram of only a portion of the configuration associated with the present application and does not constitute a limitation on the application of the present application to other devices, and that a particular device may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, the audio/video pickup position selection method provided by the present application may be implemented as a computer program, and the computer program may be run on an audio/video capture terminal as shown in fig. 3. The memory of the audio/video capture terminal can store various program modules constituting the audio/video pickup position selection system, such as the audio/video picture acquisition module 110, the three-dimensional coordinate system establishment module 120, the virtual camera module 130, and the pickup position acquisition module 140 shown in fig. 2. The computer program constituted by the respective program modules causes the processor to execute the steps of the audio-visual sound pickup position selection system of the respective embodiments of the present application described in the present specification. For example, the audio/video capture terminal shown in fig. 3 may obtain an initial audio/video image through the audio/video image obtaining module 110 in the audio/video pickup position selection system shown in fig. 2; the three-dimensional coordinate system establishing module 120 establishes a three-dimensional coordinate system, and obtains coordinates corresponding to each pixel point in an initial audio and video picture in the three-dimensional coordinate system and coordinates corresponding to the position of each pickup in the three-dimensional coordinate system; the virtual camera module 130 defines a virtual camera, and obtains the position coordinates of the virtual camera in the three-dimensional coordinate system after the camera is zoomed; the microphone position acquisition module 140 acquires microphone position coordinates that are adapted to a photographed picture after the camera is zoomed.
It should be understood that in the embodiments of the present Application, the Processor may be a Central Processing Unit (CPU), and the Processor may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components, and the like. Wherein a general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
It will be understood by those skilled in the art that all or part of the flow of the method implementing the above embodiments may be implemented by a computer program instructing associated hardware. The computer program includes program instructions, and the computer program may be stored in a storage medium, which is a computer-readable storage medium. The program instructions are executed by at least one processor in the computer system to implement the flow steps of the embodiments of the method described above.
Accordingly, the present invention also provides a storage medium. The storage medium may be a computer-readable storage medium. The storage medium stores a computer program, wherein the computer program comprises program instructions. The program instructions, when executed by the processor, cause the processor to perform the steps of:
step S110, obtaining an initial audio and video picture;
step S120, establishing a three-dimensional coordinate system, and acquiring coordinates corresponding to each pixel point in an initial audio and video picture in the three-dimensional coordinate system and coordinates corresponding to the position of each sound pickup in the three-dimensional coordinate system;
step S130, defining a virtual camera, and acquiring the position coordinate of the virtual camera in a three-dimensional coordinate system after the camera is zoomed;
step S140, a pickup position coordinate adapted to a shot picture after the zoom of the camera is acquired.
The storage medium may be a usb disk, a removable hard disk, a Read-Only Memory (ROM), a magnetic disk, or an optical disk, which can store various computer readable storage media.
In summary, according to the audio/video pickup position selection method and system, the audio/video acquisition terminal, and the storage medium of the present invention, when the camera zooms, the audio signal of the pickup in the adaptive three-dimensional space can be selected, and the audio signal of the pickup can be better selected to match with the audio/video, so as to improve the audio playing effect of the audio/video.
Those of ordinary skill in the art will appreciate that the various illustrative components and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the components and steps of the various examples have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the embodiments provided in the present invention, it should be understood that the disclosed system and method can be implemented in other ways. For example, the system embodiments described above are merely illustrative. For example, the division of each unit is only one logic function division, and there may be another division manner in actual implementation. For example, various elements or components may be combined or may be integrated in another system or some features may be omitted, or not implemented.
The steps in the method of the embodiment of the invention can be sequentially adjusted, combined and deleted according to actual needs. The units in the device of the embodiment of the invention can be merged, divided and deleted according to actual needs. In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a storage medium. Based on such understanding, the technical solution of the present invention essentially or partially contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing an apparatus (which may be a personal computer, a terminal, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention.
The above examples are merely illustrative of several embodiments of the present invention, and the description thereof is more specific and detailed, but not to be construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present invention should be subject to the appended claims.

Claims (10)

1. A method for selecting the position of an audio and video pickup is characterized by comprising the following steps:
acquiring an initial audio and video picture; the content of the picture presented in the initial audio/video picture is a scene between a camera shot by the camera at an initial focal length and a plane where a preset monitoring boundary is located, and the plane where the preset monitoring boundary is located is vertical to the axial direction of a lens of the camera;
establishing a three-dimensional coordinate system, and acquiring coordinates corresponding to each pixel point in an initial audio and video picture in the three-dimensional coordinate system and coordinates corresponding to the position of each sound pickup in the three-dimensional coordinate system;
defining a virtual camera, and acquiring the position coordinate of the virtual camera in a three-dimensional coordinate system after the camera is zoomed;
and acquiring the position coordinate of the sound pickup matched with the shot picture after the camera zooms.
2. The method for selecting the position of the audio/video pickup according to claim 1, wherein the step of defining the virtual camera and the step of obtaining the position coordinates of the virtual camera in the three-dimensional coordinate system after the camera is zoomed comprises the following specific operations:
by the formula
Figure FDA0003518581490000011
Acquiring the position coordinate (x) of the virtual camera in the three-dimensional coordinate system after the camera zooms0,y0,z0) (ii) a Wherein d is0Expressed as the distance between the preset monitoring boundary and the camera in the lens axial direction of the camera, dxExpressed as the distance between the virtual camera and the preset monitoring boundary in the axial direction of the lens, k is the zoom multiple of the lens of the camera, x0、y0Respectively carrying out zooming operation on the camera to shoot picturesCoordinate values of the central point in the x axis and the y axis of the initial audio and video picture, z0=d0-dx
3. The method for selecting the position of the audio/video pickup according to claim 1, wherein the step of obtaining the pickup position coordinates adapted to the zoom-captured picture of the camera comprises the following specific operations:
by the formula
Figure FDA0003518581490000012
Selection of diA sound pickup position coordinate corresponding to the minimum value, that is, a sound pickup adapted to a shot image after the zoom of the camera, (x)i,yi,zi) Expressed as the position coordinate of the ith pickup, i is more than or equal to 1 and less than or equal to N.
4. The method for selecting the position of the audio/video pickup according to claim 1, wherein the step of obtaining the pickup position coordinates adapted to the zoom-captured picture of the camera comprises the following specific operations:
by the formula
Figure FDA0003518581490000021
Selection of diThe position coordinate of the sound pick-up corresponding to the minimum value is the sound pick-up matched with the shot picture after the camera is zoomed, (x)i,yi,zi) Expressed as the position coordinates of the ith microphone, i is more than or equal to 1 and less than or equal to N, wherein,
Figure FDA0003518581490000022
γ0∈(0,1)。
5. an audio-video pickup position selection system, comprising:
the audio/video picture acquisition module is used for acquiring an initial audio/video picture;
the three-dimensional coordinate system establishing module is used for establishing a three-dimensional coordinate system and acquiring coordinates corresponding to each pixel point in an initial audio and video picture in the three-dimensional coordinate system and coordinates corresponding to the position of each sound pickup in the three-dimensional coordinate system;
the virtual camera module is used for defining a virtual camera and acquiring the position coordinates of the virtual camera in a three-dimensional coordinate system after the camera is zoomed;
and the pickup position acquisition module is used for acquiring pickup position coordinates matched with the shot picture after the camera is zoomed.
6. The audio-video trigger position selection system of claim 5, wherein: the virtual camera module defines a virtual camera, acquires the position coordinate of the virtual camera in a three-dimensional coordinate system after the camera is zoomed, and is specifically used for executing the following steps:
by the formula
Figure FDA0003518581490000023
Acquiring the position coordinate (x) of the virtual camera in the three-dimensional coordinate system after the camera zooms0,y0,z0) (ii) a Wherein d isxExpressed as the distance between the virtual camera and the preset monitoring boundary in the axial direction of the lens, k is the zoom multiple of the lens of the camera, x0、y0Respectively carrying out zoom operation on the camera and then shooting the coordinate values of the center point of the picture in the x axis and the y axis of the initial audio and video picture, and the coordinate value of the center point of the picture in the initial audio and video picture in the z axis0=d0-dx
7. The audio-video trigger position selection system of claim 5, wherein: the pickup position acquisition module acquires pickup position coordinates matched with a shooting picture after the camera zooms, and is specifically used for executing the following steps:
by the formula
Figure FDA0003518581490000031
Selection of diMinimum value pairThe corresponding sound pick-up position coordinate is the sound pick-up matched with the shot picture after the camera is zoomed, (x)i,yi,zi) Expressed as the position coordinate of the ith pickup, i is more than or equal to 1 and less than or equal to N.
8. The audio-video trigger position selection system of claim 5, wherein: the pickup position acquisition module acquires pickup position coordinates adapted to a shot picture after zooming of the camera, and is specifically used for executing the following steps:
by the formula
Figure FDA0003518581490000032
Selection of diA sound pickup position coordinate corresponding to the minimum value, that is, a sound pickup adapted to a shot image after the zoom of the camera, (x)i,yi,zi) Expressed as the position coordinates of the ith sound pickup, i is more than or equal to 1 and less than or equal to N, wherein,
Figure FDA0003518581490000033
γ0∈(0,1)。
9. an audio and video acquisition terminal is characterized in that: the audio and video acquisition terminal comprises a memory and a processor, wherein a computer program is stored on the memory, and the processor realizes the audio and video pickup position selection method according to any one of claims 1 to 5 when executing the computer program.
10. A storage medium, characterized by: the storage medium stores a computer program comprising program instructions which, when executed, implement an audio-visual pick-up location selection method as claimed in any one of claims 1 to 5.
CN202210174618.4A 2022-02-24 2022-02-24 Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium Pending CN114554154A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210174618.4A CN114554154A (en) 2022-02-24 2022-02-24 Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210174618.4A CN114554154A (en) 2022-02-24 2022-02-24 Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium

Publications (1)

Publication Number Publication Date
CN114554154A true CN114554154A (en) 2022-05-27

Family

ID=81679652

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210174618.4A Pending CN114554154A (en) 2022-02-24 2022-02-24 Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium

Country Status (1)

Country Link
CN (1) CN114554154A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115134499A (en) * 2022-06-28 2022-09-30 世邦通信股份有限公司 Audio and video monitoring method and system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115134499A (en) * 2022-06-28 2022-09-30 世邦通信股份有限公司 Audio and video monitoring method and system
CN115134499B (en) * 2022-06-28 2024-02-02 世邦通信股份有限公司 Audio and video monitoring method and system

Similar Documents

Publication Publication Date Title
JP6246246B2 (en) Device with adaptive camera array
JP5596972B2 (en) Control device and control method of imaging apparatus
US11258940B2 (en) Imaging apparatus
CN108605087B (en) Terminal photographing method and device and terminal
JP5799863B2 (en) Image processing apparatus, image processing method, and program
WO2016184131A1 (en) Image photographing method and apparatus based on dual cameras and computer storage medium
CN110189269B (en) Correction method, device, terminal and storage medium for 3D distortion of wide-angle lens
JP5846172B2 (en) Image processing apparatus, image processing method, program, and imaging system
CN111246080B (en) Control apparatus, control method, image pickup apparatus, and storage medium
JP2018037857A5 (en) Image processing apparatus, image processing method, computer program, and storage medium
JP6665917B2 (en) Image processing device
CN114554154A (en) Audio and video pickup position selection method and system, audio and video acquisition terminal and storage medium
JP6222205B2 (en) Image processing device
JP4877154B2 (en) Image processing apparatus, imaging apparatus, image processing method, and program
JP7424076B2 (en) Image processing device, image processing system, imaging device, image processing method and program
WO2020143090A1 (en) Image acquisition method, apparatus and device based on multiple cameras
JP5954987B2 (en) Imaging apparatus and processing method thereof
WO2022007681A1 (en) Photographing control method, mobile terminal, and computer readable storage medium
JP2018157314A (en) Information processing system, information processing method and program
JP6439845B2 (en) Image processing device
JP6579934B2 (en) Image processing apparatus, imaging apparatus, image processing method, program, and storage medium
US11445106B2 (en) Imaging apparatus
TWI621100B (en) Image processing method and portable eletronic device
JP6036934B2 (en) Image processing apparatus, image processing method, and program
KR100539774B1 (en) Wide angle photographing method for digital video system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination