WO2019174442A1 - Adapterization equipment, voice output method, device, storage medium and electronic device - Google Patents

Adapterization equipment, voice output method, device, storage medium and electronic device Download PDF

Info

Publication number
WO2019174442A1
WO2019174442A1 PCT/CN2019/075489 CN2019075489W WO2019174442A1 WO 2019174442 A1 WO2019174442 A1 WO 2019174442A1 CN 2019075489 W CN2019075489 W CN 2019075489W WO 2019174442 A1 WO2019174442 A1 WO 2019174442A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
mic
main
main mic
pickup
Prior art date
Application number
PCT/CN2019/075489
Other languages
French (fr)
Chinese (zh)
Inventor
李保民
任鹏
蔡成亮
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2019174442A1 publication Critical patent/WO2019174442A1/en

Links

Images

Classifications

    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/12Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups

Definitions

  • the present disclosure relates to sound processing technology, and in particular to a sound pickup device, a sound output method, a device, a storage medium, and an electronic device.
  • AR/VR Augmented Reality/Virtual Reality
  • 3GPP 3rd Generation Partnership Project
  • eMBB Enhance Mobile Broadband
  • the overall solution to the "real-time viewing” process is "virtual reality” + “live perception”, that is, placing the scene on the scene in front of the user, and implementing stereo sound effects through two wireless earbuds. So that users can hear more powerful audio effects than usual when watching the game.
  • the video part of this technology mainly uses real-world cameras, including a series of sensitive components such as 1080p HD camera, infrared camera and infrared laser projector to capture 3D images.
  • the audio part of this technology uses a real-world camera to output X frames per second to achieve the purpose of simulating the sound field. In short, the camera will record the audio and video of the scene at a certain fixed point, and then increase the delay delay.
  • the disadvantage of adopting this technique is that the location of the acquisition is fixed from the implementation method and the achieved goal; then the user can only experience the sense of play at a certain specific position even if worn, and the viewing angle is single. Especially in sports such as football and marathon, it is impossible to achieve multi-camera viewing.
  • the sound field environment is extremely noisy. Even with a real-life camera, it is impossible to avoid the interference of other people's noise on the viewers. If the noise reduction algorithm is used, the algorithm operation is very complicated in the extremely noisy sound field environment.
  • the noise reduction process will filter out a part of the sound with a small sound. If the user is at a more remote viewing point, the method will filter the effective sounds such as the game in the field, even if The user can't hear the sound when he views the point. The noise reduction process is also unable to shield or extract the sound of the viewer in the direction of the view.
  • the embodiment of the present disclosure provides a sound collecting device, a sound output method, a device, and a storage medium, so as to at least solve the problem that the user can only experience the sense of play of a certain specific position and is affected by the surrounding noise. problem.
  • a sound pickup apparatus configured to be connected to an augmented reality AR or a virtual reality VR device, comprising: a main microphone Mic and a processor, wherein the main Mic And the processor is connected to the main Mic, and is configured to output the sound acquired by the main Mic to the AR or VR.
  • a sound output method comprising: acquiring a sound of a first sound source in a main Mic corresponding angle of view by using a main microphone Mic in a sound pickup device; acquiring the main Mic The sound is output to an augmented reality AR or virtual reality VR device connected to the sound pickup device.
  • a sound output device comprising: an acquisition module configured to acquire a sound of a first sound source at a corresponding angle of view of the main Mic by using a main microphone Mic in the sound pickup device; And an output module configured to output the sound acquired by the main Mic to an augmented reality AR or virtual reality VR device connected to the sound pickup device.
  • a storage medium having stored therein a computer program, wherein the computer program is configured to execute the steps of any one of the method embodiments described above at runtime.
  • an electronic device comprising a memory and a processor, wherein the memory stores a computer program, the processor being configured to execute the computer program to perform any of the above Said method.
  • the main Mic in the sound pickup device can only be set to acquire the sound on its corresponding viewing angle, the sound on the non-corresponding viewing angle is not acquired, thereby effectively shielding the sound on the non-corresponding viewing angle, and due to the main Mic
  • the sound of the corresponding angle of view is collected. Therefore, when the sound pickup device rotates, the angle of view that the main Mic is facing will change accordingly, so the sound collected by the main Mic will also change, realizing the position in real time.
  • the sound, and eliminates the sound of the main Mic non-positive perspective increases the user's deep immersion to the sound, and solves the problem that the user in the related art can only experience the sense of play at a certain position, and is subject to ambient noise. The impact of the problem.
  • FIG. 1 is a block diagram showing a hardware configuration of a mobile terminal of a sound output method according to an embodiment of the present disclosure
  • FIG. 2 is a flowchart of a sound output method according to an embodiment of the present disclosure
  • FIG. 3 is an overall flow chart of a sound output method according to an embodiment of the present disclosure
  • FIG. 4 is a schematic diagram of a free sound field in accordance with an embodiment of the present disclosure.
  • FIG. 5 is a schematic diagram of a sound collecting hole according to an embodiment of the present disclosure.
  • FIG. 6 is a schematic view of a convex surface in accordance with an embodiment of the present disclosure.
  • FIG. 7 is a schematic diagram of a current aperture receiving plane wave in a one-dimensional space according to an embodiment of the present disclosure
  • Figure 8 is a schematic perspective view of an embodiment of the present disclosure.
  • Figure 9 is a perspective view of an embodiment of the present disclosure.
  • FIG. 10 is a schematic diagram of a relationship between frequency and beam width in accordance with an embodiment of the present disclosure.
  • FIG. 11 is a schematic diagram of polar coordinates in a horizontal direction according to an embodiment of the present disclosure.
  • Figure 12 is a test chart 1 in accordance with an embodiment of the present disclosure.
  • FIG. 13 is a schematic diagram of a head and shoulder simulator in accordance with an embodiment of the present disclosure.
  • Figure 15 is a test chart three in accordance with an embodiment of the present disclosure.
  • FIG. 17 is a structural block diagram of a sound output device according to an embodiment of the present disclosure.
  • FIG. 1 is a hardware structural block diagram of a mobile terminal of a sound output method according to an embodiment of the present disclosure.
  • mobile terminal 10 may include one or more (only one shown in FIG. 1) processor 102 (processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA. ), a memory 104 configured to store data, and a transmission device 106 configured as a communication function.
  • processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA.
  • memory 104 configured to store data
  • a transmission device 106 configured as a communication function.
  • the structure shown in FIG. 1 is merely illustrative and does not limit the structure of the above electronic device.
  • the mobile terminal 10 may also include more or fewer components than those shown in FIG. 1, or have a different configuration than that shown in FIG.
  • the memory 104 may be configured as a software program and a module for storing application software, such as program instructions/modules corresponding to the sound output method in the embodiment of the present disclosure, and the processor 102 executes each by executing a software program and a module stored in the memory 104.
  • a functional application and data processing, that is, the above method is implemented.
  • Memory 104 may include high speed random access memory, and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory.
  • memory 104 may further include memory remotely located relative to processor 102, which may be connected to mobile terminal 10 over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • Transmission device 106 is arranged to receive or transmit data via a network.
  • the above-described network specific example may include a wireless network provided by a communication provider of the mobile terminal 10.
  • the transmission device 106 includes a Network Interface Controller (NIC) that can be connected to other network devices through a base station to communicate with the Internet.
  • the transmission device 106 can be a Radio Frequency (RF) module configured to communicate with the Internet wirelessly.
  • NIC Network Interface Controller
  • RF Radio Frequency
  • the terminal may be a VR/AR device.
  • the user can sense the sound effect of the position in real time when the position of the head is rotated in any desired occasion, and eliminate the sound in the non-optical direction.
  • the depth immersion is increased; in the embodiment of the present disclosure, the gear position selectable by the user can also be provided, so that the user can achieve an excellent experience.
  • FIG. 2 is a flowchart of a sound output method according to an embodiment of the present disclosure. As shown in FIG. 2, the flow includes the following steps:
  • Step S202 acquiring, by using the main microphone Mic in the sound collecting device, the sound of the first sound source in the main Mic corresponding angle of view;
  • Step S204 outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device.
  • the terminal performing the above operation may be the above-mentioned sound pickup device, and the sound pickup device may be connected to the AR or VR device as a part of the AR or the VR, wherein the main Mic in the sound pickup device is used to acquire the main Mic corresponding perspective. sound.
  • the number of master Mic in the above-described sound pickup device may be one or more.
  • the main Mic in the sound pickup device can only be set to acquire the sound on its corresponding viewing angle, the sound on the non-corresponding viewing angle is not acquired, thereby effectively shielding the sound on the non-corresponding viewing angle, and Mic collects the sound at the corresponding angle of view. Therefore, when the pickup device rotates, the angle of view that the main Mic is facing will change accordingly, so the sound collected by the main Mic will also change, realizing the real-time perception.
  • the sound of the position, and the sound of the Mic non-corresponding angle is excluded, which increases the depth immersion of the user, and solves the problem that the user can only experience the sense of play at a certain position and suffers from surrounding noise. The problem of impact.
  • the method before outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device, the method further includes: acquiring the auxiliary Mic by using the auxiliary Mic in the sound collecting device. Corresponding to the sound of the second sound source in the viewing angle, wherein the auxiliary Mic is disposed around the main Mic; outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device comprises: acquiring the main Mic The sound is synthesized with the sound acquired by the auxiliary Mic, and the synthesized sound is output to the AR or VR device connected to the sound pickup device.
  • the two operations of acquiring the sound of the auxiliary Mic corresponding angle of view by using the auxiliary Mic in the above-mentioned sound pickup device are not in a proper order. It should be noted that, in actual application, only the main Mic may be set, and the purpose of setting the auxiliary Mic is to better sense the sound at a specific position around, and to increase the stereoscopic effect of the sound to a certain extent.
  • acquiring the sound of the first sound source in the main Mic corresponding angle of view by using the main Mic in the sound collecting device comprises: configuring an aperture size of the sound collecting hole of the main Mic and/or a sound collecting hole depth; The sound of the first sound source at the main Mic corresponding angle of view is acquired by the main Mic configured with the aperture size and/or the depth of the sound collecting hole.
  • the aperture size of the sound collecting hole of the main Mic is adjustable, and the depth of the sound collecting hole is also adjustable.
  • the aperture size of the sound collecting holes of different specifications can be configured for the main Mic.
  • the configuration of 24mm, 12mm, 6mm, 3mm, or the main Mic configuration can continuously adjust the aperture size of the pickup hole
  • the specific configuration can be determined according to the specific circumstances.
  • it may be manually adjusted by the user, or automatically adjusted according to the size of the sound of the sound source corresponding to the main Mic, or according to configuration information input from the user, or configuration information from other devices.
  • the other device may be a device connected to the sound collecting device, for example, the sound collecting device is a device disposed in a field, the other device is used by the user in the room and wirelessly connected to the sound collecting device The controller is connected in a manner so that remote control of the sound pickup device can be realized.
  • configuring the aperture size and/or the pickup depth of the sound collecting hole of the main Mic includes: configuring the main Mic pickup according to the configuration information when receiving the input configuration information.
  • the aperture size of the aperture and/or the depth of the pickup hole; in the case where the input configuration information is not received, the aperture size of the pickup hole of the main Mic and/or the depth of the pickup hole are configured according to a preset value, for example, preset
  • the default aperture size is 12mm, and the preset aperture size can be freely set by the user.
  • the method before acquiring the sound of the first sound source in the main Mic corresponding viewing angle by using the main Mic in the sound collecting device, the method further includes: determining a direction and a range of the sound pickup device that needs to be rotated. Using the determined pickup device, the direction and amplitude of rotation are required to control the pickup device to rotate.
  • the sound collecting device is freely rotatable.
  • the sound collecting device is a wearable device of the user's head
  • the sound collecting device rotates with the rotation of the user's head
  • the sound collecting device is disposed at The equipment in the field, while the user is indoors
  • the user can control the rotation of the sound collecting device through the control device connected to the sound collecting device, and the embodiment is mainly directed to the remote control of the sound collecting device, thereby realizing the picking up
  • the sound device moves according to the user's control, so that the user can sense the sound in any direction and improve the user experience.
  • the auxiliary Mic and the main Mic are similar, and the configuration of the pickup hole size can also be performed, and details are not described herein again.
  • the method before acquiring the sound of the first sound source in the main Mic corresponding viewing angle by using the main Mic in the sound collecting device, the method further includes: determining a direction in which the main Mic needs to rotate and Amplitude; controlling the main Mic to rotate by using the determined direction and amplitude of the main Mic to be rotated.
  • the pickup device can be configured to be stationary, but the main Mic can be flexibly rotated.
  • FIG. 3 is an overall flowchart of a sound output method according to an embodiment of the present disclosure. As shown in FIG. 3, the method includes the following steps:
  • the device is the pickup device
  • the integrated main Mic (corresponding to the main Mic described above) starts to move with the head and shoulders, and records to determine whether to adopt the default gear position (assuming the default gear position is D gear, different gear positions correspond to different main Mic
  • the aperture size of the sound collecting hole in this embodiment is assumed to have four apertures of the sound hole), if yes, go to step [004], if not, go to step [005];
  • the dual-auxiliary Mic located at the ears of the head and shoulders is recorded; and is transmitted to the user's earphone via an electro-acoustic signal;
  • the other channels continue to record the sound and correspond to the corresponding path, that is, when the user does not select the default gear position (the process of selecting other gear positions at this time) In the middle), the four positions of the ABCD will correspond to the four channels of the ABCD, and the sound recording will be performed simultaneously in each channel;
  • the system After the user selects other gear positions, the system automatically switches to the path of the corresponding gear position, and establishes a connection with the earphone path for sound transmission. Other non-user-selected gear positions close the path to the headset.
  • Sound waves are transmitted in free space, that is, in wireless ideal media. Because the boundary is infinite, we can regard the sound field we solve as a directional spherical body.
  • Figure 4 shows a spherical plane, where 'O' is the location of the person, 'A, B, C' are three different sound sources. People do perspective motion on the horizontal plane.
  • the problem we need to solve is how to determine the 'A, B, C' sound sources, and thus experience the hearing at different positions of 'A, B, C'. effect.
  • Mic's performance can be described by a series of objective parameters, including sensitivity, flatness, equivalent noise level, directivity, dynamic range, and more.
  • the lowest to highest sound pressure level of the general competition venue is generally between 60dB and 110dB.
  • the traditional Mic diameter is generally 24mm, 12mm, 6mm, 3mm, and the frequency response is 20Hz ⁇ 40kHz.
  • the approximation can be regarded as omnidirectional, and the measurement range of the sound pressure level is 30 dB to 140 dB; the ambient sound pressure level of the sound pickup device in the embodiment of the present disclosure may be within the range of the Mic measurement sound pressure level.
  • the sound hole (also known as the sound hole or the sound channel) is an important part of acquiring an external sound source. See Figure 5 for details.
  • the problem of entering the sound of the channel and shielding other non-opposing positions can be solved by plane rotation, increasing the aperture and length of the sound collecting hole.
  • the method of contrast stripping is adopted, that is, the sound is collected in real time by adding two omnidirectional Mic as the auxiliary Mic, and then the main Mic is rotated and collected by widening the aperture and length of the sound collecting hole, and then synthesized to obtain the final result. sound.
  • a sound pickup device configured to be connected to an augmented reality AR or a virtual reality VR device, including: a main Mic and a processor, wherein The main Mic is set to acquire the sound of the first sound source at the main Mic corresponding angle of view; the above processor is connected to the main Mic and the auxiliary Mic, and is arranged to output the sound acquired by the main Mic to the AR or VR.
  • the main Mic in the sound pickup device can only be set to acquire the sound on the corresponding viewing angle, the sound on the non-corresponding viewing angle is not acquired, thereby effectively shielding the sound on the non-corresponding viewing angle, and The main Mic is to collect the sound in the opposite direction.
  • the pickup device rotates, the direction that the main Mic is facing will change accordingly, so the sound collected by the main Mic will also change, realizing the real-time perception.
  • the sound of the position, and the sound of the Mic non-positive direction is excluded, which increases the user's deep immersion to the sound, and solves the problem that the user in the related art can only experience the sense of play at a certain position and is subject to ambient noise. The impact of the problem.
  • the sound collecting device further includes a secondary Mic, wherein the auxiliary Mic is disposed around the main Mic, and is configured to acquire a sound of the second sound source corresponding to the auxiliary Mic; the processor is further configured to Connected to the auxiliary Mic, it is set to synthesize the sound acquired by the main Mic and the sound acquired by the auxiliary Mic and output the synthesized sound to the AR or VR.
  • the processor is further configured to configure the aperture size of the pickup aperture of the main Mic and/or the depth of the pickup aperture.
  • the aperture size of the pickup hole of the main Mic is adjustable, and the depth of the pickup hole is also adjustable.
  • the aperture of the pickup hole of the auxiliary Mic can also be set to be adjustable, and the configuration thereof is similar to that of the main Mic, and will not be described again here.
  • the processor may configure the aperture size of the sound hole of the main Mic and/or the depth of the sound collection hole by: configuring the configuration information according to the configuration information when the input configuration information is received.
  • the processor is further configured to: determine a direction and an amplitude of the sound pickup device to be rotated; and use the determined sound pickup device to control the rotation direction and amplitude to control the sound pickup device to rotate.
  • the processor is further configured to: determine a direction and an amplitude of the main Mic to be rotated; and control the main Mic to rotate by using the determined direction and amplitude of the main Mic to be rotated.
  • the main Mic when the sound collecting device is a wearable device of a user's head, the main Mic includes a main Mic located at a front end of the wearable device, wherein the front end is set to be the forehead of the wearer Corresponding.
  • the number of the secondary Mic may be two, respectively located at a position of the left ear of the corresponding user of the wearable device and a position corresponding to the right ear of the user. It should be noted that setting the main Mic at the forehead and setting the auxiliary Mic at the ears is a more preferable setting method, and the specific reasons are as follows:
  • the main Mic is designed as a protruding surface.
  • the principle of convex reflection almost all convex surfaces have scattering effects, and they are important reflecting surfaces as diffusing bodies because for convex surfaces, r is always negative (as shown in Figure 6).
  • the first step in the purpose of the embodiment of the present disclosure is to define the main Mic as a device that can be rotated for the main-pair video source acquisition, the next step The aperture widening process will be performed on the pickup hole of the main Mic.
  • the aperture here is expressed as an electroacoustic sensor (Mic) that converts an acoustic signal into an electrical signal.
  • A(f,r) is the aperture function
  • the aperture function can be used to know the corresponding function reflected by the aperture in different spatial sizes.
  • FIG. 7 shows the signal of the linear aperture receiving plane wave in the one-dimensional space.
  • the response of the aperture is a function of the frequency of the signal entering the aperture and the direction of incidence. It can be derived by solving the wave equation that the aperture response and the aperture function are in the presence of a Fourier transform.
  • the far field condition is expressed by the response function of the aperture:
  • Fr ⁇ . ⁇ is a three-dimensional Fourier transform
  • the coordinates shown in FIG. 8 can be simplified to a one-dimensional linear aperture along the X-axis direction, and the aperture length is L, as shown in FIG.
  • the aperture response is simplified to:
  • the range of directionality can be derived from the range between: - ⁇ / L ⁇ ⁇ x ⁇ ⁇ / L
  • the area between the areas is called the main lobe, and the range is the beam width. Therefore, for a fixed aperture length, the higher the frequency, the narrower the beam width, as shown in FIG.
  • the length of the caliber response can be expressed in the horizontal direction as:
  • the linear aperture characteristics can be obtained by the calculations of the above formulas (1) to (7).
  • the angle formed by the main axis is exactly the angle of the main axis shown in FIG.
  • the ⁇ H (fixed bracket international general angle) formed by the test is the same as the relevant laboratory test of China Mobile Lab.
  • test strategy is as follows (see Figure 15 for details):
  • the two auxiliary Mics are placed in the "head and shoulder simulator" binaural position in the embodiment of the present disclosure.
  • test environment is built (see Figure 15):
  • the main Mic in the forehead portion in the embodiment of the present disclosure.
  • the main Mic further includes a main Mic located at the top end of the wearable device, wherein the top end is configured to correspond to the top of the wearer's head.
  • Stereoscopic sound collection can be achieved by setting the main Mic at the position of the corresponding overhead, so that the user can feel a more stereoscopic sound. It should be noted that, in the present disclosure, the position of the main Mic can be freely set according to actual needs, and the number of main Mic can be freely adjusted.
  • the auxiliary Mic when the above-mentioned sound pickup device is a wearable device of a user's head, the auxiliary Mic includes a secondary Mic located on both sides of the wearable device, and both sides are set to be in contact with the wearer's ears. correspond.
  • module may implement a combination of software and/or hardware of a predetermined function.
  • apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
  • the device includes:
  • the obtaining module 172 is configured to acquire the sound of the first sound source in the main Mic corresponding angle of view by using the main Mic in the sound collecting device; the output module 174 is connected to the obtaining module 172, and is configured to output the sound obtained by the main Mic to the sound An augmented reality AR or virtual reality VR device connected to the above-described sound pickup device.
  • the apparatus is further configured to: use the auxiliary Mic in the sound pickup device before outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound pickup device Obtaining a sound of the second sound source corresponding to the auxiliary Mic, wherein the auxiliary Mic is disposed around the main Mic; and outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device: The sound acquired by Mic and the sound acquired by the auxiliary Mic are combined, and the synthesized sound is output to an AR or VR device connected to the pickup device.
  • the obtaining module 172 may acquire the sound of the first sound source in the main Mic corresponding perspective by using the main Mic in the sound collecting device by configuring the aperture of the sound collecting hole of the main Mic. Size and/or pickup hole depth; the main Mic configured with the aperture size and/or the pickup aperture depth is used to acquire the sound of the first sound source at the main Mic corresponding angle of view.
  • the obtaining module 172 may configure the aperture size and/or the depth of the sound collecting hole of the main Mic by the following manner: in case the input configuration information is received, according to the configuration The information configures the aperture size of the pickup hole of the main Mic and/or the depth of the pickup hole; if the input configuration information is not received, the aperture size and/or the pickup of the pickup hole of the main Mic is configured according to the preset value. Hole depth.
  • the sound output device is further configured to determine a direction and an amplitude of the sound pickup device to be rotated before acquiring the sound in the direction of the main Mic by using the main Mic in the sound pickup device;
  • the pickup device needs the direction and amplitude of the rotation to control the pickup device to rotate.
  • the apparatus is further configured to determine a direction in which the main Mic needs to be rotated before acquiring the sound of the first sound source in the main Mic corresponding viewing angle by using the main Mic in the sound collecting device. And amplitude; controlling the main Mic to rotate by using the determined direction and amplitude of the main Mic to be rotated.
  • each of the above modules may be implemented by software or hardware.
  • the foregoing may be implemented by, but not limited to, the foregoing modules are all located in the same processor; or, the above modules are in any combination.
  • the forms are located in different processors.
  • Embodiments of the present disclosure also provide a storage medium having stored therein a computer program, wherein the computer program is configured to execute the steps of any one of the method embodiments described above.
  • the foregoing storage medium may include, but is not limited to, a USB flash drive, a Read-Only Memory (ROM), and a Random Access Memory (RAM).
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • Embodiments of the present disclosure also provide an electronic device including a memory and a processor having stored therein a computer program configured to execute a computer program to perform the steps of any of the above method embodiments.
  • the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
  • the basic core of the present disclosure is to define and layout Mic on the AR/VR product by using the AR/VR product.
  • the Mic can be temporarily set to have four specifications of 24mm, 12mm, 6mm, and 3mm.
  • the Mic is integrated into an acquisition fan, and the fan can rotate with the user's head at the same angle.
  • the structure through the plane rotation, increases the aperture length of the widening sound hole to solve the sound entering the channel, and shields the sound source entry of other non-opposing positions, and adds other auxiliary Mic to match.
  • the original Mic is a sound pickup device that can perform fan rotation.
  • the four specifications (four specifications are only exemplary descriptions, other specifications can also be used) are integrated, and the integrated Mic module is set to collect the user in the viewing direction.
  • the auxiliary Mic is two omnidirectional auxiliary Mic, and the sound next to the position where the user is located can be collected in real time, and can be rotated with the user's head, or can be set to not rotate with the user's head.
  • the integrated main Mic starts to work.
  • the device defaults to any one of A, B, C, and D (each gear corresponds to one specification). The default is D gear.
  • Each gear corresponds to one path; two auxiliary Mic Start working, collecting the ambient sound of the left and right sides of the user in real time; when the user is facing a certain area, the integrated main Mic position changes with the angle as the steering angle changes; at this time, the integrated Mic converts the electroacoustic signal.
  • the two auxiliary Mic convert the sounds on the left and right sides of the head through the electro-acoustic signal, and transmit them to the user through the earphone; when the user feels that the feeling at this time cannot meet the demand of the live viewing,
  • the adjustment button switches the gear position and opens the passage corresponding to the selection to achieve better results.
  • the method adopted is to redefine the Mic and solve the problem of entering the channel sound and shielding other non-user-to-view source by the "plane rotation", "increasing or widening the aperture” and the length of the sound hole.
  • the problem is that hardware modification is performed to perform sound source localization.
  • Such an acquisition surface Mic allocation layout strategy and usage rules are described in the embodiments of the present disclosure.
  • a plurality of main Mic may be set (the aperture size of each Mic's sound collecting hole is different, or Set a main Mic, and set the aperture size specifications of the various pickup holes for the one main Mic.
  • the number of main Mic added is set to three, which are A, B, and C respectively.
  • the three Mic apertures B and C are a mm, b mm, and c mm, and the orientations are a°, b°, and c° with respect to a certain horizontal position.
  • the "positioning method in the horizontal direction" described in the embodiments of the present disclosure is intended to better explain the problem, is one of the basic embodiments of the core of the present disclosure, and does not represent the complete and unique embodiment described in the present disclosure.
  • the acquisition of the stereoscopic space can be achieved by adding a device that uses the same technical means and is perpendicular to its basic embodiment as described in the present disclosure.
  • a complete embodiment of the present disclosure includes, but is not limited to, sound source localization in a horizontal direction, as described in this paragraph, when two or more techniques, such as those described in the present disclosure, are added, omnidirectional acquisition can be achieved.
  • Embodiments of the present disclosure provide a stereo sensing technology that deploys the device described in this patent at a specific location according to the size of the space, and can achieve all-round, stereo space sound source collection.
  • the user After wearing the above-mentioned sound collecting device, the user can experience the feeling of watching the established position at any point in the local (home, field), thereby increasing the user's selectivity and immersion.
  • the user's location does not need to change. For example, when the user is at the B point position, the head is turned to the left, and the user's selection can sense the equivalent effect of any point on the scene, such as the head of point A turning to the left.
  • Each of the received sound sources described in the embodiments of the present disclosure includes position information, and the position information is already included in the sound source at the beginning of the collection, that is, the position information is the initial sound source position. Therefore, when it is restored to the user, it not only includes the sound, but also allows the user to feel the change of the location.
  • Embodiments of the present disclosure also describe such an audio and video matching synthesis rule.
  • the prior art when the matching audio and video is restored, all parameters are adjusted and changed based on time (frame), and the method described in the patent is to add a position coordinate axis by rotating (Pan) and vertical.
  • rotating (Tilt) two parameters you can match the sound and the orientation of the view, and combine when the time and position coordinates match at the same time.
  • Embodiments of the present disclosure describe a frame structure of an audio format.
  • the existing audio file includes simple left and right channels, audio tracks (sounds), and the like.
  • position information is added.
  • the specific description is that the origin is a distance source, and the origin is a distance source.
  • the relative position; similarly, the collection point also performs video acquisition, which is also the origin of the video collection point.
  • the position information described in this embodiment is the relative position of all sound sources relative to the collection point.
  • Embodiments of the present disclosure describe a rule for audio reproduction when a position (coordinate point) changes.
  • the sound restored to the user described in this embodiment does not change the size of the sound restored to the user, but changes the proportion of the loudness that is restored to the user for each utterance point within the range of the collected sound field. Therefore, in several factors that affect the user experience, such as video, audio, body, touch light content, this embodiment mainly describes the restoration of the audio ratio.
  • Embodiments of the present disclosure describe a user self-selection mode.
  • the main Mic is described as four default gear positions.
  • the other gear positions can be arbitrarily switched according to the user's own needs, and the other channels continue to record the sound, and corresponding to the corresponding The channel is fed back to the user to increase the user experience.
  • the main Mic is the four default gear positions, which is one of the basic functional characteristics of the core of the patented invention, and does not represent the complete uniqueness described in the present disclosure. implementation plan. Therefore, a complete embodiment of the present disclosure includes, but is not limited to, four default gear positions.
  • Embodiments of the present disclosure also describe such an adaptive scheme based on multiplex matching.
  • the sound source localization method described in the present disclosure after adding the number, specification, and position of the main Mic that matches the post-design requirements, as described above, the sound source positioning of the full-field multi-plane dimension can be realized, and the main Mic acquisition surface is collected. The source is recorded. For example, there are two users A and B facing two angles of view at different angles a° and b° at the same time, and the sound source recorded in the whole field is ⁇ . At this time, the system automatically matches the angle A of the user A with the position information contained in the alpha source. When the matching is successful, the scanned source that has been successfully matched is transmitted to the user A.
  • the system automatically matches the viewing angle b° of the user B with the position information contained in the alpha sound source, and when the matching is successful, transmits the scanned matching sound source to the user B. It should be noted that what is described in the examples is one of the basic functional characteristics of the core of the present disclosure, and does not represent the complete and unique embodiment described in the present disclosure.
  • Embodiments of the present disclosure also describe such a local based non-real time multiplexing acquisition scheme. After adopting the scheme that matches the requirements of the post-design, the system automatically collects the full-field sound for recording, and stores the recorded information locally. When the user uses the above-mentioned multiplexing matching experience scheme, the non-real-time multi-user can be realized. Perception of sound source localization from different perspectives.
  • the inventive embodiment provides such a solution to the device layout.
  • the device described in the present disclosure can be placed at any point in the field, including but not limited to one terminal device.
  • the solution in the embodiment of the present disclosure is to realize the hardware modification manner of the sound source entering the channel and shielding other non-opposing positions by planar rotation, increasing the aperture and length of the sound collecting hole, and the like.
  • the existing patents are implemented by rendering and encoding the audio data collected by the N audio collection devices mentioned.
  • the method described in the embodiments of the present disclosure differs from the method described in the prior art in the implementation of “sound source localization” in the “sound source localization”.
  • the present disclosure describes the initial acquisition process.
  • the relevant sound source (including the perceived position) information has been entered, and no later processing is required on the implementation method that achieves the basic effect.
  • the existing patent is to input the sound source, and then pass the sound data device corresponding to the determined audio data; and use the determined Q speaker devices to render the M channel audio data.
  • the intervention point is the rendering stage of the M-channel audio data corresponding to the Q speakers in the later stage.
  • the acquisition method described in the present disclosure is achieved by adding two (including but not limited to two) auxiliary Mic, and redefining the integrated main Mic of four specifications (including but not limited to four).
  • the device can achieve multi-directional sound source acquisition and positioning in a single location.
  • the prior art utilizes the set orientation of each video capture device, and then sets N corresponding audio capture devices.
  • the main content of the existing patents is VR audio and video corresponding rendering, and selection and restoration of video and audio at any position in the virtual scene, and there is no distinction and collection of different azimuth sound sources in the stereo space, but the collection in the embodiment of the present disclosure
  • the method can effectively distinguish and collect sound sources in different directions in the three-dimensional space.
  • the method described in the embodiment of the present disclosure records the related sound source (including the perceived position) information in the initial collection process, and does not need to be processed later in the implementation manner that achieves the basic effect.
  • the present disclosure describes a rule for audio reproduction when positional changes (coordinate points). Instead of changing the size of the sound restored to the user, the ratio of the sound level of each sound point within the range of the collected sound field is changed.
  • the prior art does not have this function, but is simply restored to the user after rendering.
  • the present disclosure implements a mode of user self-selection, the described main Mic has four default gear positions (including but not limited to four), and when the user does not select the default gear position, any other gear position can be switched, other The path continues to record the sound and responds to the corresponding path, giving feedback to the user, increasing the user experience.
  • the present disclosure establishes such a stereo sensing technology, and according to the size of the site and its space, deploying the device described in this patent at a specific location, can achieve all-dimensional, three-dimensional space acquisition.
  • the user can observe the perception of the established position at any point in the local (at home, at any point in the stadium), increase the user's selectivity and immersion, and the user's location does not need to change.
  • the solution in the embodiment of the present disclosure can perform gear position adjustment on a device, and the user independently selects the aperture size to increase the user's selectivity.
  • the solution in the embodiment of the present disclosure can be applied to an existing smart phone product, and when the device is used, when the user visits the scene and turns the position of the head, the sound effect of the position can be sensed in real time, and the depth immersion is increased.
  • Sense also provides a variety of optional gears for users to achieve a better experience.
  • the disclosed patents can be well used in urban planning and urban modeling.
  • urban planning users can conduct on-site inspections, and after wearing the terminal, they can perceive the rationality of the set model;
  • Public patents can be well used in geography, and comprehensively use geographic information such as 3D GIS to achieve the perception of certain geographical types, providing reliable reference data and user perception; this patent can be exercised in disaster relief.
  • 3D GIS geographic information
  • this patent can be exercised in disaster relief.
  • real-time perception of the internal structure and perception information of the building and other information affecting rescue work accurately locate the best rescue route, select the best rescue means, and greatly improve the rescue efficiency.
  • the image returned from the scene is tracked and the rescue command is issued in real time.
  • patented invention can also be applied to various fields such as military, industrial, electronic cruise, education, and the like.
  • modules or steps of the present disclosure described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module. As such, the disclosure is not limited to any specific combination of hardware and software.
  • the sound collecting device, the sound output method, the device, the storage medium, and the electronic device provided by the embodiments of the present invention have the following beneficial effects: the user in the related art can only understand the viewing of a specific location. Feeling, and will be affected by the surrounding noise.

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Optics & Photonics (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The present disclosure provides adapterization equipment, a voice output method, a device, a storage medium and an electronic device, wherein the voice output method comprises: acquiring, by using a main Mic in the adapterization equipment, the voice of a first voice source at a corresponding viewing angle of the main Mic; and outputting the voice acquired by the described main Mic to an AR or VR device connected to the adapterization equipment. According to the present disclosure, the problem that a user in the related art can only experience the game watching at a certain specific location and will be affected by the surrounding noise is solved.

Description

拾音设备、声音输出方法、装置、存储介质及电子装置Sound pickup device, sound output method, device, storage medium, and electronic device 技术领域Technical field
本公开涉及声音处理技术,具体而言,涉及一种拾音设备、声音输出方法、装置、存储介质及电子装置。The present disclosure relates to sound processing technology, and in particular to a sound pickup device, a sound output method, a device, a storage medium, and an electronic device.
背景技术Background technique
目前对增强现实/虚拟现实(Augmented Reality/Virtual Reality,简称为AR/VR)的研究已经相当广泛和深入,同时各大科技媒体、网站都在热议AR/VR,AR/VR产品可以被称为智能手机之后的又一个颠覆性科技产品。人们在日常生活中与现实世界的事物互动的要更多一些,而AR/VR产品的核心理念就是通过在现实环境中加载虚拟信息,来帮助人类完成工作。特别是随着第五代移动通信技术(the 5th Generation mobile communication technology,简称为5G)时代的到来,第三代合作项目组织(The 3rd Generation Partnership Project,简称为3GPP)所定义的增强移动宽带(Enhance Mobile Broadband,简称为eMBB)更是其中的重要场景之一,其所对应的3D/超高清视频等大流量移动宽带业务更加速了VA/VR技术的发展和应用。At present, the research on Augmented Reality/Virtual Reality (AR/VR) has been extensive and in-depth. At the same time, major technology media and websites are hot on AR/VR, AR/VR products can be called Another disruptive technology product after the smartphone. People interact more with real-world things in their daily lives, and the core idea of AR/VR products is to help humans get the job done by loading virtual information in real-world environments. Especially with the advent of the 5th Generation mobile communication technology (5G) era, the enhanced mobile broadband defined by The 3rd Generation Partnership Project (3GPP) Enhance Mobile Broadband (abbreviated as eMBB) is one of the important scenarios. The high-traffic mobile broadband services such as 3D/Ultra HD video accelerate the development and application of VA/VR technology.
现在很多AR/VR厂商,甚至一些智能电视厂商都推出了如“实时观赛”的功能,其目的就是为了让每一位用户可以通过佩戴它们的产品进行“实时观赛”,从而为观众带来全新的体验。Now many AR/VR vendors, and even some smart TV manufacturers, have launched functions such as “real-time viewing”, which is designed to allow each user to “live view” by wearing their products, thus bringing the audience Come to a new experience.
在相关技术中,对于“实时观赛”处理,其总体的解决思路在于“虚拟现实”+“现场感知”,也就是将现场的画面投放在用户面前,通过两个无线耳塞,实现立体声音效等,这样用户在看球赛的时候能够听到比平时更加给力的音频效果。这种技术的视频部分主要采用实感摄像头,包括1080p高清摄像头、红外摄像头以及红外镭射投影仪等在内的一系列敏感元件捕捉3D画面。这种技术的音频部分是用实感摄像头,进行每秒X帧的输出,从而达到模拟声场的目的。简而言之就是用摄像头在某个定点的 位置将现场的音视频录制下来,再增加延时delay。In the related art, the overall solution to the "real-time viewing" process is "virtual reality" + "live perception", that is, placing the scene on the scene in front of the user, and implementing stereo sound effects through two wireless earbuds. So that users can hear more powerful audio effects than usual when watching the game. The video part of this technology mainly uses real-world cameras, including a series of sensitive components such as 1080p HD camera, infrared camera and infrared laser projector to capture 3D images. The audio part of this technology uses a real-world camera to output X frames per second to achieve the purpose of simulating the sound field. In short, the camera will record the audio and video of the scene at a certain fixed point, and then increase the delay delay.
采用这种技术的缺点在于,从实现方法来和所达到的目的来看,采集的地点是固定的;那么用户即便佩戴也只能体验某一个特定位置的观赛感,观赛视角单一。尤其是在足球运动、马拉松等场地较大的运动项目,无法达到多机位观赛。The disadvantage of adopting this technique is that the location of the acquisition is fixed from the implementation method and the achieved goal; then the user can only experience the sense of play at a certain specific position even if worn, and the viewing angle is single. Especially in sports such as football and marathon, it is impossible to achieve multi-camera viewing.
其次,对于橄榄球、篮球、棒球等场内气氛活跃的运动,场地声场环境极嘈杂,即便采用实感摄像头都无法避免旁人噪音对观赛者的干扰。如采用降噪算法处理,在极为嘈杂的声场环境下,算法运算又很复杂。Secondly, for sports such as rugby, basketball, and baseball, the sound field environment is extremely noisy. Even with a real-life camera, it is impossible to avoid the interference of other people's noise on the viewers. If the noise reduction algorithm is used, the algorithm operation is very complicated in the extremely noisy sound field environment.
再次,从技术角度看,降噪处理会滤掉一部分声音较小的声音,如果此时用户处在较为偏远的观赛点,则该方法又会将场内正在比赛等部分有效声音过滤,即便用户对视该点,也无法听到其声音。降噪处理也不能够自行屏蔽或提取观赛者对视方向上的声音。Again, from a technical point of view, the noise reduction process will filter out a part of the sound with a small sound. If the user is at a more remote viewing point, the method will filter the effective sounds such as the game in the field, even if The user can't hear the sound when he views the point. The noise reduction process is also unable to shield or extract the sound of the viewer in the direction of the view.
最后,因为设备录音模式的局限性,用户所体验的现场声音效果较为单一,无法进行选择,即便添加所谓的“3D”、“杜比”等,也只是在原有录音基础上添加。Finally, because of the limitations of the device recording mode, the live sound effect experienced by the user is relatively simple and cannot be selected. Even if the so-called "3D" or "Dolby" is added, it is only added on the basis of the original recording.
针对相关技术中存在的用户只能体会某一个特定位置的观赛感,且会受到周围噪声的影响的问题,目前尚未提出有效的解决方案。For the problem that the user in the related art can only experience the sense of watching at a certain specific location and is affected by the surrounding noise, an effective solution has not been proposed yet.
发明内容Summary of the invention
本公开实施例提供了一种拾音设备、声音输出方法、装置及存储介质,以至少解决相关技术中存在的用户只能体会某一个特定位置的观赛感,且会受到周围噪声的影响的问题。The embodiment of the present disclosure provides a sound collecting device, a sound output method, a device, and a storage medium, so as to at least solve the problem that the user can only experience the sense of play of a certain specific position and is affected by the surrounding noise. problem.
根据本公开的一个实施例,提供了一种拾音设备,所述拾音设备被设置为与增强现实AR或虚拟现实VR设备连接,包括:主麦克风Mic以及处理器,其中,所述主Mic设置为获取所述主Mic对应视角上的第一声音源的声音;所述处理器连接至所述主Mic,设置为将所述主Mic获取的声音输出给所述AR或VR。According to an embodiment of the present disclosure, there is provided a sound pickup apparatus configured to be connected to an augmented reality AR or a virtual reality VR device, comprising: a main microphone Mic and a processor, wherein the main Mic And the processor is connected to the main Mic, and is configured to output the sound acquired by the main Mic to the AR or VR.
根据本公开的另一方面,还提供了一种声音输出方法,包括:利用拾音设备中的主麦克风Mic获取所述主Mic对应视角上的第一声音源的声音;将所述主Mic获取的声音输出给与所述拾音设备连接的增强现实AR或虚拟现实VR设备。According to another aspect of the present disclosure, there is also provided a sound output method comprising: acquiring a sound of a first sound source in a main Mic corresponding angle of view by using a main microphone Mic in a sound pickup device; acquiring the main Mic The sound is output to an augmented reality AR or virtual reality VR device connected to the sound pickup device.
根据本公开的另一个实施例,还提供了一种声音输出装置,包括:获取模块,设置为利用拾音设备中的主麦克风Mic获取所述主Mic对应视角上的第一声音源的声音;输出模块,设置为将所述主Mic获取的声音输出给与所述拾音设备连接的增强现实AR或虚拟现实VR设备。According to another embodiment of the present disclosure, there is also provided a sound output device, comprising: an acquisition module configured to acquire a sound of a first sound source at a corresponding angle of view of the main Mic by using a main microphone Mic in the sound pickup device; And an output module configured to output the sound acquired by the main Mic to an augmented reality AR or virtual reality VR device connected to the sound pickup device.
根据本公开的另一个实施例,还提供了一种存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。According to another embodiment of the present disclosure, there is also provided a storage medium having stored therein a computer program, wherein the computer program is configured to execute the steps of any one of the method embodiments described above at runtime.
根据本公开的另一个实施例,还提供了一种电子装置,包括存储器和处理器,其中,该存储器中存储有计算机程序,该处理器被设置为运行所述计算机程序以执行上述任一项所述的方法。According to another embodiment of the present disclosure, there is also provided an electronic device comprising a memory and a processor, wherein the memory stores a computer program, the processor being configured to execute the computer program to perform any of the above Said method.
通过本公开,由于拾音设备中的主Mic只能设置为获取其对应视角上的声音,因此,不会获取非对应视角上的声音,从而有效屏蔽非对应视角上的声音,并且由于主Mic是采集对应视角上的声音的,因此,当拾音设备转动后,主Mic所正对的视角会随之发生变化,因此主Mic所采集的声音也会发生变化,实现了实时感知所处位置的声音,并且排除了主Mic非正对视角上的声音,增加用户对声音的深度沉浸感,解决了相关技术中存在的用户只能体会某一个特定位置的观赛感,且会受到周围噪声的影响的问题。Through the present disclosure, since the main Mic in the sound pickup device can only be set to acquire the sound on its corresponding viewing angle, the sound on the non-corresponding viewing angle is not acquired, thereby effectively shielding the sound on the non-corresponding viewing angle, and due to the main Mic The sound of the corresponding angle of view is collected. Therefore, when the sound pickup device rotates, the angle of view that the main Mic is facing will change accordingly, so the sound collected by the main Mic will also change, realizing the position in real time. The sound, and eliminates the sound of the main Mic non-positive perspective, increases the user's deep immersion to the sound, and solves the problem that the user in the related art can only experience the sense of play at a certain position, and is subject to ambient noise. The impact of the problem.
附图说明DRAWINGS
此处所说明的附图用来提供对本公开的进一步理解,构成本申请的一部分,本公开的示意性实施例及其说明用于解释本公开,并不构成对本公开的不当限定。在附图中:The drawings described herein are provided to provide a further understanding of the present disclosure, which is a part of the present disclosure, and the description of the present disclosure and the description thereof are not intended to limit the disclosure. In the drawing:
图1是本公开实施例的声音输出方法的移动终端的硬件结构框图;1 is a block diagram showing a hardware configuration of a mobile terminal of a sound output method according to an embodiment of the present disclosure;
图2是根据本公开实施例的声音输出方法的流程图;2 is a flowchart of a sound output method according to an embodiment of the present disclosure;
图3是根据本公开实施例的声音输出方法的整体流程图;3 is an overall flow chart of a sound output method according to an embodiment of the present disclosure;
图4是根据本公开实施例的自由声场示意图;4 is a schematic diagram of a free sound field in accordance with an embodiment of the present disclosure;
图5是根据本公开实施例的拾音孔的示意图;FIG. 5 is a schematic diagram of a sound collecting hole according to an embodiment of the present disclosure; FIG.
图6是根据本公开实施例的凸面示意图;6 is a schematic view of a convex surface in accordance with an embodiment of the present disclosure;
图7是根据本公开实施例的一维空间内现行孔径接收平面波示意图;7 is a schematic diagram of a current aperture receiving plane wave in a one-dimensional space according to an embodiment of the present disclosure;
图8是根据本公开实施例的角度示意图一;Figure 8 is a schematic perspective view of an embodiment of the present disclosure;
图9是根据本公开实施例的角度示意图二;Figure 9 is a perspective view of an embodiment of the present disclosure;
图10是根据本公开实施例的频率与波束宽度的关系示意图;10 is a schematic diagram of a relationship between frequency and beam width in accordance with an embodiment of the present disclosure;
图11是根据本公开实施例的水平方向上的极坐标示意图;11 is a schematic diagram of polar coordinates in a horizontal direction according to an embodiment of the present disclosure;
图12是根据本公开实施例的测试图一;Figure 12 is a test chart 1 in accordance with an embodiment of the present disclosure;
图13是根据本公开实施例的头肩模拟器示意图;13 is a schematic diagram of a head and shoulder simulator in accordance with an embodiment of the present disclosure;
图14是根据本公开实施例的测试图二;14 is a test chart 2 in accordance with an embodiment of the present disclosure;
图15是根据本公开实施例的测试图三;Figure 15 is a test chart three in accordance with an embodiment of the present disclosure;
图16是根据本公开实施例的测试图四;16 is a test chart 4 in accordance with an embodiment of the present disclosure;
图17是根据本公开实施例的声音输出装置的结构框图。17 is a structural block diagram of a sound output device according to an embodiment of the present disclosure.
具体实施方式detailed description
下文中将参考附图并结合实施例来详细说明本公开。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。The present disclosure will be described in detail below with reference to the drawings in conjunction with the embodiments. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It is to be understood that the terms "first", "second", and the like in the specification and claims of the present disclosure are used to distinguish similar objects, and are not necessarily used to describe a particular order or order.
本申请实施例一所提供的方法实施例可以在终端,例如,移动终端、计算机终端或者类似的运算装置中执行。以运行在移动终端上为例,图1 是本公开实施例的声音输出方法的移动终端的硬件结构框图。如图1所示,移动终端10可以包括一个或多个(图1中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)、设置为存储数据的存储器104、以及设置为通信功能的传输装置106。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述电子装置的结构造成限定。例如,移动终端10还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。The method embodiment provided in Embodiment 1 of the present application can be executed in a terminal, for example, a mobile terminal, a computer terminal, or the like. Taking a mobile terminal as an example, FIG. 1 is a hardware structural block diagram of a mobile terminal of a sound output method according to an embodiment of the present disclosure. As shown in FIG. 1, mobile terminal 10 may include one or more (only one shown in FIG. 1) processor 102 (processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA. ), a memory 104 configured to store data, and a transmission device 106 configured as a communication function. It will be understood by those skilled in the art that the structure shown in FIG. 1 is merely illustrative and does not limit the structure of the above electronic device. For example, the mobile terminal 10 may also include more or fewer components than those shown in FIG. 1, or have a different configuration than that shown in FIG.
存储器104可设置为存储应用软件的软件程序以及模块,如本公开实施例中的声音输出方法对应的程序指令/模块,处理器102通过运行存储在存储器104内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至移动终端10。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 may be configured as a software program and a module for storing application software, such as program instructions/modules corresponding to the sound output method in the embodiment of the present disclosure, and the processor 102 executes each by executing a software program and a module stored in the memory 104. A functional application and data processing, that is, the above method is implemented. Memory 104 may include high speed random access memory, and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory. In some examples, memory 104 may further include memory remotely located relative to processor 102, which may be connected to mobile terminal 10 over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
传输装置106设置为经由一个网络接收或者发送数据。上述的网络具体实例可包括移动终端10的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,简称为NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,简称为RF)模块,其设置为通过无线方式与互联网进行通讯。Transmission device 106 is arranged to receive or transmit data via a network. The above-described network specific example may include a wireless network provided by a communication provider of the mobile terminal 10. In one example, the transmission device 106 includes a Network Interface Controller (NIC) that can be connected to other network devices through a base station to communicate with the Internet. In one example, the transmission device 106 can be a Radio Frequency (RF) module configured to communicate with the Internet wirelessly.
上述的终端可以是VR/AR设备,利用本公开实施例中的终端,用户可在任意所需场合,转动头的位置时,实时感知所处位置的声音效果,排除非对视方向上的声音,增加深度沉浸感;在本公开实施例中还可以提供可为用户选择的档位,让用户所达到极佳的体验。The terminal may be a VR/AR device. With the terminal in the embodiment of the present disclosure, the user can sense the sound effect of the position in real time when the position of the head is rotated in any desired occasion, and eliminate the sound in the non-optical direction. The depth immersion is increased; in the embodiment of the present disclosure, the gear position selectable by the user can also be provided, so that the user can achieve an excellent experience.
在本实施例中提供了一种运行于上述移动终端的声音输出方法,图2 是根据本公开实施例的声音输出方法的流程图,如图2所示,该流程包括如下步骤:In this embodiment, a sound output method running on the mobile terminal is provided. FIG. 2 is a flowchart of a sound output method according to an embodiment of the present disclosure. As shown in FIG. 2, the flow includes the following steps:
步骤S202,利用拾音设备中的主麦克风Mic获取所述主Mic对应视角上的第一声音源的声音;Step S202, acquiring, by using the main microphone Mic in the sound collecting device, the sound of the first sound source in the main Mic corresponding angle of view;
步骤S204,将上述主Mic获取的声音输出给与拾音设备连接的增强现实AR或虚拟现实VR设备。Step S204, outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device.
其中,执行上述操作的终端可以是上述拾音设备该拾音设备可以与AR或VR设备连接以作为AR或VR的一部分,其中,利用拾音设备中的主Mic获取该主Mic对应视角上的声音。上述拾音设备中的主Mic的数量可以是一个或多个。The terminal performing the above operation may be the above-mentioned sound pickup device, and the sound pickup device may be connected to the AR or VR device as a part of the AR or the VR, wherein the main Mic in the sound pickup device is used to acquire the main Mic corresponding perspective. sound. The number of master Mic in the above-described sound pickup device may be one or more.
通过上述实施例,由于拾音设备中的主Mic只能设置为获取其对应视角上的声音,因此,不会获取非对应视角上的声音,从而有效屏蔽非对应视角上的声音,并且由于主Mic是采集对应视角上的声音的,因此,当拾音设备转动后,主Mic所正对的视角会随之发生变化,因此主Mic所采集的声音也会发生变化,实现了实时感知所处位置的声音,并且排除了Mic非对应视角上的声音,增加用户对声音的深度沉浸感,解决了相关技术中存在的用户只能体会某一个特定位置的观赛感,且会受到周围噪声的影响的问题。With the above embodiment, since the main Mic in the sound pickup device can only be set to acquire the sound on its corresponding viewing angle, the sound on the non-corresponding viewing angle is not acquired, thereby effectively shielding the sound on the non-corresponding viewing angle, and Mic collects the sound at the corresponding angle of view. Therefore, when the pickup device rotates, the angle of view that the main Mic is facing will change accordingly, so the sound collected by the main Mic will also change, realizing the real-time perception. The sound of the position, and the sound of the Mic non-corresponding angle is excluded, which increases the depth immersion of the user, and solves the problem that the user can only experience the sense of play at a certain position and suffers from surrounding noise. The problem of impact.
在一个可选的实施例中,在将主Mic获取的声音输出给与拾音设备连接的增强现实AR或虚拟现实VR设备之前,上述方法还包括:利用拾音设备中的辅Mic获取辅Mic对应视角上的第二声音源的声音,其中,辅Mic设置在主Mic的周围;将主Mic获取的声音输出给与拾音设备连接的增强现实AR或虚拟现实VR设备包括:将主Mic获取的声音和辅Mic获取的声音进行合成,并将合成后的声音输出给与拾音设备连接的AR或VR设备。在本实施例中,利用上述拾音设备中的辅Mic获取所述辅Mic对应视角上的声音两个操作是没有必然的先后顺序的。需要说明的是,在实际应用时,也可以只设置主Mic,设置辅助Mic的目的是为了更好的感 知周围特定位置上的声音,在一定程度上增加声音的立体感。In an optional embodiment, before outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device, the method further includes: acquiring the auxiliary Mic by using the auxiliary Mic in the sound collecting device. Corresponding to the sound of the second sound source in the viewing angle, wherein the auxiliary Mic is disposed around the main Mic; outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device comprises: acquiring the main Mic The sound is synthesized with the sound acquired by the auxiliary Mic, and the synthesized sound is output to the AR or VR device connected to the sound pickup device. In this embodiment, the two operations of acquiring the sound of the auxiliary Mic corresponding angle of view by using the auxiliary Mic in the above-mentioned sound pickup device are not in a proper order. It should be noted that, in actual application, only the main Mic may be set, and the purpose of setting the auxiliary Mic is to better sense the sound at a specific position around, and to increase the stereoscopic effect of the sound to a certain extent.
在一个可选的实施例中,利用拾音设备中的主Mic获取主Mic对应视角上的第一声音源的声音包括:配置主Mic的拾音孔的孔径大小和/或拾音孔深度;利用配置了所述孔径大小和/或拾音孔的深度的主Mic获取所述主Mic对应视角上的第一声音源的声音。在本实施例中,主Mic的拾音孔的孔径大小是可调的,拾音孔的深度也是可调的,优选地,可以为主Mic配置几种不同规格的拾音孔的孔径大小,例如,配置24mm、12mm、6mm、3mm,或者为主Mic配置可以连续调节孔径大小的拾音孔,具体配置可以视具体情况而定。此外,在具体配置时,也可以由使用者手动调节,或者根据主Mic对应视角上的声音源的声音的大小来自动进行调节,或者根据来自用户输入的配置信息,或者来自其他设备的配置信息来进行调节,其中,该其他设备可以是与该拾音设备连接的设备,例如,拾音设备为设置在赛场的设备,该其他设备为室内的由用户使用的且与该拾音设备通过无线方式连接的控制器,从而可以实现对拾音设备的远程控制。In an optional embodiment, acquiring the sound of the first sound source in the main Mic corresponding angle of view by using the main Mic in the sound collecting device comprises: configuring an aperture size of the sound collecting hole of the main Mic and/or a sound collecting hole depth; The sound of the first sound source at the main Mic corresponding angle of view is acquired by the main Mic configured with the aperture size and/or the depth of the sound collecting hole. In this embodiment, the aperture size of the sound collecting hole of the main Mic is adjustable, and the depth of the sound collecting hole is also adjustable. Preferably, the aperture size of the sound collecting holes of different specifications can be configured for the main Mic. For example, the configuration of 24mm, 12mm, 6mm, 3mm, or the main Mic configuration can continuously adjust the aperture size of the pickup hole, the specific configuration can be determined according to the specific circumstances. In addition, in a specific configuration, it may be manually adjusted by the user, or automatically adjusted according to the size of the sound of the sound source corresponding to the main Mic, or according to configuration information input from the user, or configuration information from other devices. To adjust, wherein the other device may be a device connected to the sound collecting device, for example, the sound collecting device is a device disposed in a field, the other device is used by the user in the room and wirelessly connected to the sound collecting device The controller is connected in a manner so that remote control of the sound pickup device can be realized.
正如上述实施例中所陈述的,在对主Mic的拾音孔的孔径大小进行配置时,可以利用输入的配置信息进行配置,若在一定时间内未收到该配置信息,则可以采用默认的配置方式,在本实施例中,配置上述主Mic的拾音孔的孔径大小和/或拾音孔深度包括:在接收到输入的配置信息的情况下,根据上述配置信息配置主Mic的拾音孔的孔径大小和/或拾音孔深度;在未接收到输入的配置信息的情况下,根据预设值配置主Mic的拾音孔的孔径大小和/或拾音孔深度,例如,预先设定默认的孔径大小为12mm,该预设的孔径大小是可以由用户自由设置的。As stated in the above embodiment, when the aperture size of the pickup hole of the main Mic is configured, the configuration information of the input can be used for configuration. If the configuration information is not received within a certain period of time, the default configuration can be adopted. In this embodiment, configuring the aperture size and/or the pickup depth of the sound collecting hole of the main Mic includes: configuring the main Mic pickup according to the configuration information when receiving the input configuration information. The aperture size of the aperture and/or the depth of the pickup hole; in the case where the input configuration information is not received, the aperture size of the pickup hole of the main Mic and/or the depth of the pickup hole are configured according to a preset value, for example, preset The default aperture size is 12mm, and the preset aperture size can be freely set by the user.
在一个可选的实施例中,在利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音之前,上述方法还包括:确定拾音设备需要转动的方向及幅度;利用确定的拾音设备需要转动方向及幅度控制拾音设备进行转动。在本实施例中,拾音设备是可以自由转动的,若拾音设备是用户头部的可穿戴设备,那么拾音设备会随着用户头部的转动而转动,若拾音设备是设置在赛场中的设备,而用户在室内时,用户可以通过与该 拾音设备连接的控制设备控制该拾音设备的转动,本实施例主要针对的是对拾音设备的远程控制,从而实现了拾音设备根据用户的控制来进行移动的目的,从而使得用户可以感知任何方向上的声音,提高用户体验度。同样地,辅Mic和主Mic是类似的,也可以进行拾音孔大小的配置,在此,不再赘述。In an optional embodiment, before acquiring the sound of the first sound source in the main Mic corresponding viewing angle by using the main Mic in the sound collecting device, the method further includes: determining a direction and a range of the sound pickup device that needs to be rotated. Using the determined pickup device, the direction and amplitude of rotation are required to control the pickup device to rotate. In this embodiment, the sound collecting device is freely rotatable. If the sound collecting device is a wearable device of the user's head, the sound collecting device rotates with the rotation of the user's head, if the sound collecting device is disposed at The equipment in the field, while the user is indoors, the user can control the rotation of the sound collecting device through the control device connected to the sound collecting device, and the embodiment is mainly directed to the remote control of the sound collecting device, thereby realizing the picking up The sound device moves according to the user's control, so that the user can sense the sound in any direction and improve the user experience. Similarly, the auxiliary Mic and the main Mic are similar, and the configuration of the pickup hole size can also be performed, and details are not described herein again.
在一个可选地实施例中,在利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音之前,上述方法还包括:确定所述主Mic需要转动的方向及幅度;利用确定的所述主Mic需要转动的方向及幅度控制所述主Mic进行转动。在本实施例中,可以配置拾音设备是不动的,但是主Mic可以灵活转动。In an optional embodiment, before acquiring the sound of the first sound source in the main Mic corresponding viewing angle by using the main Mic in the sound collecting device, the method further includes: determining a direction in which the main Mic needs to rotate and Amplitude; controlling the main Mic to rotate by using the determined direction and amplitude of the main Mic to be rotated. In this embodiment, the pickup device can be configured to be stationary, but the main Mic can be flexibly rotated.
下面以拾音设备是可穿戴设备为例,结合附图对本公开中的声音输出方法进行整体说明:Hereinafter, the sound output method in the present disclosure will be generally described with reference to the accompanying drawings, taking the sound pickup device as a wearable device as an example:
图3是根据本公开实施例的声音输出方法的整体流程图,如图3所示,该方法包括如下步骤:FIG. 3 is an overall flowchart of a sound output method according to an embodiment of the present disclosure. As shown in FIG. 3, the method includes the following steps:
【001】用户开始使用设备(该设备即为拾音设备);[001] The user starts using the device (the device is the pickup device);
【002】集成主Mic(对应于上述的主Mic)开始随头肩移动,并进行录音,判断是否采用默认档位(假设默认档位是D档,不同的档位对应于不同的主Mic的拾音孔的孔径大小,在本实施例中是假设有四种孔径大小的拾音孔),若是,则转至步骤【004】,若否,则转至步骤【005】;[002] The integrated main Mic (corresponding to the main Mic described above) starts to move with the head and shoulders, and records to determine whether to adopt the default gear position (assuming the default gear position is D gear, different gear positions correspond to different main Mic The aperture size of the sound collecting hole, in this embodiment is assumed to have four apertures of the sound hole), if yes, go to step [004], if not, go to step [005];
【003】位于头肩双耳处的双辅Mic进行录音;并经过电声信号传递至用户耳机;[003] The dual-auxiliary Mic located at the ears of the head and shoulders is recorded; and is transmitted to the user's earphone via an electro-acoustic signal;
【004】经过【002】集成主Mic录制后,用户选择D默认档位;则经过电声信号传递至用户耳机;[004] After the [002] integrated main Mic is recorded, the user selects the D default gear position; then the electroacoustic signal is transmitted to the user's earphone;
【005】经过【002】集成主Mic录制后,用户不选择D默认档位;[005] After the [002] integrated main Mic is recorded, the user does not select the D default gear position;
【006】当用户不选则默认档位时,其它通路继续进行声音的录制,并对应相应地通路,也就是说,当用户不选择默认档位时(此时也没有选择其他档位的过程中),ABCD四个档位会分别会与ABCD四个通路一 一对应,各个通路会同时进行声音的录制;[006] When the user does not select the default gear position, the other channels continue to record the sound and correspond to the corresponding path, that is, when the user does not select the default gear position (the process of selecting other gear positions at this time) In the middle), the four positions of the ABCD will correspond to the four channels of the ABCD, and the sound recording will be performed simultaneously in each channel;
【007】用户选择其它档位后,系统自动切换到对应档位的通路,并与耳机通路建立连接,进行声音传输。其它非用户选择档位关闭与耳机的通路。[007] After the user selects other gear positions, the system automatically switches to the path of the corresponding gear position, and establishes a connection with the earphone path for sound transmission. Other non-user-selected gear positions close the path to the headset.
在上述实施例中,主要对声音的录制和输出进行了说明,下面对上述的拾音设备进行详细说明:In the above embodiment, the recording and output of the sound are mainly explained. The above-mentioned pickup device will be described in detail below:
首先,对本公开实施例的实施原理进行说明:First, the implementation principle of the embodiment of the present disclosure is explained:
声波是在自由空间中传播的,也就是在无线理想媒质中传播的,由于其边界是无限的,所以可以把我们解决的声场看成一个定向球面体。Sound waves are transmitted in free space, that is, in wireless ideal media. Because the boundary is infinite, we can regard the sound field we solve as a directional spherical body.
如果建立以球面中心为原点的坐标系,如图4所示:If you establish a coordinate system with the spherical center as the origin, as shown in Figure 4:
则有:x=r*cosA(A为通过该目标点的半径与x轴夹角)Then there is: x=r*cosA (A is the angle between the radius of the target point and the x-axis)
y=r*cosB(B为通过该目标点的半径与y轴夹角)y=r*cosB (B is the angle between the radius of the target point and the y-axis)
z=r*cosC(C为通过该目标点的半径与z轴夹角)z=r*cosC (C is the angle between the radius of the target point and the z-axis)
首先利用一个平面进行说明(另外两个平面采用相同的方法加入两个变量),以自由声场为例,图4所示为一个球状的一个平面,其中‘O’为人所在的位置,‘A、B、C’为三个不同的音源,人在水平面上做视角运动,我们需要解决的问题就是如何确定‘A、B、C’音源,从而体验‘A、B、C’不同位置上的听觉效果。First, use a plane to illustrate (the other two planes use the same method to add two variables), taking the free sound field as an example. Figure 4 shows a spherical plane, where 'O' is the location of the person, 'A, B, C' are three different sound sources. People do perspective motion on the horizontal plane. The problem we need to solve is how to determine the 'A, B, C' sound sources, and thus experience the hearing at different positions of 'A, B, C'. effect.
Mic的性能可以用一系列客观参数进行描述,包括灵敏度、平率相应、等效噪声级、指向性、动态范围等内容。以应用环境是比赛场地为例,一般比赛场地最低至最高的声压级一般在60dB~110dB之间,传统的Mic直径一般为24mm、12mm、6mm、3mm四种,其频响为20Hz~40kHz,近似的可以看成全指向性,声压级的测量范围为30dB~140dB;本公开实施例中的拾音设备所处的环境声压级可以是在Mic测量声压级范围之内。Mic's performance can be described by a series of objective parameters, including sensitivity, flatness, equivalent noise level, directivity, dynamic range, and more. Taking the application environment as the competition venue as an example, the lowest to highest sound pressure level of the general competition venue is generally between 60dB and 110dB. The traditional Mic diameter is generally 24mm, 12mm, 6mm, 3mm, and the frequency response is 20Hz~40kHz. The approximation can be regarded as omnidirectional, and the measurement range of the sound pressure level is 30 dB to 140 dB; the ambient sound pressure level of the sound pickup device in the embodiment of the present disclosure may be within the range of the Mic measurement sound pressure level.
在Mic中,拾音孔(也可称为进音孔或进音通道)是十分重要的获取外部声源的重要部分,具体可参见图5所示。在本实施例中可以通过平面 旋转、增加拓宽拾音孔的孔径及长度来解决进入该通道声音以及屏蔽其它非对视位置的音源进入问题。此外,在本公开中,采用了对比剥离的方法,即通过添加2个全向Mic作为辅Mic实时采集声音,然后主Mic通过拓宽拾音孔孔径及长度进行旋转采集,然后进行合成得到最终的声音。In Mic, the sound hole (also known as the sound hole or the sound channel) is an important part of acquiring an external sound source. See Figure 5 for details. In this embodiment, the problem of entering the sound of the channel and shielding other non-opposing positions can be solved by plane rotation, increasing the aperture and length of the sound collecting hole. In addition, in the present disclosure, the method of contrast stripping is adopted, that is, the sound is collected in real time by adding two omnidirectional Mic as the auxiliary Mic, and then the main Mic is rotated and collected by widening the aperture and length of the sound collecting hole, and then synthesized to obtain the final result. sound.
基于上述目的,在本公开的一个实施例中,提供了一种拾音设备,该拾音设备被设置为与增强现实AR或虚拟现实VR设备连接,包括:主Mic以及处理器,其中,该主Mic设置为获取主Mic对应视角上的第一声音源的声音;上述处理器连接至主Mic和辅Mic,设置为将主Mic获取的声音输出给AR或VR。在本实施例中,由于拾音设备中的主Mic只能设置为获取其对应视角上的声音,因此,不会获取非对应视角上的声音,从而有效屏蔽非对应视角上的声音,并且由于主Mic是采集正对方向上的声音的,因此,当拾音设备转动后,主Mic所正对的方向会随之发生变化,因此主Mic所采集的声音也会发生变化,实现了实时感知所处位置的声音,并且排除了Mic非正对方向上的声音,增加用户对声音的深度沉浸感,解决了相关技术中存在的用户只能体会某一个特定位置的观赛感,且会受到周围噪声的影响的问题。Based on the above object, in an embodiment of the present disclosure, a sound pickup device is provided, the sound pickup device being configured to be connected to an augmented reality AR or a virtual reality VR device, including: a main Mic and a processor, wherein The main Mic is set to acquire the sound of the first sound source at the main Mic corresponding angle of view; the above processor is connected to the main Mic and the auxiliary Mic, and is arranged to output the sound acquired by the main Mic to the AR or VR. In this embodiment, since the main Mic in the sound pickup device can only be set to acquire the sound on the corresponding viewing angle, the sound on the non-corresponding viewing angle is not acquired, thereby effectively shielding the sound on the non-corresponding viewing angle, and The main Mic is to collect the sound in the opposite direction. Therefore, when the pickup device rotates, the direction that the main Mic is facing will change accordingly, so the sound collected by the main Mic will also change, realizing the real-time perception. The sound of the position, and the sound of the Mic non-positive direction is excluded, which increases the user's deep immersion to the sound, and solves the problem that the user in the related art can only experience the sense of play at a certain position and is subject to ambient noise. The impact of the problem.
在一个可选地实施例中,上述拾音设备还包括辅Mic,其中,辅Mic设置在主Mic的周围,设置为获取辅Mic对应视角上的第二声音源的声音;处理器还设置为连接至辅Mic,设置为将主Mic获取的声音和辅Mic获取的声音进行合成并输出合成后的声音给AR或VR。In an optional embodiment, the sound collecting device further includes a secondary Mic, wherein the auxiliary Mic is disposed around the main Mic, and is configured to acquire a sound of the second sound source corresponding to the auxiliary Mic; the processor is further configured to Connected to the auxiliary Mic, it is set to synthesize the sound acquired by the main Mic and the sound acquired by the auxiliary Mic and output the synthesized sound to the AR or VR.
在一个可选地实施例中,上述处理器还设置为配置主Mic的拾音孔的孔径大小和/或拾音孔的深度。正如前述所陈述的,主Mic的拾音孔的孔径大小是可调的,拾音孔的深度也是可调的,具体内容可参见前述方法实施例,同样地,辅Mic的拾音孔的孔径大小和拾音孔的深度也可以设置为可调的,其配置方式与主Mic的类似,在此,不再赘述。In an alternative embodiment, the processor is further configured to configure the aperture size of the pickup aperture of the main Mic and/or the depth of the pickup aperture. As stated above, the aperture size of the pickup hole of the main Mic is adjustable, and the depth of the pickup hole is also adjustable. For details, refer to the foregoing method embodiment, and similarly, the aperture of the pickup hole of the auxiliary Mic. The size and the depth of the pickup hole can also be set to be adjustable, and the configuration thereof is similar to that of the main Mic, and will not be described again here.
在一个可选地实施例中,上述处理器可以通过如下方式配置主Mic的拾音孔的孔径大小和/或拾音孔的深度:在接收到输入的配置信息的情况 下,根据配置信息配置主Mic的拾音孔的孔径大小和/或拾音孔深度;在未接收到输入的配置信息的情况下,根据预设值配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度。In an optional embodiment, the processor may configure the aperture size of the sound hole of the main Mic and/or the depth of the sound collection hole by: configuring the configuration information according to the configuration information when the input configuration information is received. The aperture size of the pickup hole of the main Mic and/or the depth of the pickup hole; in the case where the input configuration information is not received, the aperture size and/or the pickup of the pickup hole of the main Mic is configured according to a preset value. Hole depth.
在一个可选地实施例中,上述处理器还设置为:确定拾音设备需要转动的方向及幅度;利用确定的上述拾音设备需要转动方向及幅度控制拾音设备进行转动。In an alternative embodiment, the processor is further configured to: determine a direction and an amplitude of the sound pickup device to be rotated; and use the determined sound pickup device to control the rotation direction and amplitude to control the sound pickup device to rotate.
在一个可选地实施例中,上述处理器还设置为:确定主Mic需要转动的方向及幅度;利用确定的主Mic需要转动的方向及幅度控制主Mic进行转动。In an optional embodiment, the processor is further configured to: determine a direction and an amplitude of the main Mic to be rotated; and control the main Mic to rotate by using the determined direction and amplitude of the main Mic to be rotated.
在一个可选地实施例中,当上述拾音设备为用户头部的可穿戴设备时,上述主Mic包括位于可穿戴设备的前端的主Mic,其中,该前端被设置为与佩戴者的前额相对应。在本实施例中,辅Mic的数量可以是两个,分别位于可穿戴设备的对应用户左耳的位置和对应用户右耳的位置。需要说明的是,将主Mic设置在额头处并且将辅Mic设置在双耳处是较为优选的设置方式,具体原因如下所示:In an optional embodiment, when the sound collecting device is a wearable device of a user's head, the main Mic includes a main Mic located at a front end of the wearable device, wherein the front end is set to be the forehead of the wearer Corresponding. In this embodiment, the number of the secondary Mic may be two, respectively located at a position of the left ear of the corresponding user of the wearable device and a position corresponding to the right ear of the user. It should be noted that setting the main Mic at the forehead and setting the auxiliary Mic at the ears is a more preferable setting method, and the specific reasons are as follows:
A、主Mic的实施设计算法A, the main Mic implementation design algorithm
1.在本实施例中将主Mic作为一个突出面来设计,利用凸面反射的原理,几乎所有的凸面都具有散射的作用,它们是作为扩散体的重要反射面,因为对于凸面而言,其r永远是负值的(如图6所示)。1. In the present embodiment, the main Mic is designed as a protruding surface. With the principle of convex reflection, almost all convex surfaces have scattering effects, and they are important reflecting surfaces as diffusing bodies because for convex surfaces, r is always negative (as shown in Figure 6).
如果以负值带入到凸面方程:
Figure PCTCN2019075489-appb-000001
中,那么b也会是负值,各参数已标注在图6上。综上,Q1为用户所在的位置,S为主Mic位置,则Q2声源传递的声波会进入S主Mic的通道内,其它波形会在球面进行反射,如A、B点。相反的,若Q2不在图6所示的位置上,而在其它位置的话,其波形传递必定形成散射,也就是说可以进行屏蔽。
If you take a negative value into the convex equation:
Figure PCTCN2019075489-appb-000001
In the case, then b will also be a negative value, and each parameter has been marked on Figure 6. In summary, Q1 is the location of the user, and S is the main Mic position. The sound waves transmitted by the Q2 sound source will enter the channel of the S main Mic, and other waveforms will be reflected on the spherical surface, such as points A and B. Conversely, if Q2 is not in the position shown in Figure 6, but in other positions, the waveform transmission must form a scattering, that is, it can be shielded.
2.当确认了上述‘1’中所描述的现象后,因为本公开实施例的目的中 的第一个步骤是就将主Mic定义为可旋转进行主对视音源采集的器件,所以下一步进行将对主Mic的拾音孔进行孔径拓宽处理。2. After confirming the phenomenon described in the above '1', since the first step in the purpose of the embodiment of the present disclosure is to define the main Mic as a device that can be rotated for the main-pair video source acquisition, the next step The aperture widening process will be performed on the pickup hole of the main Mic.
3.这里的孔径表示为将一个声信号转换为电信号的电声传感器(Mic)。3. The aperture here is expressed as an electroacoustic sensor (Mic) that converts an acoustic signal into an electrical signal.
4.为了更好地说明问题,可以增加几个变量,即体积为V的Mic接收孔径,考虑一体积为V的接收孔径,x(t,r)表示在时间t和空间r处信号的值。接收孔径在r处的一个无限小的体积dV的冲击响应为a(t,r),那么接收到的信号可以用卷积表示:4. To better illustrate the problem, several variables can be added, namely the Mic receiving aperture of volume V, considering a receiving aperture of volume V, and x(t,r) representing the value of the signal at time t and space r . The impulse response of an infinitesimal volume dV at the receiving aperture at a is a(t,r), then the received signal can be represented by convolution:
x R(t,r)=∫x(i,r)a(t-i,r)di x R (t,r)=∫x(i,r)a(ti,r)di
或用其频域表示X R(f,r)=X(f,r)A(f,r)  (1) Or use its frequency domain to represent X R (f,r)=X(f,r)A(f,r) (1)
其中A(f,r)是孔径函数,可以通过该孔径函数知道孔径在不同空间大小下所反映的相应函数。Where A(f,r) is the aperture function, and the aperture function can be used to know the corresponding function reflected by the aperture in different spatial sizes.
5.因为接收孔径对于不同方向传来的信号而言,该接收孔径所张开的立体角是不同的,如图7所示,图7表示的是一维空间内线性孔径接收平面波的信号。5. Because the receiving aperture is different for the signals transmitted in different directions, the solid angle of the receiving aperture is different. As shown in FIG. 7, FIG. 7 shows the signal of the linear aperture receiving plane wave in the one-dimensional space.
孔径的响应是进入孔径的信号的频率和入射方向的函数,通过求解波动方程可以推导出,孔径响应与孔径函数是存在傅里叶变换的关系。特别的,在本公开实施例中所描述的场景中(以比赛场地为例)为远场条件,其孔径的响应函数的表示方式为:The response of the aperture is a function of the frequency of the signal entering the aperture and the direction of incidence. It can be derived by solving the wave equation that the aperture response and the aperture function are in the presence of a Fourier transform. In particular, in the scenario described in the embodiment of the present disclosure (taking the game venue as an example), the far field condition is expressed by the response function of the aperture:
Figure PCTCN2019075489-appb-000002
Figure PCTCN2019075489-appb-000002
Fr{.}是三维的傅里叶变换,Fr{.} is a three-dimensional Fourier transform,
Figure PCTCN2019075489-appb-000003
是一个点在孔径上的空间位置。
Figure PCTCN2019075489-appb-000003
Is a spatial location of the point on the aperture.
Figure PCTCN2019075489-appb-000004
是波的方向矢量,θ和φ可以参见图8。
Figure PCTCN2019075489-appb-000004
It is the direction vector of the wave, and θ and φ can be seen in Fig. 8.
为了更简单的说明问题以及获得孔径响应,可以将图8所示的坐标简化为沿着X轴方向的一维线性孔径,孔径长度为L,如图9所示。In order to explain the problem more simply and obtain the aperture response, the coordinates shown in FIG. 8 can be simplified to a one-dimensional linear aperture along the X-axis direction, and the aperture length is L, as shown in FIG.
在图9所示的情况下:In the case shown in Figure 9:
Figure PCTCN2019075489-appb-000005
Figure PCTCN2019075489-appb-000005
一维线性孔径,那么在这种情况下One-dimensional linear aperture, then in this case
孔径响应简化为:The aperture response is simplified to:
Figure PCTCN2019075489-appb-000006
Figure PCTCN2019075489-appb-000006
其中:among them:
Figure PCTCN2019075489-appb-000007
Figure PCTCN2019075489-appb-000007
如果将上面式子用θ和φ来表达的话,则为:If the above expression is expressed by θ and φ, then:
Figure PCTCN2019075489-appb-000008
Figure PCTCN2019075489-appb-000008
上面的算法是在平面波假设的条件下得到的,所以只适用于远场的条件。对于线性的孔径,应该满足下面的条件时可以认为满足远场条件。The above algorithm is obtained under the assumption of the plane wave, so it is only applicable to the conditions of the far field. For a linear aperture, the following conditions should be met to satisfy the far field condition.
Figure PCTCN2019075489-appb-000009
Figure PCTCN2019075489-appb-000009
考虑一个特定的情况,如果线性孔径,孔径函数不随着频率位置变化,那么,孔径函数可以写为:Consider a specific case, if the linear aperture does not change with the frequency position, then the aperture function can be written as:
A R(x α)=rect(x α/L)    (5) A R (x α )=rect(x α /L) (5)
傅里叶变换结果为:The result of the Fourier transform is:
D R(f,a x)=Lsinc(α x/L)}  
Figure PCTCN2019075489-appb-000010
D R (f, a x )=Lsinc(α x /L)}
Figure PCTCN2019075489-appb-000010
综上,通过计算均匀孔径函数和相应地方向性孔径函数得出图形(如图10所示),从图10中可以看出方向性孔径函数的零点分布在α x=mλ/L,其中m为整数。方向性范围就可以得出,范围是:-λ/L≤α x≤λ/L之间的区域之间的区域被称为主瓣,其范围就是作波束宽度。因此,对于一个固定的孔径长度,频率越高,波束宽度越窄,如图10所示。 In summary, the figure is obtained by calculating the uniform aperture function and the corresponding directional aperture function (as shown in Fig. 10). It can be seen from Fig. 10 that the zero point distribution of the directional aperture function is α x = mλ/L, where m Is an integer. The range of directionality can be derived from the range between: - λ / L α x ≤ λ / L The area between the areas is called the main lobe, and the range is the beam width. Therefore, for a fixed aperture length, the higher the frequency, the narrower the beam width, as shown in FIG.
根据上述计算可知,对于一个固定的孔径长度,频率越高,波束宽度越窄,归一化的口径响应长度为:According to the above calculation, for a fixed aperture length, the higher the frequency, the narrower the beam width, and the normalized aperture response length is:
口径响应长度在水平方向上可以表示为:The length of the caliber response can be expressed in the horizontal direction as:
Figure PCTCN2019075489-appb-000011
Figure PCTCN2019075489-appb-000011
由(7)公式可以得出在水平方向上的极坐标的表示方式,那么在L/λ的条件下极坐标如图11所示,分别为L/λ=0.5、1、2、4四个不同数值。From the formula (7), the representation of the polar coordinates in the horizontal direction can be obtained. Then, under the condition of L/λ, the polar coordinates are as shown in Fig. 11, which are respectively L/λ=0.5, 1, 2, and 4. Different values.
通过上述公式(1)~(7)的计算,可以得出线性孔径特性。The linear aperture characteristics can be obtained by the calculations of the above formulas (1) to (7).
所以,主Mic的实施设计算法所描述的线性孔径特性,结合水平方向上的线性孔径特征公式:Therefore, the main Mic implementation design algorithm describes the linear aperture characteristics, combined with the linear aperture characteristic formula in the horizontal direction:
Figure PCTCN2019075489-appb-000012
Figure PCTCN2019075489-appb-000012
Figure PCTCN2019075489-appb-000013
Figure PCTCN2019075489-appb-000013
其中,w n(f)是传感器的权重参数,C是在推导公式时,将原有的 λ明确写为f后变为的C,原有公式可参见公式(9)。 Where w n (f) is the weight parameter of the sensor, and C is C when the original λ is explicitly written as f when deriving the formula. The original formula can be found in equation (9).
得出,无论是线性的Mic或者等间距分布的Mic阵列下的孔径特征,都取决于以下几个条件:It is concluded that either the linear Mic or the aperture characteristics under the equally spaced Mic array depend on the following conditions:
传感器的数量N;The number of sensors N;
传感器间的间距d;The spacing d between the sensors;
声波的频率f;The frequency of the sound wave f;
由于离散传感器阵列是连续孔径的一种近似。有一点需要注意的是,传感器阵列的有效长度定义为相应的连续孔径的长度,为L=Nd,而传感器阵列的实际长度是d(N-1)。Since the discrete sensor array is an approximation of the continuous aperture. It should be noted that the effective length of the sensor array is defined as the length of the corresponding continuous aperture, L = Nd, and the actual length of the sensor array is d (N-1).
通过图6和图7所描述的散射和频率和入射方向的函数,就可以较为准确的识别到对视方向上的音源。By means of the scattering and frequency and incident direction functions described in Figures 6 and 7, the source in the opposite direction can be more accurately identified.
下面结合仿真结果对本公开进行说明:The present disclosure will be described below in conjunction with simulation results:
为了便于理解,以及确认本公开实施例中对主Mic和双辅Mic的摆放位置。所以在音频实验室做如下实验室论证,在本次试验中引入了符合国际标准的B&K“头肩模拟器”(HATS)、测试环境为标准全消音室。(测试图可参见图12)In order to facilitate understanding, and to confirm the placement position of the main Mic and the dual auxiliary Mic in the embodiment of the present disclosure. Therefore, the following laboratory demonstrations were made in the audio laboratory. In this test, the B&K "Head and Shoulder Simulator" (HATS) conforming to international standards was introduced, and the test environment was a standard silencer. (Test chart can be seen in Figure 12)
A、两个固定辅Mic测试A, two fixed auxiliary Mic test
a.进行实验室测试时,采用标准“头肩模拟器”(丹麦4128C型头与躯干模拟器),测试图可参见图13。a. For laboratory testing, the standard "head and shoulder simulator" (Danish 4128C head and torso simulator) is used. The test chart can be seen in Figure 13.
由图13所示,如果将两个辅Mic放置于头肩模拟器人工耳位置时,由于人耳耳廓朝向,所形成的主轴线夹角角度恰为图13中所表示的主轴线夹角,且根据以往对终端产品音频测试经验,放在此处,所形成的∠H(固定支架国际通用角度)所测试的指标与中移动实验室其相关局方实验室测试相同。As shown in FIG. 13, if two auxiliary Mic are placed in the head and shoulder simulator artificial ear position, since the human ear auricle is oriented, the angle formed by the main axis is exactly the angle of the main axis shown in FIG. According to the previous experience of audio testing of terminal products, the ∠H (fixed bracket international general angle) formed by the test is the same as the relevant laboratory test of China Mobile Lab.
b.为了进一步论证,进行如下实验,所搭建的环境如图14所示。在本 次试验中可以将“人工耳”近似的看成辅Mic(左右耳各一个),使用标准音箱播放P.501音源信号/标准英文对话。b. For further demonstration, the following experiment was carried out, and the constructed environment is shown in Fig. 14. In this test, the “artificial ear” can be approximated as the auxiliary Mic (one for each of the left and right ears), and the P.501 source signal/standard English dialogue can be played using a standard speaker.
测试策略如下(具体可参见附图15):The test strategy is as follows (see Figure 15 for details):
a、放置在本公开实施例中所设计的位置,测试1020Hz,8个频点的接收失真情况;a, placed in the position designed in the embodiment of the present disclosure, testing 1020Hz, 8 frequency points of receiving distortion;
b、按照标准头肩模拟器的刻度,将两侧的辅Mic上移6cm,测试1020Hz,8个频点的接收失真情况;b. According to the scale of the standard head and shoulder simulator, move the auxiliary Mic on both sides up by 6cm, and test the receiving distortion of 1020Hz and 8 frequency points;
测试结论详见表1:The test conclusions are detailed in Table 1:
表1Table 1
方案Program 次数frequency 测试结论Test conclusion
a.本专利设计位置a. The design position of this patent 5050 PassPass
b.上移6cm位置b. Move up 6cm position 5050 FailFail
综上可知,本公开实施例中将两个辅Mic放置在“头肩模拟器”双耳位置是较优的。In summary, it is preferred that the two auxiliary Mics are placed in the "head and shoulder simulator" binaural position in the embodiment of the present disclosure.
B、两个固定辅Mic测试B, two fixed auxiliary Mic test
测试环境搭建(见图15):The test environment is built (see Figure 15):
在本实验中,将一根标准1/4Mic视为可随头肩模拟器移动的主Mic(B&K 2670)测试策略如下:In this experiment, a standard 1/4Mic is considered as the main Mic (B&K 2670) test strategy that can be moved with the head and shoulder simulator as follows:
a、将Mic置于唇环中心点位置(A点),测试1020Hz,8个频点的接收失真情况;a. Place Mic at the center point of the lip ring (point A), and test the receiving distortion of 1020 Hz and 8 frequency points;
b、将Mic置于唇环中心点上端鼻子处(B点)测试1020Hz,8个频点的接收失真情况;b. Place Mic at the upper end of the center of the lip ring (point B) to test 1020 Hz, receiving distortion at 8 frequency points;
c、将Mic置于唇环中心点前额处(C点)测试1020Hz,8个频点的接收失真情况;c. Place Mic at the forehead of the center of the lip ring (point C) to test the reception distortion of 1020 Hz and 8 frequency points;
测试结论详见表2:The test conclusions are detailed in Table 2:
表2Table 2
方案Program 次数frequency 测试结论Test conclusion
a、唇环中心位置a, the center of the lip ring 5050 Fail(0%次)-3个点不过Fail (0% times) - 3 points but
b、鼻子b, nose 5050 Fail(0%次)-4个点不过Fail (0% times) - 4 points but
c、前额c, forehead 5050 Fail(0%次)-1个点不过Fail (0% times) - 1 point but
所以,在本公开实施例中将主Mic放置于前额部位是较优的。Therefore, it is preferred to place the main Mic in the forehead portion in the embodiment of the present disclosure.
在一个可选的实施例中,上述主Mic还包括位于可穿戴设备的顶端的主Mic,其中,该顶端被设置为与佩戴者的头顶相对应。通过在对应头顶的位置上设置主Mic可以实现立体空间声音采集,从而使用户能够感受到更为立体真实的声音。需要说明的是,在本公开中,可以根据实际需求自由设置主Mic的位置,并且自由调整主Mic的数量。In an alternative embodiment, the main Mic further includes a main Mic located at the top end of the wearable device, wherein the top end is configured to correspond to the top of the wearer's head. Stereoscopic sound collection can be achieved by setting the main Mic at the position of the corresponding overhead, so that the user can feel a more stereoscopic sound. It should be noted that, in the present disclosure, the position of the main Mic can be freely set according to actual needs, and the number of main Mic can be freely adjusted.
在一个可选的实施例中,当上述拾音设备为用户头部的可穿戴设备时,辅Mic包括位于可穿戴设备的两侧的辅Mic,两侧被设置为与佩戴者的双耳相对应。In an optional embodiment, when the above-mentioned sound pickup device is a wearable device of a user's head, the auxiliary Mic includes a secondary Mic located on both sides of the wearable device, and both sides are set to be in contact with the wearer's ears. correspond.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据上述实施例的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本公开的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本公开各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the method according to the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware, but in many cases, the former is A better implementation. Based on such understanding, portions of the technical solutions of the present disclosure that contribute substantially or to the prior art may be embodied in the form of a software product stored in a storage medium (eg, ROM/RAM, disk, The optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, or a network device, etc.) to perform the methods described in various embodiments of the present disclosure.
在本实施例中还提供了一种声音输出装置,该装置用于实现上述实施 例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。Also provided in the present embodiment is a sound output device for implementing the above-described embodiments and preferred embodiments, which have not been described again. As used below, the term "module" may implement a combination of software and/or hardware of a predetermined function. Although the apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
图17是根据本公开实施例的声音输出装置的结构框图,如图17所示,该装置包括:17 is a structural block diagram of a sound output device according to an embodiment of the present disclosure. As shown in FIG. 17, the device includes:
获取模块172,设置为利用拾音设备中的主Mic获取主Mic对应视角上的第一声音源的声音;输出模块174,连接至上述获取模块172,设置为将上述主Mic获取的声音输出给与上述拾音设备连接的增强现实AR或虚拟现实VR设备。The obtaining module 172 is configured to acquire the sound of the first sound source in the main Mic corresponding angle of view by using the main Mic in the sound collecting device; the output module 174 is connected to the obtaining module 172, and is configured to output the sound obtained by the main Mic to the sound An augmented reality AR or virtual reality VR device connected to the above-described sound pickup device.
在一个可选的实施例中,该装置还设置为:在将主Mic获取的声音输出给与所述拾音设备连接的增强现实AR或虚拟现实VR设备之前,利用拾音设备中的辅Mic获取辅Mic对应视角上的第二声音源的声音,其中,辅Mic设置在主Mic的周围;将主Mic获取的声音输出给与拾音设备连接的增强现实AR或虚拟现实VR设备:将主Mic获取的声音和辅Mic获取的声音进行合成,并将合成后的声音输出给与拾音设备连接的AR或VR设备。In an optional embodiment, the apparatus is further configured to: use the auxiliary Mic in the sound pickup device before outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound pickup device Obtaining a sound of the second sound source corresponding to the auxiliary Mic, wherein the auxiliary Mic is disposed around the main Mic; and outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device: The sound acquired by Mic and the sound acquired by the auxiliary Mic are combined, and the synthesized sound is output to an AR or VR device connected to the pickup device.
在一个可选的实施例中,上述获取模块172可以通过如下方式利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音:配置主Mic的拾音孔的孔径大小和/或拾音孔深度;利用配置了孔径大小和/或拾音孔深度的主Mic获取主Mic对应视角上的第一声音源的声音。In an optional embodiment, the obtaining module 172 may acquire the sound of the first sound source in the main Mic corresponding perspective by using the main Mic in the sound collecting device by configuring the aperture of the sound collecting hole of the main Mic. Size and/or pickup hole depth; the main Mic configured with the aperture size and/or the pickup aperture depth is used to acquire the sound of the first sound source at the main Mic corresponding angle of view.
在一个可选的实施例中,上述获取模块172可以通过如下方式配置上述主Mic的拾音孔的孔径大小和/或拾音孔深度:在接收到输入的配置信息的情况下,根据该配置信息配置主Mic的拾音孔的孔径大小和/或拾音孔深度;在未接收到输入的配置信息的情况下,根据预设值配置主Mic的拾音孔的孔径大小和/或拾音孔深度。In an optional embodiment, the obtaining module 172 may configure the aperture size and/or the depth of the sound collecting hole of the main Mic by the following manner: in case the input configuration information is received, according to the configuration The information configures the aperture size of the pickup hole of the main Mic and/or the depth of the pickup hole; if the input configuration information is not received, the aperture size and/or the pickup of the pickup hole of the main Mic is configured according to the preset value. Hole depth.
在一个可选的实施例中,上述声音输出装置还设置为在利用拾音设备 中的主Mic获取主Mic正对方向上的声音之前,确定上述拾音设备需要转动的方向及幅度;利用确定的拾音设备需要转动的方向及幅度控制拾音设备进行转动。In an optional embodiment, the sound output device is further configured to determine a direction and an amplitude of the sound pickup device to be rotated before acquiring the sound in the direction of the main Mic by using the main Mic in the sound pickup device; The pickup device needs the direction and amplitude of the rotation to control the pickup device to rotate.
在一个可选的实施例中,该装置还设置为:在利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音之前,确定所述主Mic需要转动的方向及幅度;利用确定的所述主Mic需要转动的方向及幅度控制所述主Mic进行转动。In an optional embodiment, the apparatus is further configured to determine a direction in which the main Mic needs to be rotated before acquiring the sound of the first sound source in the main Mic corresponding viewing angle by using the main Mic in the sound collecting device. And amplitude; controlling the main Mic to rotate by using the determined direction and amplitude of the main Mic to be rotated.
需要说明的是,上述各个模块是可以通过软件或硬件来实现的,对于后者,可以通过以下方式实现,但不限于此:上述模块均位于同一处理器中;或者,上述各个模块以任意组合的形式分别位于不同的处理器中。It should be noted that each of the above modules may be implemented by software or hardware. For the latter, the foregoing may be implemented by, but not limited to, the foregoing modules are all located in the same processor; or, the above modules are in any combination. The forms are located in different processors.
本公开的实施例还提供了一种存储介质,该存储介质中存储有计算机程序,其中,上述计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide a storage medium having stored therein a computer program, wherein the computer program is configured to execute the steps of any one of the method embodiments described above.
可选地,在本实施例中,上述存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。Optionally, in the embodiment, the foregoing storage medium may include, but is not limited to, a USB flash drive, a Read-Only Memory (ROM), and a Random Access Memory (RAM). A variety of media that can store program code, such as a hard disk, a disk, or an optical disk.
本公开的实施例还提供了一种电子装置,包括存储器和处理器,该存储其中存储有计算机程序,该处理器被设置为运行计算机程序以执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide an electronic device including a memory and a processor having stored therein a computer program configured to execute a computer program to perform the steps of any of the above method embodiments.
可选地,上述电子装置还可以包括传输设备以及输入输出设备,其中,该传输设备和上述处理器连接,该输入输出设备和上述处理器连接。Optionally, the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
可选地,本实施例中的具体示例可以参考上述实施例及可选实施方式中所描述的示例,本实施例在此不再赘述。For example, the specific examples in this embodiment may refer to the examples described in the foregoing embodiments and the optional embodiments, and details are not described herein again.
由上述实施例可知,本公开的基本核心是在AR/VR产品上,通过在AR/VR产品上进行Mic的定义及布局。为了便于理解和更好的阐明本公开所描述的内容,可以暂设定Mic有24mm、12mm、6mm、3mm四种规 格,该Mic集成为一个采集扇面,扇面可随用户头部进行同角度旋转结构,通过平面旋转、增加拓宽拾音孔的孔径长度来解决进入该通道声音,以及屏蔽其它非对视位置的音源进入,并增加其它辅助Mic与之匹配。It can be seen from the above embodiments that the basic core of the present disclosure is to define and layout Mic on the AR/VR product by using the AR/VR product. In order to facilitate understanding and better clarify the content described in the present disclosure, the Mic can be temporarily set to have four specifications of 24mm, 12mm, 6mm, and 3mm. The Mic is integrated into an acquisition fan, and the fan can rotate with the user's head at the same angle. The structure, through the plane rotation, increases the aperture length of the widening sound hole to solve the sound entering the channel, and shields the sound source entry of other non-opposing positions, and adds other auxiliary Mic to match.
其中,原有Mic为可以进行扇面旋转的拾音设备,将四种规格(四种规格仅是示例性说明,也可以采用其他的规格)进行集成,集成Mic模块设置为采集用户对视方向上的声音;在用户佩戴上述拾音设备后,辅Mic为2个全向辅Mic,实时采集用户所处位置旁边的声音,可以随着用户头进行转动,也可以设置为不随着用户头进行转动。集成主Mic开始工作,设备默认A、B、C、D中任意一个档位(每个档位分别对应一种规格),默认为D档位,每个档位对应一个通路;两个辅Mic开始工作,实时采集用户左侧和右侧的环境声音;当用户朝向某个区域时,随着转向角度变化,集成主Mic位置同角度发生变化;此时,集成Mic将电声信号进行转换,通过耳机传递给用户;同样的,两个辅Mic将头左右两侧的声音通过电声信号转换,通过耳机传递给用户;当用户感觉此时的感受不能够满足现场观赛感需求时,可以调节按钮切换档位,并开启与之选择相对应的通路,从而达到更好的效果。Among them, the original Mic is a sound pickup device that can perform fan rotation. The four specifications (four specifications are only exemplary descriptions, other specifications can also be used) are integrated, and the integrated Mic module is set to collect the user in the viewing direction. After the user wears the above-mentioned pickup device, the auxiliary Mic is two omnidirectional auxiliary Mic, and the sound next to the position where the user is located can be collected in real time, and can be rotated with the user's head, or can be set to not rotate with the user's head. . The integrated main Mic starts to work. The device defaults to any one of A, B, C, and D (each gear corresponds to one specification). The default is D gear. Each gear corresponds to one path; two auxiliary Mic Start working, collecting the ambient sound of the left and right sides of the user in real time; when the user is facing a certain area, the integrated main Mic position changes with the angle as the steering angle changes; at this time, the integrated Mic converts the electroacoustic signal. Through the earphones to the user; similarly, the two auxiliary Mic convert the sounds on the left and right sides of the head through the electro-acoustic signal, and transmit them to the user through the earphone; when the user feels that the feeling at this time cannot meet the demand of the live viewing, The adjustment button switches the gear position and opens the passage corresponding to the selection to achieve better results.
在本公开实施例中,所采用的方法是,重新定义Mic并通过“平面旋转”、“增加或拓宽拾音孔”的孔径和长度来解决进入通道声音并且屏蔽其他非用户对视点的音源进入问题,进行了硬件修改等方式,从而进行音源定位。In the embodiment of the present disclosure, the method adopted is to redefine the Mic and solve the problem of entering the channel sound and shielding other non-user-to-view source by the "plane rotation", "increasing or widening the aperture" and the length of the sound hole. The problem is that hardware modification is performed to perform sound source localization.
在本公开实施例中描述了这样一种采集面Mic分配布局策略和使用规则。例如:由于本公开专利核心之一为“增加或拓宽拾音孔”,故当为了实现某一方向上的音源采集,可以设置多个主Mic(每个Mic的拾音孔的孔径大小不同,或者设置一个主Mic,并为该一个主Mic设置多种拾音孔的孔径大小规格),例如设定添加的主Mic数量为3个,分别为A、B、C三种,所采用的A、B、C三种Mic孔径为a毫米、b毫米、c毫米,朝向相对于某水平位置为a°、b°、c°,由于其本身结构特性和位置特性,所表征的含义为各个角度朝向的所带给用户感受。需要说明的是,上文所 举例描述的是本公开核心的基本实施实例之一,并不代表本公开所描述的完整唯一实施方案。本公开实施例根据后期设计的需求确定Mic的数量、规格、朝向等内容。Such an acquisition surface Mic allocation layout strategy and usage rules are described in the embodiments of the present disclosure. For example, since one of the cores of the present disclosure is "increasing or widening the sound collecting hole", when the sound source is collected in a certain direction, a plurality of main Mic may be set (the aperture size of each Mic's sound collecting hole is different, or Set a main Mic, and set the aperture size specifications of the various pickup holes for the one main Mic. For example, the number of main Mic added is set to three, which are A, B, and C respectively. The three Mic apertures B and C are a mm, b mm, and c mm, and the orientations are a°, b°, and c° with respect to a certain horizontal position. Due to their structural and positional characteristics, the meanings are expressed in various angle orientations. Brought to the user experience. It should be noted that the above description is one of the basic embodiments of the core of the present disclosure, and does not represent the complete and unique embodiment described in the present disclosure. The embodiments of the present disclosure determine the number, specification, orientation, and the like of the Mic according to the requirements of the post-design.
本公开实施例所描述的“水平方向上的定位方法”目的是为了更好地说明问题,是本公开核心的基本实施实例之一,并不代表本公开所描述的完整唯一实施方案。当再添加一个与本公开所描述的基本实施例采用相同技术手段、且垂直于它的设备之后,就可以达到立体空间的采集。所以,本公开完整的实施方案包括但不限于一个水平方向上的音源定位,如本段文字所述,当添加两个或两个以上等本公开描述的技术后,可以达到全向采集。The "positioning method in the horizontal direction" described in the embodiments of the present disclosure is intended to better explain the problem, is one of the basic embodiments of the core of the present disclosure, and does not represent the complete and unique embodiment described in the present disclosure. The acquisition of the stereoscopic space can be achieved by adding a device that uses the same technical means and is perpendicular to its basic embodiment as described in the present disclosure. Thus, a complete embodiment of the present disclosure includes, but is not limited to, sound source localization in a horizontal direction, as described in this paragraph, when two or more techniques, such as those described in the present disclosure, are added, omnidirectional acquisition can be achieved.
本公开实施例提供了这样的一种立体声感知技术,即根据空间大小,在特定位置部署本专利所描述的设备,可以达到全方位,立体空间的音源采集。佩戴了上述拾音设备后,用户可以体验到在本地(家中、赛场)的任意一点观察感知所建立机位的观赛感觉,从而增加用户的可选择性和沉浸感。且用户所在位置无需发生变化。例如:当用户处于B点位置时,头部向左转动,通过用户的选择可以感知到现场任意一点如A点的头部向左转动的等同效果。Embodiments of the present disclosure provide a stereo sensing technology that deploys the device described in this patent at a specific location according to the size of the space, and can achieve all-round, stereo space sound source collection. After wearing the above-mentioned sound collecting device, the user can experience the feeling of watching the established position at any point in the local (home, field), thereby increasing the user's selectivity and immersion. And the user's location does not need to change. For example, when the user is at the B point position, the head is turned to the left, and the user's selection can sense the equivalent effect of any point on the scene, such as the head of point A turning to the left.
本公开实施例所描述的接收到的每一个音源,都包含位置信息,且该位置信息在采集之初,就已经包含在音源之中,也就是说该位置信息为初始发声源位置。所以还原给用户时,不仅包括声音,同时也能让用户感觉到所处位置的变化。Each of the received sound sources described in the embodiments of the present disclosure includes position information, and the position information is already included in the sound source at the beginning of the collection, that is, the position information is the initial sound source position. Therefore, when it is restored to the user, it not only includes the sound, but also allows the user to feel the change of the location.
本公开实施例还描述了这样的一种音视频匹配合成规则。就现有技术而言,在还原匹配音视频时,所有参数的调整和变化都是基于时间(帧)的,而专利所描述的方法是,增加一个位置坐标轴线,通过转动(Pan)和垂直转动(Tilt)两个参数,就能匹配到视角上的声音和朝向画面,当时间、位置坐标二者同时匹配时进行合成。Embodiments of the present disclosure also describe such an audio and video matching synthesis rule. As far as the prior art is concerned, when the matching audio and video is restored, all parameters are adjusted and changed based on time (frame), and the method described in the patent is to add a position coordinate axis by rotating (Pan) and vertical. By rotating (Tilt) two parameters, you can match the sound and the orientation of the view, and combine when the time and position coordinates match at the same time.
本公开实施例描述了一种音频格式的框架结构。现有的音频文件包括 简单的左右声道、音轨(声部)等信息,而本公开实施例中,添加了位置信息,具体的描述是,其以采集点为原点,该原点是距离音源的相对位置;同样的,采集点同时也进行视频采集,也同为视频采集点的原点。很明显的,由于视频和音频属于同一个坐标系,在位置信息都一样的情况下,可以进行匹配。综上,本实施例所描述的位置信息是所有发声源相对于采集点的相对位置。Embodiments of the present disclosure describe a frame structure of an audio format. The existing audio file includes simple left and right channels, audio tracks (sounds), and the like. In the embodiment of the present disclosure, position information is added. The specific description is that the origin is a distance source, and the origin is a distance source. The relative position; similarly, the collection point also performs video acquisition, which is also the origin of the video collection point. Obviously, since the video and audio belong to the same coordinate system, the matching can be performed if the position information is the same. In summary, the position information described in this embodiment is the relative position of all sound sources relative to the collection point.
本公开实施例描述了一种当位置(坐标点)变动时音频还原的一种规则。本实施例所描述的还原给用户的声音不是改变还原给用户声音的大小,而是改变所采集的声场范围内每个发声点还原给用户的响度比例。所以,在影响用户体验的几个因素中,诸如视频、音频、体感、触摸灯内容中,本实施主要描述的是对音频比例的还原更改。Embodiments of the present disclosure describe a rule for audio reproduction when a position (coordinate point) changes. The sound restored to the user described in this embodiment does not change the size of the sound restored to the user, but changes the proportion of the loudness that is restored to the user for each utterance point within the range of the collected sound field. Therefore, in several factors that affect the user experience, such as video, audio, body, touch light content, this embodiment mainly describes the restoration of the audio ratio.
本公开实施例描述了一种用户自我选择模式。本公开实施例中描述了主Mic为四个默认档位,当用户不选择默认档位时,可以根据用户自身需求,任意切换其他档位,且其他通路继续进行声音的录制,并对应相应的通路反馈给用户,增加用户体验。在此需要重点说明的是,本专利为了更好的说明问题,描述了主Mic为四个默认档位,是本专利发明核心的基本功能特性之一,并不代表本公开所描述的完整唯一实施方案。所以,本公开完整的实施方案包括但不限于四个默认档位。Embodiments of the present disclosure describe a user self-selection mode. In the embodiment of the present disclosure, the main Mic is described as four default gear positions. When the user does not select the default gear position, the other gear positions can be arbitrarily switched according to the user's own needs, and the other channels continue to record the sound, and corresponding to the corresponding The channel is fed back to the user to increase the user experience. It should be noted here that in order to better illustrate the problem, this patent describes that the main Mic is the four default gear positions, which is one of the basic functional characteristics of the core of the patented invention, and does not represent the complete uniqueness described in the present disclosure. implementation plan. Therefore, a complete embodiment of the present disclosure includes, but is not limited to, four default gear positions.
本公开实施例还描述了这样一种基于复用匹配的体验方案。采用本公开所描述音源定位方式,当添加与后期设计需求匹配的主Mic数量、规格、位置后,如上文描述,可以实现全场多平面维度的音源定位,此时主Mic采集面采集全场的音源进行录制。例如:有两个用户A、B同时朝向不同的角度a°、b°两个视角,全场录制的音源为α。此时,系统自动将用户A的视角角度a°与α音源中所包含的位置信息进行匹配,当匹配成功时,将所扫描到的匹配成功的音源传递给用户A。同样的,系统自动将用户B的视角角度b°与α音源中所包含的位置信息进行匹配,当匹配成功时,将所扫描到的匹配成功的音源传递给用户B。需要说明的是,举例中所描述的是本公开核心的基本功能特性之一,并不代表本公开所描述的完整唯 一实施方案。Embodiments of the present disclosure also describe such an adaptive scheme based on multiplex matching. With the sound source localization method described in the present disclosure, after adding the number, specification, and position of the main Mic that matches the post-design requirements, as described above, the sound source positioning of the full-field multi-plane dimension can be realized, and the main Mic acquisition surface is collected. The source is recorded. For example, there are two users A and B facing two angles of view at different angles a° and b° at the same time, and the sound source recorded in the whole field is α. At this time, the system automatically matches the angle A of the user A with the position information contained in the alpha source. When the matching is successful, the scanned source that has been successfully matched is transmitted to the user A. Similarly, the system automatically matches the viewing angle b° of the user B with the position information contained in the alpha sound source, and when the matching is successful, transmits the scanned matching sound source to the user B. It should be noted that what is described in the examples is one of the basic functional characteristics of the core of the present disclosure, and does not represent the complete and unique embodiment described in the present disclosure.
本公开实施例还描述了这样一种基于本地的非实时复用采集方案。当采用与后期设计需求向匹配的方案后,系统自动采集全场声音进行录制,将录制信息存储在本地,当用户使用时,采用前述的复用匹配体验方案,可以实现非实时的多用户多不同视角的音源定位的感知。Embodiments of the present disclosure also describe such a local based non-real time multiplexing acquisition scheme. After adopting the scheme that matches the requirements of the post-design, the system automatically collects the full-field sound for recording, and stores the recorded information locally. When the user uses the above-mentioned multiplexing matching experience scheme, the non-real-time multi-user can be realized. Perception of sound source localization from different perspectives.
发明实施例对设备布局提供了这样一种解决方案,本公开专利所描述的设备,可以在场内任意一点放置,包括但不限于一台终端设备。The inventive embodiment provides such a solution to the device layout. The device described in the present disclosure can be placed at any point in the field, including but not limited to one terminal device.
本公开实施例中所描述的方法和现有技术提到的方法在实现手段上是存在如下差异的:The method described in the embodiments of the present disclosure and the method mentioned in the prior art have the following differences in the implementation means:
A、现有专利主要内容是VR音频和视频对应渲染,以及在虚拟场景中任意位置视频和音频的选择还原,并没有对立体空间中不同方位音源区分和采集,最重要的是该专利的实现基础是本专利的采集方法,否则,现有专利技术无法实现。A. The main content of the existing patents is the corresponding rendering of VR audio and video, and the selection and restoration of video and audio at any position in the virtual scene. There is no distinction and collection of different azimuth sources in the three-dimensional space. The most important thing is the realization of the patent. The basis is the collection method of this patent. Otherwise, the existing patent technology cannot be realized.
B、本公开实施例中的方案是通过上述推论,通过平面旋转、增加拓宽拾音孔的孔径及长度来解决进入该通道声音以及屏蔽其它非对视位置的音源进入等硬件修改方式达到本专利所提到的目的。现有专利是通过渲染、将其所提到的N个音频采集设备采集的音频数据进行编码实现的。B. The solution in the embodiment of the present disclosure is to realize the hardware modification manner of the sound source entering the channel and shielding other non-opposing positions by planar rotation, increasing the aperture and length of the sound collecting hole, and the like. The purpose mentioned. The existing patents are implemented by rendering and encoding the audio data collected by the N audio collection devices mentioned.
C、本公开实施例中所描述的方法与现有技术所描述的方法在“音源定位”时,实现“音源定位”的介入点存在不同,本公开所描述的是在初期的采集过程中就已经将相关音源(含感知位置)信息录入,在达到基本效果的实施方式上不需要后期进行处理。而现有专利是在录入音源,后期再通过所确定音频数据对应的扬声设备;利用确定的Q个扬声设备,渲染M路音频数据。介入点是在后期对Q个扬声器所对应的M路音频数据的渲染阶段。C. The method described in the embodiments of the present disclosure differs from the method described in the prior art in the implementation of “sound source localization” in the “sound source localization”. The present disclosure describes the initial acquisition process. The relevant sound source (including the perceived position) information has been entered, and no later processing is required on the implementation method that achieves the basic effect. The existing patent is to input the sound source, and then pass the sound data device corresponding to the determined audio data; and use the determined Q speaker devices to render the M channel audio data. The intervention point is the rendering stage of the M-channel audio data corresponding to the Q speakers in the later stage.
D、采集方式不同,本公开所描述的采集方式是,通过添加两个(包括但不限于两个)辅Mic,以及重新定义四种规格(包括但不限于四个)的集成主Mic来实现的,该设备在某单个机位可以实现多向音源采集和定 位。而现有技术则是利用针对每个视频采集设备的设置方位,再在其对应设置的N个音频采集设备。D. Different acquisition methods, the acquisition method described in the present disclosure is achieved by adding two (including but not limited to two) auxiliary Mic, and redefining the integrated main Mic of four specifications (including but not limited to four). The device can achieve multi-directional sound source acquisition and positioning in a single location. The prior art utilizes the set orientation of each video capture device, and then sets N corresponding audio capture devices.
现有专利主要内容是VR音频和视频对应渲染,以及在虚拟场景中任意位置视频和音频的选择还原,并没有对立体空间中不同方位音源的区分和采集,而通过本公开实施例中的采集方式能够有效对立体空间中不同方位音源进行区分和采集。The main content of the existing patents is VR audio and video corresponding rendering, and selection and restoration of video and audio at any position in the virtual scene, and there is no distinction and collection of different azimuth sound sources in the stereo space, but the collection in the embodiment of the present disclosure The method can effectively distinguish and collect sound sources in different directions in the three-dimensional space.
本公开实施例中所描述的方法,是在初期的采集过程中就已经将相关音源(含感知位置)信息录入,在达到基本效果的实施方式上不需要后期进行处理。The method described in the embodiment of the present disclosure records the related sound source (including the perceived position) information in the initial collection process, and does not need to be processed later in the implementation manner that achieves the basic effect.
本公开描述了一种当位置变动(坐标点)时,音频还原的一种规则。不采用改变还原给用户声音的大小,而是改变所采集的声场范围内每个发声点声音大小的比例。现有技术无此功能,只是单纯的渲染后还原给用户。The present disclosure describes a rule for audio reproduction when positional changes (coordinate points). Instead of changing the size of the sound restored to the user, the ratio of the sound level of each sound point within the range of the collected sound field is changed. The prior art does not have this function, but is simply restored to the user after rendering.
本公开实现了一种用户自我选择的模式,所描述的主Mic有四个默认档位(包括但不限于四个),当用户不选则默认档位时,可以切换任意其它档位,其它通路继续进行声音的录制,并对应相应地通路,反馈给用户,增加用户的体验。The present disclosure implements a mode of user self-selection, the described main Mic has four default gear positions (including but not limited to four), and when the user does not select the default gear position, any other gear position can be switched, other The path continues to record the sound and responds to the corresponding path, giving feedback to the user, increasing the user experience.
本公开建立了这样的一种立体声感知技术,根据场地及其空间大小,在特定位置部署本专利所描述的设备,可以达到全方位,立体的空间采集。用户可以在本地(家中、赛场任意一点)任意一点观察感知所建立机位的观赛感觉,增加用户的可选择性和沉浸感,且用户所在位置无需发生变化。The present disclosure establishes such a stereo sensing technology, and according to the size of the site and its space, deploying the device described in this patent at a specific location, can achieve all-dimensional, three-dimensional space acquisition. The user can observe the perception of the established position at any point in the local (at home, at any point in the stadium), increase the user's selectivity and immersion, and the user's location does not need to change.
本公开的有益效果总结如下:The beneficial effects of the present disclosure are summarized as follows:
本公开实施例中的方案实施简单,便捷,不需要做太大的改动;The solution in the embodiment of the present disclosure is simple and convenient to implement, and does not need to be changed too much;
本公开实施例中的方案可以较好的规避已有技术中降噪技术的不足;The solution in the embodiment of the present disclosure can better avoid the deficiencies of the noise reduction technology in the prior art;
本公开实施例中的方案可以在一种设备上进行档位调节,由用户自主选择孔径大小,增加用户的可选择性。The solution in the embodiment of the present disclosure can perform gear position adjustment on a device, and the user independently selects the aperture size to increase the user's selectivity.
下面对本公开的具体应用场景进行说明:The specific application scenarios of the present disclosure are described below:
本公开实施例中的方案可以运用在已有的智能手机产品上,可以达到使用该设备时,用户亲临现场观赛,转动头的位置时,可以实时感知所处位置的声音效果,增加深度沉浸感;还为提供了多种可选择档位,让用户所达到更佳的体验。The solution in the embodiment of the present disclosure can be applied to an existing smart phone product, and when the device is used, when the user visits the scene and turns the position of the head, the sound effect of the position can be sensed in real time, and the depth immersion is increased. Sense; also provides a variety of optional gears for users to achieve a better experience.
在其他领域种,本公开专利在城市规划、城市建模中可以得到很好的运用,例如:在城市规划中,用户可以进行现场巡视,佩戴终端后,可以感知所设置模型的合理性;本公开专利在地理学科中可以得到很好的运用,并综合利用三维GIS等地理信息,来实现某些特定地域类型的感知,提供可靠的参考数据和用户感知;本公开专利可以运动在抢险救灾中,当重大事故发生时,可实时感知查看建筑内部构造和感知信息等其他影响救援工作信息,精准定位最佳救援路线,选择最优救援手段,极大提高救援效率。并对现场传回的图像进行跟踪,实时下达救援命令。In other fields, the disclosed patents can be well used in urban planning and urban modeling. For example, in urban planning, users can conduct on-site inspections, and after wearing the terminal, they can perceive the rationality of the set model; Public patents can be well used in geography, and comprehensively use geographic information such as 3D GIS to achieve the perception of certain geographical types, providing reliable reference data and user perception; this patent can be exercised in disaster relief. When a major accident occurs, real-time perception of the internal structure and perception information of the building and other information affecting rescue work, accurately locate the best rescue route, select the best rescue means, and greatly improve the rescue efficiency. The image returned from the scene is tracked and the rescue command is issued in real time.
需要说明的是,本专利发明也可以运用到如军事、工业、电子巡航、教育等多个领域。It should be noted that the patented invention can also be applied to various fields such as military, industrial, electronic cruise, education, and the like.
显然,本领域的技术人员应该明白,上述的本公开的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本公开不限制于任何特定的硬件和软件结合。It will be apparent to those skilled in the art that the various modules or steps of the present disclosure described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein. The steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module. As such, the disclosure is not limited to any specific combination of hardware and software.
以上所述仅为本公开的优选实施例而已,并不用于限制本公开,对于本领域的技术人员来说,本公开可以有各种更改和变化。凡在本公开的原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The above description is only a preferred embodiment of the present disclosure, and is not intended to limit the disclosure, and various changes and modifications may be made to the present disclosure. Any modifications, equivalent substitutions, improvements, etc., made within the scope of the present disclosure are intended to be included within the scope of the present disclosure.
工业实用性Industrial applicability
如上所述,本发明实施例提供的一种拾音设备、声音输出方法、装置、存储介质及电子装置具有以下有益效果:解决了相关技术中存在的用户只能体会某一个特定位置的观赛感,且会受到周围噪声的影响的问题。As described above, the sound collecting device, the sound output method, the device, the storage medium, and the electronic device provided by the embodiments of the present invention have the following beneficial effects: the user in the related art can only understand the viewing of a specific location. Feeling, and will be affected by the surrounding noise.

Claims (19)

  1. 一种拾音设备,所述拾音设备被设置为与增强现实AR或虚拟现实VR设备连接,包括:主麦克风Mic以及处理器,其中,A sound pickup device, the sound pickup device being configured to be connected to an augmented reality AR or a virtual reality VR device, comprising: a main microphone Mic and a processor, wherein
    所述主Mic设置为获取所述主Mic对应视角上的第一声音源的声音;The main Mic is configured to acquire a sound of a first sound source in a corresponding perspective of the main Mic;
    所述处理器连接至所述主Mic,设置为将所述主Mic获取的声音输出给所述AR或VR。The processor is coupled to the main Mic and configured to output the sound acquired by the main Mic to the AR or VR.
  2. 根据权利要求1所述的拾音设备,其中,所述拾音设备还包括辅Mic,其中,The sound pickup device according to claim 1, wherein said sound pickup device further comprises a secondary Mic, wherein
    所述辅Mic设置在所述主Mic的周围,设置为获取所述辅Mic对应视角上的第二声音源的声音;The auxiliary Mic is disposed around the main Mic, and is configured to acquire a sound of a second sound source corresponding to the auxiliary Mic;
    所述处理器还设置为连接至所述辅Mic,设置为将所述主Mic获取的声音和所述辅Mic获取的声音进行合成并输出合成后的声音给所述AR或VR。The processor is further configured to be connected to the secondary Mic, configured to combine the sound acquired by the primary Mic and the sound acquired by the secondary Mic, and output the synthesized sound to the AR or VR.
  3. 根据权利要求1所述的拾音设备,其中,所述处理器还设置为配置所述主Mic的拾音孔的孔径大小和/或拾音孔的深度。The sound pickup apparatus according to claim 1, wherein said processor is further configured to configure an aperture size of said sound hole of said main Mic and/or a depth of a sound pickup hole.
  4. 根据权利要求3所述的拾音设备,其中,所述处理器通过如下方式配置所述主Mic的拾音孔的孔径大小和/或拾音孔的深度:The sound pickup apparatus according to claim 3, wherein said processor configures an aperture size of said sound pickup hole of said main Mic and/or a depth of a sound pickup hole by:
    在接收到输入的配置信息的情况下,根据所述配置信息配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度;When receiving the input configuration information, configuring an aperture size and/or a pickup depth of the sound collecting hole of the main Mic according to the configuration information;
    在未接收到输入的配置信息的情况下,根据预设的孔径大小配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度。In the case where the input configuration information is not received, the aperture size and/or the pickup hole depth of the pickup hole of the main Mic are configured according to a preset aperture size.
  5. 根据权利要求1所述的拾音设备,其中,所述处理器还设置为:The sound pickup device of claim 1, wherein the processor is further configured to:
    确定所述拾音设备需要转动的方向及幅度;Determining a direction and a range in which the pickup device needs to be rotated;
    利用确定的所述拾音设备需要转动方向及幅度控制所述拾音设 备进行转动。The pickup device is controlled to rotate by the determined direction and magnitude of rotation of the pickup device.
  6. 根据权利要求1所述的拾音设备,其中,所述处理器还设置为:The sound pickup device of claim 1, wherein the processor is further configured to:
    确定所述主Mic需要转动的方向及幅度;Determining the direction and magnitude of rotation of the main Mic;
    利用确定的所述主Mic需要转动的方向及幅度控制所述主Mic进行转动。The main Mic is controlled to rotate by using the determined direction and amplitude of the main Mic to be rotated.
  7. 根据权利要求1至6中任一项所述的拾音设备,其中,当所述拾音设备为用户头部的可穿戴设备时,所述主Mic包括位于所述可穿戴设备前端的主Mic,其中,所述前端被设置为与佩戴者的前额相对应。The sound pickup device according to any one of claims 1 to 6, wherein when the sound pickup device is a wearable device of a user's head, the main Mic includes a main Mic located at a front end of the wearable device Wherein the front end is arranged to correspond to a wearer's forehead.
  8. 根据权利要求7所述的拾音设备,其中,所述主Mic还包括位于所述可穿戴设备的顶端的主Mic,其中,所述顶端被设置为与佩戴者的头顶相对应。The sound pickup apparatus according to claim 7, wherein said main Mic further comprises a main Mic located at a top end of said wearable device, wherein said top end is disposed to correspond to a top of the wearer's head.
  9. 根据权利要求2所述的拾音设备,其中,当所述拾音设备为用户头部的可穿戴设备时,所述辅Mic包括位于所述可穿戴设备的两侧的辅Mic,所述两侧被设置为与佩戴者的双耳相对应。The sound pickup device according to claim 2, wherein when the sound pickup device is a wearable device of a user's head, the auxiliary Mic includes a secondary Mic located on both sides of the wearable device, the two The sides are arranged to correspond to the wearer's ears.
  10. 一种声音输出方法,包括:A method of sound output, comprising:
    利用拾音设备中的主麦克风Mic获取所述主Mic对应视角上的第一声音源的声音;Acquiring, by using the main microphone Mic in the sound collecting device, the sound of the first sound source on the main Mic corresponding angle of view;
    将所述主Mic获取的声音输出给与所述拾音设备连接的增强现实AR或虚拟现实VR设备。The sound acquired by the main Mic is output to an augmented reality AR or virtual reality VR device connected to the sound pickup device.
  11. 根据权利要求10所述的方法,其中,The method of claim 10, wherein
    在将所述主Mic获取的声音输出给与所述拾音设备连接的增强现实AR或虚拟现实VR设备之前,所述方法还包括:利用所述拾音设备中的辅Mic获取所述辅Mic对应视角上的第二声音源的声音,其中,所述辅Mic设置在所述主Mic的周围;Before outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound collecting device, the method further comprises: acquiring the auxiliary Mic by using the auxiliary Mic in the sound collecting device Corresponding to the sound of the second sound source in the viewing angle, wherein the auxiliary Mic is disposed around the main Mic;
    将所述主Mic获取的声音输出给与所述拾音设备连接的增强现实AR或虚拟现实VR设备包括:将所述主Mic获取的声音和所述辅Mic获取的声音进行合成,并将合成后的声音输出给与所述拾音设备连接的所述AR或所述VR设备。Outputting the sound acquired by the main Mic to the augmented reality AR or the virtual reality VR device connected to the sound pickup device includes synthesizing the sound acquired by the main Mic and the sound acquired by the auxiliary Mic, and synthesizing The subsequent sound is output to the AR or the VR device connected to the pickup device.
  12. 根据权利要求10所述的声音输出方法,其中,利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音包括:The sound output method according to claim 10, wherein the acquiring the sound of the first sound source in the main Mic corresponding angle of view by using the main Mic in the sound pickup device comprises:
    配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度;Configuring a hole size of the sound hole of the main Mic and/or a depth of the sound hole;
    利用配置了所述孔径大小和/或拾音孔深度的主Mic获取所述主Mic对应视角上的第一声音源的声音。The sound of the first sound source at the main Mic corresponding angle of view is acquired by the main Mic configured with the aperture size and/or the pickup hole depth.
  13. 根据权利要求12所述的声音输出方法,其中,配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度包括:The sound output method according to claim 12, wherein arranging the aperture size and/or the pickup depth of the sound collecting hole of the main Mic comprises:
    在接收到输入的配置信息的情况下,根据所述配置信息配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度;When receiving the input configuration information, configuring an aperture size and/or a pickup depth of the sound collecting hole of the main Mic according to the configuration information;
    在未接收到输入的配置信息的情况下,根据预设值配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度。In the case where the input configuration information is not received, the aperture size and/or the pickup hole depth of the pickup hole of the main Mic are configured according to a preset value.
  14. 根据权利要求10所述的声音输出方法,其中,在利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音之前,所述方法还包括:The sound output method according to claim 10, wherein the method further comprises: before acquiring the sound of the first sound source in the main Mic corresponding angle of view by using the main Mic in the sound pickup device, the method further comprising:
    确定所述拾音设备需要转动的方向及幅度;Determining a direction and a range in which the pickup device needs to be rotated;
    利用确定的所述拾音设备需要转动的方向及幅度控制所述拾音设备进行转动。The pickup device is controlled to rotate by using the determined direction and amplitude of the rotation of the pickup device.
  15. 根据权利要求10所述的声音输出方法,其中,在利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音之前,所述方法还包括:The sound output method according to claim 10, wherein the method further comprises: before acquiring the sound of the first sound source in the main Mic corresponding angle of view by using the main Mic in the sound pickup device, the method further comprising:
    确定所述主Mic需要转动的方向及幅度;Determining the direction and magnitude of rotation of the main Mic;
    利用确定的所述主Mic需要转动的方向及幅度控制所述主Mic进行转动。The main Mic is controlled to rotate by using the determined direction and amplitude of the main Mic to be rotated.
  16. 一种声音输出装置,包括:A sound output device comprising:
    获取模块,设置为利用拾音设备中的主麦克风Mic获取所述主Mic对应视角上的第一声音源的声音;Obtaining a module, configured to acquire, by using the main microphone Mic in the sound collecting device, a sound of the first sound source in the main Mic corresponding angle of view;
    输出模块,设置为将所述主Mic获取的声音输出给与所述拾音设备连接的增强现实AR或虚拟现实VR设备。And an output module configured to output the sound acquired by the main Mic to an augmented reality AR or virtual reality VR device connected to the sound pickup device.
  17. 根据权利要求16所述的声音输出装置,其中,所述获取模块通过如下方式利用拾音设备中的主Mic获取所述主Mic对应视角上的第一声音源的声音:The sound output device according to claim 16, wherein the acquisition module acquires the sound of the first sound source at the main Mic corresponding angle of view by using the main Mic in the sound pickup device as follows:
    配置所述主Mic的拾音孔的孔径大小和/或拾音孔深度;Configuring a hole size of the sound hole of the main Mic and/or a depth of the sound hole;
    利用配置了所述孔径大小和/或拾音孔深度的主Mic获取所述主Mic对应视角上的第一声音源的声音。The sound of the first sound source at the main Mic corresponding angle of view is acquired by the main Mic configured with the aperture size and/or the pickup hole depth.
  18. 一种存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行权利要求10至15中任一项所述的方法。A storage medium having stored therein a computer program, wherein the computer program is arranged to perform the method of any one of claims 10 to 15 at runtime.
  19. 一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行权利要求10至15中任一项所述的方法。An electronic device comprising a memory and a processor, the memory storing a computer program, the processor being arranged to run the computer program to perform the method of any one of claims 10 to 15.
PCT/CN2019/075489 2018-03-13 2019-02-19 Adapterization equipment, voice output method, device, storage medium and electronic device WO2019174442A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810206356.9A CN110278512A (en) 2018-03-13 2018-03-13 Pick up facility, method of outputting acoustic sound, device, storage medium and electronic device
CN201810206356.9 2018-03-13

Publications (1)

Publication Number Publication Date
WO2019174442A1 true WO2019174442A1 (en) 2019-09-19

Family

ID=67907383

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/075489 WO2019174442A1 (en) 2018-03-13 2019-02-19 Adapterization equipment, voice output method, device, storage medium and electronic device

Country Status (2)

Country Link
CN (1) CN110278512A (en)
WO (1) WO2019174442A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110716178A (en) * 2019-09-17 2020-01-21 苏宁智能终端有限公司 Full sound field oriented sound source positioning method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20070027379A (en) * 2005-09-06 2007-03-09 엘지전자 주식회사 Microphone and the control apparatus and the method thereof
CN103916723A (en) * 2013-01-08 2014-07-09 联想(北京)有限公司 Sound acquisition method and electronic equipment
CN104065951A (en) * 2013-12-26 2014-09-24 北京金山网络科技有限公司 Video shooting method, video playing method and intelligent glasses
CN106162206A (en) * 2016-08-03 2016-11-23 北京疯景科技有限公司 Panorama recording, player method and device
CN106486147A (en) * 2015-08-26 2017-03-08 华为终端(东莞)有限公司 The directivity way of recording, device and sound pick-up outfit
CN106657769A (en) * 2016-11-02 2017-05-10 刘宏晋 Photographing method and system for virtual reality videos

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9313599B2 (en) * 2010-11-19 2016-04-12 Nokia Technologies Oy Apparatus and method for multi-channel signal playback
CN206532076U (en) * 2017-02-22 2017-09-29 上海金蓝络科技信息系统股份有限公司 A kind of intelligent VR pan-shots machine
CN106878877A (en) * 2017-03-23 2017-06-20 南京邮电大学 The method and system of surround sound are provided the user under VR experience scenes

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20070027379A (en) * 2005-09-06 2007-03-09 엘지전자 주식회사 Microphone and the control apparatus and the method thereof
CN103916723A (en) * 2013-01-08 2014-07-09 联想(北京)有限公司 Sound acquisition method and electronic equipment
CN104065951A (en) * 2013-12-26 2014-09-24 北京金山网络科技有限公司 Video shooting method, video playing method and intelligent glasses
CN106486147A (en) * 2015-08-26 2017-03-08 华为终端(东莞)有限公司 The directivity way of recording, device and sound pick-up outfit
CN106162206A (en) * 2016-08-03 2016-11-23 北京疯景科技有限公司 Panorama recording, player method and device
CN106657769A (en) * 2016-11-02 2017-05-10 刘宏晋 Photographing method and system for virtual reality videos

Also Published As

Publication number Publication date
CN110278512A (en) 2019-09-24

Similar Documents

Publication Publication Date Title
US11706582B2 (en) Calibrating listening devices
US10939225B2 (en) Calibrating listening devices
US9706292B2 (en) Audio camera using microphone arrays for real time capture of audio images and method for jointly processing the audio images with video images
CN106134223B (en) Reappear the audio signal processing apparatus and method of binaural signal
US10257630B2 (en) Computer program and method of determining a personalized head-related transfer function and interaural time difference function
CN111294724B (en) Spatial repositioning of multiple audio streams
US20150189455A1 (en) Transformation of multiple sound fields to generate a transformed reproduced sound field including modified reproductions of the multiple sound fields
US11109177B2 (en) Methods and systems for simulating acoustics of an extended reality world
US10652686B2 (en) Method of improving localization of surround sound
US11871210B2 (en) Sharing locations where binaural sound externally localizes
Braasch et al. A binaural model that analyses acoustic spaces and stereophonic reproduction systems by utilizing head rotations
US11979735B2 (en) Apparatus, method, sound system
WO2019174442A1 (en) Adapterization equipment, voice output method, device, storage medium and electronic device
Geronazzo et al. Acoustic selfies for extraction of external ear features in mobile audio augmented reality
CN110620982A (en) Method for audio playback in a hearing aid
CN111246345B (en) Method and device for real-time virtual reproduction of remote sound field
CN108574925A (en) The method and apparatus that audio signal output is controlled in virtual auditory environment
RU2721571C1 (en) Method of receiving, displaying and reproducing data and information
WO2023061130A1 (en) Earphone, user device and signal processing method
CN110716178A (en) Full sound field oriented sound source positioning method and device
US20220312144A1 (en) Sound signal generation circuitry and sound signal generation method
Atbas Real-Time Immersive Audio Featuring Facial Recognition and Tracking

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19766454

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO FORM 1205A DATED 25.01.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19766454

Country of ref document: EP

Kind code of ref document: A1