WO2017185663A1 - Method and device for increasing reverberation - Google Patents

Method and device for increasing reverberation Download PDF

Info

Publication number
WO2017185663A1
WO2017185663A1 PCT/CN2016/102026 CN2016102026W WO2017185663A1 WO 2017185663 A1 WO2017185663 A1 WO 2017185663A1 CN 2016102026 W CN2016102026 W CN 2016102026W WO 2017185663 A1 WO2017185663 A1 WO 2017185663A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
reverberation
target
rir
module
Prior art date
Application number
PCT/CN2016/102026
Other languages
French (fr)
Chinese (zh)
Inventor
徐德著
赵翔宇
霍夫曼帕布洛
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2017185663A1 publication Critical patent/WO2017185663A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/305Electronic adaptation of stereophonic audio signals to reverberation of the listening space
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K15/00Acoustics not otherwise provided for
    • G10K15/08Arrangements for producing a reverberation or echo sound
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a method and apparatus for increasing reverberation.
  • the VR audio effect mainly uses the head related transfer function HRTF (Head Related Transfer Function) technology combined with the head tracking to realize the sound source does not follow the head effect, and simulates the real environment audio experience.
  • HRTF Head Related Transfer Function
  • the frequency domain signal can be obtained by the fast Fourier transform FFT, that is, the HRTF signal, and the HRTF signal can be obtained by the IFFT signal.
  • FFT Fast Fourier transform
  • the HRTF signal can be obtained by the IFFT signal.
  • the VR audio system generally includes an HRIR or HRTF database, such as a well-known database such as CIPIC (Center for Image Processing and Integrated Computing), which includes HRTF/HRIR data corresponding to different orientations on the three-dimensional spherical surface.
  • the sound wave emitted by the sound source is diffracted by the head, the auricle, the trunk, etc., and the physical process can be regarded as a linear time-invariant acoustic filtering system. Its characteristics can be described by the frequency domain transfer function of the system, which is defined as Head related transfer function. It reflects the filtering effect of the outer ear, head and torso on the sound signal when the orientation of the sound signal entering the human ear is different. Therefore the HRTF data contains direction information.
  • FIG. 1 A common audio processing technique is shown in FIG. 1.
  • the HRIR database is converted into an HRTF database in advance, and the corresponding HRTF data is selected according to the orientation information obtained by the head-tracking processing.
  • the HRTF data is multiplied by the FFT transformation result of the input time domain audio signal such as the video track, and then the IFFT is converted into the time domain signal and the time domain signal is used in the earphone. Play to the user.
  • this kind of processing technology only realizes the sound with a sense of direction, and the effects of the head effect and the sound source distance sense inherent in the HRTF technology are still defective.
  • the so-called head-in effect is specifically when listening to music. As if vocals and instruments exist in the brain rather than in space, this effect should be minimized or avoided.
  • the head external effect means that when the user listens to music with headphones, the listener seems to be in the center of a concert hall, and he is surrounded by the sounds of various instruments around him. This effect should be Enhanced.
  • the head effect and the head external effect are the same, such as increasing the head external effect and reducing the head effect. On this basis, users also need to be able to feel the proximity of different sound sources, and this effect should also be enhanced.
  • a short acoustic test signal is generally transmitted through the speaker indoors. After a series of reflections, the sound is picked up by the microphone, and the indoor space can be obtained by specific processing for different types of test signals.
  • the time domain signal of the reverb is generally transmitted through the speaker indoors.
  • the Dolby processing is a process of adding late reverberation based on the technique shown in FIG. 1.
  • Dolby uses the well-known FDN (Feedback Delay Network) method to add post-reverberation solution to the head effect.
  • FDN Field Delay Network
  • HRTF convolution processing is performed on each channel of the multi-channel input, while the next reverberation is performed.
  • the channel does at least one FDN processing. A specific example is shown in FIG.
  • the input signal is a multi-channel time domain signal such as a video track; and the signal of each channel in the input signal is convoluted by the direct response pre-reflection processing module 200 and the HRTF to obtain a corresponding
  • the time domain two-channel output signal directly adds the outputs of the different channels to obtain the first two-channel output signal; adds the different channel signals in the input signal, and reverberates through the module under the reverberation module 201 Into a mono signal, the mono signal is processed by the FDN module (220) to obtain a second two-channel output signal; the first two-channel output signal and the second two-channel output signal are mixed
  • the ringing module 210 adds reverberation to obtain the final output signal, and feeds back to the user with the earphone.
  • FDN For the FDN module (220), at least one FDN module operation is required, and each FDN module needs all-pass filtering, multiple delay processing, and each audio time domain sample needs to perform a matrix operation (see patent WO 2015/102920 A1). Therefore, the calculation modules and parameter settings involved are very In addition, FDN is artificial reverberation, different scenes need to set different FDN parameters, so it can not be flexibly applied to a variety of application scenarios; the effect is difficult to be consistent with the real environment reverberation; and the control function of distance sense is not realized.
  • embodiments of the present invention provide a method and apparatus for increasing reverberation.
  • an embodiment of the present invention provides a method for generating reverberation: acquiring a room impulse response RIR of an indoor environment, the RIR characterizing time domain information of the reverberation in the indoor; and determining a target head related impulse response HRIR in the audio system Data, and calculating energy and length of the target HRIR data; normalizing the RIR according to the energy of the target HRIR data to obtain a first signal; and performing fade-in processing on the first signal according to the length of the target HRIR data Two signals; determining distance information, wherein the distance information is used to represent a distance of the virtual sound source from the user; and the second signal is energy-adjusted according to the distance information to generate a reverberation signal.
  • the device can easily acquire the RIR through its own speaker and microphone, eliminating the need for external equipment; processing the resulting sampled reverb, adding energy normalization and gain control combined with distance information, can enhance the head in the audio system External sense, and different distance senses are generated by distance information.
  • an embodiment of the present invention provides a method for adding reverberation: the foregoing method is applied to a VR device, where the VR device includes a head sensor, where the head sensor is used to obtain head orientation sensing information; : acquiring a room impulse response RIR of the indoor environment, the RIR characterizing the time domain information of the reverberation in the indoor; determining the target head related impulse response HRIR data in the audio system, and calculating the energy and length of the target HRIR data; according to the target HRIR The energy of the data is normalized to obtain the first signal; the first signal is fade-in processed according to the length of the target HRIR data to obtain a second signal; the distance information is determined, and the distance information is used to represent the virtual sound source distance.
  • the VR device can easily obtain the RIR through the speaker and microphone provided by the smart terminal or the smart terminal, and no external device is needed; the obtained sample reverberation is processed, and the energy normalization and the gain control of the combined distance information are added.
  • the head external sense in the VR audio system can be enhanced, and different distance senses are generated by the distance information.
  • the obtaining the room impulse response RIR of the current indoor environment comprises: transmitting an acoustic reference test signal, collecting the reflected signal of the acoustic reference test signal; and performing the reflected signal Sampling to obtain the above RIR.
  • the emission test signal may be a preset empirical reference test signal. This makes it easier to get reverb information from the room.
  • the determining the target head related impulse response HRIR data in the audio system comprises: selecting the largest absolute value of the sample points in all HRIR data waveforms in the audio system HRIR database.
  • the HRIR data is used as the target head related impulse response HRIR data. Because the HRIR is for energy normalization, selecting the largest absolute value of the sampling point normalizes the RIR to an energy level that best matches the HRIR database.
  • an embodiment of the present invention provides an apparatus for generating reverberation, the apparatus comprising: an acquisition module, configured to acquire a room impulse response RIR of an indoor environment, where the RIR characterizes time domain information of the reverberation in the indoor; a determining module, configured to determine a target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data; a first calculating module, configured to The energy of the target HRIR data calculated by the module is normalized to obtain the first signal by the RIR obtained by the obtaining module, and the second calculating module is configured to calculate the length of the target HRIR data calculated by the first determining module.
  • the first signal is subjected to the fade-in processing to obtain the second signal; the second determining module is configured to determine the distance information, where the distance information is used to represent the distance of the virtual sound source from the user; and the third calculating module is configured to determine according to the second determining module.
  • the distance information is energy-adjusted to the second signal calculated by the second calculation module to generate a reverberation signal.
  • the device can easily acquire the RIR through its own speaker and microphone, eliminating the need for external equipment; processing the resulting sampled reverb, adding energy normalization and gain control combined with distance information, can enhance the head in the audio system External sense, and different distance senses are generated by distance information.
  • an embodiment of the present invention provides a device for generating reverberation, the device being applied to a VR device, where the VR device includes a head sensor, where the head sensor is used to obtain head orientation sensing information; And an obtaining module, configured to obtain a room impulse response RIR of the indoor environment, wherein the RIR characterizes time domain information of the reverberation in the indoor; the first determining module is configured to determine a target head related impulse response HRIR data in the audio system, and calculate the above a first calculation module, configured to normalize the RIR obtained by the obtaining module according to the energy of the target HRIR data calculated by the first determining module to obtain a first signal; and the second computing module And performing a fade-in process on the first signal according to the length of the target HRIR data calculated by the first determining module to obtain a second signal; and a second determining module, configured to determine distance information, where the distance information is used to represent the virtual sound source a distance from the user; a third calculation module, configured to
  • the acquiring module is specifically configured to: transmit an acoustic reference test signal, collect a reflected signal of the acoustic reference test signal; and sample the reflected signal to obtain the RIR.
  • the emission test signal may be a preset empirical reference test signal. This makes it easier to get reverb information from the room.
  • the first determining module is specifically configured to: select, as the target, the maximum HRIR data of the absolute values of the samples in all HRIR data waveforms in the audio system HRIR database.
  • the head related impulse responds to HRIR data.
  • an embodiment of the present invention provides an apparatus for enhancing reverberation, where the apparatus includes: a processor, a memory, a transceiver, and a bus system; wherein the processor and the memory and the transceiver are connected by using the bus system,
  • the above memory is used for storing programs and instructions
  • the transceiver is for transmitting and receiving audio signals
  • the processor executes any of the above possible implementation methods by calling the programs and instructions in the above memory.
  • the RIR of the current room can be easily obtained by using the speaker and the microphone of the device without using other audio collection and testing equipment, and the mathematical method of the RIR is processed to obtain an audio system.
  • a reverberation signal that can be used to convolute the input signal of the audio so that the output signal can increase the head external effect of the user, mitigate the effect in the head, and create artificial reverberation without additionally setting complicated parameter information.
  • the invention can bring a better audio experience to the user.
  • FIG. 2 is a schematic diagram of signal processing of a late reverberation
  • FIG. 3 is a schematic diagram of an application device according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of an application scenario combined with specific hardware according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of an audio processing process for adding reverberation according to an embodiment of the present invention
  • FIG. 6 is a flowchart of a method for generating a reverberation signal according to an embodiment of the present invention
  • FIG. 7 is a schematic diagram of an apparatus for generating reverberation according to an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of an apparatus for increasing reverberation according to an embodiment of the present invention.
  • FIG. 9 is a schematic structural diagram of an apparatus for generating reverberation according to an embodiment of the present invention.
  • Embodiments of the present invention provide a method and apparatus for increasing reverberation.
  • the application scenario of the present invention is a smart wearable device, a smart audio and video system, and the like, such as a headband VR device, which includes a smart terminal device, such as a smart terminal such as a mobile phone or a tablet (hereinafter, a mobile phone is used as an example, if the VR device includes a function)
  • a smart terminal device such as a smart terminal such as a mobile phone or a tablet
  • Other terminals equivalent to the mobile phone should also belong to the protection scope of the present invention.
  • the reverberation command can be issued through the mobile phone to obtain the reverberation information of the current indoor space, and according to the current indoor
  • the reverberation information is post-processed to the sound in the VR device, and the processed sound is returned to the user through the earphone.
  • the device to which the present invention is applied is a smart terminal combined with the VR device 300, such as a smart phone 304.
  • the smart phone includes a speaker 306 and a microphone 305, and the mobile phone is inserted into the VR device.
  • a VR display device As well as an audio signal processing device, a pair of connected VR devices
  • the earphone device 302 is configured to play a virtual audio signal to the user.
  • the smart phone also includes a storage unit 307 and a computing unit 308.
  • the control unit 301 in the VR device accepts a user command. When the user triggers the update of the reverberation command, the smart phone 304 reverbers to the real environment room through the microphone 305 and the speaker 306.
  • the measurement is performed and processed by the calculation unit 308 to obtain a late reverberation signal, which is stored in the storage unit 307 together with the HRTF database.
  • computing unit 308 is responsible for the operation of the VR audio system, and the output time domain audio signal is ultimately played back to the user with headset device 302.
  • the VR device 300 and the smart phone 304 can be connected by wire.
  • the VR device 300 and the smart phone 304 are integrated in this case; the VR device 300 and the smart phone 304 can also pass Bluetooth, Wifi,
  • the wireless communication method such as infrared performs wireless connection communication.
  • the VR device 300 and the smart phone 304 are separated in this case.
  • the present invention utilizes a speaker and a microphone on a smart phone that is used with a VR device, and does not require additional test equipment. It is possible to quickly measure and update the current real environment reverberation information without having to make a lot of reverberation parameter settings. After obtaining the real room reverberation information, a series of processes in the present invention are performed to obtain a usable late reverberation signal, and a usable late reverberation signal is obtained, and further gain control is combined with distance information for use in VR audio processing.
  • FIG. 4 A schematic diagram of an application scenario combining specific hardware is shown in FIG. 4: 10 is a smart phone with a signal control and processing unit, 11 is a microphone on a smart phone, 12 is a speaker on a smart phone, and 13 is a head mounted VR device.
  • the input signal is a multi-channel time domain signal such as a video track; the processing flow is as follows:
  • the output signal S1 and the output signal S3 are summed and reverberated by the reverberation module 504 to obtain a final output signal S4, which is fed back to the user with the earphone.
  • step 2) is a key step of the present invention, and how to obtain the late reverberation signal S2 will be described below in a more detailed embodiment. See Figure 6 for the method steps for obtaining the late reverberation signal S2. Specific steps are as follows:
  • the VR device is used to transmit an acoustic reference test signal, such as a test signal S610 such as TSP/MLS, with the speaker of the mobile phone; and the test signal is used to measure the room reverberation.
  • an acoustic reference test signal such as a test signal S610 such as TSP/MLS
  • the TSP signal refers to a well-known signal that responds to the measurement signal, and can be stored in a storage device in a format such as wav, and is played by a smartphone speaker, and the sound wave is reflected by various obstacles in the room, thereby reflecting the reverberation information of the room.
  • test signal is not limited to the TSP signal, and may be a maximum length sequence (MLS) or a sine sweep signal; the issuance of the test signal is controlled by the user.
  • MLS maximum length sequence
  • sine sweep signal the issuance of the test signal is controlled by the user.
  • step 620 the VR device is used to match the microphone of the mobile phone or the microphone of the VR device to pick up the acoustic feedback signal corresponding to the microphone pickup test signal S610 provided by the acoustic echo signal device, and the room impulse response S620 is calculated by the calculation unit.
  • the above acoustic feedback signal includes signals that the sound waves are reflected by different obstacles and different times.
  • the RIR room impulse response is calculated by the calculation unit, which represents the time domain waveform of the room reverberation information.
  • the RIR of different rooms have different room impulse responses, which can be recorded as R(n), where n is [0, N], N
  • the length of the sample of the finally selected RIR, such as N 4096; this calculation method is a well-known method.
  • a two-dimensional waveform represents the RIR signal, the abscissa is the sample point, and the ordinate is the sample point value.
  • the sample points represent the sampling points of the audio signal. Taking 48000 Hz sampling as an example, the interval between samples is 1/48000 seconds, that is, there are 48000 samples in 1 second.
  • this step it is not limited to one microphone recording. If there are more than two available microphones, the two microphones whose materials are selected and are far away are preferentially selected.
  • the two microphone recording data is processed. When there is only one microphone, only one available post-reverberation signal can be obtained.
  • the VR audio processing can be applied to the left and right channels; when there are two microphones, there are two available post-reverberation signals. No., respectively, they are applied to the left and right channels, because the signals collected by the two microphones are not completely the same, the correlation of the reverberation is reduced, and the overall stereoscopic effect will be enhanced, but this will require two steps of the S630-S650 calculation in this scheme. . For three microphones, it is not necessary to perform a three-way operation to select two microphone signals with the closest performance and the farthest distance from the microphone.
  • Step 630 Perform energy normalization processing on the room impulse response S620 by the calculating unit according to the HRIR data energy used by the VR audio system to obtain a signal S630.
  • the target HRIR data of the maximum value of the absolute values of the samples in all HRIR data waveforms in the HRIR database is statistically obtained, and the sample with the largest absolute value of the sample points in the target HRIR data.
  • the sampled position N2 corresponding to the maximum value of the absolute value of the sampled impulse response of the RIR room is statistically recorded, and the recording response peak point is aligned according to the peak sample of the HRIR data.
  • the front (N2-N1) samples of the RIR data need to be deleted to ensure time synchronization.
  • the aligned RIR is:
  • R 1 (n) R(n+N2-N1) n ⁇ [1,N]
  • the HRIR after alignment processing and the RIR after alignment processing samples of the same length are intercepted, and the length of the intercept depends on the length used in the HRIR database of the VR audio system.
  • the HRIR samples [N1, N1+511] are intercepted, and the RIR samples [N2', N2'+511] are intercepted, that is, 512 samples are respectively intercepted, and the HRIR and the RIR intercepted portions are respectively calculated.
  • the energy normalization is performed by multiplying the aligned RIR by E1/E2. among them,
  • R 2 (n) (E1/E2)*R 1 (n) n ⁇ [1,N]
  • R 2 (n) is the signal S630.
  • Step 640 according to the HRIR data length adopted by the VR audio system, the calculation unit performs a fade-in process on the signal in the signal S630 to obtain a first late reverberation signal S640.
  • the purpose of this processing is to fade in, delete the direct sound and the pre-reflection part of the measurement reverb, leaving only the first late reverberation signal.
  • the fade in window function is:
  • LR(n) is the signal S640.
  • Step 650 Perform energy adjustment on the first late reverberation signal S640 signal by the calculating unit according to the distance information set by the distance sensor or the user, to obtain a processed second, late reverberation signal S650 available to the system, where the signal S650 is The above signal S2 is used for subsequent arithmetic processing by the VR audio system.
  • the unit is (m)
  • the distance of the virtual sound source can be set for the user, or the sound source distance information obtained by the distance sensor in the VR device.
  • the late reverberation signal gain is This gain is directly loaded on the late reverb signal.
  • LR 1 (n) is the signal S650.
  • the algorithm of each step has several forms of deformation, and those skilled in the art should implement the signal processing method based on the prior art and the equivalent algorithm without any creative improvement. It is within the scope of protection of the present invention. It is not enumerated in the present invention.
  • the RIR of the current room can be easily obtained by using the speaker and the microphone of the device without using other audio collection and testing equipment.
  • the RIR is processed by a certain mathematical method to obtain a reverberation signal usable by the audio system, and the reverberation signal can perform convolution operation on the input signal of the audio, so that the output signal can increase the head external effect of the user and reduce the head. Effect, no need to set complex parameter information to create artificial reverberation.
  • FIG. 7 is a diagram of a device for generating reverberation according to an embodiment of the present invention, which may be an intelligent device.
  • the terminal can also be a wearable device such as a VR device, or an audio processing module integrated in the product.
  • the device 700 includes:
  • the obtaining module 701 is configured to obtain a room impulse response RIR of the indoor environment, and the RIR represents time domain information of the reverberation in the current indoor.
  • the acquisition module can be implemented by a common wireless transceiver module, and the corresponding wireless transceiver mode can receive and transmit audio signals, which can be controlled by a specific application.
  • the first determining module 702 is configured to determine target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data.
  • the HRIR data can be stored by a memory, and the first determining module 702 can be implemented in a processor.
  • the first calculating module 703 is configured to perform normalization processing on the RIR acquired by the obtaining module 701 according to the energy of the target HRIR data determined by the first determining module 702 to obtain a first signal.
  • the first computing module can be implemented in a processor.
  • the second calculating module 704 is configured to perform a fade-in process on the first signal according to the length of the target HRIR data calculated by the first calculating module 703 to obtain a second signal.
  • the second computing module can be implemented in the processor.
  • the second determining module 705 is configured to determine distance information, where the distance information is used to represent a distance of the virtual sound source from the user; and the second determining module 705 can be implemented by a processor.
  • the third calculating module 706 is configured to perform energy adjustment on the second signal calculated by the second calculating module 704 according to the distance information determined by the second determining module 705 to generate a reverberation signal.
  • the third calculation module 706 can be implemented by a processor.
  • FIG. 8 is a device for adding reverberation according to an embodiment of the present invention.
  • the device may be an intelligent terminal, a wearable device such as a VR device, or a set of audio systems.
  • the device 800 includes: an obtaining module 801, configured to acquire a room impulse response RIR of the indoor environment, and the RIR characterizes time domain information of the reverberation in the room; the acquiring module can be implemented by a common wireless transceiver module, and the corresponding wireless transceiver The mode can receive and transmit audio signals, which can be controlled by a specific application.
  • the first determining module 802 is configured to determine target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data; the HRIR data may be stored by the memory.
  • the first calculating module 803 is configured to perform normalization processing on the RIR acquired by the obtaining module 801 according to the energy of the target HRIR data calculated by the first determining module 802 to obtain a first signal;
  • the second calculating module 804 is configured to perform a fade-in process on the first signal according to the length of the target HRIR data calculated by the first determining module 802 to obtain a second signal;
  • a second determining module 805, configured to determine distance information, where the distance information is used to represent a distance of the virtual sound source from the user;
  • the third calculating module 806 is configured to perform energy adjustment on the second signal calculated by the second calculating module 804 according to the distance information determined by the second determining module 805, to generate a reverberation signal;
  • the fourth calculating module 807 is configured to process the audio input signal according to the head related transfer function HRTF to obtain a first audio signal, where the HRTF is determined by the head orientation sensing information;
  • a fifth calculating module 808, configured to process the audio input signal according to the reverberation signal calculated by the third calculating module 806, to obtain a second audio signal;
  • the sixth calculation module 809 is configured to perform reverberation on the first audio signal calculated by the fourth calculation module 807 and the second audio signal calculated by the fifth calculation module 808 to obtain an output audio signal.
  • the above modules 802-809 can be implemented by the processor and complete the corresponding functions.
  • the acquiring module 701/801 is specifically configured to: transmit an acoustic reference test signal, and collect a reflected signal of the acoustic reference test signal; sample the reflected signal to obtain an RIR.
  • the first determining module 702/802 is specifically configured to: select, in the audio system HRIR database, the largest HRIR data of the absolute values of the sample points in all HRIR data waveforms as the target head related impulse response HRIR data.
  • the technical solution of the present invention provides a device for increasing reverberation, the device comprising the above modules 701-706, Or the above modules 801-808; the device is used to implement the implementation of the above steps 601-605 and an equivalent alternative implementation.
  • the device can easily access the RIR through its own speaker and microphone, eliminating the need for external equipment; processing the resulting sampled reverb, adding energy normalization and gain control combined with distance information, can enhance the head external sense in the audio system And generate a different sense of distance through the distance information.
  • FIG. 9 is a schematic structural diagram of an apparatus for generating reverberation according to an embodiment of the present invention.
  • the device 900 includes:
  • the processor 901 is configured to generate a corresponding operation control signal, send the corresponding component to the processing processing device, read and process the data in the software, and in particular, read and process the data and the program in the memory 902, so that each of the components
  • the function module performs the corresponding function, thereby controlling the corresponding component to act according to the requirements of the instruction.
  • the memory 902 is used to store programs and various data, and mainly stores software units such as operating systems, applications, and function instructions, or a subset thereof, or an extended set thereof.
  • a non-volatile random access memory may also be included, providing hardware, software, and data resources in the management computing processing device to the processor 901, supporting control software and applications.
  • the transceiver 903 is configured to collect, acquire or transmit information, especially an audio signal.
  • Each of the above hardware units can communicate via a bus connection.
  • the processor 901 controls the transceiver 903 to acquire the room impulse response RIR of the indoor environment by calling a program or instruction stored in the memory 902, and the processor 301 executes steps 610-650 in the above embodiment to generate an RIR signal.
  • Reverb signal available in the audio system.
  • the implementation of the device 900 may be a smart terminal, or may be an intelligent audio system such as a VR device or a smart headset, a smart audiovisual system, a wearable home theater, and the like.

Abstract

Provided in embodiments of the present invention are a method and device for generating reverberation. A room impulse response (RIR) of an interior environment is acquired, the RIR expressing time-domain information of reverberation of the interior; target head-related impulse response (HRIR) data in an audio system is determined, and the energy and length of the target HRIR data are calculated; the RIR is normalized on the basis of the energy of the target HRIR data to produce a first signal; the first signal is faded in on the basis of the length of the target HRIR data to produce a second signal; distance information is determined, the distance information being used for expressing the distance at which a virtual audio source is distanced from a user; and the energy of the second signal is adjusted on the basis of the distance information to produce a late reverberation signal. Facilitated is the production of true sampling reverberation in a timely manner; at the same time, the sampling reverberation produced is processed, gain control that is energy normalized and combined with distance information is added, extracephalic sensations in an audio system can be intensified, and different perceptions of distance can be produced with the distance information.

Description

一种增加混响的方法与装置Method and device for increasing reverberation 技术领域Technical field
本发明涉及通信技术领域,尤其涉及一种增加混响的方法与装置。The present invention relates to the field of communications technologies, and in particular, to a method and apparatus for increasing reverberation.
背景技术Background technique
当前虚拟现实领域研究非常广泛,其中虚拟音频方面的体验也越来越被重视。目前VR音频效果主要是利用头头相关传递函数HRTF(Head Related Transfer Function)技术结合头部跟踪来实现音源不随头动效果,模拟真实环境音频体验。The current research in the field of virtual reality is very extensive, and the experience of virtual audio is getting more and more attention. At present, the VR audio effect mainly uses the head related transfer function HRTF (Head Related Transfer Function) technology combined with the head tracking to realize the sound source does not follow the head effect, and simulates the real environment audio experience.
对于一个时域信号-头相关脉冲响应HRIR(Head Related Impulse Response),经快速傅里叶变换FFT就可以得到频域信号,也即HRTF信号,反之HRTF信号经过IFFT就可以得到HRIR信号,两者可视为等价关系。VR音频系统中一般都包含一个HRIR或者HRTF数据库,如CIPIC(Center for Image Processing and Integrated Computing)等公知数据库,它包含了三维球面上不同方位对应的HRTF/HRIR数据。For a time domain signal-head related impulse response HRIR (Head Related Impulse Response), the frequency domain signal can be obtained by the fast Fourier transform FFT, that is, the HRTF signal, and the HRTF signal can be obtained by the IFFT signal. Can be regarded as an equivalence relationship. The VR audio system generally includes an HRIR or HRTF database, such as a well-known database such as CIPIC (Center for Image Processing and Integrated Computing), which includes HRTF/HRIR data corresponding to different orientations on the three-dimensional spherical surface.
声源发出的声波经头部、耳廓、躯干等衍射后到达双耳,其中的物理过程可视为一个线性时不变的声滤波系统,其特性可由系统的频域传输函数描述,定义为头部相关的传递函数。它反映了进入人耳的声音信号的方位不同时,外耳、头部、躯干对声音信号的滤波作用。因此HRTF数据包含了方向信息。The sound wave emitted by the sound source is diffracted by the head, the auricle, the trunk, etc., and the physical process can be regarded as a linear time-invariant acoustic filtering system. Its characteristics can be described by the frequency domain transfer function of the system, which is defined as Head related transfer function. It reflects the filtering effect of the outer ear, head and torso on the sound signal when the orientation of the sound signal entering the human ear is different. Therefore the HRTF data contains direction information.
一种常见的音频处理技术如图1所示,预先将HRIR数据库使用FFT转换为HRTF数据库,根据头部跟踪处理(Head-tracking processing)得到的方位信息,选取对应的HRTF数据。根据时域卷积等价于频域乘积的原理,将HRTF数据与视频音轨等输入时域音频信号的FFT变换结果相乘,再进行IFFT变换为时域信号并将此时域信号用耳机播放给用户。A common audio processing technique is shown in FIG. 1. The HRIR database is converted into an HRTF database in advance, and the corresponding HRTF data is selected according to the orientation information obtained by the head-tracking processing. According to the principle that the time domain convolution is equivalent to the frequency domain product, the HRTF data is multiplied by the FFT transformation result of the input time domain audio signal such as the video track, and then the IFFT is converted into the time domain signal and the time domain signal is used in the earphone. Play to the user.
然而,这类处理技术仅仅实现了具有方向感的声音,HRTF技术固有的头中效应、声源距离感等方面效果还存在缺陷。所谓头中效应具体表现就是听音乐时, 仿佛人声、乐器存在于脑中而非空间中,这种效应应当尽量减小或避免。除了头中效应还有头外部效应,头外部效应即表示用户用耳机听音乐时,听者仿佛置身于一个音乐厅的中心位置,自己被周围(外界)的各个乐器声包围,这种效应应当增强。相对应的,头中效应与头外部效应是此消彼长的,如提高头外部效应同时也会降低头中效应。在此基础上,用户还需要能感觉到不同声源的远近,这种效应也是应当增强的。However, this kind of processing technology only realizes the sound with a sense of direction, and the effects of the head effect and the sound source distance sense inherent in the HRTF technology are still defective. The so-called head-in effect is specifically when listening to music. As if vocals and instruments exist in the brain rather than in space, this effect should be minimized or avoided. In addition to the head effect and the head external effect, the head external effect means that when the user listens to music with headphones, the listener seems to be in the center of a concert hall, and he is surrounded by the sounds of various instruments around him. This effect should be Enhanced. Correspondingly, the head effect and the head external effect are the same, such as increasing the head external effect and reducing the head effect. On this basis, users also need to be able to feel the proximity of different sound sources, and this effect should also be enhanced.
声波在室内传播时,要被墙壁、天花板、地板等障碍物反射,每反射一次都要被障碍物吸收一些。这样,当声源停止发声后,声波在室内要经过多次反射和吸收,最后才消失,我们就感觉到声源停止发声后声音还继续一段时间。这种现象叫做混响。混响包含了室内的空间信息,因此添加混响处理能够增强头外部效应,削弱头中效应。When sound waves are transmitted indoors, they must be reflected by obstacles such as walls, ceilings, and floors. Each reflection must be absorbed by obstacles. In this way, when the sound source stops sounding, the sound wave is reflected and absorbed multiple times in the room, and finally disappears. We feel that the sound continues for a while after the sound source stops sounding. This phenomenon is called reverberation. Reverb contains spatial information in the room, so adding reverberation can enhance the head external effect and weaken the head effect.
对于一个室内混响的测量,一般是在室内通过扬声器发射一个短暂的声学测试信号,声音经过一系列的反射后,被麦克风拾取,针对不同类型的测试信号经过特定处理就可以得到这个室内空间的混响的时域信号。For the measurement of an indoor reverberation, a short acoustic test signal is generally transmitted through the speaker indoors. After a series of reflections, the sound is picked up by the microphone, and the indoor space can be obtained by specific processing for different types of test signals. The time domain signal of the reverb.
Dolby处理方式为在图1所示技术基础之上添加了后期混响的处理。Dolby采用公知的FDN(Feedback delay network)方法添加后期混响解决头中效应,在一些实例中,对多声道输入的每一个声道都进行HRTF卷积处理,同时对下混响后的单声道做至少一次FDN处理。一种具体实例如图2所示:输入信号为视频音轨等多声道时域信号;将输入信号中每一声道的信号通过直接响应前期反射处理模块200与HRTF进行卷积,得到对应的时域双声道输出信号,将不同声道的输出直接相加得到第一路双声道输出信号;将输入信号中的不同声道信号相加,通过模块下混响模块201下混响成一个单声道信号,此单声道信号经过FDN模块(220)处理,得到第二路双声道输出信号;将第一路双声道输出信号和第二路双声道输出信号用混响模块210相加混响得到最终输出信号,并用耳机反馈给用户。The Dolby processing is a process of adding late reverberation based on the technique shown in FIG. 1. Dolby uses the well-known FDN (Feedback Delay Network) method to add post-reverberation solution to the head effect. In some instances, HRTF convolution processing is performed on each channel of the multi-channel input, while the next reverberation is performed. The channel does at least one FDN processing. A specific example is shown in FIG. 2: the input signal is a multi-channel time domain signal such as a video track; and the signal of each channel in the input signal is convoluted by the direct response pre-reflection processing module 200 and the HRTF to obtain a corresponding The time domain two-channel output signal directly adds the outputs of the different channels to obtain the first two-channel output signal; adds the different channel signals in the input signal, and reverberates through the module under the reverberation module 201 Into a mono signal, the mono signal is processed by the FDN module (220) to obtain a second two-channel output signal; the first two-channel output signal and the second two-channel output signal are mixed The ringing module 210 adds reverberation to obtain the final output signal, and feeds back to the user with the earphone.
其中,对于FDN模块(220),需要进行至少一个FDN模块的运算,每个FDN模块均需要全通滤波,多路延时处理,每个音频时域样点需要进行一个矩阵运算(可参见专利WO 2015/102920 A1)。因此涉及到的计算模块和参数设置都很 复杂;此外,FDN为人工混响,不同场景需要设置不同的FDN参数,因此不能灵活适用多种应用场景;效果难以与真实环境混响一致;并且没有实现对距离感的控制功能。For the FDN module (220), at least one FDN module operation is required, and each FDN module needs all-pass filtering, multiple delay processing, and each audio time domain sample needs to perform a matrix operation (see patent WO 2015/102920 A1). Therefore, the calculation modules and parameter settings involved are very In addition, FDN is artificial reverberation, different scenes need to set different FDN parameters, so it can not be flexibly applied to a variety of application scenarios; the effect is difficult to be consistent with the real environment reverberation; and the control function of distance sense is not realized.
发明内容Summary of the invention
有鉴于此,本发明实施例提供了一种增加混响的方法与装置。In view of this, embodiments of the present invention provide a method and apparatus for increasing reverberation.
第一方面,本发明实施例提供了一种生成混响的方法:获取室内环境的房间脉冲响应RIR,上述RIR表征上述室内的混响的时域信息;确定音频系统中目标头相关脉冲响应HRIR数据,并计算上述目标HRIR数据的能量和长度;根据上述目标HRIR数据的能量对上述RIR进行归一化处理得到第一信号;根据上述目标HRIR数据的长度对上述第一信号进行淡入处理得到第二信号;确定距离信息,上述距离信息用于表征虚拟声源距离用户的距离;根据上述距离信息对上述第二信号进行能量调整,生成混响信号。通常,设备可以通过自带的扬声器和麦克风轻松获取到RIR,无需外部设备;对得到的采样混响进行处理,加入了能量归一化与结合距离信息的增益控制,可以增强音频系统中的头外部感,并且通过距离信息产生不同的距离感。In a first aspect, an embodiment of the present invention provides a method for generating reverberation: acquiring a room impulse response RIR of an indoor environment, the RIR characterizing time domain information of the reverberation in the indoor; and determining a target head related impulse response HRIR in the audio system Data, and calculating energy and length of the target HRIR data; normalizing the RIR according to the energy of the target HRIR data to obtain a first signal; and performing fade-in processing on the first signal according to the length of the target HRIR data Two signals; determining distance information, wherein the distance information is used to represent a distance of the virtual sound source from the user; and the second signal is energy-adjusted according to the distance information to generate a reverberation signal. In general, the device can easily acquire the RIR through its own speaker and microphone, eliminating the need for external equipment; processing the resulting sampled reverb, adding energy normalization and gain control combined with distance information, can enhance the head in the audio system External sense, and different distance senses are generated by distance information.
第二方面,本发明实施例提供了一种增加混响的方法:上述方法应用于VR设备,上述VR设备包括头部传感器,上述头部传感器用于获得头部方位传感信息;上述方法包括:获取室内环境的房间脉冲响应RIR,上述RIR表征上述室内的混响的时域信息;确定音频系统中目标头相关脉冲响应HRIR数据,并计算上述目标HRIR数据的能量和长度;根据上述目标HRIR数据的能量对上述RIR进行归一化处理得到第一信号;根据上述目标HRIR数据的长度对上述第一信号进行淡入处理得到第二信号;确定距离信息,上述距离信息用于表征虚拟声源距离用 户的距离;根据上述距离信息对上述第二信号进行能量调整,生成混响信号;获得目标头部方位传感信息,确定上述目标头部方位传感信息对应的头相关传递函数HRTF;根据上述HRTF对音频输入信号进行处理,得到第一路音频信号;根据上述混响信号对上述音频输入信号进行处理,得到第二路音频信号;对上述第一路音频信号和上述第二路音频信号进行混响得到输出音频信号。通常,VR设备可以通过自带的或者智能终端自带的扬声器和麦克风轻松获取到RIR,无需外部设备;对得到的采样混响进行处理,加入了能量归一化与结合距离信息的增益控制,可以增强VR音频系统中的头外部感,并且通过距离信息产生不同的距离感。In a second aspect, an embodiment of the present invention provides a method for adding reverberation: the foregoing method is applied to a VR device, where the VR device includes a head sensor, where the head sensor is used to obtain head orientation sensing information; : acquiring a room impulse response RIR of the indoor environment, the RIR characterizing the time domain information of the reverberation in the indoor; determining the target head related impulse response HRIR data in the audio system, and calculating the energy and length of the target HRIR data; according to the target HRIR The energy of the data is normalized to obtain the first signal; the first signal is fade-in processed according to the length of the target HRIR data to obtain a second signal; the distance information is determined, and the distance information is used to represent the virtual sound source distance. use a distance of the household; performing energy adjustment on the second signal according to the distance information to generate a reverberation signal; obtaining target head orientation sensing information, and determining a head related transfer function HRTF corresponding to the target head orientation sensing information; The HRTF processes the audio input signal to obtain a first audio signal; and processes the audio input signal according to the reverberation signal to obtain a second audio signal; and performs the first audio signal and the second audio signal The reverb produces an output audio signal. Generally, the VR device can easily obtain the RIR through the speaker and microphone provided by the smart terminal or the smart terminal, and no external device is needed; the obtained sample reverberation is processed, and the energy normalization and the gain control of the combined distance information are added. The head external sense in the VR audio system can be enhanced, and different distance senses are generated by the distance information.
根据第一方面或第二方面,在一个可能的设计中,上述获取当前室内环境的房间脉冲响应RIR包括:发射一个声学参考测试信号,收集上述声学参考测试信号的反射信号;对上述反射信号进行采样,得到上述RIR。其中,发射测试信号可以是预设的经验参考测试信号。这样更容易获取到房间的混响信息。According to the first aspect or the second aspect, in a possible design, the obtaining the room impulse response RIR of the current indoor environment comprises: transmitting an acoustic reference test signal, collecting the reflected signal of the acoustic reference test signal; and performing the reflected signal Sampling to obtain the above RIR. The emission test signal may be a preset empirical reference test signal. This makes it easier to get reverb information from the room.
根据第一方面或第二方面,在一个可能的设计中,上述确定音频系统中目标头相关脉冲响应HRIR数据包括:在音频系统HRIR数据库中选出所有HRIR数据波形中样点绝对值的最大的HRIR数据作为上述目标头相关脉冲响应HRIR数据。因为该HRIR是用于能量归一化的,选取样点绝对值最大的能够使RIR归一化到一个与HRIR数据库最匹配的能量水平。According to the first aspect or the second aspect, in a possible design, the determining the target head related impulse response HRIR data in the audio system comprises: selecting the largest absolute value of the sample points in all HRIR data waveforms in the audio system HRIR database. The HRIR data is used as the target head related impulse response HRIR data. Because the HRIR is for energy normalization, selecting the largest absolute value of the sampling point normalizes the RIR to an energy level that best matches the HRIR database.
第三方面,本发明实施例提供了一种生成混响的装置,该装置包括:获取模块,用于获取室内环境的房间脉冲响应RIR,上述RIR表征上述室内的混响的时域信息;第一确定模块,用于确定音频系统中目标头相关脉冲响应HRIR数据,并计算上述目标HRIR数据的能量和长度;第一计算模块,用于根据上述第一确 定模块计算出的目标HRIR数据的能量对上述获取模块获取的RIR进行归一化处理得到第一信号;第二计算模块,用于根据上述第一确定模块计算出的目标HRIR数据的长度对上述第一信号进行淡入处理得到第二信号;第二确定模块,用于确定距离信息,上述距离信息用于表征虚拟声源距离用户的距离;第三计算模块,用于根据上述第二确定模块确定的距离信息对上述第二计算模块计算出的第二信号进行能量调整,生成混响信号。通常,设备可以通过自带的扬声器和麦克风轻松获取到RIR,无需外部设备;对得到的采样混响进行处理,加入了能量归一化与结合距离信息的增益控制,可以增强音频系统中的头外部感,并且通过距离信息产生不同的距离感。In a third aspect, an embodiment of the present invention provides an apparatus for generating reverberation, the apparatus comprising: an acquisition module, configured to acquire a room impulse response RIR of an indoor environment, where the RIR characterizes time domain information of the reverberation in the indoor; a determining module, configured to determine a target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data; a first calculating module, configured to The energy of the target HRIR data calculated by the module is normalized to obtain the first signal by the RIR obtained by the obtaining module, and the second calculating module is configured to calculate the length of the target HRIR data calculated by the first determining module. The first signal is subjected to the fade-in processing to obtain the second signal; the second determining module is configured to determine the distance information, where the distance information is used to represent the distance of the virtual sound source from the user; and the third calculating module is configured to determine according to the second determining module. The distance information is energy-adjusted to the second signal calculated by the second calculation module to generate a reverberation signal. In general, the device can easily acquire the RIR through its own speaker and microphone, eliminating the need for external equipment; processing the resulting sampled reverb, adding energy normalization and gain control combined with distance information, can enhance the head in the audio system External sense, and different distance senses are generated by distance information.
第四方面,本发明实施例提供了一种生成混响的装置,该装置应用于VR设备,上述VR设备包括头部传感器,上述头部传感器用于获得头部方位传感信息;上述装置包括:获取模块,用于获取室内环境的房间脉冲响应RIR,上述RIR表征上述室内的混响的时域信息;第一确定模块,用于确定音频系统中目标头相关脉冲响应HRIR数据,并计算上述目标HRIR数据的能量和长度;第一计算模块,用于根据上述第一确定模块计算出的目标HRIR数据的能量对上述获取模块获取的RIR进行归一化处理得到第一信号;第二计算模块,用于根据上述第一确定模块计算出的目标HRIR数据的长度对上述第一信号进行淡入处理得到第二信号;第二确定模块,用于确定距离信息,上述距离信息用于表征虚拟声源距离用户的距离;第三计算模块,用于根据上述第二确定模块确定的距离信息对上述第二计算模块计算出的第二信号进行能量调整,生成混响信号;第四计算模块,用于根据头相关传递函数HRTF对音频输入信号进行处理,得到第一路音频信号,其中,所述HRTF由所述头部方位传感信息决定;第五计算模块,用于根据上述第三计 算模块计算出的混响信号对上述音频输入信号进行处理,得到第二路音频信号;第六计算模块,用于对上述第四计算模块计算出来的第一路音频信号和上述第五计算模块计算出的第二路音频信号进行混响得到输出音频信号。In a fourth aspect, an embodiment of the present invention provides a device for generating reverberation, the device being applied to a VR device, where the VR device includes a head sensor, where the head sensor is used to obtain head orientation sensing information; And an obtaining module, configured to obtain a room impulse response RIR of the indoor environment, wherein the RIR characterizes time domain information of the reverberation in the indoor; the first determining module is configured to determine a target head related impulse response HRIR data in the audio system, and calculate the above a first calculation module, configured to normalize the RIR obtained by the obtaining module according to the energy of the target HRIR data calculated by the first determining module to obtain a first signal; and the second computing module And performing a fade-in process on the first signal according to the length of the target HRIR data calculated by the first determining module to obtain a second signal; and a second determining module, configured to determine distance information, where the distance information is used to represent the virtual sound source a distance from the user; a third calculation module, configured to determine the distance information according to the second determining module The second signal calculated by the second calculation module performs energy adjustment to generate a reverberation signal; and a fourth calculation module is configured to process the audio input signal according to the head related transfer function HRTF to obtain a first audio signal, where The HRTF is determined by the head orientation sensing information; and the fifth calculation module is configured to calculate according to the third The reverberation signal calculated by the calculation module processes the audio input signal to obtain a second audio signal, and the sixth calculation module is configured to calculate the first audio signal and the fifth calculation module calculated by the fourth calculation module. The calculated second audio signal is reverberated to obtain an output audio signal.
根据第三方面或第四方面,在一个可能的设计中,上述获取模块具体用于:发射一个声学参考测试信号,收集上述声学参考测试信号的反射信号;对上述反射信号进行采样,得到上述RIR。其中,发射测试信号可以是预设的经验参考测试信号。这样更容易获取到房间的混响信息。According to the third aspect or the fourth aspect, in a possible design, the acquiring module is specifically configured to: transmit an acoustic reference test signal, collect a reflected signal of the acoustic reference test signal; and sample the reflected signal to obtain the RIR. . The emission test signal may be a preset empirical reference test signal. This makes it easier to get reverb information from the room.
根据第三方面或第四方面,在一个可能的设计中,上述第一确定模块具体用于:在音频系统HRIR数据库中选出所有HRIR数据波形中样点绝对值的最大的HRIR数据作为上述目标头相关脉冲响应HRIR数据。According to the third aspect or the fourth aspect, in a possible design, the first determining module is specifically configured to: select, as the target, the maximum HRIR data of the absolute values of the samples in all HRIR data waveforms in the audio system HRIR database. The head related impulse responds to HRIR data.
第五方面,本发明实施例提供了一种增强混响的设备,设备包括:处理器、存储器、收发器和总线系统;其中,上述处理器和上述存储器、上述收发器通过上述总线系统相连,上述存储器用于存储程序和指令,上述收发器用于收发音频信号,上述处理器通过调用上述存储器中的程序和指令,执行如上述任一一种可能的实现方法。In a fifth aspect, an embodiment of the present invention provides an apparatus for enhancing reverberation, where the apparatus includes: a processor, a memory, a transceiver, and a bus system; wherein the processor and the memory and the transceiver are connected by using the bus system, The above memory is used for storing programs and instructions, the transceiver is for transmitting and receiving audio signals, and the processor executes any of the above possible implementation methods by calling the programs and instructions in the above memory.
根据本发明实施例提供的技术方案,可以无需其他音频采集和测试设备,利用设备自身的扬声器和话筒就可以轻松获得当前房间的RIR,并对该RIR进行一定的数学方法处理,得到一个音频系统可用的混响信号,该混响信号可以对音频的输入信号进行卷积运算,使得输出信号可以增加用户的头外部效应、减轻头中效应,无需额外设置复杂的参数信息制造人工混响,总而言之,本发明能够给用户带来更好的音频体验。 According to the technical solution provided by the embodiment of the present invention, the RIR of the current room can be easily obtained by using the speaker and the microphone of the device without using other audio collection and testing equipment, and the mathematical method of the RIR is processed to obtain an audio system. A reverberation signal that can be used to convolute the input signal of the audio so that the output signal can increase the head external effect of the user, mitigate the effect in the head, and create artificial reverberation without additionally setting complicated parameter information. The invention can bring a better audio experience to the user.
附图说明DRAWINGS
图1为现有技术中一种常见的音频处理技术;1 is a common audio processing technique in the prior art;
图2为一种后期混响的信号处理示意图;2 is a schematic diagram of signal processing of a late reverberation;
图3为本发明实施例中一种的应用设备示意图;3 is a schematic diagram of an application device according to an embodiment of the present invention;
图4为本发明实施例中一个结合具体硬件的应用场景示意图;FIG. 4 is a schematic diagram of an application scenario combined with specific hardware according to an embodiment of the present invention; FIG.
图5为本发明实施例中一种增加混响的音频处理流程示意图;FIG. 5 is a schematic diagram of an audio processing process for adding reverberation according to an embodiment of the present invention; FIG.
图6为本发明实施例中一种生成混响信号的方法流程图;6 is a flowchart of a method for generating a reverberation signal according to an embodiment of the present invention;
图7为本发明实施例中一种生成混响的装置示意图;FIG. 7 is a schematic diagram of an apparatus for generating reverberation according to an embodiment of the present invention; FIG.
图8为本发明实施例中一种增加混响的装置示意图;FIG. 8 is a schematic diagram of an apparatus for increasing reverberation according to an embodiment of the present invention; FIG.
图9为本发明实施例中一种生成混响的设备结构示意图。FIG. 9 is a schematic structural diagram of an apparatus for generating reverberation according to an embodiment of the present invention.
具体实施方式Detailed ways
本发明实施例提供了一种增加混响的方法与装置。Embodiments of the present invention provide a method and apparatus for increasing reverberation.
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分优选实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of preferred embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
本发明的应用场景为智能穿戴设备、智能影音系统等,如头带式VR设备,该设备包含智能终端设备,如手机、平板等智能终端(下文中以手机进行举例,若VR设备中包含功能与手机等同的其他终端,也应属于本发明保护范围内),用户在不同的室内空间时,可以通过手机下发更新混响的命令,以便获得当前室内空间的混响信息,并根据当前室内的混响信息对VR设备中的声音进行后期处理,并通过耳机将处理后的声音返回给用户。The application scenario of the present invention is a smart wearable device, a smart audio and video system, and the like, such as a headband VR device, which includes a smart terminal device, such as a smart terminal such as a mobile phone or a tablet (hereinafter, a mobile phone is used as an example, if the VR device includes a function) Other terminals equivalent to the mobile phone should also belong to the protection scope of the present invention. When the user is in different indoor spaces, the reverberation command can be issued through the mobile phone to obtain the reverberation information of the current indoor space, and according to the current indoor The reverberation information is post-processed to the sound in the VR device, and the processed sound is returned to the user through the earphone.
请参阅图3,在一种具体实现过程中,本发明所应用的设备为一个与VR设备300结合的智能终端,如智能手机304,智能手机中含有扬声器306与麦克风305,手机插在VR设备中作为VR显示设备、以及音频信号处理设备,一副连接VR设 备的耳机设备302,用于播放虚拟音频信号给用户。智能手机中还包含存储单元307与计算单元308,VR设备中的控制单元301接受用户命令,当用户触发更新混响命令时,智能手机304通过麦克风305与扬声器306对所处真实环境房间混响进行测量,并通过计算单元308进行处理后得到后期混响信号,将该信号与HRTF数据库一起存放在存储单元307中。当用户体验VR音频时,计算单元308负责VR音频系统的运算,输出时域音频信号最终用耳机设备302回放给用户。Referring to FIG. 3, in a specific implementation process, the device to which the present invention is applied is a smart terminal combined with the VR device 300, such as a smart phone 304. The smart phone includes a speaker 306 and a microphone 305, and the mobile phone is inserted into the VR device. As a VR display device, as well as an audio signal processing device, a pair of connected VR devices The earphone device 302 is configured to play a virtual audio signal to the user. The smart phone also includes a storage unit 307 and a computing unit 308. The control unit 301 in the VR device accepts a user command. When the user triggers the update of the reverberation command, the smart phone 304 reverbers to the real environment room through the microphone 305 and the speaker 306. The measurement is performed and processed by the calculation unit 308 to obtain a late reverberation signal, which is stored in the storage unit 307 together with the HRTF database. When the user experiences VR audio, computing unit 308 is responsible for the operation of the VR audio system, and the output time domain audio signal is ultimately played back to the user with headset device 302.
其中,VR设备300与智能手机304的既可以通过有线的方式连接,通常这种情形下VR设备300与智能手机304是集成在一体的;VR设备300与智能手机304也可以通过蓝牙、Wifi、红外等无线通讯方式进行无线连接通讯,通常这种情形下VR设备300与智能手机304是分立的。The VR device 300 and the smart phone 304 can be connected by wire. Generally, the VR device 300 and the smart phone 304 are integrated in this case; the VR device 300 and the smart phone 304 can also pass Bluetooth, Wifi, The wireless communication method such as infrared performs wireless connection communication. Usually, the VR device 300 and the smart phone 304 are separated in this case.
本发明利用了VR设备搭配使用的智能手机上的扬声器与麦克风,不需要额外的测试设备。可以快速地测量获得以及更新当前真实环境混响信息,无需进行大量混响参数设定。在获得真实房间混响信息后进行本发明中的一系列处理即可得到可用的后期混响信号,得到可用的后期混响信号,并且结合距离信息进一步增益控制,在VR音频处理中使用。The present invention utilizes a speaker and a microphone on a smart phone that is used with a VR device, and does not require additional test equipment. It is possible to quickly measure and update the current real environment reverberation information without having to make a lot of reverberation parameter settings. After obtaining the real room reverberation information, a series of processes in the present invention are performed to obtain a usable late reverberation signal, and a usable late reverberation signal is obtained, and further gain control is combined with distance information for use in VR audio processing.
一个结合具体硬件的应用场景示意图如图4所示:10为含信号控制与处理单元的智能手机,11为智能手机上的麦克风,12为智能手机上的扬声器,13为头戴式VR设备。A schematic diagram of an application scenario combining specific hardware is shown in FIG. 4: 10 is a smart phone with a signal control and processing unit, 11 is a microphone on a smart phone, 12 is a speaker on a smart phone, and 13 is a head mounted VR device.
更为形象的音频处理流程可参阅图5。其中,输入信号为视频音轨等多声道时域信号;处理流程如下:A more visual audio processing flow can be seen in Figure 5. The input signal is a multi-channel time domain signal such as a video track; the processing flow is as follows:
1)根据VR设备中的头部传感器确定出头部方位传感信息,根据头部方位传感信息在HRTF数据库中匹配出该头部方位传感信息所对应的HRTF函数;将输入信号中每一声道的信号通过HRTF双耳音频处理模块500与上述匹配出来的HRTF函数进行卷积,得到对应的时域双声道输出信号S1,将不同声道的输出直接相加得到输出信号S1;1) determining head orientation sensing information according to the head sensor in the VR device, and matching the HRTF function corresponding to the head orientation sensing information in the HRTF database according to the head orientation sensing information; One channel of the signal is convoluted by the HRTF binaural audio processing module 500 and the matched HRTF function to obtain a corresponding time domain two-channel output signal S1, and the outputs of the different channels are directly added to obtain an output signal S1;
2)通过后期混响处理模块503获取后期混响信号S2;2) obtaining the late reverberation signal S2 through the late reverberation processing module 503;
3)将输入信号中的不同声道信号相加,通过模块下混响模块501下混响成一 个单声道信号,对该单声道信号与后期混响信号S2通过FFT快速卷积模块502进行FFT快速卷积,得到输出信号S3;3) Adding different channel signals in the input signal, and reverberating into one by the module under reverberation module 501 a mono signal, the mono signal and the late reverberation signal S2 are FFT fast convolved by the FFT fast convolution module 502 to obtain an output signal S3;
4)将输出信号S1和输出信号S3用混响模块504相加混响得到最终输出信号S4,并用耳机反馈给用户。4) The output signal S1 and the output signal S3 are summed and reverberated by the reverberation module 504 to obtain a final output signal S4, which is fed back to the user with the earphone.
以上步骤中,步骤2)为本发明的关键步骤,下面将以更详细的实施例来描述如何获取后期混响信号S2。获取后期混响信号S2的方法步骤请参阅图6。具体步骤如下:Among the above steps, step 2) is a key step of the present invention, and how to obtain the late reverberation signal S2 will be described below in a more detailed embodiment. See Figure 6 for the method steps for obtaining the late reverberation signal S2. Specific steps are as follows:
步骤610,使用VR设备搭配手机的扬声器发射一个声学参考测试信号,如TSP/MLS等测试信号S610;并使用测试信号对房间混响进行测量。In step 610, the VR device is used to transmit an acoustic reference test signal, such as a test signal S610 such as TSP/MLS, with the speaker of the mobile phone; and the test signal is used to measure the room reverberation.
TSP信号是指一种公知的响应测量信号的信号,可以用wav等格式存储于存储设备,使用智能手机扬声器播放,产生声波经房间内各种障碍物反射,进而反应出房间的混响信息。The TSP signal refers to a well-known signal that responds to the measurement signal, and can be stored in a storage device in a format such as wav, and is played by a smartphone speaker, and the sound wave is reflected by various obstacles in the room, thereby reflecting the reverberation information of the room.
另外,测试信号不限于TSP信号,也可以是最大长度序列(MLS)、正弦扫频信号;测试信号的发出由用户进行控制。In addition, the test signal is not limited to the TSP signal, and may be a maximum length sequence (MLS) or a sine sweep signal; the issuance of the test signal is controlled by the user.
步骤620,使用VR设备搭配手机的麦克风或者VR设备自带的麦克风拾取声学回声信号设备自带的麦克风拾取测试信号S610对应的声学反馈信号,由计算单元计算得出房间脉冲响应S620。In step 620, the VR device is used to match the microphone of the mobile phone or the microphone of the VR device to pick up the acoustic feedback signal corresponding to the microphone pickup test signal S610 provided by the acoustic echo signal device, and the room impulse response S620 is calculated by the calculation unit.
上述声学反馈信号包含了声波经不同障碍物、不同次反射的信号。由计算单元计算得出RIR房间脉冲响应,它表示房间混响信息的时域波形,不同房间的RIR均不同房间脉冲响应,可记为R(n),其中n为[0,N],N为最终选取的RIR的样点长度,如N=4096;该计算方法为公知方法。例如一个二维波形表示的RIR信号,横坐标为样点,纵坐标为样点值。样点表示音频信号采样点,以48000Hz采样为例,样点间间隔为1/48000秒,也即1秒中有48000个样点。The above acoustic feedback signal includes signals that the sound waves are reflected by different obstacles and different times. The RIR room impulse response is calculated by the calculation unit, which represents the time domain waveform of the room reverberation information. The RIR of different rooms have different room impulse responses, which can be recorded as R(n), where n is [0, N], N The length of the sample of the finally selected RIR, such as N = 4096; this calculation method is a well-known method. For example, a two-dimensional waveform represents the RIR signal, the abscissa is the sample point, and the ordinate is the sample point value. The sample points represent the sampling points of the audio signal. Taking 48000 Hz sampling as an example, the interval between samples is 1/48000 seconds, that is, there are 48000 samples in 1 second.
作为可选的,本步骤中不限于一个麦克风录制,若有多于两个可用的麦克风,则优先选取选材接近、距离较远的两个麦克风。对这两个麦克风录制数据进行处理。只有一个麦克风时,则只能得到一个可用的后期混响信号,这时VR音频处理中应用到左右两个声道即可;当有两个麦克风时,则有两个可用的后期混响信 号,将它们分别应用于左右声道,因为两麦克风采集信号不完全相同,后期混响相关性降低,整体立体感会得到一定增强,但是这样会需要两路的本方案中S630-S650计算步骤。对于三个麦克风,则没有必要进行三路运算,选取麦克风性能最近、距离最远的两个麦克风信号。Optionally, in this step, it is not limited to one microphone recording. If there are more than two available microphones, the two microphones whose materials are selected and are far away are preferentially selected. The two microphone recording data is processed. When there is only one microphone, only one available post-reverberation signal can be obtained. In this case, the VR audio processing can be applied to the left and right channels; when there are two microphones, there are two available post-reverberation signals. No., respectively, they are applied to the left and right channels, because the signals collected by the two microphones are not completely the same, the correlation of the reverberation is reduced, and the overall stereoscopic effect will be enhanced, but this will require two steps of the S630-S650 calculation in this scheme. . For three microphones, it is not necessary to perform a three-way operation to select two microphone signals with the closest performance and the farthest distance from the microphone.
步骤630,根据VR音频系统采用的HRIR数据能量,由计算单元对房间脉冲响应S620进行能量归一化处理得到信号S630。Step 630: Perform energy normalization processing on the room impulse response S620 by the calculating unit according to the HRIR data energy used by the VR audio system to obtain a signal S630.
首先,统计得出HRIR数据库(与HRTF数据库等价,为时域信号)中所有HRIR数据波形中样点绝对值的最大值的目标HRIR数据,以及目标HRIR数据中样点绝对值最大的样点位置(时间点)N1。First, the target HRIR data of the maximum value of the absolute values of the samples in all HRIR data waveforms in the HRIR database (equivalent to the HRTF database) is statistically obtained, and the sample with the largest absolute value of the sample points in the target HRIR data. Position (time point) N1.
其次,统计录制的RIR房间脉冲响应的样点绝对值最大值对应的样点位置N2,将录制响应峰值点按HRIR数据峰值样点进行对齐处理。此时需要将RIR数据的前(N2-N1)个样点删除,保证时间同步。对齐后的RIR为:Secondly, the sampled position N2 corresponding to the maximum value of the absolute value of the sampled impulse response of the RIR room is statistically recorded, and the recording response peak point is aligned according to the peak sample of the HRIR data. At this time, the front (N2-N1) samples of the RIR data need to be deleted to ensure time synchronization. The aligned RIR is:
R1(n)=R(n+N2-N1)      n∈[1,N]R 1 (n)=R(n+N2-N1) n∈[1,N]
对对齐处理后的HRIR以及对齐处理后的RIR截取相同长度的样点,截取长度取决于VR音频系统HRIR数据库中所使用的长度。以长度512为示例,截取HRIR样点[N1,N1+511],截取RIR样点[N2’,N2’+511]部分,也即分别截取512个样点,分别计算HRIR以及RIR截取部分的RMS均方根能量E1,E2。对对齐处理后的RIR乘以E1/E2,进行能量归一化处理。其中,For the HRIR after alignment processing and the RIR after alignment processing, samples of the same length are intercepted, and the length of the intercept depends on the length used in the HRIR database of the VR audio system. Taking the length 512 as an example, the HRIR samples [N1, N1+511] are intercepted, and the RIR samples [N2', N2'+511] are intercepted, that is, 512 samples are respectively intercepted, and the HRIR and the RIR intercepted portions are respectively calculated. RMS rms energy E1, E2. The energy normalization is performed by multiplying the aligned RIR by E1/E2. among them,
Figure PCTCN2016102026-appb-000001
Figure PCTCN2016102026-appb-000001
R2(n)=(E1/E2)*R1(n)       n∈[1,N]R 2 (n)=(E1/E2)*R 1 (n) n∈[1,N]
R2(n)即信号S630。R 2 (n) is the signal S630.
步骤640,根据VR音频系统采用的HRIR数据长度,由计算单元对信号S630中所述信号进行淡入处理,获得第一后期混响信号S640。 Step 640, according to the HRIR data length adopted by the VR audio system, the calculation unit performs a fade-in process on the signal in the signal S630 to obtain a first late reverberation signal S640.
该处理目的是淡入处理,删除测量混响中的直达声及前期反射部分,只剩下第一后期混响信号。以HRIR长度为M个样点为例(M取值一般可为256、512等), 淡入窗函数为:The purpose of this processing is to fade in, delete the direct sound and the pre-reflection part of the measurement reverb, leaving only the first late reverberation signal. Take the HRIR length as M samples (M can generally be 256, 512, etc.). The fade in window function is:
Figure PCTCN2016102026-appb-000002
Figure PCTCN2016102026-appb-000002
LR(n)=W(n)*R2(n)LR(n)=W(n)*R 2 (n)
LR(n)即信号S640。LR(n) is the signal S640.
步骤650,根据距离传感器或者用户设定的距离信息,由计算单元对第一后期混响信号S640信号进行能量调整,得到经处理的、系统可用的第二后期混响信号S650,该信号S650即上述信号S2,供VR音频系统后续进行运算处理。Step 650: Perform energy adjustment on the first late reverberation signal S640 signal by the calculating unit according to the distance information set by the distance sensor or the user, to obtain a processed second, late reverberation signal S650 available to the system, where the signal S650 is The above signal S2 is used for subsequent arithmetic processing by the VR audio system.
具体为,对于用户设定距离r(r≥1),单位为(米),可为用户设定虚拟声源的距离,或者可为VR设备中距离传感器获得的声源距离信息。后期混响信号增益为
Figure PCTCN2016102026-appb-000003
该增益直接加载在后期混响信号上,
Specifically, for the user to set the distance r (r ≥ 1), the unit is (m), the distance of the virtual sound source can be set for the user, or the sound source distance information obtained by the distance sensor in the VR device. The late reverberation signal gain is
Figure PCTCN2016102026-appb-000003
This gain is directly loaded on the late reverb signal.
Figure PCTCN2016102026-appb-000004
Figure PCTCN2016102026-appb-000004
LR1(n)即信号S650。LR 1 (n) is the signal S650.
以上实现过程中,步骤610-650中,每一个步骤的算法都存在着若干形式的变形,本领域技术人员基于现有技术中的信号处理方法、以及无需经过任何创造性改进的同等算法实现都应属于本发明所保护的范围内。本发明中不予以一一列举。In the above implementation process, in the steps 610-650, the algorithm of each step has several forms of deformation, and those skilled in the art should implement the signal processing method based on the prior art and the equivalent algorithm without any creative improvement. It is within the scope of protection of the present invention. It is not enumerated in the present invention.
本发明实施例中提供了一种增加混响的方法,据本发明实施例提供的技术方案,可以无需其他音频采集和测试设备,利用设备自身的扬声器和话筒就可以轻松获得当前房间的RIR,并对该RIR进行一定的数学方法处理,得到一个音频系统可用的混响信号,该混响信号可以对音频的输入信号进行卷积运算,使得输出信号可以增加用户的头外部效应、减轻头中效应,无需额外设置复杂的参数信息制造人工混响。In the embodiment of the present invention, a method for adding reverberation is provided. According to the technical solution provided by the embodiment of the present invention, the RIR of the current room can be easily obtained by using the speaker and the microphone of the device without using other audio collection and testing equipment. And the RIR is processed by a certain mathematical method to obtain a reverberation signal usable by the audio system, and the reverberation signal can perform convolution operation on the input signal of the audio, so that the output signal can increase the head external effect of the user and reduce the head. Effect, no need to set complex parameter information to create artificial reverberation.
请参阅图7,图7为本发明实施例中生成混响的装置,该装置可以是一个智能 终端,也可以是可穿戴设备如VR设备,也可以是集成在产品中的一个音频处理模块。该装置700包括:Please refer to FIG. 7. FIG. 7 is a diagram of a device for generating reverberation according to an embodiment of the present invention, which may be an intelligent device. The terminal can also be a wearable device such as a VR device, or an audio processing module integrated in the product. The device 700 includes:
获取模块701,用于获取室内环境的房间脉冲响应RIR,RIR表征当前室内的混响的时域信息。该获取模块可以由常见的无线收发模块来实现,相应的无线收发模式可以接收和发送音频信号,可由特定应用来控制实现。The obtaining module 701 is configured to obtain a room impulse response RIR of the indoor environment, and the RIR represents time domain information of the reverberation in the current indoor. The acquisition module can be implemented by a common wireless transceiver module, and the corresponding wireless transceiver mode can receive and transmit audio signals, which can be controlled by a specific application.
第一确定模块702,用于确定音频系统中目标头相关脉冲响应HRIR数据,并计算目标HRIR数据的能量和长度。HRIR数据可以由存储器来存储,第一确定模块702可以在处理器中实现。The first determining module 702 is configured to determine target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data. The HRIR data can be stored by a memory, and the first determining module 702 can be implemented in a processor.
第一计算模块703,用于根据第一确定模块702确定出的目标HRIR数据的能量对获取模块701获取的RIR进行归一化处理得到第一信号。第一计算模块可在处理器中实现。The first calculating module 703 is configured to perform normalization processing on the RIR acquired by the obtaining module 701 according to the energy of the target HRIR data determined by the first determining module 702 to obtain a first signal. The first computing module can be implemented in a processor.
第二计算模块704,用于根据第一计算模块703计算出的目标HRIR数据的长度对第一信号进行淡入处理得到第二信号。第二计算模块可在处理器中实现。The second calculating module 704 is configured to perform a fade-in process on the first signal according to the length of the target HRIR data calculated by the first calculating module 703 to obtain a second signal. The second computing module can be implemented in the processor.
第二确定模块705,用于确定距离信息,该距离信息用于表征虚拟声源距离用户的距离;第二确定模块705可以由处理器来实现。The second determining module 705 is configured to determine distance information, where the distance information is used to represent a distance of the virtual sound source from the user; and the second determining module 705 can be implemented by a processor.
第三计算模块706,用于根据第二确定模块705确定的距离信息对第二计算模块704计算出的第二信号进行能量调整,生成混响信号。第三计算模块706可以由处理器来实现。The third calculating module 706 is configured to perform energy adjustment on the second signal calculated by the second calculating module 704 according to the distance information determined by the second determining module 705 to generate a reverberation signal. The third calculation module 706 can be implemented by a processor.
请参阅图8,图8为本发明实施例中增加混响的装置,该装置可以是一个智能终端,也可以是可穿戴设备如VR设备,或者是一套音频系统。该装置800包括:获取模块801,用于获取室内环境的房间脉冲响应RIR,RIR表征所述室内的混响的时域信息;该获取模块可以由常见的无线收发模块来实现,相应的无线收发模式可以接收和发送音频信号,可由特定应用来控制实现。 Please refer to FIG. 8. FIG. 8 is a device for adding reverberation according to an embodiment of the present invention. The device may be an intelligent terminal, a wearable device such as a VR device, or a set of audio systems. The device 800 includes: an obtaining module 801, configured to acquire a room impulse response RIR of the indoor environment, and the RIR characterizes time domain information of the reverberation in the room; the acquiring module can be implemented by a common wireless transceiver module, and the corresponding wireless transceiver The mode can receive and transmit audio signals, which can be controlled by a specific application.
第一确定模块802,用于确定音频系统中目标头相关脉冲响应HRIR数据,并计算目标HRIR数据的能量和长度;HRIR数据可以由存储器来存储。The first determining module 802 is configured to determine target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data; the HRIR data may be stored by the memory.
第一计算模块803,用于根据第一确定模块802计算出的目标HRIR数据的能量对获取模块801获取的RIR进行归一化处理得到第一信号;The first calculating module 803 is configured to perform normalization processing on the RIR acquired by the obtaining module 801 according to the energy of the target HRIR data calculated by the first determining module 802 to obtain a first signal;
第二计算模块804,用于根据第一确定模块802计算出的目标HRIR数据的长度对第一信号进行淡入处理得到第二信号;The second calculating module 804 is configured to perform a fade-in process on the first signal according to the length of the target HRIR data calculated by the first determining module 802 to obtain a second signal;
第二确定模块805,用于确定距离信息,该距离信息用于表征虚拟声源距离用户的距离;a second determining module 805, configured to determine distance information, where the distance information is used to represent a distance of the virtual sound source from the user;
第三计算模块806,用于根据第二确定模块805确定的距离信息对第二计算模块804计算出的第二信号进行能量调整,生成混响信号;The third calculating module 806 is configured to perform energy adjustment on the second signal calculated by the second calculating module 804 according to the distance information determined by the second determining module 805, to generate a reverberation signal;
第四计算模块807,用于根据头相关传递函数HRTF对音频输入信号进行处理,得到第一路音频信号,其中,该HRTF由所述头部方位传感信息决定;The fourth calculating module 807 is configured to process the audio input signal according to the head related transfer function HRTF to obtain a first audio signal, where the HRTF is determined by the head orientation sensing information;
第五计算模块808,用于根据第三计算模块806计算出的混响信号对音频输入信号进行处理,得到第二路音频信号;a fifth calculating module 808, configured to process the audio input signal according to the reverberation signal calculated by the third calculating module 806, to obtain a second audio signal;
第六计算模块809,用于对第四计算模块807计算出来的第一路音频信号和第五计算模块808计算出的第二路音频信号进行混响得到输出音频信号。The sixth calculation module 809 is configured to perform reverberation on the first audio signal calculated by the fourth calculation module 807 and the second audio signal calculated by the fifth calculation module 808 to obtain an output audio signal.
以上模块中802-809均可由处理器进行实现并完成相应的功能。The above modules 802-809 can be implemented by the processor and complete the corresponding functions.
在具体实现过程中,所述获取模块701/801具体用于:发射一个声学参考测试信号,并收集声学参考测试信号的反射信号;对该反射信号进行采样,得到RIR。第一确定模块702/802具体用于:在音频系统HRIR数据库中选出所有HRIR数据波形中样点绝对值的最大的HRIR数据作为目标头相关脉冲响应HRIR数据In a specific implementation process, the acquiring module 701/801 is specifically configured to: transmit an acoustic reference test signal, and collect a reflected signal of the acoustic reference test signal; sample the reflected signal to obtain an RIR. The first determining module 702/802 is specifically configured to: select, in the audio system HRIR database, the largest HRIR data of the absolute values of the sample points in all HRIR data waveforms as the target head related impulse response HRIR data.
本发明技术方案提供了一种增加混响的装置,该装置包括上述模块701-706, 或上述模块801-808;该装置用于实现上述步骤601-605的实现方式以及可等同替换的实现方式。设备可以通过自带的扬声器和麦克风轻松获取到RIR,无需外部设备;对得到的采样混响进行处理,加入了能量归一化与结合距离信息的增益控制,可以增强音频系统中的头外部感,并且通过距离信息产生不同的距离感。The technical solution of the present invention provides a device for increasing reverberation, the device comprising the above modules 701-706, Or the above modules 801-808; the device is used to implement the implementation of the above steps 601-605 and an equivalent alternative implementation. The device can easily access the RIR through its own speaker and microphone, eliminating the need for external equipment; processing the resulting sampled reverb, adding energy normalization and gain control combined with distance information, can enhance the head external sense in the audio system And generate a different sense of distance through the distance information.
请参阅图9,图9为本发明实施例中一种生成混响的设备结构示意图。该设备900包括:Please refer to FIG. 9. FIG. 9 is a schematic structural diagram of an apparatus for generating reverberation according to an embodiment of the present invention. The device 900 includes:
处理器901,用于产生相应的操作控制信号,发给计算处理设备相应的部件,读取以及处理软件中的数据,尤其是读取和处理存储器902中的数据和程序,以使其中的各个功能模块执行相应的功能,从而控制相应的部件按指令的要求进行动作。The processor 901 is configured to generate a corresponding operation control signal, send the corresponding component to the processing processing device, read and process the data in the software, and in particular, read and process the data and the program in the memory 902, so that each of the components The function module performs the corresponding function, thereby controlling the corresponding component to act according to the requirements of the instruction.
存储器902,用于存储程序和各种数据,主要存储操作系统、应用和功能指令等软件单元、或者他们的子集、或者他们的扩展集。还可以包括非易失性随机存取存储器,向处理器901提供包括管理计算处理设备中的硬件、软件及数据资源,支持控制软件和应用。The memory 902 is used to store programs and various data, and mainly stores software units such as operating systems, applications, and function instructions, or a subset thereof, or an extended set thereof. A non-volatile random access memory may also be included, providing hardware, software, and data resources in the management computing processing device to the processor 901, supporting control software and applications.
收发器903,用于采集、获取或发送信息,尤其是音频信号。The transceiver 903 is configured to collect, acquire or transmit information, especially an audio signal.
上述各个硬件单元可以通过总线连接进行通信。Each of the above hardware units can communicate via a bus connection.
如此一来,处理器901通过调用存储器902存储的程序或指令,控制收发器903获取室内环境的房间脉冲响应RIR,处理器301执行上述实施例中步骤610-650,将RIR信号处理生成一个在音频系统中可用的混响信号。In this way, the processor 901 controls the transceiver 903 to acquire the room impulse response RIR of the indoor environment by calling a program or instruction stored in the memory 902, and the processor 301 executes steps 610-650 in the above embodiment to generate an RIR signal. Reverb signal available in the audio system.
设备900的实现形式可以是智能终端,也可以是智能音频系统,如VR设备或智能耳麦、智能视听系统、可穿戴家庭影院等等。The implementation of the device 900 may be a smart terminal, or may be an intelligent audio system such as a VR device or a smart headset, a smart audiovisual system, a wearable home theater, and the like.
本领域普通技术人员可知,上述方法中的全部或部分步骤可以通过程序指令相关的硬件完成,该程序可以存储于一计算机可读存储介质中。通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到本发明可以用硬件实现,或固件实现,或它们的组合方式来实现。It will be apparent to those skilled in the art that all or part of the above steps may be performed by hardware associated with program instructions, which may be stored in a computer readable storage medium. Through the description of the above embodiments, those skilled in the art can clearly understand that the present invention can be implemented in hardware, firmware implementation, or a combination thereof.
以上实施例仅为本发明技术方案的较佳实施例而已,并非用于限定本发明的 保护范围。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围。 The above embodiments are merely preferred embodiments of the technical solutions of the present invention, and are not intended to limit the present invention. protected range. Any modifications, equivalent substitutions, improvements, etc., made within the spirit and scope of the invention are intended to be included within the scope of the invention.

Claims (13)

  1. 一种生成混响的方法,其特征在于,所述方法包括:A method of generating reverberation, characterized in that the method comprises:
    获取室内环境的房间脉冲响应RIR,所述RIR表征所述室内的混响的时域信息;Acquiring a room impulse response RIR of the indoor environment, the RIR characterizing time domain information of the reverberation in the room;
    确定音频系统中目标头相关脉冲响应HRIR数据,并计算所述目标HRIR数据的能量和长度;Determining a target head related impulse response HRIR data in the audio system and calculating an energy and length of the target HRIR data;
    根据所述目标HRIR数据的能量对所述RIR进行归一化处理得到第一信号;Normalizing the RIR according to the energy of the target HRIR data to obtain a first signal;
    根据所述目标HRIR数据的长度对所述第一信号进行淡入处理得到第二信号;Performing a fade-in process on the first signal according to a length of the target HRIR data to obtain a second signal;
    确定距离信息,所述距离信息用于表征虚拟声源距离用户的距离;Determining distance information, the distance information being used to represent a distance of the virtual sound source from the user;
    根据所述距离信息对所述第二信号进行能量调整,生成混响信号。And performing energy adjustment on the second signal according to the distance information to generate a reverberation signal.
  2. 如权利要求1所述方法,其特征在于,所述获取当前室内环境的房间脉冲响应RIR包括:The method of claim 1 wherein said obtaining a room impulse response RIR of the current indoor environment comprises:
    发射一个声学参考测试信号,收集所述声学参考测试信号的反射信号;Transmitting an acoustic reference test signal, collecting a reflected signal of the acoustic reference test signal;
    对所述反射信号进行采样,得到所述RIR。The reflected signal is sampled to obtain the RIR.
  3. 如权利要求2所述的方法,其特征在于,所述确定音频系统中目标头相关脉冲响应HRIR数据包括:The method of claim 2 wherein said determining a target header related impulse response HRIR data in the audio system comprises:
    在音频系统HRIR数据库中选出所有HRIR数据波形中样点绝对值的最大的HRIR数据作为所述目标头相关脉冲响应HRIR数据。The largest HRIR data of the absolute values of the samples in all HRIR data waveforms is selected in the audio system HRIR database as the target head related impulse response HRIR data.
  4. 一种增加混响的方法,其特征在于,所述方法应用于VR设备,所述VR设备包括头部传感器,所述头部传感器用于获得头部方位传感信息;所述方法包括:获取室内环境的房间脉冲响应RIR,所述RIR表征所述室内的混响的时域信息;确定音频系统中目标头相关脉冲响应HRIR数据,并计算所述目标HRIR数据的能量和长度; A method for increasing reverberation, characterized in that the method is applied to a VR device, the VR device comprising a head sensor for obtaining head orientation sensing information; the method comprising: acquiring a room impulse response RIR of the indoor environment, the RIR characterizing time domain information of the reverberation in the room; determining target head related impulse response HRIR data in the audio system, and calculating energy and length of the target HRIR data;
    根据所述目标HRIR数据的能量对所述RIR进行归一化处理得到第一信号;Normalizing the RIR according to the energy of the target HRIR data to obtain a first signal;
    根据所述目标HRIR数据的长度对所述第一信号进行淡入处理得到第二信号;Performing a fade-in process on the first signal according to a length of the target HRIR data to obtain a second signal;
    确定距离信息,所述距离信息用于表征虚拟声源距离用户的距离;Determining distance information, the distance information being used to represent a distance of the virtual sound source from the user;
    根据所述距离信息对所述第二信号进行能量调整,生成混响信号;Performing energy adjustment on the second signal according to the distance information to generate a reverberation signal;
    获得目标头部方位传感信息,确定所述目标头部方位传感信息对应的头相关传递函数HRTF;Obtaining target head orientation sensing information, determining a head related transfer function HRTF corresponding to the target head orientation sensing information;
    根据所述HRTF对音频输入信号进行处理,得到第一路音频信号;Processing the audio input signal according to the HRTF to obtain a first audio signal;
    根据所述混响信号对所述音频输入信号进行处理,得到第二路音频信号;Processing the audio input signal according to the reverberation signal to obtain a second audio signal;
    对所述第一路音频信号和所述第二路音频信号进行混响得到输出音频信号。The first audio signal and the second audio signal are reverberated to obtain an output audio signal.
  5. 如权利要求4所述方法,其特征在于,所述获取当前室内环境的房间脉冲响应RIR包括:The method of claim 4 wherein said obtaining a room impulse response RIR of the current indoor environment comprises:
    发射一个声学参考测试信号,收集所述声学参考测试信号的反射信号;Transmitting an acoustic reference test signal, collecting a reflected signal of the acoustic reference test signal;
    对所述反射信号进行采样,得到所述RIR。The reflected signal is sampled to obtain the RIR.
  6. 如权利要求4所述的方法,其特征在于,所述确定音频系统中目标头相关脉冲响应HRIR数据包括:The method of claim 4 wherein said determining a target header related impulse response HRIR data in the audio system comprises:
    在音频系统HRIR数据库中选出所有HRIR数据波形中样点绝对值的最大的HRIR数据作为所述目标头相关脉冲响应HRIR数据。The largest HRIR data of the absolute values of the samples in all HRIR data waveforms is selected in the audio system HRIR database as the target head related impulse response HRIR data.
  7. 一种生成混响的装置,其特征在于,所述装置包括:A device for generating reverberation, characterized in that the device comprises:
    获取模块,用于获取室内环境的房间脉冲响应RIR,所述RIR表征所述室内的混响的时域信息;An acquisition module, configured to acquire a room impulse response RIR of the indoor environment, the RIR characterizing time domain information of the reverberation in the room;
    第一确定模块,用于确定音频系统中目标头相关脉冲响应HRIR数据,并计算所述目标HRIR数据的能量和长度; a first determining module, configured to determine target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data;
    第一计算模块,用于根据所述第一确定模块计算出的目标HRIR数据的能量对所述获取模块获取的RIR进行归一化处理得到第一信号;a first calculating module, configured to perform normalization processing on the RIR acquired by the acquiring module according to the energy of the target HRIR data calculated by the first determining module to obtain a first signal;
    第二计算模块,用于根据所述第一确定模块计算出的目标HRIR数据的长度对所述第一信号进行淡入处理得到第二信号;a second calculating module, configured to perform a fade-in process on the first signal according to a length of the target HRIR data calculated by the first determining module to obtain a second signal;
    第二确定模块,用于确定距离信息,所述距离信息用于表征虚拟声源距离用户的距离;a second determining module, configured to determine distance information, where the distance information is used to represent a distance of the virtual sound source from the user;
    第三计算模块,用于根据所述第二确定模块确定的距离信息对所述第二计算模块计算出的第二信号进行能量调整,生成混响信号。And a third calculating module, configured to perform energy adjustment on the second signal calculated by the second calculating module according to the distance information determined by the second determining module, to generate a reverberation signal.
  8. 如权利要求7所述装置,其特征在于,所述获取模块具体用于:The device according to claim 7, wherein the obtaining module is specifically configured to:
    发射一个声学参考测试信号,收集所述声学参考测试信号的反射信号;Transmitting an acoustic reference test signal, collecting a reflected signal of the acoustic reference test signal;
    对所述反射信号进行采样,得到所述RIR。The reflected signal is sampled to obtain the RIR.
  9. 如权利要求7所述装置,其特征在于,所述第一确定模块具体用于:The device according to claim 7, wherein the first determining module is specifically configured to:
    在音频系统HRIR数据库中选出所有HRIR数据波形中样点绝对值的最大的HRIR数据作为所述目标头相关脉冲响应HRIR数据。The largest HRIR data of the absolute values of the samples in all HRIR data waveforms is selected in the audio system HRIR database as the target head related impulse response HRIR data.
  10. 一种增加混响的装置,其特征在于,所述装置应用于VR设备,所述VR设备包括头部传感器,所述头部传感器用于获得头部方位传感信息;所述装置包括:获取模块,用于获取室内环境的房间脉冲响应RIR,所述RIR表征所述室内的混响的时域信息;An apparatus for increasing reverberation, wherein the apparatus is applied to a VR device, the VR device includes a head sensor for obtaining head orientation sensing information; and the apparatus includes: acquiring a module for acquiring a room impulse response RIR of an indoor environment, the RIR characterizing time domain information of the reverberation in the room;
    第一确定模块,用于确定音频系统中目标头相关脉冲响应HRIR数据,并计算所述目标HRIR数据的能量和长度;a first determining module, configured to determine target head related impulse response HRIR data in the audio system, and calculate energy and length of the target HRIR data;
    第一计算模块,用于根据所述第一确定模块计算出的目标HRIR数据的能量对所述获取模块获取的RIR进行归一化处理得到第一信号; a first calculating module, configured to perform normalization processing on the RIR acquired by the acquiring module according to the energy of the target HRIR data calculated by the first determining module to obtain a first signal;
    第二计算模块,用于根据所述第一确定模块计算出的目标HRIR数据的长度对所述第一信号进行淡入处理得到第二信号;a second calculating module, configured to perform a fade-in process on the first signal according to a length of the target HRIR data calculated by the first determining module to obtain a second signal;
    第二确定模块,用于确定距离信息,所述距离信息用于表征虚拟声源距离用户的距离;a second determining module, configured to determine distance information, where the distance information is used to represent a distance of the virtual sound source from the user;
    第三计算模块,用于根据所述第二确定模块确定的距离信息对所述第二计算模块计算出的第二信号进行能量调整,生成混响信号;a third calculation module, configured to perform energy adjustment on the second signal calculated by the second calculation module according to the distance information determined by the second determining module, to generate a reverberation signal;
    第四计算模块,用于根据头相关传递函数HRTF对音频输入信号进行处理,得到第一路音频信号,其中,所述HRTF由所述头部方位传感信息决定;a fourth calculating module, configured to process the audio input signal according to the head related transfer function HRTF to obtain a first audio signal, wherein the HRTF is determined by the head orientation sensing information;
    第五计算模块,用于根据所述第三计算模块计算出的混响信号对所述音频输入信号进行处理,得到第二路音频信号;a fifth calculating module, configured to process the audio input signal according to the reverberation signal calculated by the third calculating module, to obtain a second audio signal;
    第六计算模块,用于对所述第四计算模块计算出来的第一路音频信号和所述第五计算模块计算出的第二路音频信号进行混响得到输出音频信号。And a sixth calculating module, configured to perform reverberation on the first audio signal calculated by the fourth computing module and the second audio signal calculated by the fifth computing module to obtain an output audio signal.
  11. 如权利要求10所述装置,其特征在于,所述获取模块具体用于:The device of claim 10, wherein the obtaining module is specifically configured to:
    发射一个声学参考测试信号,收集所述声学参考测试信号的反射信号;Transmitting an acoustic reference test signal, collecting a reflected signal of the acoustic reference test signal;
    对所述反射信号进行采样,得到所述RIR。The reflected signal is sampled to obtain the RIR.
  12. 如权利要求10所述装置,其特征在于,所述第一确定模块具体用于:The device of claim 10, wherein the first determining module is specifically configured to:
    在音频系统HRIR数据库中选出所有HRIR数据波形中样点绝对值的最大的HRIR数据作为所述目标头相关脉冲响应HRIR数据。The largest HRIR data of the absolute values of the samples in all HRIR data waveforms is selected in the audio system HRIR database as the target head related impulse response HRIR data.
  13. 一种增加混响的设备,其特征在于,所述设备包括:处理器、存储器、收发器和总线系统;其中,所述处理器和所述存储器、所述收发器通过所述总线系统相连,所述存储器用于存储程序和指令,所述收发器用于收发音频信号,所述处理器通过调用所述存储器中的程序和指令,执行如权利要求1-6任一项所述方法。 An apparatus for increasing reverberation, the apparatus comprising: a processor, a memory, a transceiver, and a bus system; wherein the processor and the memory, the transceiver are connected by the bus system, The memory is for storing programs and instructions, the transceiver is for transceiving audio signals, and the processor performs the method of any of claims 1-6 by calling programs and instructions in the memory.
PCT/CN2016/102026 2016-04-27 2016-10-13 Method and device for increasing reverberation WO2017185663A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610269909.6 2016-04-27
CN201610269909.6A CN105792090B (en) 2016-04-27 2016-04-27 A kind of method and apparatus for increasing reverberation

Publications (1)

Publication Number Publication Date
WO2017185663A1 true WO2017185663A1 (en) 2017-11-02

Family

ID=56398794

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/102026 WO2017185663A1 (en) 2016-04-27 2016-10-13 Method and device for increasing reverberation

Country Status (2)

Country Link
CN (1) CN105792090B (en)
WO (1) WO2017185663A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020057727A1 (en) 2018-09-18 2020-03-26 Huawei Technologies Co., Ltd. Device and method for adaptation of virtual 3d audio to a real room
CN114040319A (en) * 2021-11-17 2022-02-11 青岛海信移动通信技术股份有限公司 Method, device, equipment and medium for optimizing external playback quality of terminal equipment
WO2023208333A1 (en) 2022-04-27 2023-11-02 Huawei Technologies Co., Ltd. Devices and methods for binaural audio rendering

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105792090B (en) * 2016-04-27 2018-06-26 华为技术有限公司 A kind of method and apparatus for increasing reverberation
CN109642818B (en) * 2016-08-29 2022-04-26 哈曼国际工业有限公司 Apparatus and method for generating a virtual venue for a listening room
CN106528038B (en) * 2016-10-25 2019-09-06 三星电子(中国)研发中心 The method, system and device of audio frequency effect are adjusted under a kind of virtual reality scenario
US10158963B2 (en) * 2017-01-30 2018-12-18 Google Llc Ambisonic audio with non-head tracked stereo based on head position and time
CN106954127B (en) * 2017-04-07 2019-10-29 南京邮电大学 A kind of VR scene subaudio frequency immersion control method
CN107231599A (en) * 2017-06-08 2017-10-03 北京奇艺世纪科技有限公司 A kind of 3D sound fields construction method and VR devices
CN107358962B (en) * 2017-06-08 2018-09-04 腾讯科技(深圳)有限公司 Audio-frequency processing method and apparatus for processing audio
CN107820158B (en) * 2017-07-07 2020-09-29 大连理工大学 Three-dimensional audio generation device based on head-related impulse response
CN109286889A (en) * 2017-07-21 2019-01-29 华为技术有限公司 A kind of audio-frequency processing method and device, terminal device
CN107708013B (en) * 2017-10-19 2020-04-10 上海交通大学 Immersive experience earphone system based on VR technique
JP7294135B2 (en) * 2017-10-20 2023-06-20 ソニーグループ株式会社 SIGNAL PROCESSING APPARATUS AND METHOD, AND PROGRAM
US10458840B2 (en) * 2017-11-08 2019-10-29 Harman International Industries, Incorporated Location classification for intelligent personal assistant
CN110164464A (en) * 2018-02-12 2019-08-23 北京三星通信技术研究有限公司 Audio-frequency processing method and terminal device
CN112205005B (en) * 2018-05-23 2022-06-24 皇家Kpn公司 Adapting acoustic rendering to image-based objects
CN109104687B (en) * 2018-09-25 2021-04-13 Oppo广东移动通信有限公司 Sound effect processing method and related product
CN108924705B (en) * 2018-09-25 2021-07-02 Oppo广东移动通信有限公司 3D sound effect processing method and related product
US11503423B2 (en) * 2018-10-25 2022-11-15 Creative Technology Ltd Systems and methods for modifying room characteristics for spatial audio rendering over headphones
CN110312198B (en) * 2019-07-08 2021-04-20 雷欧尼斯(北京)信息技术有限公司 Virtual sound source repositioning method and device for digital cinema
CN112672259B (en) * 2019-10-16 2023-03-10 北京地平线机器人技术研发有限公司 Loudspeaker control method and device
CN110972053B (en) * 2019-11-25 2021-06-25 腾讯音乐娱乐科技(深圳)有限公司 Method and related apparatus for constructing a listening scene
CN111372167B (en) * 2020-02-24 2021-10-26 Oppo广东移动通信有限公司 Sound effect optimization method and device, electronic equipment and storage medium
CN112351379B (en) * 2020-10-28 2021-07-30 歌尔光学科技有限公司 Control method of audio component and intelligent head-mounted device
CN113889140A (en) * 2021-09-24 2022-01-04 北京有竹居网络技术有限公司 Audio signal playing method and device and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120201405A1 (en) * 2007-02-02 2012-08-09 Logitech Europe S.A. Virtual surround for headphones and earbuds headphone externalization system
CN102665156A (en) * 2012-03-27 2012-09-12 中国科学院声学研究所 Virtual 3D replaying method based on earphone
CN104768121A (en) * 2014-01-03 2015-07-08 杜比实验室特许公司 Generating binaural audio in response to multi-channel audio using at least one feedback delay network
CN105432097A (en) * 2013-05-29 2016-03-23 高通股份有限公司 Filtering with binaural room impulse responses with content analysis and weighting
CN105792090A (en) * 2016-04-27 2016-07-20 华为技术有限公司 Method and device of increasing reverberation

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1953620B (en) * 2006-09-05 2014-04-02 华南理工大学 A method to process virtual surround sound signal of 5.1 access
WO2009046223A2 (en) * 2007-10-03 2009-04-09 Creative Technology Ltd Spatial audio analysis and synthesis for binaural reproduction and format conversion
CN102572676B (en) * 2012-01-16 2016-04-13 华南理工大学 A kind of real-time rendering method for virtual auditory environment
EP3062535B1 (en) * 2013-10-22 2019-07-03 Industry-Academic Cooperation Foundation, Yonsei University Method and apparatus for processing audio signal
CN104240695A (en) * 2014-08-29 2014-12-24 华南理工大学 Optimized virtual sound synthesis method based on headphone replay
CN105142094B (en) * 2015-09-16 2018-07-13 华为技术有限公司 A kind for the treatment of method and apparatus of audio signal

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120201405A1 (en) * 2007-02-02 2012-08-09 Logitech Europe S.A. Virtual surround for headphones and earbuds headphone externalization system
CN102665156A (en) * 2012-03-27 2012-09-12 中国科学院声学研究所 Virtual 3D replaying method based on earphone
CN105432097A (en) * 2013-05-29 2016-03-23 高通股份有限公司 Filtering with binaural room impulse responses with content analysis and weighting
CN104768121A (en) * 2014-01-03 2015-07-08 杜比实验室特许公司 Generating binaural audio in response to multi-channel audio using at least one feedback delay network
CN105792090A (en) * 2016-04-27 2016-07-20 华为技术有限公司 Method and device of increasing reverberation

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020057727A1 (en) 2018-09-18 2020-03-26 Huawei Technologies Co., Ltd. Device and method for adaptation of virtual 3d audio to a real room
US11668600B2 (en) 2018-09-18 2023-06-06 Huawei Technologies Co., Ltd. Device and method for adaptation of virtual 3D audio to a real room
CN114040319A (en) * 2021-11-17 2022-02-11 青岛海信移动通信技术股份有限公司 Method, device, equipment and medium for optimizing external playback quality of terminal equipment
CN114040319B (en) * 2021-11-17 2023-11-14 青岛海信移动通信技术有限公司 Method, device, equipment and medium for optimizing playback quality of terminal equipment
WO2023208333A1 (en) 2022-04-27 2023-11-02 Huawei Technologies Co., Ltd. Devices and methods for binaural audio rendering

Also Published As

Publication number Publication date
CN105792090B (en) 2018-06-26
CN105792090A (en) 2016-07-20

Similar Documents

Publication Publication Date Title
WO2017185663A1 (en) Method and device for increasing reverberation
CN109644314B (en) Method of rendering sound program, audio playback system, and article of manufacture
Algazi et al. Headphone-based spatial sound
US8520871B2 (en) Method of and device for generating and processing parameters representing HRTFs
CN102804814B (en) Multichannel sound reproduction method and equipment
CN113170271B (en) Method and apparatus for processing stereo signals
US10341799B2 (en) Impedance matching filters and equalization for headphone surround rendering
GB2543276A (en) Distributed audio capture and mixing
US20150092965A1 (en) Method of improving externalization of virtual surround sound
CN106134223A (en) Reappear audio signal processing apparatus and the method for binaural signal
US9769585B1 (en) Positioning surround sound for virtual acoustic presence
CN110677802B (en) Method and apparatus for processing audio
US10652686B2 (en) Method of improving localization of surround sound
CA2744429C (en) Converter and method for converting an audio signal
Lee et al. A real-time audio system for adjusting the sweet spot to the listener's position
Kan et al. A psychophysical evaluation of near-field head-related transfer functions synthesized using a distance variation function
US7327848B2 (en) Visualization of spatialized audio
Martellotta Optimizing stepwise rotation of dodecahedron sound source to improve the accuracy of room acoustic measures
Grimm et al. Virtual acoustic environments for comprehensive evaluation of model-based hearing devices
CN112954579B (en) Method and device for reproducing on-site listening effect
US20230005464A1 (en) Live data distribution method, live data distribution system, and live data distribution apparatus
Tan Binaural recording methods with analysis on inter-aural time, level, and phase differences
Fodde Spatial Comparison of Full Sphere Panning Methods
JP2024007669A (en) Sound field reproduction program using sound source and position information of sound-receiving medium, device, and method
CN116193196A (en) Virtual surround sound rendering method, device, equipment and storage medium

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16900171

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16900171

Country of ref document: EP

Kind code of ref document: A1