WO2018036194A1 - Sound signal processing method, terminal, and computer storage medium - Google Patents

Sound signal processing method, terminal, and computer storage medium Download PDF

Info

Publication number
WO2018036194A1
WO2018036194A1 PCT/CN2017/082940 CN2017082940W WO2018036194A1 WO 2018036194 A1 WO2018036194 A1 WO 2018036194A1 CN 2017082940 W CN2017082940 W CN 2017082940W WO 2018036194 A1 WO2018036194 A1 WO 2018036194A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
hrtf
human head
head model
transmitted
Prior art date
Application number
PCT/CN2017/082940
Other languages
French (fr)
Chinese (zh)
Inventor
卢好峰
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2018036194A1 publication Critical patent/WO2018036194A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • H04S5/02Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation  of the pseudo four-channel type, e.g. in which rear channel signals are derived from two-channel stereo signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B99/00Subject matter not provided for in other groups of this subclass
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Definitions

  • the present disclosure relates to audio processing technologies, and in particular, to a method and a terminal for processing a sound signal, and a computer storage medium.
  • virtual reality technology is more and more mature, but the core of virtual reality technology is now focused on the visual aspect.
  • human beings perceive the objective world, they can obtain about 60% of information through vision, about 30% of information through hearing, and the remaining 10% of information through senses such as touch and smell, so in virtual reality technology, virtual Hearing technology is also an indispensable technology. People's perception of the position and distance of hearing is equally important for people's real virtual reality experience.
  • Virtual hearing is based on audio signal processing technology, based on some special acoustic effects of the human ear, through the acoustic correlation algorithm to calculate the simulation, so that the sound source is reconstructed in any position in the three-dimensional space to achieve the reproduction of the sound source orientation.
  • the virtual hearing is an auditory rendering of the virtual environment composed of the sound source signal, the listener and the scene, and the virtual sound at a specific position in the specific scene is obtained by numerical simulation instead of the real auditory feeling. For the current wearing VR device, plus three-dimensional virtual hearing, it can bring real immersion.
  • Virtual hearing technology is the physical and geometric conditions of a given sound source and environment, simulating the sound source The sound wave and its transmission process, so as to obtain the temporal and spatial information of the sound, and finally use the head correlation transfer function (HRTF) signal to process the simulated human ear to synthesize the sound wave, and convert the time and space information of the sound into a binaural sound signal. Then replay it through the headphones to both ears.
  • the HRTF is the basis of virtual hearing technology, and the HRTF can be reconstructed from the amplitude characteristics of the left and right ears and the time difference between the ears. Most of the methods of obtaining HRTF are obtained by laboratory measurement. At present, some laboratories at home and abroad have obtained the HRTF database through testing, and measured the HRTF of the partial orientation of a specific population.
  • the HRTF can be obtained by measurement
  • the measurement method is cumbersome, expensive and time consuming, and the database of measurement is only the HRTF of a specific population.
  • the HRTF database cannot represent the structural distribution characteristics of all people.
  • the experimentally obtained HRTF data is used for virtual auditory rendering, it is likely that an auditory difference will occur when auditory playback is performed, and even the auditory orientation information cannot be discerned.
  • the database of HRTF measurements is only spatially discrete part of the azimuth data, and can not represent the position information of all orientations, so there will be deviations in virtual hearing.
  • the present disclosure provides a method for processing a sound signal, a terminal, and a computer storage medium.
  • the present disclosure provides a method for processing a sound signal, including:
  • the sound signal emitted by the sound source is processed according to the first HRTF and the second HRTF.
  • the step of acquiring the three-dimensional human head model of the terminal user comprises: acquiring the three-dimensional human head model of the terminal user by using the terminal scanning.
  • the step of calculating the first head related transfer function HRTF of the sound wave emitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF transmitted to the right ear of the three-dimensional human head model includes: calculating Determining a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in the original of the three-dimensional human head model a third sound pressure generated at the position; calculating the first HRTF according to the first sound pressure and the third sound pressure; calculating the second HRTF according to the second sound pressure and the third sound pressure .
  • the step of calculating the first sound pressure transmitted by the sound wave to the left ear comprises: calculating a first sound transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear
  • the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures calculated in sequence are greater than one
  • the intensity threshold is set, the first sound pressure transmitted by the sound wave to the left ear is calculated.
  • the step of calculating the first HRTF according to the first sound pressure and the third sound pressure comprises: according to a formula Calculating a first HRTF, where H L represents the first HRTF, Representing a first sound pressure, P 0 (r 0 , f) represents a third sound pressure, r L represents a distance between the sound source and the left ear, and ⁇ L represents the sound source relative to the left ear Horizontal angle, Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
  • the step of calculating the second sound pressure of the sound wave transmitted to the right ear comprises: calculating a second sound of the sound wave transmitted to the right ear when the sound wave is directly transmitted to the right ear
  • the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures calculated in sequence are greater than one
  • the intensity threshold is set, the second sound pressure transmitted by the sound wave to the right ear is calculated.
  • the step of calculating the second HRTF according to the second sound pressure and the third sound pressure comprises: according to a formula Calculating a second HRTF, where H R represents a second HRTF, Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and ⁇ R represents the sound source relative to the right ear Horizontal angle, Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
  • the present disclosure also provides a terminal, including:
  • Obtaining a module configured to acquire a three-dimensional human head model of the end user
  • a calculation module configured to calculate a first head association transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;
  • the processing module is configured to process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
  • the acquiring module is configured to acquire a three-dimensional human head model of the terminal user by using the terminal scanning.
  • the calculation module includes: a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and The third sound pressure generated by the sound wave at the original position of the three-dimensional human head model after the three-dimensional human head model is removed; the second calculating unit is configured to calculate according to the first sound pressure and the third sound pressure The first HRTF; the third calculating unit is configured to calculate the second HRTF according to the second sound pressure and the third sound pressure.
  • the first calculating unit is configured to calculate, when the sound wave is directly transmitted to the left ear, a first sound pressure transmitted by the sound wave to the left ear; when the sound wave is transmitted to the sound In the left ear, the sound pressure after the sound wave is reflected is sequentially calculated according to the sequence in which the reflection occurs, wherein when the sound pressures sequentially calculated are greater than a preset intensity threshold, Calculating a first sound pressure transmitted by the sound wave to the left ear.
  • the second calculating unit is configured according to a formula Calculating a first HRTF, where H L represents the first HRTF, Representing a first sound pressure, P 0 (r 0 , f) represents a third sound pressure, r L represents a distance between the sound source and the left ear, and ⁇ L represents the sound source relative to the left ear Horizontal angle, Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
  • the first calculating unit is configured to calculate a second sound pressure that is transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; and when the sound wave is transmitted to the sound wave through reflection In the right ear, the sound pressure after the sound wave is reflected is sequentially calculated according to the sequence in which the reflection occurs, wherein the sound wave transmission is calculated when the sound pressures sequentially calculated are greater than a preset intensity threshold. a second sound pressure to the right ear.
  • the third calculating unit is configured according to a formula Calculating a second HRTF, where H R represents a second HRTF, Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and ⁇ R represents the sound source relative to the right ear Horizontal angle, Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
  • the acquisition module, the calculation module, the processing module, the first calculation unit, the second calculation unit, and the third calculation unit may use a central processing unit (CPU, Central Processing) when performing processing. Unit), Digital Signal Processor (DSP, Digital Singnal Processor) or Field-Programmable Gate Array (FPGA) achieve.
  • CPU Central Processing
  • DSP Digital Signal Processor
  • FPGA Field-Programmable Gate Array
  • the present disclosure also provides a computer storage medium having stored therein computer executable instructions configured to perform a method of processing the sound signal.
  • the present disclosure first obtains a three-dimensional human head model of the end user, and then calculates a first HRTF transmitted by the sound source to the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model, and finally calculated according to the calculation.
  • the first HRTF and the second HRTF process the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first HRTF and the second HRTF.
  • the sound signal is processed, so that the end user can obtain a more realistic sound experience, and solves the problem that the virtual hearing technology has the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art, and improves the user's sound effect.
  • Experience is described by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first HRTF and the second HRTF.
  • the sound signal is processed, so that the end
  • FIG. 1 is a flow chart showing the steps of a method for processing a sound signal in an embodiment of the present disclosure
  • FIG. 2 is a flow chart showing the steps of a method for processing another sound signal in the embodiment of the present disclosure
  • Figure 3 is a schematic diagram showing a scene in which sound waves are transmitted indoors
  • FIG. 4 is a block diagram showing the structure of a terminal of a sound signal in an embodiment of the present disclosure.
  • FIG. 1 is a flow chart showing the steps of a method for processing a sound signal according to an embodiment of the present disclosure, the processing method includes:
  • Step 101 Acquire a three-dimensional human head model of the terminal user.
  • the three-dimensional human head model of the terminal user may be acquired by means of terminal scanning. Additionally, optionally, the human head three-dimensional model may include one-third of the torso portion of the end user. In addition, after the terminal scans the user's three-dimensional human head model, the three-dimensional human head model can be modeled to obtain the physiological parameters and the like of the three-dimensional human head model.
  • Step 102 Calculate a first head related transfer function HRTF of the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model.
  • the first HRTF of the left ear of the three-dimensional human head model can be calculated and calculated by calculation, and the sound source is emitted.
  • the sound waves are transmitted to the second HRTF of the right ear of the three-dimensional human head model. In this way, a personalized HRTF unique to each end user can be obtained by calculation.
  • Step 103 Process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
  • the sound signal emitted by the sound source may be processed according to the calculated first HRTF and the second HRTF.
  • the sound signal emitted by the sound source is processed, so that in the virtual reality technology, the HRTF in the HRTF database measured by the laboratory does not need to be received by the terminal user.
  • the incoming sound signal is processed to improve the auditory perception of the end user.
  • the first embodiment obtains the three-dimensional human head model of the end user, and then calculates the first HRTF transmitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF transmitted to the right ear of the three-dimensional human head model, and finally according to Calculating the first HRTF and the second HRTF, processing the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first HRTF and Second HRTF
  • the sound signal is processed, so that the end user can obtain a more realistic sound experience, and solves the problem that the virtual hearing technology has the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art, and improves the user's sound effect.
  • Experience is described in this way, the first embodiment obtains the three-dimensional human head model of the end user, and then calculates the first HRTF transmitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF transmitted to the right ear of the three-
  • FIG. 2 it is a flow chart of steps of a method for processing a sound signal according to an embodiment of the present disclosure, the processing method includes:
  • Step 201 Acquire a three-dimensional human head model of the terminal user.
  • the three-dimensional human head model of the terminal user may be acquired by means of terminal scanning. Additionally, optionally, the human head three-dimensional model may include one-third of the torso portion of the end user. In addition, after the terminal scans the user's three-dimensional human head model, the three-dimensional human head model can be modeled to obtain the physiological parameters and the like of the three-dimensional human head model.
  • Step 202 Calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and a third sound pressure generated by the sound wave at the original position of the three-dimensional human head model after the three-dimensional human head model is removed.
  • the first sound of the sound wave transmitted to the left ear can be directly calculated. If the sound wave is transmitted to the left ear of the three-dimensional human head model through reflection, the sound pressure after the sound wave is reflected may be sequentially calculated according to the order in which the reflection occurs, wherein the sound pressure calculated in sequence is greater than a preset intensity threshold. At the time, the first sound pressure transmitted by the sound wave to the left ear is calculated.
  • the first sound intensity transmitted to the left ear of the three-dimensional human head model may be calculated first.
  • the second sound pressure transmitted by the sound wave to the right ear of the three-dimensional human head model if the sound wave is directly transmitted to the right ear, the second sound pressure transmitted by the sound wave to the right ear can be calculated; if the sound wave is transmitted to the right ear through the reflection, The sound pressure after the sound wave is reflected may be sequentially calculated according to the order in which the reflection occurs, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the sound is calculated.
  • the second sound pressure transmitted by the wave to the right ear may be calculated first.
  • FIG. 3 it is a schematic diagram of a scene in which sound waves are transmitted indoors.
  • A is a three-dimensional human head model
  • B is a sound source
  • C is an indoor wall surface
  • a solid line is a direct wave transmission path
  • a broken line is a sound wave reflection transmission path.
  • the sound intensity generated by the sound source B in the three-dimensional human head model can be calculated according to the calculation formula of the sound intensity directly according to the parameters r and e.
  • r is the distance between the sound source B and the three-dimensional human head model
  • e is the energy emitted by the sound source B per second
  • the energy and parameter ⁇ of the parameter e transmitted in each direction Correlation, where ⁇ represents the horizontal angle of sound source B relative to the recipient of the sound source, Indicates the elevation angle of source B relative to the recipient of the source.
  • the sound pressure received by the three-dimensional human head model is also related to the acoustic frequency f and the physiological parameters of the three-dimensional human head model, it can be based on the parameters.
  • a calculate the first sound pressure generated by the sound source in the left ear of the three-dimensional human head model, according to the parameters r R , ⁇ R , f, a calculates the second sound pressure generated by the sound source in the right ear of the three-dimensional human head model, where r L represents the distance between the sound source and the left ear, and ⁇ L represents the horizontal angle of the sound source relative to the left ear, Indicates the elevation angle of the sound source relative to the left ear, f denotes the acoustic wave frequency, a denotes the physiological parameter of the three-dimensional human head model, r R denotes the distance between the sound source and the right ear, and ⁇ R denotes the horizontal angle of the sound source with respect to the right ear, Indicates the elevation angle of the
  • the sound wave reflection from the sound source B is transmitted to the three-dimensional human head model A
  • the collision point of the wall surface C is then obtained according to the acquired collision point, and the reflection of the sound wave is reversed, and the sound wave intensity after the sound wave reflection is sequentially calculated according to the order in which the sound wave reflection occurs, and the sound wave intensity is converted into the sound pressure.
  • the sound pressure is less than a preset intensity threshold, that is, when the sound wave is not transmitted to the three-dimensional human head model, the sound wave transmission may not be processed, but if the sound pressure is calculated in turn, Above the preset intensity threshold, it is also necessary to calculate the sound pressure received by the sound wave transmitted to the three-dimensional human head model.
  • a preset intensity threshold that is, when the sound wave is not transmitted to the three-dimensional human head model, the sound wave transmission may not be processed, but if the sound pressure is calculated in turn, Above the preset intensity threshold, it is also necessary to calculate the sound pressure received by the sound wave transmitted to the three-dimensional human head model.
  • the reflection coefficient and the scattering coefficient are also different.
  • the first sound pressure transmitted by the sound wave to the left ear of the three-dimensional human head model by direct or reflective means, the second sound pressure transmitted to the right ear of the three-dimensional human head model, and the sound wave at the original position of the three-dimensional human head model can be calculated.
  • Step 203 Calculate the first HRTF according to the first sound pressure and the third sound pressure.
  • this step optionally, when acquiring the third sound pressure generated by the sound wave transmitted to the left ear of the three-dimensional human head model and the sound pressure generated at the original position of the three-dimensional human head model, according to the formula Calculating a first HRTF, where H L represents the first HRTF, Representing the first sound pressure, P 0 (r 0 , f) represents the third sound pressure, r L represents the distance between the sound source and the left ear, and ⁇ L represents the horizontal angle of the sound source with respect to the left ear.
  • f represents the acoustic wave frequency
  • a represents the physiological parameter of the three-dimensional human head model
  • r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
  • Step 204 Calculate a second HRTF according to the second sound pressure and the third sound pressure.
  • this step optionally, when acquiring the third sound pressure generated by the sound wave transmitted to the right ear of the three-dimensional human head model and the third sound pressure generated at the original position of the three-dimensional human head model, according to the formula Calculating a second HRTF, where H R represents a second HRTF, Representing the second sound pressure, P 0 (r 0 , f) represents the third sound pressure, r R represents the distance between the sound source and the right ear, and ⁇ R represents the horizontal angle of the sound source with respect to the right ear.
  • f represents the acoustic wave frequency
  • a represents the physiological parameter of the three-dimensional human head model
  • r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
  • Step 205 Process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
  • the sound signal emitted by the sound source may be processed according to the calculated first HRTF and the second HRTF.
  • the sound signal emitted by the sound source is processed, so that in the virtual reality technology, the HRTF in the HRTF database measured by the laboratory does not need to be received by the terminal user.
  • the incoming sound signal is processed to improve the auditory perception of the end user.
  • the embodiment of the present disclosure generates a first sound pressure transmitted by the sound source to the left ear of the three-dimensional human head model, and a second sound pressure that is transmitted to the right ear of the three-dimensional human head model, and the sound wave is generated at the original position of the three-dimensional human head model.
  • the third sound pressure is used to calculate the first HRTF of the sound wave transmitted to the left ear of the three-dimensional human head model and the second HRTF of the sound wave transmitted to the right ear of the three-dimensional human head model, and finally the first HRTF and the second HRTF obtained by the calculation, the sound source
  • the emitted sound signal is processed, thus simplifying the calculation process of the HRTF, so that the end user can obtain the personalized HRTF parameters and process the sound signal according to the personalized HRTF without referring to the experimentally measured HRTF database.
  • the processing of the sound signal solves the problem of the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art virtual hearing technology, and improves the user's sound experience.
  • FIG. 4 it is a structural block diagram of a terminal in an embodiment of the present disclosure, where the terminal includes:
  • the obtaining module 401 is configured to acquire a three-dimensional human head model of the terminal user
  • the calculation module 402 is configured to calculate a first head association transfer function HRTF of the sound wave emitted by one sound source to the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model;
  • the processing module 403 is configured to emit sound to the sound source according to the first HRTF and the second HRTF The tone signal is processed.
  • the obtaining module 401 is configured to obtain a three-dimensional human head model of the terminal user by using the terminal scanning.
  • the calculation module 402 includes: a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in three dimensions a third sound pressure generated at a home position of the head model; a second calculating unit configured to calculate a first HRTF according to the first sound pressure and the third sound pressure; and a third calculating unit configured to be based on the second sound pressure sum The third sound pressure is calculated to obtain a second HRTF.
  • the first calculating unit is configured to calculate a first sound pressure transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear; and when the sound wave is transmitted to the left ear through the reflection, according to the order in which the reflection occurs, The sound pressure after the sound wave is reflected is calculated, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the first sound pressure transmitted by the sound wave to the left ear is calculated.
  • the second computing unit is configured to be according to a formula Calculating a first HRTF, where H L represents the first HRTF, Representing the first sound pressure, P 0 (r 0 , f) represents the third sound pressure, r L represents the distance between the sound source and the left ear, and ⁇ L represents the horizontal angle of the sound source with respect to the left ear.
  • H L represents the first HRTF
  • P 0 (r 0 , f) represents the third sound pressure
  • r L represents the distance between the sound source and the left ear
  • ⁇ L represents the horizontal angle of the sound source with respect to the left ear.
  • f represents the acoustic wave frequency
  • a represents the physiological parameter of the three-dimensional human head model
  • r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
  • the first calculating unit is configured to calculate a second sound pressure transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; and when the sound wave is transmitted to the right ear through the reflection, according to the order in which the reflection occurs, The sound pressure after the sound wave is reflected is calculated, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the second sound pressure transmitted by the sound wave to the right ear is calculated.
  • the third computing unit is configured to be according to a formula Calculating a second HRTF, where H R represents a second HRTF, Representing the second sound pressure, P 0 (r 0 , f) represents the third sound pressure, r R represents the distance between the sound source and the right ear, and ⁇ R represents the horizontal angle of the sound source with respect to the right ear, Indicates the elevation angle of the sound source relative to the right ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
  • the present disclosure also provides a computer storage medium having stored therein computer executable instructions configured to perform a method of processing the sound signal described above.
  • the present disclosure also provides a terminal, where the terminal includes a memory, a processor, where
  • the memory for storing a computer executable program for executing a processing method of the sound signal described above;
  • the processor is configured to read the computer executable program from the memory, and execute the processing method of the sound signal according to the computer executable program.
  • the processing method of the sound signal provided by the present disclosure, acquiring the three-dimensional human head model of the end user, calculating the first HRTF of the sound wave emitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF of the right ear transmitted to the three-dimensional human head model And processing, according to the calculated first HRTF and the second HRTF, the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first
  • the HRTF and the second HRTF process the sound signal, so that the end user can obtain a more realistic sound experience, and solve the problem of the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art virtual hearing technology. Improve the user's sound experience.

Abstract

The present disclosure provides a sound signal processing method, a terminal, and a computer storage medium. The processing method comprises: obtaining a three-dimensional head model of a terminal user; computing a first head-related transfer function (HRTF) indicating that a sound wave sent by a sound source is transmitted to the left ear of the three-dimensional head model and a second HRTF indicating that the sound wave sent by the sound source is transmitted to the right ear of the three-dimensional head model; and processing a sound signal sent by the sound source according to the first HRTF and the second HRTF.

Description

一种声音信号的处理方法及终端、计算机存储介质Method for processing sound signal and terminal, computer storage medium
相关申请的交叉引用Cross-reference to related applications
本申请基于申请号为201610724585.0、申请日为2016年08月25日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。The present application is based on a Chinese patent application filed on Jan. 25, 2016, the disclosure of which is hereby incorporated by reference.
技术领域Technical field
本公开涉及音频处理技术,尤其涉及一种声音信号的处理方法及终端、计算机存储介质。The present disclosure relates to audio processing technologies, and in particular, to a method and a terminal for processing a sound signal, and a computer storage medium.
背景技术Background technique
虚拟现实技术的发展越来越成熟,但目前虚拟现实技术的核心都聚焦于视觉方面。但是人类在感知客观世界时,通过视觉可以获取大约60%的信息,通过听觉可以获取大约30%的信息,通过触觉、嗅觉等感官可以获取其余10%的信息,因此在虚拟现实技术中,虚拟听觉技术同样是不可或缺的技术,人们感知判断听觉的方位和距离等信息,对于人们真实的虚拟现实体验也同等重要。The development of virtual reality technology is more and more mature, but the core of virtual reality technology is now focused on the visual aspect. However, when human beings perceive the objective world, they can obtain about 60% of information through vision, about 30% of information through hearing, and the remaining 10% of information through senses such as touch and smell, so in virtual reality technology, virtual Hearing technology is also an indispensable technology. People's perception of the position and distance of hearing is equally important for people's real virtual reality experience.
虚拟听觉是通过音频信号处理技术,基于人耳的一些特殊声学效应,通过声学相关算法计算模拟,从而将声源重建在三维空间任一位置,实现声源方位的重现。此外,虚拟听觉是对由声源信号、听者与场景所组成的虚拟环境进行听觉的渲染,并通过数值仿真的方式,获得特定场景中特定位置处的虚拟声,来代替真实的听觉感受。对于目前的头戴VR设备来说,加上三维虚拟听觉,更能带来真实的沉浸感。Virtual hearing is based on audio signal processing technology, based on some special acoustic effects of the human ear, through the acoustic correlation algorithm to calculate the simulation, so that the sound source is reconstructed in any position in the three-dimensional space to achieve the reproduction of the sound source orientation. In addition, the virtual hearing is an auditory rendering of the virtual environment composed of the sound source signal, the listener and the scene, and the virtual sound at a specific position in the specific scene is obtained by numerical simulation instead of the real auditory feeling. For the current wearing VR device, plus three-dimensional virtual hearing, it can bring real immersion.
虚拟听觉技术就是给定声源、环境的物理和几何条件,模拟出声源辐 射的声波及其传输过程,从而得到声音的时间和空间信息,最后用头部关联传递函数(HRTF)信号处理模拟人耳对声波综合作用,将声音的时间和空间信息转换为双耳声信号,再通过耳机重放给双耳。HRTF是虚拟听觉技术的基础,HRTF可以由左右耳的幅度特性和耳间时间差重建。HRTF的获取方法多为实验室测量得到,目前国内外部分实验室通过测试的方式获得了HRTF数据库,并且测量的是特定人群的部分方位的HRTF。Virtual hearing technology is the physical and geometric conditions of a given sound source and environment, simulating the sound source The sound wave and its transmission process, so as to obtain the temporal and spatial information of the sound, and finally use the head correlation transfer function (HRTF) signal to process the simulated human ear to synthesize the sound wave, and convert the time and space information of the sound into a binaural sound signal. Then replay it through the headphones to both ears. The HRTF is the basis of virtual hearing technology, and the HRTF can be reconstructed from the amplitude characteristics of the left and right ears and the time difference between the ears. Most of the methods of obtaining HRTF are obtained by laboratory measurement. At present, some laboratories at home and abroad have obtained the HRTF database through testing, and measured the HRTF of the partial orientation of a specific population.
但是,虽然HRTF可以通过测量的方式得到,但是测量方法繁琐、昂贵耗时,并且测量的数据库也只是特定人群的HRTF。在现实生活中,由于人们每个人的生理结构不同,头部形状和外耳尺寸也不同,因此每个人都有其独特的HRTF,即测量的HRTF数据库不能代表所有人的结构分布特征。这样,如果拿实验得到的HRTF数据来进行虚拟听觉渲染,在进行听觉重放的时候人们很可能会产生听觉差异,甚至无法辨别听觉的方位信息。另外,HRTF测量的数据库也只是空间上离散的部分方位的数据,并不能代表所有方位的位置信息,这样虚拟听觉也会存在偏差。However, although the HRTF can be obtained by measurement, the measurement method is cumbersome, expensive and time consuming, and the database of measurement is only the HRTF of a specific population. In real life, because each person's physiological structure is different, the head shape and the outer ear size are different, so each person has its own unique HRTF, that is, the measured HRTF database cannot represent the structural distribution characteristics of all people. Thus, if the experimentally obtained HRTF data is used for virtual auditory rendering, it is likely that an auditory difference will occur when auditory playback is performed, and even the auditory orientation information cannot be discerned. In addition, the database of HRTF measurements is only spatially discrete part of the azimuth data, and can not represent the position information of all orientations, so there will be deviations in virtual hearing.
发明内容Summary of the invention
为了解决虚拟听觉空间合成失真导致听觉存在偏差的问题,本公开提供了一种声音信号的处理方法及终端、计算机存储介质。In order to solve the problem that the virtual auditory space synthesis distortion causes the hearing to be deviated, the present disclosure provides a method for processing a sound signal, a terminal, and a computer storage medium.
为了解决上述技术问题,第一方面,本公开提供了一种声音信号的处理方法,包括:In order to solve the above technical problem, in a first aspect, the present disclosure provides a method for processing a sound signal, including:
获取终端用户的三维人头模型;Obtaining a three-dimensional human head model of the end user;
计算一声源发出的声波传输至所述三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至所述三维人头模型的右耳的第二HRTF;Calculating a first head correlation transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;
根据第一HRTF和第二HRTF,对所述声源发出的声音信号进行处理。The sound signal emitted by the sound source is processed according to the first HRTF and the second HRTF.
可选地,所述获取终端用户的三维人头模型的步骤包括:通过终端扫描获取终端用户的三维人头模型。 Optionally, the step of acquiring the three-dimensional human head model of the terminal user comprises: acquiring the three-dimensional human head model of the terminal user by using the terminal scanning.
可选地,计算一声源发出的声波传输至所述三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至所述三维人头模型的右耳的第二HRTF的步骤包括:计算所述声波传输至所述左耳的第一声压、所述声波传输至所述右耳的第二声压以及在所述三维人头模型移开后,所述声波在所述三维人头模型的原位置处产生的第三声压;根据所述第一声压和第三声压,计算得到所述第一HRTF;根据所述第二声压和第三声压,计算得到所述第二HRTF。Optionally, the step of calculating the first head related transfer function HRTF of the sound wave emitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF transmitted to the right ear of the three-dimensional human head model includes: calculating Determining a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in the original of the three-dimensional human head model a third sound pressure generated at the position; calculating the first HRTF according to the first sound pressure and the third sound pressure; calculating the second HRTF according to the second sound pressure and the third sound pressure .
可选地,计算所述声波传输至所述左耳的第一声压的步骤包括:当所述声波直射传输至所述左耳时,计算所述声波传输至所述左耳的第一声压;当所述声波经过反射传输至所述左耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计算所述声波传输至所述左耳的第一声压。Optionally, the step of calculating the first sound pressure transmitted by the sound wave to the left ear comprises: calculating a first sound transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear When the sound waves are transmitted to the left ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures calculated in sequence are greater than one When the intensity threshold is set, the first sound pressure transmitted by the sound wave to the left ear is calculated.
可选地,所述根据所述第一声压和第三声压,计算得到所述第一HRTF的步骤包括:根据公式
Figure PCTCN2017082940-appb-000001
计算得到第一HRTF,其中,HL表示第一HRTF,
Figure PCTCN2017082940-appb-000002
表示第一声压,P0(r0,f)表示第三声压,rL表示所述声源与所述左耳之间的距离,θL表示所述声源相对于所述左耳的水平角,
Figure PCTCN2017082940-appb-000003
表示所述声源相对于所述左耳的仰角,f表示声波频率,a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
Optionally, the step of calculating the first HRTF according to the first sound pressure and the third sound pressure comprises: according to a formula
Figure PCTCN2017082940-appb-000001
Calculating a first HRTF, where H L represents the first HRTF,
Figure PCTCN2017082940-appb-000002
Representing a first sound pressure, P 0 (r 0 , f) represents a third sound pressure, r L represents a distance between the sound source and the left ear, and θ L represents the sound source relative to the left ear Horizontal angle,
Figure PCTCN2017082940-appb-000003
Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
可选地,计算所述声波传输至所述右耳的第二声压的步骤包括:当所述声波直射传输至所述右耳时,计算所述声波传输至所述右耳的第二声压;当所述声波经过反射传输至所述右耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计算所述声波传输至所述右耳的第二声压。Optionally, the step of calculating the second sound pressure of the sound wave transmitted to the right ear comprises: calculating a second sound of the sound wave transmitted to the right ear when the sound wave is directly transmitted to the right ear When the sound waves are transmitted to the right ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures calculated in sequence are greater than one When the intensity threshold is set, the second sound pressure transmitted by the sound wave to the right ear is calculated.
可选地,根据所述第二声压和第三声压,计算得到所述第二HRTF的 步骤包括:根据公式
Figure PCTCN2017082940-appb-000004
计算得到第二HRTF,其中,HR表示第二HRTF,
Figure PCTCN2017082940-appb-000005
表示第二声压,P0(r0,f)表示第三声压,rR表示所述声源与所述右耳之间的距离,θR表示所述声源相对于所述右耳的水平角,
Figure PCTCN2017082940-appb-000006
表示所述声源相对于所述右耳的仰角,f表示声波频率,a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
Optionally, the step of calculating the second HRTF according to the second sound pressure and the third sound pressure comprises: according to a formula
Figure PCTCN2017082940-appb-000004
Calculating a second HRTF, where H R represents a second HRTF,
Figure PCTCN2017082940-appb-000005
Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and θ R represents the sound source relative to the right ear Horizontal angle,
Figure PCTCN2017082940-appb-000006
Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
第二方面,本公开还提供了一种终端,包括:In a second aspect, the present disclosure also provides a terminal, including:
获取模块,配置为获取终端用户的三维人头模型;Obtaining a module configured to acquire a three-dimensional human head model of the end user;
计算模块,配置为计算一声源发出的声波传输至所述三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至所述三维人头模型的右耳的第二HRTF;a calculation module configured to calculate a first head association transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;
处理模块,配置为根据第一HRTF和第二HRTF,对所述声源发出的声音信号进行处理。The processing module is configured to process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
可选地,所述获取模块,配置为通过终端扫描获取终端用户的三维人头模型。Optionally, the acquiring module is configured to acquire a three-dimensional human head model of the terminal user by using the terminal scanning.
可选地,所述计算模块包括:第一计算单元,配置为计算所述声波传输至所述左耳的第一声压、所述声波传输至所述右耳的第二声压以及在所述三维人头模型移开后,所述声波在所述三维人头模型的原位置处产生的第三声压;第二计算单元,配置为根据所述第一声压和第三声压,计算得到所述第一HRTF;第三计算单元,配置为根据所述第二声压和第三声压,计算得到所述第二HRTF。Optionally, the calculation module includes: a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and The third sound pressure generated by the sound wave at the original position of the three-dimensional human head model after the three-dimensional human head model is removed; the second calculating unit is configured to calculate according to the first sound pressure and the third sound pressure The first HRTF; the third calculating unit is configured to calculate the second HRTF according to the second sound pressure and the third sound pressure.
可选地,所述第一计算单元,配置为当所述声波直射传输至所述左耳时,计算所述声波传输至所述左耳的第一声压;当所述声波经过反射传输至所述左耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计 算所述声波传输至所述左耳的第一声压。Optionally, the first calculating unit is configured to calculate, when the sound wave is directly transmitted to the left ear, a first sound pressure transmitted by the sound wave to the left ear; when the sound wave is transmitted to the sound In the left ear, the sound pressure after the sound wave is reflected is sequentially calculated according to the sequence in which the reflection occurs, wherein when the sound pressures sequentially calculated are greater than a preset intensity threshold, Calculating a first sound pressure transmitted by the sound wave to the left ear.
可选地,所述第二计算单元,配置为根据公式
Figure PCTCN2017082940-appb-000007
计算得到第一HRTF,其中,HL表示第一HRTF,
Figure PCTCN2017082940-appb-000008
表示第一声压,P0(r0,f)表示第三声压,rL表示所述声源与所述左耳之间的距离,θL表示所述声源相对于所述左耳的水平角,
Figure PCTCN2017082940-appb-000009
表示所述声源相对于所述左耳的仰角,f表示声波频率,a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
Optionally, the second calculating unit is configured according to a formula
Figure PCTCN2017082940-appb-000007
Calculating a first HRTF, where H L represents the first HRTF,
Figure PCTCN2017082940-appb-000008
Representing a first sound pressure, P 0 (r 0 , f) represents a third sound pressure, r L represents a distance between the sound source and the left ear, and θ L represents the sound source relative to the left ear Horizontal angle,
Figure PCTCN2017082940-appb-000009
Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
可选地,所述第一计算单元,配置为当所述声波直射传输至所述右耳时,计算所述声波传输至所述右耳的第二声压;当所述声波经过反射传输至所述右耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计算所述声波传输至所述右耳的第二声压。Optionally, the first calculating unit is configured to calculate a second sound pressure that is transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; and when the sound wave is transmitted to the sound wave through reflection In the right ear, the sound pressure after the sound wave is reflected is sequentially calculated according to the sequence in which the reflection occurs, wherein the sound wave transmission is calculated when the sound pressures sequentially calculated are greater than a preset intensity threshold. a second sound pressure to the right ear.
可选地,所述第三计算单元,配置为根据公式
Figure PCTCN2017082940-appb-000010
计算得到第二HRTF,其中,HR表示第二HRTF,
Figure PCTCN2017082940-appb-000011
表示第二声压,P0(r0,f)表示第三声压,rR表示所述声源与所述右耳之间的距离,θR表示所述声源相对于所述右耳的水平角,
Figure PCTCN2017082940-appb-000012
表示所述声源相对于所述右耳的仰角,f表示声波频率,a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
Optionally, the third calculating unit is configured according to a formula
Figure PCTCN2017082940-appb-000010
Calculating a second HRTF, where H R represents a second HRTF,
Figure PCTCN2017082940-appb-000011
Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and θ R represents the sound source relative to the right ear Horizontal angle,
Figure PCTCN2017082940-appb-000012
Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
所述获取模块、所述计算模块、所述处理模块、所述第一计算单元、所述第二计算单元、所述第三计算单元在执行处理时,可以采用中央处理器(CPU,Central Processing Unit)、数字信号处理器(DSP,Digital Singnal Processor)或可编程逻辑阵列(FPGA,Field-Programmable Gate Array) 实现。The acquisition module, the calculation module, the processing module, the first calculation unit, the second calculation unit, and the third calculation unit may use a central processing unit (CPU, Central Processing) when performing processing. Unit), Digital Signal Processor (DSP, Digital Singnal Processor) or Field-Programmable Gate Array (FPGA) achieve.
第三方面,本公开还提供了一种计算机存储介质,其中存储有计算机可执行指令,该计算机可执行指令配置执行上述声音信号的处理方法。In a third aspect, the present disclosure also provides a computer storage medium having stored therein computer executable instructions configured to perform a method of processing the sound signal.
采用本公开能达到的有益效果是:The beneficial effects that can be achieved with the present disclosure are:
本公开通过首先获取终端用户的三维人头模型,然后计算声源发出的声波传输至三维人头模型的左耳的第一HRTF以及传输至三维人头模型的右耳的第二HRTF,最后根据计算得到的第一HRTF和第二HRTF,对声源发出的信号进行处理,使得终端可以通过计算的方式获取终端用户个性化的第一HRTF和第二HRTF,并通过个性化的第一HRTF和第二HRTF对声音信号进行处理,从而使得终端用户能够获得更加真实的音效体验,解决了现有技术中虚拟听觉技术存在的由于非个性化的HRTF导致的虚拟听觉空间合成失真的问题,提高了用户的音效体验。The present disclosure first obtains a three-dimensional human head model of the end user, and then calculates a first HRTF transmitted by the sound source to the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model, and finally calculated according to the calculation. The first HRTF and the second HRTF process the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first HRTF and the second HRTF. The sound signal is processed, so that the end user can obtain a more realistic sound experience, and solves the problem that the virtual hearing technology has the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art, and improves the user's sound effect. Experience.
附图说明DRAWINGS
图1表示本公开实施例中一声音信号的处理方法的步骤流程图;1 is a flow chart showing the steps of a method for processing a sound signal in an embodiment of the present disclosure;
图2表示本公开实施例中又一声音信号的处理方法的步骤流程图;2 is a flow chart showing the steps of a method for processing another sound signal in the embodiment of the present disclosure;
图3表示声波在室内传输的场景示意图;Figure 3 is a schematic diagram showing a scene in which sound waves are transmitted indoors;
图4表示本公开实施例中一声音信号的终端的结构框图。4 is a block diagram showing the structure of a terminal of a sound signal in an embodiment of the present disclosure.
具体实施方式detailed description
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While the embodiments of the present invention have been shown in the drawings, the embodiments Rather, these embodiments are provided so that this disclosure will be more fully understood and the scope of the disclosure will be fully disclosed.
如图1所示,为本公开实施例中一声音信号的处理方法的步骤流程图,该处理方法包括: FIG. 1 is a flow chart showing the steps of a method for processing a sound signal according to an embodiment of the present disclosure, the processing method includes:
步骤101,获取终端用户的三维人头模型。Step 101: Acquire a three-dimensional human head model of the terminal user.
在本步骤中,可选地,在获取终端用户的三维人头模型时,可以通过终端扫描的方式获取终端用户的三维人头模型。此外,可选地,人头三维模型可以包括终端用户的三分之一躯干部分。此外,终端扫描获取到用户的三维人头模型之后,可以对三维人头模型进行建模处理,从而获取到三维人头模型的生理参数等数据。In this step, optionally, when acquiring the three-dimensional human head model of the terminal user, the three-dimensional human head model of the terminal user may be acquired by means of terminal scanning. Additionally, optionally, the human head three-dimensional model may include one-third of the torso portion of the end user. In addition, after the terminal scans the user's three-dimensional human head model, the three-dimensional human head model can be modeled to obtain the physiological parameters and the like of the three-dimensional human head model.
步骤102,计算一声源发出的声波传输至三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至三维人头模型的右耳的第二HRTF。Step 102: Calculate a first head related transfer function HRTF of the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model.
在本步骤中,可选地,在获取到终端用户的三维人头模型之后,可以通过计算的方式,计算得到一声源发出的声波传输至三维人头模型的左耳的第一HRTF,以及声源发出的声波传输至三维人头模型的右耳的第二HRTF。这样,可以通过计算的方式获取每个终端用户独有的个性化的HRTF。In this step, optionally, after acquiring the three-dimensional human head model of the terminal user, the first HRTF of the left ear of the three-dimensional human head model can be calculated and calculated by calculation, and the sound source is emitted. The sound waves are transmitted to the second HRTF of the right ear of the three-dimensional human head model. In this way, a personalized HRTF unique to each end user can be obtained by calculation.
步骤103,根据第一HRTF和第二HRTF,对声源发出的声音信号进行处理。Step 103: Process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
在本步骤中,可选地,可以根据计算得到的第一HRTF和第二HRTF,对声源发出的声音信号进行处理。这样,根据每个终端用户独有的个性化的HRTF,对声源发出的声音信号进行处理,使得在虚拟现实技术中,不需要再根据实验室测量得到的HRTF数据库中的HRTF对终端用户接收到的声音信号进行处理,提高了终端用户的听觉感知度。In this step, optionally, the sound signal emitted by the sound source may be processed according to the calculated first HRTF and the second HRTF. In this way, according to the personalized HRTF unique to each end user, the sound signal emitted by the sound source is processed, so that in the virtual reality technology, the HRTF in the HRTF database measured by the laboratory does not need to be received by the terminal user. The incoming sound signal is processed to improve the auditory perception of the end user.
这样,本实施例通过首先获取终端用户的三维人头模型,然后计算声源发出的声波传输至三维人头模型的左耳的第一HRTF以及传输至三维人头模型的右耳的第二HRTF,最后根据计算得到的第一HRTF和第二HRTF,对声源发出的信号进行处理,使得终端可以通过计算的方式获取终端用户个性化的第一HRTF和第二HRTF,并通过个性化的第一HRTF和第二HRTF 对声音信号进行处理,从而使得终端用户能够获得更加真实的音效体验,解决了现有技术中虚拟听觉技术存在的由于非个性化的HRTF导致的虚拟听觉空间合成失真的问题,提高了用户的音效体验。In this way, the first embodiment obtains the three-dimensional human head model of the end user, and then calculates the first HRTF transmitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF transmitted to the right ear of the three-dimensional human head model, and finally according to Calculating the first HRTF and the second HRTF, processing the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first HRTF and Second HRTF The sound signal is processed, so that the end user can obtain a more realistic sound experience, and solves the problem that the virtual hearing technology has the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art, and improves the user's sound effect. Experience.
如图2所示,为本公开实施例中又一声音信号的处理方法的步骤流程图,该处理方法包括:As shown in FIG. 2, it is a flow chart of steps of a method for processing a sound signal according to an embodiment of the present disclosure, the processing method includes:
步骤201,获取终端用户的三维人头模型。Step 201: Acquire a three-dimensional human head model of the terminal user.
在本步骤中,可选地,在获取终端用户的三维人头模型时,可以通过终端扫描的方式获取终端用户的三维人头模型。此外,可选地,人头三维模型可以包括终端用户的三分之一躯干部分。此外,终端扫描获取到用户的三维人头模型之后,可以对三维人头模型进行建模处理,从而获取到三维人头模型的生理参数等数据。In this step, optionally, when acquiring the three-dimensional human head model of the terminal user, the three-dimensional human head model of the terminal user may be acquired by means of terminal scanning. Additionally, optionally, the human head three-dimensional model may include one-third of the torso portion of the end user. In addition, after the terminal scans the user's three-dimensional human head model, the three-dimensional human head model can be modeled to obtain the physiological parameters and the like of the three-dimensional human head model.
步骤202,计算声波传输至左耳的第一声压、声波传输至右耳的第二声压以及在三维人头模型移开后,声波在三维人头模型的原位置处产生的第三声压。Step 202: Calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and a third sound pressure generated by the sound wave at the original position of the three-dimensional human head model after the three-dimensional human head model is removed.
在本步骤中,可选地,在计算声波传输至三维人头模型左耳的第一声压时,若声波直射传输至三维人头模型左耳,则可以直接计算声波传输至左耳的第一声压;若声波为经过反射传输至三维人头模型左耳时,可以按照反射发生的先后顺序,依次计算声波发生反射后的声压,其中,当依次计算得到的声压均大于一预设强度阈值时,计算该声波传输至左耳的第一声压。可选地,在计算第一声压时,可以先计算得到传输至三维人头模型左耳的第一声强。In this step, optionally, when calculating the first sound pressure transmitted by the sound wave to the left ear of the three-dimensional human head model, if the sound wave is directly transmitted to the left ear of the three-dimensional human head model, the first sound of the sound wave transmitted to the left ear can be directly calculated. If the sound wave is transmitted to the left ear of the three-dimensional human head model through reflection, the sound pressure after the sound wave is reflected may be sequentially calculated according to the order in which the reflection occurs, wherein the sound pressure calculated in sequence is greater than a preset intensity threshold. At the time, the first sound pressure transmitted by the sound wave to the left ear is calculated. Optionally, when calculating the first sound pressure, the first sound intensity transmitted to the left ear of the three-dimensional human head model may be calculated first.
当然,在计算声波传输至三维人头模型右耳的第二声压时,若声波直射传输至右耳,则可以计算声波传输至右耳的第二声压;若声波经过反射传输至右耳,则可以按照反射发生的先后顺序,依次计算声波发生反射后的声压,其中,当依次计算得到的声压均大于一预设强度阈值时,计算声 波传输至右耳的第二声压。可选地,在计算第二声压时,可以先计算得到传输至三维人头模型右耳的第二声强。Of course, when calculating the second sound pressure transmitted by the sound wave to the right ear of the three-dimensional human head model, if the sound wave is directly transmitted to the right ear, the second sound pressure transmitted by the sound wave to the right ear can be calculated; if the sound wave is transmitted to the right ear through the reflection, The sound pressure after the sound wave is reflected may be sequentially calculated according to the order in which the reflection occurs, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the sound is calculated. The second sound pressure transmitted by the wave to the right ear. Optionally, when calculating the second sound pressure, the second sound intensity transmitted to the right ear of the three-dimensional human head model may be calculated first.
下面对此进行具体说明。This will be specifically described below.
在声波传输的过程中,可以将声波的传输过程分为直射和反射两种,且声波在室外传输时多为直射传输,而在室内传输时多为反射传输。如图3所示,为声波在室内传输的场景示意图。在图3中,A为三维人头模型,B为声源,C为室内壁面,实线为声波直射传输路径,虚线为声波反射传输路径。In the process of sound wave transmission, the sound wave transmission process can be divided into direct light and reflection, and the sound wave is mostly direct transmission when transmitting outdoors, and mostly reflective transmission when transmitting indoors. As shown in FIG. 3, it is a schematic diagram of a scene in which sound waves are transmitted indoors. In Fig. 3, A is a three-dimensional human head model, B is a sound source, C is an indoor wall surface, a solid line is a direct wave transmission path, and a broken line is a sound wave reflection transmission path.
当声源B发出的声波直射传输至三维人头模型A时,可以根据声强的计算公式,直接根据参数r和e,计算得到声源B在三维人头模型产生的声强。其中,r表示声源B与三维人头模型之间的距离,e表示声源B每秒发出的能量,且参数e在每个方向上传输的分能量与参数θ和
Figure PCTCN2017082940-appb-000013
相关,其中,θ表示声源B相对于声源接收者的水平角,
Figure PCTCN2017082940-appb-000014
表示声源B相对于声源接收者的仰角。此外,由于三维人头模型接收到的声压还与声波频率f以及三维人头模型的生理参数有关,因此可以根据参数
Figure PCTCN2017082940-appb-000015
和a,计算得到声源在三维人头模型左耳产生的第一声压,根据参数rR,θR
Figure PCTCN2017082940-appb-000016
f,a计算得到声源在三维人头模型右耳产生的第二声压,其中,rL表示声源与左耳之间的距离,θL表示声源相对于左耳的水平角,
Figure PCTCN2017082940-appb-000017
表示声源相对于左耳的仰角,f表示声波频率,a表示三维人头模型的生理参数,rR表示声源与右耳之间的距离,θR表示声源相对于右耳的水平角,
Figure PCTCN2017082940-appb-000018
表示声源相对于右耳的仰角。
When the sound wave emitted by the sound source B is directly transmitted to the three-dimensional human head model A, the sound intensity generated by the sound source B in the three-dimensional human head model can be calculated according to the calculation formula of the sound intensity directly according to the parameters r and e. Where r is the distance between the sound source B and the three-dimensional human head model, e is the energy emitted by the sound source B per second, and the energy and parameter θ of the parameter e transmitted in each direction
Figure PCTCN2017082940-appb-000013
Correlation, where θ represents the horizontal angle of sound source B relative to the recipient of the sound source,
Figure PCTCN2017082940-appb-000014
Indicates the elevation angle of source B relative to the recipient of the source. In addition, since the sound pressure received by the three-dimensional human head model is also related to the acoustic frequency f and the physiological parameters of the three-dimensional human head model, it can be based on the parameters.
Figure PCTCN2017082940-appb-000015
And a, calculate the first sound pressure generated by the sound source in the left ear of the three-dimensional human head model, according to the parameters r R , θ R ,
Figure PCTCN2017082940-appb-000016
f, a calculates the second sound pressure generated by the sound source in the right ear of the three-dimensional human head model, where r L represents the distance between the sound source and the left ear, and θ L represents the horizontal angle of the sound source relative to the left ear,
Figure PCTCN2017082940-appb-000017
Indicates the elevation angle of the sound source relative to the left ear, f denotes the acoustic wave frequency, a denotes the physiological parameter of the three-dimensional human head model, r R denotes the distance between the sound source and the right ear, and θ R denotes the horizontal angle of the sound source with respect to the right ear,
Figure PCTCN2017082940-appb-000018
Indicates the elevation angle of the sound source relative to the right ear.
此外,当声源B发出的声波反射传输至三维人头模型A时,由于声波在反射的过程中会根据反射的次数以及路径强度进行衰减,因此在声波反射传输过程中,还需要获取声波与室内壁面C的碰撞点,然后根据获取到的碰撞点,获取到声波的反射反向,并按照声波反射发生的先后顺序,依 次计算得到声波反射后的声波强度,并将声波强度转换为声压。其中,若声波在反射的过程中,声压小于一预设强度阈值,即声波不会再传输至三维人头模型时,可以不再对声波的传输进行处理,但是若依次计算得到的声压均大于该预设强度阈值,则还需要计算声波传输至三维人头模型接收到的声压。可选地,在计算声波反射传输至三维人头模型A的声压时,在考虑参数r,θ,
Figure PCTCN2017082940-appb-000019
f和a的同时,还需要考虑到反射系数和散射系数,当然,根据反射介质的不同,反射系数和散射系数也不相同。
In addition, when the sound wave reflection from the sound source B is transmitted to the three-dimensional human head model A, since the sound wave is attenuated according to the number of reflections and the path intensity during the reflection process, it is necessary to acquire the sound wave and the indoor during the sound wave reflection transmission process. The collision point of the wall surface C is then obtained according to the acquired collision point, and the reflection of the sound wave is reversed, and the sound wave intensity after the sound wave reflection is sequentially calculated according to the order in which the sound wave reflection occurs, and the sound wave intensity is converted into the sound pressure. Wherein, if the sound wave is in the process of reflection, the sound pressure is less than a preset intensity threshold, that is, when the sound wave is not transmitted to the three-dimensional human head model, the sound wave transmission may not be processed, but if the sound pressure is calculated in turn, Above the preset intensity threshold, it is also necessary to calculate the sound pressure received by the sound wave transmitted to the three-dimensional human head model. Optionally, when calculating the sound pressure transmitted by the acoustic reflection to the three-dimensional human head model A, considering the parameters r, θ,
Figure PCTCN2017082940-appb-000019
At the same time as f and a, it is also necessary to consider the reflection coefficient and the scattering coefficient. Of course, depending on the reflection medium, the reflection coefficient and the scattering coefficient are also different.
这样,参照上述方式,可以计算得到声波通过直射或反射方式传输至三维人头模型左耳的第一声压、传输至三维人头模型右耳的第二声压以及声波在三维人头模型的原位置处产生的第三声压。Thus, referring to the above manner, the first sound pressure transmitted by the sound wave to the left ear of the three-dimensional human head model by direct or reflective means, the second sound pressure transmitted to the right ear of the three-dimensional human head model, and the sound wave at the original position of the three-dimensional human head model can be calculated. The third sound pressure produced.
步骤203,根据第一声压和第三声压,计算得到第一HRTF。Step 203: Calculate the first HRTF according to the first sound pressure and the third sound pressure.
在本步骤中,可选地,在获取到声波传输至三维人头模型左耳的第一声压和声波在三维人头模型的原位置处产生的第三声压时,可以根据公式
Figure PCTCN2017082940-appb-000020
计算得到第一HRTF,其中,HL表示第一HRTF,
Figure PCTCN2017082940-appb-000021
表示第一声压,P0(r0,f)表示第三声压,rL表示声源与左耳之间的距离,θL表示声源相对于左耳的水平角,
Figure PCTCN2017082940-appb-000022
表示声源相对于左耳的仰角,f表示声波频率,a表示三维人头模型的生理参数,r0表示声源与三维人头模型的中心位置之间的距离。
In this step, optionally, when acquiring the third sound pressure generated by the sound wave transmitted to the left ear of the three-dimensional human head model and the sound pressure generated at the original position of the three-dimensional human head model, according to the formula
Figure PCTCN2017082940-appb-000020
Calculating a first HRTF, where H L represents the first HRTF,
Figure PCTCN2017082940-appb-000021
Representing the first sound pressure, P 0 (r 0 , f) represents the third sound pressure, r L represents the distance between the sound source and the left ear, and θ L represents the horizontal angle of the sound source with respect to the left ear.
Figure PCTCN2017082940-appb-000022
Indicates the elevation angle of the sound source relative to the left ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
步骤204,根据第二声压和第三声压,计算得到第二HRTF。Step 204: Calculate a second HRTF according to the second sound pressure and the third sound pressure.
在本步骤中,可选地,在获取到声波传输至三维人头模型右耳的第二声压和声波在三维人头模型的原位置处产生的第三声压时,可以根据公式
Figure PCTCN2017082940-appb-000023
计算得到第二HRTF,其中,HR表示第二HRTF,
Figure PCTCN2017082940-appb-000024
表示第二声压,P0(r0,f)表示第三声压,rR表示声源与右耳之间的距离,θR表示声源相对于右耳的水平角,
Figure PCTCN2017082940-appb-000025
表示声源 相对于右耳的仰角,f表示声波频率,a表示三维人头模型的生理参数,r0表示声源与三维人头模型的中心位置之间的距离。
In this step, optionally, when acquiring the third sound pressure generated by the sound wave transmitted to the right ear of the three-dimensional human head model and the third sound pressure generated at the original position of the three-dimensional human head model, according to the formula
Figure PCTCN2017082940-appb-000023
Calculating a second HRTF, where H R represents a second HRTF,
Figure PCTCN2017082940-appb-000024
Representing the second sound pressure, P 0 (r 0 , f) represents the third sound pressure, r R represents the distance between the sound source and the right ear, and θ R represents the horizontal angle of the sound source with respect to the right ear.
Figure PCTCN2017082940-appb-000025
Representing the elevation angle of the sound source relative to the right ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
步骤205,根据第一HRTF和第二HRTF,对声源发出的声音信号进行处理。Step 205: Process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
在本步骤中,可选地,可以根据计算得到的第一HRTF和第二HRTF,对声源发出的声音信号进行处理。这样,根据每个终端用户独有的个性化的HRTF,对声源发出的声音信号进行处理,使得在虚拟现实技术中,不需要再根据实验室测量得到的HRTF数据库中的HRTF对终端用户接收到的声音信号进行处理,提高了终端用户的听觉感知度。In this step, optionally, the sound signal emitted by the sound source may be processed according to the calculated first HRTF and the second HRTF. In this way, according to the personalized HRTF unique to each end user, the sound signal emitted by the sound source is processed, so that in the virtual reality technology, the HRTF in the HRTF database measured by the laboratory does not need to be received by the terminal user. The incoming sound signal is processed to improve the auditory perception of the end user.
这样,本公开实施例通过计算声源发出的声波传输至三维人头模型左耳的第一声压、声波传输至三维人头模型右耳的第二声压,声波在三维人头模型的原位置处产生的第三声压,来计算声波传输至三维人头模型左耳的第一HRTF和声波传输至三维人头模型右耳的第二HRTF,最后通过计算得到的第一HRTF和第二HRTF,对声源发出的声音信号进行处理,这样,简化了HRTF的计算过程,使得终端用户可以获取个性化的HRTF参数,并根据个性化的HRTF进行声音信号的处理,而不用再参照实验测得的HRTF数据库来进行声音信号的处理,解决了现有技术中虚拟听觉技术存在的由于非个性化的HRTF导致的虚拟听觉空间合成失真的问题,提高了用户的音效体验。In this way, the embodiment of the present disclosure generates a first sound pressure transmitted by the sound source to the left ear of the three-dimensional human head model, and a second sound pressure that is transmitted to the right ear of the three-dimensional human head model, and the sound wave is generated at the original position of the three-dimensional human head model. The third sound pressure is used to calculate the first HRTF of the sound wave transmitted to the left ear of the three-dimensional human head model and the second HRTF of the sound wave transmitted to the right ear of the three-dimensional human head model, and finally the first HRTF and the second HRTF obtained by the calculation, the sound source The emitted sound signal is processed, thus simplifying the calculation process of the HRTF, so that the end user can obtain the personalized HRTF parameters and process the sound signal according to the personalized HRTF without referring to the experimentally measured HRTF database. The processing of the sound signal solves the problem of the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art virtual hearing technology, and improves the user's sound experience.
如图4所示,为本公开实施例中一终端的结构框图,该终端包括:As shown in FIG. 4, it is a structural block diagram of a terminal in an embodiment of the present disclosure, where the terminal includes:
获取模块401,配置为获取终端用户的三维人头模型;The obtaining module 401 is configured to acquire a three-dimensional human head model of the terminal user;
计算模块402,配置为计算一声源发出的声波传输至三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至三维人头模型的右耳的第二HRTF;The calculation module 402 is configured to calculate a first head association transfer function HRTF of the sound wave emitted by one sound source to the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model;
处理模块403,配置为根据第一HRTF和第二HRTF,对声源发出的声 音信号进行处理。The processing module 403 is configured to emit sound to the sound source according to the first HRTF and the second HRTF The tone signal is processed.
可选地,获取模块401,配置为通过终端扫描获取终端用户的三维人头模型。Optionally, the obtaining module 401 is configured to obtain a three-dimensional human head model of the terminal user by using the terminal scanning.
可选地,计算模块402包括:第一计算单元,配置为计算声波传输至左耳的第一声压、声波传输至右耳的第二声压以及在三维人头模型移开后,声波在三维人头模型的原位置处产生的第三声压;第二计算单元,配置为根据第一声压和第三声压,计算得到第一HRTF;第三计算单元,配置为根据第二声压和第三声压,计算得到第二HRTF。Optionally, the calculation module 402 includes: a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in three dimensions a third sound pressure generated at a home position of the head model; a second calculating unit configured to calculate a first HRTF according to the first sound pressure and the third sound pressure; and a third calculating unit configured to be based on the second sound pressure sum The third sound pressure is calculated to obtain a second HRTF.
可选地,第一计算单元,配置为当声波直射传输至左耳时,计算声波传输至左耳的第一声压;当声波经过反射传输至左耳时,按照反射发生的先后顺序,依次计算声波发生反射后的声压,其中,当依次计算得到的声压均大于一预设强度阈值时,计算声波传输至左耳的第一声压。Optionally, the first calculating unit is configured to calculate a first sound pressure transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear; and when the sound wave is transmitted to the left ear through the reflection, according to the order in which the reflection occurs, The sound pressure after the sound wave is reflected is calculated, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the first sound pressure transmitted by the sound wave to the left ear is calculated.
可选地,第二计算单元,配置为根据公式
Figure PCTCN2017082940-appb-000026
计算得到第一HRTF,其中,HL表示第一HRTF,
Figure PCTCN2017082940-appb-000027
表示第一声压,P0(r0,f)表示第三声压,rL表示声源与左耳之间的距离,θL表示声源相对于左耳的水平角,表示声源相对于左耳的仰角,f表示声波频率,a表示三维人头模型的生理参数,r0表示声源与三维人头模型的中心位置之间的距离。
Optionally, the second computing unit is configured to be according to a formula
Figure PCTCN2017082940-appb-000026
Calculating a first HRTF, where H L represents the first HRTF,
Figure PCTCN2017082940-appb-000027
Representing the first sound pressure, P 0 (r 0 , f) represents the third sound pressure, r L represents the distance between the sound source and the left ear, and θ L represents the horizontal angle of the sound source with respect to the left ear. Indicates the elevation angle of the sound source relative to the left ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
可选地,第一计算单元,配置为当声波直射传输至右耳时,计算声波传输至右耳的第二声压;当声波经过反射传输至右耳时,按照反射发生的先后顺序,依次计算声波发生反射后的声压,其中,当依次计算得到的声压均大于一预设强度阈值时,计算声波传输至右耳的第二声压。Optionally, the first calculating unit is configured to calculate a second sound pressure transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; and when the sound wave is transmitted to the right ear through the reflection, according to the order in which the reflection occurs, The sound pressure after the sound wave is reflected is calculated, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the second sound pressure transmitted by the sound wave to the right ear is calculated.
可选地,第三计算单元,配置为根据公式
Figure PCTCN2017082940-appb-000029
计算得到第二HRTF,其中,HR表示第二HRTF,
Figure PCTCN2017082940-appb-000030
表示第二声压,P0(r0,f)表示第三声压,rR表示声源与右耳之间的距离,θR表示声源相对于右耳的水平角,
Figure PCTCN2017082940-appb-000031
表示声源相对于右耳的仰角,f表示声波频率,a表示三维人头模型的生理参数,r0表示声源与三维人头模型的中心位置之间的距离。
Optionally, the third computing unit is configured to be according to a formula
Figure PCTCN2017082940-appb-000029
Calculating a second HRTF, where H R represents a second HRTF,
Figure PCTCN2017082940-appb-000030
Representing the second sound pressure, P 0 (r 0 , f) represents the third sound pressure, r R represents the distance between the sound source and the right ear, and θ R represents the horizontal angle of the sound source with respect to the right ear,
Figure PCTCN2017082940-appb-000031
Indicates the elevation angle of the sound source relative to the right ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r 0 represents the distance between the sound source and the center position of the three-dimensional human head model.
本公开还提供了一种计算机存储介质,其中存储有计算机可执行指令,该计算机可执行指令配置执行上述声音信号的处理方法。The present disclosure also provides a computer storage medium having stored therein computer executable instructions configured to perform a method of processing the sound signal described above.
本公开还提供一种终端,所述终端包括存储器,处理器;其中,The present disclosure also provides a terminal, where the terminal includes a memory, a processor, where
所述存储器,用于存储用于执行上述声音信号的处理方法的计算机可执行程序;The memory for storing a computer executable program for executing a processing method of the sound signal described above;
所述处理器,用于从所述存储器中读取所述计算机可执行程序,根据所述计算机可执行程序执行上述声音信号的处理方法。The processor is configured to read the computer executable program from the memory, and execute the processing method of the sound signal according to the computer executable program.
以上所述的是本公开的优选实施方式,应当指出对于本技术领域的普通人员来说,在不脱离本公开所述的原理前提下还可以作出若干改进和润饰,这些改进和润饰也在本公开的保护范围内。The above is a preferred embodiment of the present disclosure, and it should be noted that those skilled in the art can also make several improvements and refinements without departing from the principles of the present disclosure. Within the scope of public protection.
工业实用性Industrial applicability
采用本公开提供的声音信号的处理方法,获取终端用户的三维人头模型,计算声源发出的声波传输至三维人头模型的左耳的第一HRTF以及传输至三维人头模型的右耳的第二HRTF,根据计算得到的第一HRTF和第二HRTF,对声源发出的信号进行处理,使得终端可以通过计算的方式获取终端用户个性化的第一HRTF和第二HRTF,并通过个性化的第一HRTF和第二HRTF对声音信号进行处理,从而使得终端用户能够获得更加真实的音效体验,解决了现有技术中虚拟听觉技术存在的由于非个性化的HRTF导致的虚拟听觉空间合成失真的问题,提高了用户的音效体验。 Using the processing method of the sound signal provided by the present disclosure, acquiring the three-dimensional human head model of the end user, calculating the first HRTF of the sound wave emitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF of the right ear transmitted to the three-dimensional human head model And processing, according to the calculated first HRTF and the second HRTF, the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first The HRTF and the second HRTF process the sound signal, so that the end user can obtain a more realistic sound experience, and solve the problem of the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art virtual hearing technology. Improve the user's sound experience.

Claims (15)

  1. 一种声音信号的处理方法,包括:A method for processing a sound signal, comprising:
    获取终端用户的三维人头模型;Obtaining a three-dimensional human head model of the end user;
    计算一声源发出的声波传输至所述三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至所述三维人头模型的右耳的第二HRTF;Calculating a first head correlation transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;
    根据第一HRTF和第二HRTF,对所述声源发出的声音信号进行处理。The sound signal emitted by the sound source is processed according to the first HRTF and the second HRTF.
  2. 根据权利要求1所述的处理方法,其中,所述获取终端用户的三维人头模型,包括:The processing method according to claim 1, wherein the acquiring a three-dimensional human head model of the end user comprises:
    通过终端扫描获取终端用户的三维人头模型。The terminal user's three-dimensional human head model is obtained through terminal scanning.
  3. 根据权利要求1所述的处理方法,其中,计算一声源发出的声波传输至所述三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至所述三维人头模型的右耳的第二HRTF,包括:The processing method according to claim 1, wherein a first head related transfer function HRTF for transmitting a sound wave emitted from a sound source to a left ear of the three-dimensional human head model and a right ear transmitted to the three-dimensional human head model are calculated Two HRTFs, including:
    计算所述声波传输至所述左耳的第一声压、所述声波传输至所述右耳的第二声压以及在所述三维人头模型移开后,所述声波在所述三维人头模型的原位置处产生的第三声压;Calculating a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in the three-dimensional human head model The third sound pressure generated at the original position;
    根据所述第一声压和第三声压,计算得到所述第一HRTF;Calculating the first HRTF according to the first sound pressure and the third sound pressure;
    根据所述第二声压和第三声压,计算得到所述第二HRTF。The second HRTF is calculated according to the second sound pressure and the third sound pressure.
  4. 根据权利要求3所述的处理方法,其中,计算所述声波传输至所述左耳的第一声压,包括:The processing method according to claim 3, wherein calculating the first sound pressure transmitted by the sound wave to the left ear comprises:
    当所述声波直射传输至所述左耳时,计算所述声波传输至所述左耳的第一声压;Calculating a first sound pressure transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear;
    当所述声波经过反射传输至所述左耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计算所述声波传输至所述左耳的第一声压。When the sound waves are transmitted to the left ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a predetermined intensity. At the threshold, the first sound pressure transmitted by the sound wave to the left ear is calculated.
  5. 根据权利要求4所述的处理方法,其中,所述根据所述第一声压和 第三声压,计算得到所述第一HRTF,包括:The processing method according to claim 4, wherein said according to said first sound pressure sum The third sound pressure is calculated to obtain the first HRTF, including:
    根据公式
    Figure PCTCN2017082940-appb-100001
    计算得到第一HRTF,其中,HL表示第一HRTF,
    Figure PCTCN2017082940-appb-100002
    表示第一声压,P0(r0,f)表示第三声压,rL表示所述声源与所述左耳之间的距离,θL表示所述声源相对于所述左耳的水平角,
    Figure PCTCN2017082940-appb-100003
    表示所述声源相对于所述左耳的仰角,f表示声波频率,a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
    According to the formula
    Figure PCTCN2017082940-appb-100001
    Calculating a first HRTF, where H L represents the first HRTF,
    Figure PCTCN2017082940-appb-100002
    Representing a first sound pressure, P 0 (r 0 , f) represents a third sound pressure, r L represents a distance between the sound source and the left ear, and θ L represents the sound source relative to the left ear Horizontal angle,
    Figure PCTCN2017082940-appb-100003
    Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
  6. 根据权利要求3所述的处理方法,其中,计算所述声波传输至所述右耳的第二声压,包括:The processing method according to claim 3, wherein calculating the second sound pressure transmitted by the sound wave to the right ear comprises:
    当所述声波直射传输至所述右耳时,计算所述声波传输至所述右耳的第二声压;Calculating a second sound pressure transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear;
    当所述声波经过反射传输至所述右耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计算所述声波传输至所述右耳的第二声压。When the sound waves are transmitted to the right ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a predetermined intensity. At the threshold, a second sound pressure transmitted by the sound wave to the right ear is calculated.
  7. 根据权利要求6所述的处理方法,其中,根据所述第二声压和第三声压,计算得到所述第二HRTF,包括:The processing method according to claim 6, wherein the calculating the second HRTF according to the second sound pressure and the third sound pressure comprises:
    根据公式
    Figure PCTCN2017082940-appb-100004
    计算得到第二HRTF,其中,HR表示第二HRTF,
    Figure PCTCN2017082940-appb-100005
    表示第二声压,P0(r0,f)表示第三声压,rR表示所述声源与所述右耳之间的距离,θR表示所述声源相对于所述右耳的水平角,
    Figure PCTCN2017082940-appb-100006
    表示所述声源相对于所述右耳的仰角,f表示声波频率,a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
    According to the formula
    Figure PCTCN2017082940-appb-100004
    Calculating a second HRTF, where H R represents a second HRTF,
    Figure PCTCN2017082940-appb-100005
    Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and θ R represents the sound source relative to the right ear Horizontal angle,
    Figure PCTCN2017082940-appb-100006
    Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
  8. 一种终端,包括:A terminal comprising:
    获取模块,配置为获取终端用户的三维人头模型; Obtaining a module configured to acquire a three-dimensional human head model of the end user;
    计算模块,配置为计算一声源发出的声波传输至所述三维人头模型的左耳的第一头部关联传递函数HRTF以及传输至所述三维人头模型的右耳的第二HRTF;a calculation module configured to calculate a first head association transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;
    处理模块,配置为根据第一HRTF和第二HRTF,对所述声源发出的声音信号进行处理。The processing module is configured to process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
  9. 根据权利要求8所述的终端,其中,所述获取模块,配置为通过终端扫描获取终端用户的三维人头模型。The terminal according to claim 8, wherein the obtaining module is configured to acquire a three-dimensional human head model of the terminal user by scanning the terminal.
  10. 根据权利要求8所述的终端,其中,所述计算模块包括:The terminal of claim 8, wherein the calculation module comprises:
    第一计算单元,配置为计算所述声波传输至所述左耳的第一声压、所述声波传输至所述右耳的第二声压以及在所述三维人头模型移开后,所述声波在所述三维人头模型的原位置处产生的第三声压;a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the a third sound pressure generated by the sound wave at the original position of the three-dimensional human head model;
    第二计算单元,配置为根据所述第一声压和第三声压,计算得到所述第一HRTF;a second calculating unit, configured to calculate the first HRTF according to the first sound pressure and the third sound pressure;
    第三计算单元,配置为根据所述第二声压和第三声压,计算得到所述第二HRTF。And a third calculating unit configured to calculate the second HRTF according to the second sound pressure and the third sound pressure.
  11. 根据权利要求10所述的终端,其中,所述第一计算单元,配置为当所述声波直射传输至所述左耳时,计算所述声波传输至所述左耳的第一声压;当所述声波经过反射传输至所述左耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计算所述声波传输至所述左耳的第一声压。The terminal according to claim 10, wherein the first calculating unit is configured to calculate a first sound pressure transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear; When the sound waves are transmitted to the left ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a preset intensity threshold. At the time, the first sound pressure transmitted by the sound wave to the left ear is calculated.
  12. 根据权利要求11所述的终端,其中,所述第二计算单元,配置为根据公式
    Figure PCTCN2017082940-appb-100007
    计算得到第一HRTF,其中,HL表示第一HRTF,
    Figure PCTCN2017082940-appb-100008
    表示第一声压,P0(r0,f)表示第三声压,rL表示所述声源与所述左耳之间的距离,θL表示所述声源相对于所述左耳的水平角,
    Figure PCTCN2017082940-appb-100009
    表示所述声源相对于所述左耳的仰角,f表示声波频率, a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
    The terminal according to claim 11, wherein the second calculating unit is configured according to a formula
    Figure PCTCN2017082940-appb-100007
    Calculating a first HRTF, where H L represents the first HRTF,
    Figure PCTCN2017082940-appb-100008
    Represents a sound pressure, P 0 (r 0, f ) represents a third sound pressure, r L denotes a distance between the sound source to the left ear, θ L represents the sound source with respect to the left ear Horizontal angle,
    Figure PCTCN2017082940-appb-100009
    Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the central position of the three-dimensional human head model .
  13. 根据权利要求10所述的终端,其中,所述第一计算单元,配置为当所述声波直射传输至所述右耳时,计算所述声波传输至所述右耳的第二声压;当所述声波经过反射传输至所述右耳时,按照反射发生的先后顺序,依次计算所述声波发生反射后的声压,其中,当依次计算得到的所述声压均大于一预设强度阈值时,计算所述声波传输至所述右耳的第二声压。The terminal according to claim 10, wherein the first calculating unit is configured to calculate a second sound pressure transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; When the sound waves are transmitted to the right ear by reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the sequence in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a preset intensity threshold. At the time, the second sound pressure transmitted by the sound wave to the right ear is calculated.
  14. 根据权利要求13所述的终端,其中,所述第三计算单元,配置为根据公式
    Figure PCTCN2017082940-appb-100010
    计算得到第二HRTF,其中,HR表示第二HRTF,
    Figure PCTCN2017082940-appb-100011
    表示第二声压,P0(r0,f)表示第三声压,rR表示所述声源与所述右耳之间的距离,θR表示所述声源相对于所述右耳的水平角,
    Figure PCTCN2017082940-appb-100012
    表示所述声源相对于所述右耳的仰角,f表示声波频率,a表示所述三维人头模型的生理参数,r0表示所述声源与所述三维人头模型的中心位置之间的距离。
    The terminal according to claim 13, wherein the third calculating unit is configured according to a formula
    Figure PCTCN2017082940-appb-100010
    Calculating a second HRTF, where H R represents a second HRTF,
    Figure PCTCN2017082940-appb-100011
    Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and θ R represents the sound source relative to the right ear Horizontal angle,
    Figure PCTCN2017082940-appb-100012
    Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
  15. 一种计算机存储介质,存储有计算机可执行指令,该计算机可执行指令配置执行上述权利要求1-7任一项所述的声音信号的处理方法。 A computer storage medium storing computer executable instructions configured to perform a method of processing a sound signal according to any of the preceding claims 1-7.
PCT/CN2017/082940 2016-08-25 2017-05-03 Sound signal processing method, terminal, and computer storage medium WO2018036194A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610724585.0 2016-08-25
CN201610724585.0A CN107786936A (en) 2016-08-25 2016-08-25 The processing method and terminal of a kind of voice signal

Publications (1)

Publication Number Publication Date
WO2018036194A1 true WO2018036194A1 (en) 2018-03-01

Family

ID=61245440

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/082940 WO2018036194A1 (en) 2016-08-25 2017-05-03 Sound signal processing method, terminal, and computer storage medium

Country Status (2)

Country Link
CN (1) CN107786936A (en)
WO (1) WO2018036194A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020037984A1 (en) * 2018-08-20 2020-02-27 华为技术有限公司 Audio processing method and apparatus

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111886882A (en) * 2018-03-19 2020-11-03 OeAW奥地利科学院 Method for determining a listener specific head related transfer function
EP3827603A1 (en) 2018-07-25 2021-06-02 Dolby Laboratories Licensing Corporation Personalized hrtfs via optical capture
CN114205730A (en) 2018-08-20 2022-03-18 华为技术有限公司 Audio processing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1778143A (en) * 2003-09-08 2006-05-24 松下电器产业株式会社 Audio image control device design tool and audio image control device
CN102413414A (en) * 2010-10-13 2012-04-11 微软公司 System and method for high-precision 3-dimensional audio for augmented reality
CN105163242A (en) * 2015-09-01 2015-12-16 深圳东方酷音信息技术有限公司 Multi-angle 3D sound playback method and device
CN105611481A (en) * 2015-12-30 2016-05-25 北京时代拓灵科技有限公司 Man-machine interaction method and system based on space voices

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1778143A (en) * 2003-09-08 2006-05-24 松下电器产业株式会社 Audio image control device design tool and audio image control device
CN102413414A (en) * 2010-10-13 2012-04-11 微软公司 System and method for high-precision 3-dimensional audio for augmented reality
CN105163242A (en) * 2015-09-01 2015-12-16 深圳东方酷音信息技术有限公司 Multi-angle 3D sound playback method and device
CN105611481A (en) * 2015-12-30 2016-05-25 北京时代拓灵科技有限公司 Man-machine interaction method and system based on space voices

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020037984A1 (en) * 2018-08-20 2020-02-27 华为技术有限公司 Audio processing method and apparatus
US11611841B2 (en) 2018-08-20 2023-03-21 Huawei Technologies Co., Ltd. Audio processing method and apparatus
US11910180B2 (en) 2018-08-20 2024-02-20 Huawei Technologies Co., Ltd. Audio processing method and apparatus

Also Published As

Publication number Publication date
CN107786936A (en) 2018-03-09

Similar Documents

Publication Publication Date Title
US11082791B2 (en) Head-related impulse responses for area sound sources located in the near field
US10939225B2 (en) Calibrating listening devices
CN109076305B (en) Augmented reality headset environment rendering
Bronkhorst Localization of real and virtual sound sources
US10602298B2 (en) Directional propagation
Spagnol et al. On the relation between pinna reflection patterns and head-related transfer function features
WO2018036194A1 (en) Sound signal processing method, terminal, and computer storage medium
Keyrouz Advanced binaural sound localization in 3-D for humanoid robots
US9609436B2 (en) Systems and methods for audio creation and delivery
Kahana et al. Boundary element simulations of the transfer function of human heads and baffled pinnae using accurate geometric models
Spagnol et al. Current use and future perspectives of spatial audio technologies in electronic travel aids
JP2014505420A (en) Audio system and operation method thereof
Schönstein et al. HRTF selection for binaural synthesis from a database using morphological parameters
CN111696513A (en) Audio signal processing method and device, electronic equipment and storage medium
Geronazzo et al. A minimal personalization of dynamic binaural synthesis with mixed structural modeling and scattering delay networks
Geronazzo et al. A head-related transfer function model for real-time customized 3-D sound rendering
Mokhtari et al. Computer simulation of KEMAR's head-related transfer functions: Verification with measurements and acoustic effects of modifying head shape and pinna concavity
US20200137507A1 (en) Sharing Locations where Binaural Sound Externally Localizes
KR100818660B1 (en) 3d sound generation system for near-field
US11315277B1 (en) Device to determine user-specific HRTF based on combined geometric data
Geronazzo et al. Acoustic selfies for extraction of external ear features in mobile audio augmented reality
JP2018152834A (en) Method and apparatus for controlling audio signal output in virtual auditory environment
Oldfield The analysis and improvement of focused source reproduction with wave field synthesis
WO2019174442A1 (en) Adapterization equipment, voice output method, device, storage medium and electronic device
Duraiswami et al. Capturing and recreating auditory virtual reality

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17842619

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17842619

Country of ref document: EP

Kind code of ref document: A1