WO2018036194A1

WO2018036194A1 - Sound signal processing method, terminal, and computer storage medium

Info

Publication number: WO2018036194A1
Application number: PCT/CN2017/082940
Authority: WO
Inventors: 卢好峰
Original assignee: 中兴通讯股份有限公司
Priority date: 2016-08-25
Filing date: 2017-05-03
Publication date: 2018-03-01
Also published as: CN107786936A

Abstract

The present disclosure provides a sound signal processing method, a terminal, and a computer storage medium. The processing method comprises: obtaining a three-dimensional head model of a terminal user; computing a first head-related transfer function (HRTF) indicating that a sound wave sent by a sound source is transmitted to the left ear of the three-dimensional head model and a second HRTF indicating that the sound wave sent by the sound source is transmitted to the right ear of the three-dimensional head model; and processing a sound signal sent by the sound source according to the first HRTF and the second HRTF.

Description

Method for processing sound signal and terminal, computer storage medium

Cross-reference to related applications

The present application is based on a Chinese patent application filed on Jan. 25, 2016, the disclosure of which is hereby incorporated by reference.

Technical field

The present disclosure relates to audio processing technologies, and in particular, to a method and a terminal for processing a sound signal, and a computer storage medium.

Background technique

The development of virtual reality technology is more and more mature, but the core of virtual reality technology is now focused on the visual aspect. However, when human beings perceive the objective world, they can obtain about 60% of information through vision, about 30% of information through hearing, and the remaining 10% of information through senses such as touch and smell, so in virtual reality technology, virtual Hearing technology is also an indispensable technology. People's perception of the position and distance of hearing is equally important for people's real virtual reality experience.

Virtual hearing is based on audio signal processing technology, based on some special acoustic effects of the human ear, through the acoustic correlation algorithm to calculate the simulation, so that the sound source is reconstructed in any position in the three-dimensional space to achieve the reproduction of the sound source orientation. In addition, the virtual hearing is an auditory rendering of the virtual environment composed of the sound source signal, the listener and the scene, and the virtual sound at a specific position in the specific scene is obtained by numerical simulation instead of the real auditory feeling. For the current wearing VR device, plus three-dimensional virtual hearing, it can bring real immersion.

Virtual hearing technology is the physical and geometric conditions of a given sound source and environment, simulating the sound source The sound wave and its transmission process, so as to obtain the temporal and spatial information of the sound, and finally use the head correlation transfer function (HRTF) signal to process the simulated human ear to synthesize the sound wave, and convert the time and space information of the sound into a binaural sound signal. Then replay it through the headphones to both ears. The HRTF is the basis of virtual hearing technology, and the HRTF can be reconstructed from the amplitude characteristics of the left and right ears and the time difference between the ears. Most of the methods of obtaining HRTF are obtained by laboratory measurement. At present, some laboratories at home and abroad have obtained the HRTF database through testing, and measured the HRTF of the partial orientation of a specific population.

However, although the HRTF can be obtained by measurement, the measurement method is cumbersome, expensive and time consuming, and the database of measurement is only the HRTF of a specific population. In real life, because each person's physiological structure is different, the head shape and the outer ear size are different, so each person has its own unique HRTF, that is, the measured HRTF database cannot represent the structural distribution characteristics of all people. Thus, if the experimentally obtained HRTF data is used for virtual auditory rendering, it is likely that an auditory difference will occur when auditory playback is performed, and even the auditory orientation information cannot be discerned. In addition, the database of HRTF measurements is only spatially discrete part of the azimuth data, and can not represent the position information of all orientations, so there will be deviations in virtual hearing.

Summary of the invention

In order to solve the problem that the virtual auditory space synthesis distortion causes the hearing to be deviated, the present disclosure provides a method for processing a sound signal, a terminal, and a computer storage medium.

In order to solve the above technical problem, in a first aspect, the present disclosure provides a method for processing a sound signal, including:

Obtaining a three-dimensional human head model of the end user;

Calculating a first head correlation transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;

The sound signal emitted by the sound source is processed according to the first HRTF and the second HRTF.

Optionally, the step of acquiring the three-dimensional human head model of the terminal user comprises: acquiring the three-dimensional human head model of the terminal user by using the terminal scanning.

Optionally, the step of calculating the first head related transfer function HRTF of the sound wave emitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF transmitted to the right ear of the three-dimensional human head model includes: calculating Determining a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in the original of the three-dimensional human head model a third sound pressure generated at the position; calculating the first HRTF according to the first sound pressure and the third sound pressure; calculating the second HRTF according to the second sound pressure and the third sound pressure .

Optionally, the step of calculating the first sound pressure transmitted by the sound wave to the left ear comprises: calculating a first sound transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear When the sound waves are transmitted to the left ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures calculated in sequence are greater than one When the intensity threshold is set, the first sound pressure transmitted by the sound wave to the left ear is calculated.

Optionally, the step of calculating the first HRTF according to the first sound pressure and the third sound pressure comprises: according to a formula

Calculating a first HRTF, where H _L represents the first HRTF,

Representing a first sound pressure, P ₀ (r ₀ , f) represents a third sound pressure, r _L represents a distance between the sound source and the left ear, and θ _L represents the sound source relative to the left ear Horizontal angle,

Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r ₀ indicating the distance between the sound source and the center position of the three-dimensional human head model .

Optionally, the step of calculating the second sound pressure of the sound wave transmitted to the right ear comprises: calculating a second sound of the sound wave transmitted to the right ear when the sound wave is directly transmitted to the right ear When the sound waves are transmitted to the right ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures calculated in sequence are greater than one When the intensity threshold is set, the second sound pressure transmitted by the sound wave to the right ear is calculated.

Optionally, the step of calculating the second HRTF according to the second sound pressure and the third sound pressure comprises: according to a formula

Calculating a second HRTF, where H _R represents a second HRTF,

Representing a second sound pressure, P ₀ (r ₀ , f) represents a third sound pressure, r _R represents a distance between the sound source and the right ear, and θ _R represents the sound source relative to the right ear Horizontal angle,

Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r ₀ indicating the distance between the sound source and the center position of the three-dimensional human head model .

In a second aspect, the present disclosure also provides a terminal, including:

Obtaining a module configured to acquire a three-dimensional human head model of the end user;

a calculation module configured to calculate a first head association transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;

The processing module is configured to process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.

Optionally, the acquiring module is configured to acquire a three-dimensional human head model of the terminal user by using the terminal scanning.

Optionally, the calculation module includes: a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and The third sound pressure generated by the sound wave at the original position of the three-dimensional human head model after the three-dimensional human head model is removed; the second calculating unit is configured to calculate according to the first sound pressure and the third sound pressure The first HRTF; the third calculating unit is configured to calculate the second HRTF according to the second sound pressure and the third sound pressure.

Optionally, the first calculating unit is configured to calculate, when the sound wave is directly transmitted to the left ear, a first sound pressure transmitted by the sound wave to the left ear; when the sound wave is transmitted to the sound In the left ear, the sound pressure after the sound wave is reflected is sequentially calculated according to the sequence in which the reflection occurs, wherein when the sound pressures sequentially calculated are greater than a preset intensity threshold, Calculating a first sound pressure transmitted by the sound wave to the left ear.

Optionally, the second calculating unit is configured according to a formula

Calculating a first HRTF, where H _L represents the first HRTF,

Optionally, the first calculating unit is configured to calculate a second sound pressure that is transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; and when the sound wave is transmitted to the sound wave through reflection In the right ear, the sound pressure after the sound wave is reflected is sequentially calculated according to the sequence in which the reflection occurs, wherein the sound wave transmission is calculated when the sound pressures sequentially calculated are greater than a preset intensity threshold. a second sound pressure to the right ear.

Optionally, the third calculating unit is configured according to a formula

Calculating a second HRTF, where H _R represents a second HRTF,

The acquisition module, the calculation module, the processing module, the first calculation unit, the second calculation unit, and the third calculation unit may use a central processing unit (CPU, Central Processing) when performing processing. Unit), Digital Signal Processor (DSP, Digital Singnal Processor) or Field-Programmable Gate Array (FPGA) achieve.

In a third aspect, the present disclosure also provides a computer storage medium having stored therein computer executable instructions configured to perform a method of processing the sound signal.

The beneficial effects that can be achieved with the present disclosure are:

The present disclosure first obtains a three-dimensional human head model of the end user, and then calculates a first HRTF transmitted by the sound source to the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model, and finally calculated according to the calculation. The first HRTF and the second HRTF process the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first HRTF and the second HRTF. The sound signal is processed, so that the end user can obtain a more realistic sound experience, and solves the problem that the virtual hearing technology has the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art, and improves the user's sound effect. Experience.

DRAWINGS

1 is a flow chart showing the steps of a method for processing a sound signal in an embodiment of the present disclosure;

2 is a flow chart showing the steps of a method for processing another sound signal in the embodiment of the present disclosure;

Figure 3 is a schematic diagram showing a scene in which sound waves are transmitted indoors;

4 is a block diagram showing the structure of a terminal of a sound signal in an embodiment of the present disclosure.

detailed description

Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While the embodiments of the present invention have been shown in the drawings, the embodiments Rather, these embodiments are provided so that this disclosure will be more fully understood and the scope of the disclosure will be fully disclosed.

FIG. 1 is a flow chart showing the steps of a method for processing a sound signal according to an embodiment of the present disclosure, the processing method includes:

Step 101: Acquire a three-dimensional human head model of the terminal user.

In this step, optionally, when acquiring the three-dimensional human head model of the terminal user, the three-dimensional human head model of the terminal user may be acquired by means of terminal scanning. Additionally, optionally, the human head three-dimensional model may include one-third of the torso portion of the end user. In addition, after the terminal scans the user's three-dimensional human head model, the three-dimensional human head model can be modeled to obtain the physiological parameters and the like of the three-dimensional human head model.

Step 102: Calculate a first head related transfer function HRTF of the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model.

In this step, optionally, after acquiring the three-dimensional human head model of the terminal user, the first HRTF of the left ear of the three-dimensional human head model can be calculated and calculated by calculation, and the sound source is emitted. The sound waves are transmitted to the second HRTF of the right ear of the three-dimensional human head model. In this way, a personalized HRTF unique to each end user can be obtained by calculation.

Step 103: Process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.

In this step, optionally, the sound signal emitted by the sound source may be processed according to the calculated first HRTF and the second HRTF. In this way, according to the personalized HRTF unique to each end user, the sound signal emitted by the sound source is processed, so that in the virtual reality technology, the HRTF in the HRTF database measured by the laboratory does not need to be received by the terminal user. The incoming sound signal is processed to improve the auditory perception of the end user.

In this way, the first embodiment obtains the three-dimensional human head model of the end user, and then calculates the first HRTF transmitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF transmitted to the right ear of the three-dimensional human head model, and finally according to Calculating the first HRTF and the second HRTF, processing the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first HRTF and Second HRTF The sound signal is processed, so that the end user can obtain a more realistic sound experience, and solves the problem that the virtual hearing technology has the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art, and improves the user's sound effect. Experience.

As shown in FIG. 2, it is a flow chart of steps of a method for processing a sound signal according to an embodiment of the present disclosure, the processing method includes:

Step 201: Acquire a three-dimensional human head model of the terminal user.

Step 202: Calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and a third sound pressure generated by the sound wave at the original position of the three-dimensional human head model after the three-dimensional human head model is removed.

In this step, optionally, when calculating the first sound pressure transmitted by the sound wave to the left ear of the three-dimensional human head model, if the sound wave is directly transmitted to the left ear of the three-dimensional human head model, the first sound of the sound wave transmitted to the left ear can be directly calculated. If the sound wave is transmitted to the left ear of the three-dimensional human head model through reflection, the sound pressure after the sound wave is reflected may be sequentially calculated according to the order in which the reflection occurs, wherein the sound pressure calculated in sequence is greater than a preset intensity threshold. At the time, the first sound pressure transmitted by the sound wave to the left ear is calculated. Optionally, when calculating the first sound pressure, the first sound intensity transmitted to the left ear of the three-dimensional human head model may be calculated first.

Of course, when calculating the second sound pressure transmitted by the sound wave to the right ear of the three-dimensional human head model, if the sound wave is directly transmitted to the right ear, the second sound pressure transmitted by the sound wave to the right ear can be calculated; if the sound wave is transmitted to the right ear through the reflection, The sound pressure after the sound wave is reflected may be sequentially calculated according to the order in which the reflection occurs, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the sound is calculated. The second sound pressure transmitted by the wave to the right ear. Optionally, when calculating the second sound pressure, the second sound intensity transmitted to the right ear of the three-dimensional human head model may be calculated first.

This will be specifically described below.

In the process of sound wave transmission, the sound wave transmission process can be divided into direct light and reflection, and the sound wave is mostly direct transmission when transmitting outdoors, and mostly reflective transmission when transmitting indoors. As shown in FIG. 3, it is a schematic diagram of a scene in which sound waves are transmitted indoors. In Fig. 3, A is a three-dimensional human head model, B is a sound source, C is an indoor wall surface, a solid line is a direct wave transmission path, and a broken line is a sound wave reflection transmission path.

When the sound wave emitted by the sound source B is directly transmitted to the three-dimensional human head model A, the sound intensity generated by the sound source B in the three-dimensional human head model can be calculated according to the calculation formula of the sound intensity directly according to the parameters r and e. Where r is the distance between the sound source B and the three-dimensional human head model, e is the energy emitted by the sound source B per second, and the energy and parameter θ of the parameter e transmitted in each direction

Correlation, where θ represents the horizontal angle of sound source B relative to the recipient of the sound source,

Indicates the elevation angle of source B relative to the recipient of the source. In addition, since the sound pressure received by the three-dimensional human head model is also related to the acoustic frequency f and the physiological parameters of the three-dimensional human head model, it can be based on the parameters.

And a, calculate the first sound pressure generated by the sound source in the left ear of the three-dimensional human head model, according to the parameters r _R , θ _R ,

f, a calculates the second sound pressure generated by the sound source in the right ear of the three-dimensional human head model, where r _L represents the distance between the sound source and the left ear, and θ _L represents the horizontal angle of the sound source relative to the left ear,

Indicates the elevation angle of the sound source relative to the left ear, f denotes the acoustic wave frequency, a denotes the physiological parameter of the three-dimensional human head model, r _R denotes the distance between the sound source and the right ear, and θ _R denotes the horizontal angle of the sound source with respect to the right ear,

Indicates the elevation angle of the sound source relative to the right ear.

In addition, when the sound wave reflection from the sound source B is transmitted to the three-dimensional human head model A, since the sound wave is attenuated according to the number of reflections and the path intensity during the reflection process, it is necessary to acquire the sound wave and the indoor during the sound wave reflection transmission process. The collision point of the wall surface C is then obtained according to the acquired collision point, and the reflection of the sound wave is reversed, and the sound wave intensity after the sound wave reflection is sequentially calculated according to the order in which the sound wave reflection occurs, and the sound wave intensity is converted into the sound pressure. Wherein, if the sound wave is in the process of reflection, the sound pressure is less than a preset intensity threshold, that is, when the sound wave is not transmitted to the three-dimensional human head model, the sound wave transmission may not be processed, but if the sound pressure is calculated in turn, Above the preset intensity threshold, it is also necessary to calculate the sound pressure received by the sound wave transmitted to the three-dimensional human head model. Optionally, when calculating the sound pressure transmitted by the acoustic reflection to the three-dimensional human head model A, considering the parameters r, θ,

At the same time as f and a, it is also necessary to consider the reflection coefficient and the scattering coefficient. Of course, depending on the reflection medium, the reflection coefficient and the scattering coefficient are also different.

Thus, referring to the above manner, the first sound pressure transmitted by the sound wave to the left ear of the three-dimensional human head model by direct or reflective means, the second sound pressure transmitted to the right ear of the three-dimensional human head model, and the sound wave at the original position of the three-dimensional human head model can be calculated. The third sound pressure produced.

Step 203: Calculate the first HRTF according to the first sound pressure and the third sound pressure.

In this step, optionally, when acquiring the third sound pressure generated by the sound wave transmitted to the left ear of the three-dimensional human head model and the sound pressure generated at the original position of the three-dimensional human head model, according to the formula

Calculating a first HRTF, where H _L represents the first HRTF,

Representing the first sound pressure, P ₀ (r ₀ , f) represents the third sound pressure, r _L represents the distance between the sound source and the left ear, and θ _L represents the horizontal angle of the sound source with respect to the left ear.

Indicates the elevation angle of the sound source relative to the left ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r ₀ represents the distance between the sound source and the center position of the three-dimensional human head model.

Step 204: Calculate a second HRTF according to the second sound pressure and the third sound pressure.

In this step, optionally, when acquiring the third sound pressure generated by the sound wave transmitted to the right ear of the three-dimensional human head model and the third sound pressure generated at the original position of the three-dimensional human head model, according to the formula

Calculating a second HRTF, where H _R represents a second HRTF,

Representing the second sound pressure, P ₀ (r ₀ , f) represents the third sound pressure, r _R represents the distance between the sound source and the right ear, and θ _R represents the horizontal angle of the sound source with respect to the right ear.

Representing the elevation angle of the sound source relative to the right ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r ₀ represents the distance between the sound source and the center position of the three-dimensional human head model.

Step 205: Process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.

In this way, the embodiment of the present disclosure generates a first sound pressure transmitted by the sound source to the left ear of the three-dimensional human head model, and a second sound pressure that is transmitted to the right ear of the three-dimensional human head model, and the sound wave is generated at the original position of the three-dimensional human head model. The third sound pressure is used to calculate the first HRTF of the sound wave transmitted to the left ear of the three-dimensional human head model and the second HRTF of the sound wave transmitted to the right ear of the three-dimensional human head model, and finally the first HRTF and the second HRTF obtained by the calculation, the sound source The emitted sound signal is processed, thus simplifying the calculation process of the HRTF, so that the end user can obtain the personalized HRTF parameters and process the sound signal according to the personalized HRTF without referring to the experimentally measured HRTF database. The processing of the sound signal solves the problem of the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art virtual hearing technology, and improves the user's sound experience.

As shown in FIG. 4, it is a structural block diagram of a terminal in an embodiment of the present disclosure, where the terminal includes:

The obtaining module 401 is configured to acquire a three-dimensional human head model of the terminal user;

The calculation module 402 is configured to calculate a first head association transfer function HRTF of the sound wave emitted by one sound source to the left ear of the three-dimensional human head model and a second HRTF transmitted to the right ear of the three-dimensional human head model;

The processing module 403 is configured to emit sound to the sound source according to the first HRTF and the second HRTF The tone signal is processed.

Optionally, the obtaining module 401 is configured to obtain a three-dimensional human head model of the terminal user by using the terminal scanning.

Optionally, the calculation module 402 includes: a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in three dimensions a third sound pressure generated at a home position of the head model; a second calculating unit configured to calculate a first HRTF according to the first sound pressure and the third sound pressure; and a third calculating unit configured to be based on the second sound pressure sum The third sound pressure is calculated to obtain a second HRTF.

Optionally, the first calculating unit is configured to calculate a first sound pressure transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear; and when the sound wave is transmitted to the left ear through the reflection, according to the order in which the reflection occurs, The sound pressure after the sound wave is reflected is calculated, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the first sound pressure transmitted by the sound wave to the left ear is calculated.

Optionally, the second computing unit is configured to be according to a formula

Calculating a first HRTF, where H _L represents the first HRTF,

Representing the first sound pressure, P ₀ (r ₀ , f) represents the third sound pressure, r _L represents the distance between the sound source and the left ear, and θ _L represents the horizontal angle of the sound source with respect to the left ear. Indicates the elevation angle of the sound source relative to the left ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r ₀ represents the distance between the sound source and the center position of the three-dimensional human head model.

Optionally, the first calculating unit is configured to calculate a second sound pressure transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; and when the sound wave is transmitted to the right ear through the reflection, according to the order in which the reflection occurs, The sound pressure after the sound wave is reflected is calculated, wherein when the sound pressure calculated in sequence is greater than a preset intensity threshold, the second sound pressure transmitted by the sound wave to the right ear is calculated.

Optionally, the third computing unit is configured to be according to a formula

Calculating a second HRTF, where H _R represents a second HRTF,

Representing the second sound pressure, P ₀ (r ₀ , f) represents the third sound pressure, r _R represents the distance between the sound source and the right ear, and θ _R represents the horizontal angle of the sound source with respect to the right ear,

Indicates the elevation angle of the sound source relative to the right ear, f represents the acoustic wave frequency, a represents the physiological parameter of the three-dimensional human head model, and r ₀ represents the distance between the sound source and the center position of the three-dimensional human head model.

The present disclosure also provides a computer storage medium having stored therein computer executable instructions configured to perform a method of processing the sound signal described above.

The present disclosure also provides a terminal, where the terminal includes a memory, a processor, where

The memory for storing a computer executable program for executing a processing method of the sound signal described above;

The processor is configured to read the computer executable program from the memory, and execute the processing method of the sound signal according to the computer executable program.

The above is a preferred embodiment of the present disclosure, and it should be noted that those skilled in the art can also make several improvements and refinements without departing from the principles of the present disclosure. Within the scope of public protection.

Industrial applicability

Using the processing method of the sound signal provided by the present disclosure, acquiring the three-dimensional human head model of the end user, calculating the first HRTF of the sound wave emitted by the sound source to the left ear of the three-dimensional human head model and the second HRTF of the right ear transmitted to the three-dimensional human head model And processing, according to the calculated first HRTF and the second HRTF, the signal sent by the sound source, so that the terminal can obtain the first HRTF and the second HRTF personalized by the terminal user by calculation, and pass the personalized first The HRTF and the second HRTF process the sound signal, so that the end user can obtain a more realistic sound experience, and solve the problem of the virtual auditory space synthesis distortion caused by the non-personalized HRTF in the prior art virtual hearing technology. Improve the user's sound experience.

Claims

A method for processing a sound signal, comprising:

Obtaining a three-dimensional human head model of the end user;

Calculating a first head correlation transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;

The sound signal emitted by the sound source is processed according to the first HRTF and the second HRTF.
The processing method according to claim 1, wherein the acquiring a three-dimensional human head model of the end user comprises:

The terminal user's three-dimensional human head model is obtained through terminal scanning.
The processing method according to claim 1, wherein a first head related transfer function HRTF for transmitting a sound wave emitted from a sound source to a left ear of the three-dimensional human head model and a right ear transmitted to the three-dimensional human head model are calculated Two HRTFs, including:

Calculating a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the sound wave is in the three-dimensional human head model The third sound pressure generated at the original position;

Calculating the first HRTF according to the first sound pressure and the third sound pressure;

The second HRTF is calculated according to the second sound pressure and the third sound pressure.
The processing method according to claim 3, wherein calculating the first sound pressure transmitted by the sound wave to the left ear comprises:

Calculating a first sound pressure transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear;

When the sound waves are transmitted to the left ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a predetermined intensity. At the threshold, the first sound pressure transmitted by the sound wave to the left ear is calculated.
The processing method according to claim 4, wherein said according to said first sound pressure sum The third sound pressure is calculated to obtain the first HRTF, including:

According to the formula
Calculating a first HRTF, where H L represents the first HRTF,
Representing a first sound pressure, P 0 (r 0 , f) represents a third sound pressure, r L represents a distance between the sound source and the left ear, and θ L represents the sound source relative to the left ear Horizontal angle,
Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
The processing method according to claim 3, wherein calculating the second sound pressure transmitted by the sound wave to the right ear comprises:

Calculating a second sound pressure transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear;

When the sound waves are transmitted to the right ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a predetermined intensity. At the threshold, a second sound pressure transmitted by the sound wave to the right ear is calculated.
The processing method according to claim 6, wherein the calculating the second HRTF according to the second sound pressure and the third sound pressure comprises:

According to the formula
Calculating a second HRTF, where H R represents a second HRTF,
Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and θ R represents the sound source relative to the right ear Horizontal angle,
Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
A terminal comprising:

Obtaining a module configured to acquire a three-dimensional human head model of the end user;

a calculation module configured to calculate a first head association transfer function HRTF transmitted by a sound source to a left ear of the three-dimensional human head model and a second HRTF transmitted to a right ear of the three-dimensional human head model;

The processing module is configured to process the sound signal emitted by the sound source according to the first HRTF and the second HRTF.
The terminal according to claim 8, wherein the obtaining module is configured to acquire a three-dimensional human head model of the terminal user by scanning the terminal.
The terminal of claim 8, wherein the calculation module comprises:

a first calculating unit configured to calculate a first sound pressure transmitted by the sound wave to the left ear, a second sound pressure transmitted by the sound wave to the right ear, and after the three-dimensional human head model is removed, the a third sound pressure generated by the sound wave at the original position of the three-dimensional human head model;

a second calculating unit, configured to calculate the first HRTF according to the first sound pressure and the third sound pressure;

And a third calculating unit configured to calculate the second HRTF according to the second sound pressure and the third sound pressure.
The terminal according to claim 10, wherein the first calculating unit is configured to calculate a first sound pressure transmitted by the sound wave to the left ear when the sound wave is directly transmitted to the left ear; When the sound waves are transmitted to the left ear through reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the order in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a preset intensity threshold. At the time, the first sound pressure transmitted by the sound wave to the left ear is calculated.
The terminal according to claim 11, wherein the second calculating unit is configured according to a formula
Calculating a first HRTF, where H L represents the first HRTF,
Represents a sound pressure, P 0 (r 0, f ) represents a third sound pressure, r L denotes a distance between the sound source to the left ear, θ L represents the sound source with respect to the left ear Horizontal angle,
Representing the elevation angle of the sound source with respect to the left ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the central position of the three-dimensional human head model .
The terminal according to claim 10, wherein the first calculating unit is configured to calculate a second sound pressure transmitted by the sound wave to the right ear when the sound wave is directly transmitted to the right ear; When the sound waves are transmitted to the right ear by reflection, the sound pressures after the sound waves are reflected are sequentially calculated according to the sequence in which the reflection occurs, wherein the sound pressures sequentially calculated are greater than a preset intensity threshold. At the time, the second sound pressure transmitted by the sound wave to the right ear is calculated.
The terminal according to claim 13, wherein the third calculating unit is configured according to a formula
Calculating a second HRTF, where H R represents a second HRTF,
Representing a second sound pressure, P 0 (r 0 , f) represents a third sound pressure, r R represents a distance between the sound source and the right ear, and θ R represents the sound source relative to the right ear Horizontal angle,
Representing the elevation angle of the sound source with respect to the right ear, f representing the acoustic wave frequency, a representing the physiological parameter of the three-dimensional human head model, and r 0 indicating the distance between the sound source and the center position of the three-dimensional human head model .
A computer storage medium storing computer executable instructions configured to perform a method of processing a sound signal according to any of the preceding claims 1-7.