RU2020144204A

RU2020144204A - Method and system for generating a personalized transfer function of a free field audio signal based on transfer function data of a near field audio signal

Info

Publication number: RU2020144204A
Application number: RU2020144204A
Authority: RU
Inventors: Андрей Викторович Филимонов; Андрей Игоревич ЕПИШИН; Михаил Сергеевич Клешнин; Джой ЛАЙОНЗ
Original assignee: Харман Интернэшнл Индастриз, Инкорпорейтид
Filing date: 2020-12-31
Publication date: 2022-07-04

Claims

1. A computer-implemented method for generating a personalized audio signal transfer function, the method comprising the steps of:

receiving, by the sound receiving means, an audio signal at or in the user's ear;

determining, based on the received audio signal, first data, wherein the first data represents a first audio signal transfer function associated with the user's ear;

determining, based on the first data, second data, wherein the second data represents a second audio signal transfer function associated with the user's ear.

2. The computer-implemented method according to claim 1, wherein the first audio signal transfer function is a near-field audio signal transfer function and/or the method further comprises receiving an audio signal from the audio transmission means, in particular from headphones worn by the user, within the near field relative to the user's ear.

3. A computer-implemented method according to any one of the preceding claims, wherein the second audio transfer function is a far-field or free-field audio transfer function.

4. The computer-implemented method according to any one of the preceding claims, wherein the second transfer function of the audio signal is associated with the direction of the audio signal; wherein the method further comprises determining the third data, wherein the third data indicates the direction of the audio signal, and wherein the determination of the second data is further based on the third data.

5. A computer-implemented method according to any one of the preceding claims, further comprising the steps of:

before receiving the audio signal, the audio signal is transmitted by the audio transmission means; and/or

determining, based on the second data, a filter function for modifying the audio signal and/or the subsequent audio signal; and/or

transmitting, by means of sound transmission, a modified sound signal and/or a modified subsequent sound signal.

6. A computer-implemented method according to any one of the preceding claims, wherein the second data is determined using an artificial intelligence-based or machine learning-based regression algorithm, preferably a neural network model, wherein in particular the first data and/or third data are used as input to the regression algorithm.

7. A computer-implemented method according to claim 6, further comprising, in the learning process, a computer-implemented method for initiating and/or learning a regression algorithm according to any one of claims 8-13.

8. A computer-implemented method for initiating and/or training an AI-based or machine learning-based regression algorithm, preferably a neural network model, the method comprising:

determining a training data set, wherein the training data set comprises a plurality of first training data and a plurality of second training data; and

initiating and/or training a regression algorithm based on the training dataset to derive a second audio transfer function associated with the user's ear based on the input first audio transfer function associated with the user's ear;

wherein each of the plurality of first training data represents a respective first audio signal training transfer function associated with the ear of the training subject or the respective ear of the training subject;

wherein each of the plurality of second training data represents a respective second audio signal training transfer function associated with the ear of the learning subject or the corresponding ear of the learning subject.

9. The computer-implemented method of claim 8, wherein each of the respective first training audio signal transfer functions is a respective near field audio signal transfer function, wherein in particular the input first audio signal transfer function is an audio signal transfer function near field.

10. The computer-implemented method of claim 8 or 9, wherein each of the respective second training audio signal transfer functions is a respective far-field or free-field audio signal transfer function, wherein, in frequency, the output second audio signal transfer function is is the transfer function of the far-field or free-field audio signal.

11. A computer-implemented method according to any one of claims 8-10, wherein

each of the respective second learning transfer functions of the audio signal is associated with a teaching direction of the audio signal relative to the ear of the teaching subject or a corresponding teaching direction of the audio signal relative to the ear of the teaching subject; and/or

the training dataset further comprises third training data, wherein the third training data indicates a training direction of the audio signal or a corresponding training direction of the audio signal; and/or

the output second audio transfer function is associated with the input direction of the audio signal relative to the user's ear, wherein, in particular, the initiation and/or training of the regression algorithm to output the second audio signal transfer function is additionally based on the input direction of the audio signal.

12. A computer-implemented method according to any one of claims 8 to 11, further comprising the steps of:

receiving a plurality of first training audio signals in or at the ear of the training subject from the first sound transmission means, in particular from headphones worn by the training subject, within a near field relative to the ear of the training subject; and

determining, based on each of the received plurality of first training audio signals, corresponding first training audio signal transfer functions; and/or

receiving a plurality of second training audio signals in or at the ear of the training user from or a corresponding second audio transmission medium, within a far field or free field relative to the ear of the training subject; and

determining, based on each of the received plurality of second training audio signals, corresponding second training audio signal transfer functions;

wherein, in particular, the teaching direction of the audio signal or the corresponding teaching direction of the audio signal is the direction in which the corresponding second teaching audio signal is received at or in the ear of the teaching subject relative to the ear of the teaching user, and/or the direction in which the corresponding second transmission means sound is located relative to the ear of the teaching subject.

13. The computer-implemented method of claim 12, wherein the third training data comprises vector data indicating the training direction of the audio signal; wherein the third training data comprises the second vector data, the second vector data being dependent on, in particular derived from, the first vector data.

14. A data processing system comprising means for carrying out the method according to any one of the preceding claims.

15. A computer-readable storage medium containing instructions which, when executed by the data processing means according to claim 14, cause the data processing means according to claim 14 to perform the method according to any one of claims 1 to 13.