RU2021139507A

RU2021139507A - PACKET LOSS CAUTION FOR DIRAC-BASED SPATIAL AUDIO DATA CODING

Info

Publication number: RU2021139507A
Application number: RU2021139507A
Authority: RU
Inventors: Гийом ФУКС; Маркус МУЛЬТРУС; Штефан ДЁЛА; Андреа АЙХЕНЗЕР
Original assignee: Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф.
Priority date: 2019-06-12
Filing date: 2020-06-05
Publication date: 2023-07-12

Claims

1. A method (100) for concealing loss of spatial audio parameters, wherein the spatial audio parameters contain at least arrival direction information, the method comprising the steps of:

- receive (110) the first set of spatial audio parameters containing at least the first direction information (azi1, ele1) arrival;

receiving (120) a second set of spatial audio parameters containing at least second arrival direction information (azi2, ele2); And

- replacing the second arrival direction information (azi2, ele2) of the second set with replacing arrival direction information extracted from the first arrival direction information (azi1, ele1) if at least the second arrival direction information (azi2, ele2) or part of the second arrival direction information (azi2, ele2) is lost or damaged.

2. The method (100) of claim 1, wherein the first sets (1st sets) and second sets (2nd sets) of spatial audio parameters contain first and second diffuseness information (Ψ1, Ψ2), respectively.

3. The method (100) of claim 2, wherein the first or second diffuseness information (Ψ1, Ψ2) is extracted from at least one energy ratio associated with at least one arrival direction information.

4. The method (100) according to claim 2 or 3, the method further comprising replacing the second diffuseness information (Ψ2) of the second set (2nd set) with replacement diffuseness information derived from the first diffuseness information (Ψ1).

5. The method (100) according to one of the preceding claims, wherein the replacement arrival direction information corresponds to the first arrival direction information (azi1, ele1).

6. The method (100) according to one of the previous paragraphs, in which the step of replacing contains the step of blurring the replacement information of the direction of receipt; and/or

wherein the replacement step comprises a step of introducing random noise into the first arrival direction information (azi1, ele1) to obtain replacement arrival direction information.

7. The method (100) according to claim 6, wherein the step of introducing is performed if the first or second diffuseness information (Ψ1, Ψ2) indicates high diffuseness; and/or if the first or second diffuseness information (Ψ1, Ψ2) is above a predetermined threshold value for the diffuseness information.

8. The method (100) of claim 7, wherein the diffuseness information comprises or is based on a relationship between directional and non-directional components of an audio scene described by a first set (1st set) and/or a second set (2nd set) of spatial audio parameters.

9. The method (100) according to one of paragraphs. 6-8, in which the random noise to be introduced depends on the first and/or second diffuseness information (Ψ1, Ψ2); and/or

- in which the random noise to be introduced is scaled by a factor depending on the first and/or second diffuseness information (Ψ1, Ψ2).

10. The method (100) according to one of paragraphs. 6-9, further comprising analyzing the tone of the audio scene described by the first set (1st set) and/or the second set (2nd set) of spatial audio parameters, or analyzing the tone of the transmitted downmix relating to the first set (1st set) and/or the second set (2nd set) of spatial audio parameters to obtain a tone value describing the tone; And

- while the random noise to be introduced depends on the tone value.

11. The method (100) of claim 10, wherein the random noise is scaled down by a factor that decreases with the inversion of the tone value, or if the tone increases.

12. The method (100) according to one of the preceding claims, wherein the method (100) comprises extrapolating the first arrival direction information (azi1, ele1) to obtain substitute arrival direction information.

13. The method (100) of claim 12, wherein the extrapolation is based on one or more arrival direction side information related to one or more spatial audio parameter sets.

14. The method (100) according to one of paragraphs. 12 or 13, wherein extrapolation is performed if the first and/or second diffuseness information (Ψ1, Ψ2) indicates low diffuseness; or if the first and/or second diffuseness information (Ψ1, Ψ2) is below a predetermined threshold value for the diffuseness information.

15. The method (100) according to one of the previous paragraphs, in which the first set (1st set) of spatial audio parameters refers to the first time point and/or the first frame, and the second set (2nd set) of spatial audio parameters refers to the second time point and/or the second frame; or

- in which the first set (1st set) of spatial audio parameters refers to the first time point, and the second time point follows the first time point, or in which the second frame follows the first frame.

16. The method (100) according to one of the previous paragraphs, in which the first set (1st set) of spatial audio parameters contains the first subset of spatial audio parameters for the first frequency band and the second subset of spatial audio parameters for the second frequency band; and/or

- wherein the second set (2nd set) of spatial audio parameters comprises another first subset of spatial audio parameters for the first frequency band and another second subset of spatial audio parameters for the second frequency band.

17. A method (200) for decoding a DirAC-encoded audio scene, comprising the steps of:

decoding a DirAC-encoded audio scene comprising a downmix, a first set of spatial audio parameters, and a second set of spatial audio parameters;

- carry out the method according to one of the previous stages.

18. A computer-readable digital storage medium on which a computer program is stored having a program code for implementing the method (100, 200) according to one of the previous paragraphs when it is executed on a computer.

19. Loss masking device (50) for masking losses of spatial audio parameters, wherein the spatial audio parameters contain at least arrival direction information, the device comprising:

- a receiver (52) for receiving (110) a first set of spatial audio parameters containing first arrival direction information (azi1, ele1) and for receiving (120) a second set of spatial audio parameters containing second arrival direction information (azi2, ele2);

- a processor (54) for replacing the second arrival direction information (azi2, ele2) of the second set with replacing arrival direction information extracted from the first arrival direction information (azi1, ele1) if at least the second arrival direction information (azi2, ele2) or part of the second arrival direction information (azi2, ele2) is lost or damaged.

20. Decoder (70) for a DirAC-encoded audio scene, comprising a loss masking device according to claim 19.