RU2016121163A

RU2016121163A - EXTENDING THE AUDIO BANDBAND BY INSERTING A NOISE WITH A PRELIMINARY TIME FORM IN THE FREQUENCY AREA

Info

Publication number: RU2016121163A
Application number: RU2016121163A
Authority: RU
Inventors: Саша ДИШ; Маркус МУЛЬТРУС; Беньямин ШУБЕРТ; Маркус ШНЕЛЛЬ
Original assignee: Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф.
Priority date: 2013-10-31
Filing date: 2014-10-30
Publication date: 2017-12-05
Also published as: ES2657337T3; MX355452B; JP6396459B2; KR20160075768A; RU2666468C2; CA2927990C; KR101852749B1; CN105706166B; US9805731B2; CN105706166A; WO2015063227A1; EP3063761B1; TR201802303T4; EP3063761A1; JP2016541012A; CA2927990A1; US20160240200A1; MX2016005167A

Claims

1. An audio decoder device for decoding a bitstream (BS), wherein the audio decoder device (1), comprising:

a bitstream receiver (2) configured to receive a bitstream (BS) and derive the encoded audio signal (EAS) from the bitstream (BS);

a base decoder module (3) configured to derive a decoded audio signal (DAS) in the time domain from an encoded audio signal (EAS);

a time envelope generator (4) configured to determine a time envelope (TED) of a decoded audio signal (DAS);

a bandwidth expansion module (5) configured to generate a frequency domain bandwidth extension (BEF) signal, wherein the bandwidth expansion module (5) comprises a noise generator (6) configured to generate a noise signal (NOS) in the time domain wherein the bandwidth extension module (5) comprises a preforming module (7) configured to temporally shape the noise signal (NOS) depending on the time envelope (TED) of the decoded audio signal (DAS) for the tog in order to create a shaped noise signal (SNS), while the bandwidth extension module (5) comprises a time-frequency converter (8) adapted to transform the shaped noise signal (SNS) into a frequency domain noise signal (FNS) ; wherein the frequency domain bandwidth extension (BEF) signal is dependent on the frequency domain noise signal (FNS);

a time-frequency converter (9), configured to transform a decoded audio signal (DAS) into a frequency-domain decoded audio signal (FDS);

a combiner (10) configured to combine the decoded frequency domain audio signal (FDS) and the frequency domain bandwidth extension (BEF) signal to create an extended frequency bandwidth domain (BFS) audio signal; and

a frequency-time converter (11), configured to transform the audio signal of the frequency domain with extended bandwidth (BFS) into the audio signal of the time domain with extended-bandwidth (BAS).

2. The audio decoder apparatus of claim 1, wherein a frequency domain bandwidth extension (BEF) signal is generated without spectral band replication.

3. The audio decoder device according to one of the preceding paragraphs, in which the bandwidth extension module (5) is configured such that the time-shaping of the noise signal (NOS) is performed in an overly pronounced manner.

4. The audio decoder device according to one of the preceding paragraphs, in which the bandwidth extension module (5) is configured such that time shaping of the noise signal (NOS) is performed by subbands by splitting the noise signal (NOS) into several noise signals of the subband by a set of band-pass filters and perform special shaping in time over each of the noise signals of the subband.

5. The audio decoder device according to one of the preceding claims, in which the bandwidth extension module (5) comprises a frequency band selector (12) configured to set a frequency band of a frequency domain bandwidth extension (BEF) signal.

6. The audio decoder device according to one of the preceding paragraphs, in which the bandwidth extension module (5) comprises a post-shaping module configured to shape in time and / or spectrum in a private domain in a frequency domain bandwidth extension signal (BEF )

7. The audio decoder device according to one of the preceding paragraphs, in which the bitstream receiver (2) is configured to derive a side information signal (SIS) from the bitstream (BS), while the bandwidth extension module (5) is configured to generate a signal frequency domain bandwidth (BEF) extensions depending on the side information signal (SIS).

8. The audio decoder device according to the preceding paragraph, in which the noise generator (6) is configured to generate a noise signal (NOS) depending on the side information signal (SIS).

9. An audio decoder device according to one of claims 7 or 8, wherein the preforming module (7) is configured to temporally shape the noise signal (NOS) depending on the side information signal (SIS).

10. An audio decoder device according to one of claims 7 through 9, in which the post-shaping module (13) is configured to shape in time and / or spectrum in a frequency domain bandwidth extension (BEF) signal depending on the side information signal (SIS).

11. The audio decoder device according to one of the preceding claims, in which the bandwidth extension module (5) comprises an additional noise generator (14) configured to create an additional noise signal (NOSF) in the time domain, an additional preforming module (15) configured to temporally shape an additional noise signal (NOSF) depending on the time envelope (TED) of the decoded audio signal (DAS) in order to create an additional noise signal with the given shape th (SNSF), and an additional time-frequency converter (16), configured to transform an additional shaped noise signal (SNSF) into an additional frequency domain noise signal (FNSF), wherein the frequency domain bandwidth extension signal (BEF) depends from an additional frequency domain noise signal (FNSF).

12. The audio decoder device according to the preceding paragraph, in which the bandwidth extension module (5) is configured such that time shaping of the additional noise signal (NOSF) is performed in an overly pronounced manner.

13. The device audio decoder p. 11 or 12, wherein the bandwidth extension module (5) is configured such that time shaping of the additional noise signal (NOSF) is performed by subbands by splitting the additional noise signal (NOSF) into several additional noise signals of the subband by a set of bandpass filters and performing a particular shaping in time over each of the additional subband noise signals.

14. The audio decoder device according to one of the preceding claims, in which the bandwidth extension module (5) comprises a tone generator (17) configured to generate a tone signal (TOS) in the time domain, a preliminary tone-shaping module (18) made with the possibility of shaping in time the tone signal (TOS) depending on the time envelope (TED) of the decoded audio signal (DAS) in order to create a tone signal with a given shape (STS) and a time-frequency converter (19) configured to transform shape tone signal (STS) into the frequency domain tone signal (FTS), wherein the frequency domain bandwidth extension signal (BEF) depends on the frequency domain tone signal (FTS).

15. The audio decoder device according to one of the preceding claims, in which the base decoder module (5) comprises a base time domain decoder (21) and a frequency domain base decoder (22), wherein either the base time domain decoder (21) or the base decoder (22) the frequency domain is used to derive the decoded audio signal (DAS) from the encoded audio signal (EAS).

16. The audio decoder device according to the preceding paragraph, in which the control parameter extractor (23) is configured to extract control parameters (CP) used by the base decoder module (3) from the decoded audio signal (DAS) and the band extension module (5) bandwidth is configured to create a frequency domain bandwidth extension (BEF) signal depending on control parameters (CP).

17. The audio decoder device according to one of the preceding claims, in which the bandwidth extension module (5) comprises a shaping gain calculator (24) configured to set the shaping gain (SG) for the preliminary shaping module (7) in depending on the time envelope (TED) of the decoded audio signal (DAS) and the pre-shaping module (7) is configured to time-shape the noise signal (NOS) depending on the gain shaping (SG) for the module (7) pre-shaping.

18. The device audio decoder in p. 16 and 17, wherein the shaping gain calculator (24) for determining shaping gain (SG) for the pre-shaping module (7) is configured to set shaping gain (SG) for the pre-shaping module (7) depending on control parameters (CP).

19. An audio decoder device according to one of claims 11 to 18, wherein the bandwidth extension module (5) comprises a shaping gain calculator configured to set the shaping gain for the pre-shaping add-on module (15) depending on the time envelope (TED) of the decoded audio signal (DAS ) and at the same time, the additional module (14) for preliminary shaping is configured to give the form in time an additional noise signal (NOSF) depending on the gain of giving the ph frames for the additional module (14) pre-shaping.

20. The device audio decoder p. 16 and 19, wherein the shaping gain factor calculator for establishing the shaping gain for the pre-shaping additional module (15) is configured to set the shaping gain for the pre-shaping additional module (15) depending on the control parameters (CP )

21. An audio decoder device according to one of claims 14 to 20, wherein the bandwidth extension module (5) comprises a shaping gain calculator configured to set the shaping gain for the pre-shaping module (18) depending on the time envelope (TED) of the decoded audio signal (DAS ) and at the same time, the pre-shaping module (18) is configured to temporally shape the tone signal (TOS) depending on the shaping gain for the pre-shaping module (18) Nia forms tone.

22. The device audio decoder in p. 16 and 21, wherein the shaping gain factor calculator for establishing shaping gain factors for the pre-shaping module (18) is configured to set shaping gain factors for the supplementary shaping module (18) depending on control parameters (CP )

23. A method for decoding a bitstream (BS), the method comprising the steps of:

receiving a bitstream (BS) and outputting the encoded audio signal (EAS) from the bitstream (BS) using the bitstream receiver (2);

outputting the decoded audio signal (DAS) in the time domain from the encoded audio signal (EAS) using the base decoder module (3);

determining a temporal envelope (TED) of the decoded audio signal (DAS) using the temporal envelope generator (4);

creating a frequency domain bandwidth extension (BEF) signal using the bandwidth extension module (5), performing the steps of:

creating a noise signal (NOS) in the time domain using the noise generator (6) of the bandwidth extension module (5),

shape the noise signal (NOS) in time as a function of the time envelope (TED) of the decoded audio signal (DAS) in order to create a shaped noise signal (SNS) using the pre-shaped module (7) of the band extension module (5) transmittance

transforming the shaped noise signal (SNS) into a frequency domain noise signal (FNS); wherein the frequency domain bandwidth extension (BEF) signal is dependent on the frequency domain noise signal (FNS) using the time-frequency converter (8) of the bandwidth extension module (5);

transforming the decoded audio signal (DAS) into a frequency domain decoded audio signal (FDS) using an additional time-frequency converter (9);

combining the decoded frequency domain audio signal (FDS) and the frequency domain bandwidth extension (BEF) signal to create an extended frequency bandwidth domain (BFS) audio signal using a combiner (10); and

transforming the audio signal of the frequency domain with extended bandwidth (BFS) into the audio signal of the time domain with extended bandwidth (BAS) using the time-frequency converter (11).

24. A computer program, when executed on a processor, executing the method of the preceding paragraph.