WO2020038938A1 - Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (ipfs) - Google Patents
Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (ipfs) Download PDFInfo
- Publication number
- WO2020038938A1 WO2020038938A1 PCT/EP2019/072258 EP2019072258W WO2020038938A1 WO 2020038938 A1 WO2020038938 A1 WO 2020038938A1 EP 2019072258 W EP2019072258 W EP 2019072258W WO 2020038938 A1 WO2020038938 A1 WO 2020038938A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frame
- frames
- audio
- sample values
- encoded
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/162—Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/173—Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
Definitions
- the present disclosure relates generally to audio encoders, encoding methods, audio decoders and decoding methods, including a method for decoding a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, and a method for generating a bitstream of encoded audio data with immediate playout frames.
- the present disclosure relates further to an apparatus for generating immediate playout frames in a bitstream of encoded audio data or for removing immediate playout frames from a bitstream of encoded audio data.
- An IPF provides information to a special frame that permits
- an IPF is a frame where a decoder upon its reception can immediately produce correct samples from the first sample which is encoded into this IPF, as it contains all information to do so.
- An IPF thus denotes an independently decodable frame which can be decoded using information only from within itself.
- Encoded audio usually comes in data frames or chunks. In the context of audio as
- the frames/chunks may be known as granules, the encoded chunks/ frames are called access units (AU) and the decoded chunks are called composition units (CU).
- AU access units
- CU composition units
- the audio signal may only be accessible and addressable in the granularity of these coded chunks (access units).
- a decoder needs to be supplied with an AU n representing the corresponding time-segment of an audio program, and with additional AU n-i , AU n- 2, ... AUs and configuration data preceding AU n . Otherwise, due to different coding configurations (e.g., Windowing data, SBR-related data, PS related data), it cannot be guaranteed that a decoder produces correct output when decoding only AU n .
- coding configurations e.g., Windowing data, SBR-related data, PS related data
- the first AU n to be decoded with a new configuration has to carry the new configuration data and all the pre-roll data (in form of AU n x , representing time-segments before AU n ) that is needed to initialize the decoder with the new configuration.
- This can be done by means of an Immediate Playout Frame (IPF) as defined in the MPEG-H 3D Audio standard or in the MPEG-D US AC standard.
- IPF Immediate Playout Frame
- an audio decoder for decoding a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the audio decoder may comprise a determiner configured to determine whether a frame of the bitstream of encoded audio data is an immediate playout frame comprising encoded audio sample values associated with a current frame and additional information, wherein the additional information may comprise encoded audio sample values of a number of frames preceding the immediate playout frame, wherein the encoded audio sample values of the preceding frames may be encoded using the same codec configuration as the current frame, wherein the number of preceding frames, corresponding to pre-roll frames, may correspond to the number of frames needed by the decoder to build up the full signal so as to be in a position to output valid audio sample values associated with the current frame whenever an immediate playout frame is decoded.
- the decoder may comprise an initializer configured to initialize the decoder if the determiner determines that the frame is an immediate playout frame, wherein initializing the decoder may comprise decoding the encoded audio sample values comprised by the additional information before decoding the encoded audio sample values associated with the current frame, wherein the initializer may be configured to switch the audio decoder from a current codec configuration to a different codec configuration if the determiner determines that the frame is an immediate playout frame and if the audio sample values of the current frame have been encoded using the different codec configuration, and wherein the decoder may be configured to decode the current frame using the current codec configuration and to discard the additional information if the determiner determines that the frame is an immediate playout frame and if the audio sample values of the current frame have been encoded using the current codec configuration.
- the additional information may further comprise information on the codec configuration used for encoding the audio sample values associated with the current frame, and the determiner may further be configured to determine whether the codec configuration of the additional information is different from the current codec configuration.
- the immediate playout frame may comprise the additional information as an extension payload and the determiner may be configured to evaluate the extension payload of the immediate playout frame.
- the bitstream of encoded audio data may be an MPEG-4 Audio bitstream.
- the additional information may be transported via an MPEG-4 Audio bitstream extension mechanism that is either a Data Stream Element (DSE) or an
- DSE Data Stream Element
- either the Data Stream Element (DSE) or the extension_payload element may be located at a predefined position in the MPEG-4 Audio bitstream and/or may have a specific instance tag signaling that a payload of the Data Stream Element (DSE) or the extension_payload element is the additional information.
- the extension_payload element may, for example, be contained at different places of the MPEG-4 Audio bitstream syntax. Accordingly, this allows to use immediate playout frame functionality also in MPEG-4 Audio.
- the extension_payload element may be contained inside a fill element (ID FIL).
- the additional information may further comprise a unique identifier, and optionally the unique identifier may be used to detect the different codec configuration.
- the decoder may further comprise a crossfader configured to perform crossfading of output sample values acquired by flushing the decoder in the previous codec configuration and output sample values acquired by decoding the encoded audio sample values associated with the current frame.
- an earliest frame of the number of frames comprised in the additional information may not be time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame and the immediate playout frame may not be time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame of the number of frames preceding the immediate playout frame or relative to any frame previous to the immediate playout frame.
- bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the method may comprise determining whether a frame of the bitstream of encoded audio data is an immediate playout frame comprising encoded audio sample values associated with a current frame and additional information, wherein the additional information may comprise encoded audio sample values of a number of frames preceding the immediate playout frame, wherein the encoded audio sample values of the preceding frames may be encoded using the same codec configuration as the immediate playout frame, wherein the number of preceding frames, corresponding to pre-roll frames, may correspond to the number of frames needed by a decoder to build up the full signal so as to be in a position to output valid audio sample values associated with the current frame whenever an immediate playout frame is decoded.
- the method may further comprise initializing the decoder if it is determined that the frame is an immediate playout frame, wherein the initializing may comprise decoding the encoded audio sample values comprised by the additional information before decoding the encoded audio sample values associated with the current frame.
- the method may further comprise switching the audio decoder from a current codec configuration to a different codec configuration if it is determined that the frame is an immediate playout frame and if the audio sample values of the immediate playout frame have been encoded using the different codec configuration. And the method may comprise decoding the immediate playout frame using the current codec configuration and discarding the additional information if it is determined that the frame is an immediate playout frame and if the audio sample values of the immediate playout frame have been encoded using the current codec configuration.
- the method allows, for example, switching of AudioObjectTypes (AOT) as defined in ISO/IEC 14496-3 in combination with continuously producing correct output samples and without introducing gaps of silence in the audio output.
- AOT AudioObjectTypes
- the additional information may further comprise information on the codec configuration used for encoding the audio sample values associated with the current frame, wherein the method may further comprise determining whether the codec configuration of the additional information is different from the current codec configuration used to encode audio sample values associated with frames in the bitstream preceding the immediate playout frame.
- the bitstream of encoded audio data may be an MPEG-4 Audio bitstream.
- the additional information may be transported via an MPEG-4 Audio bitstream extension mechanism that is either a Data Stream Element (ID DSE) or an extension_payload element.
- ID DSE Data Stream Element
- extension_payload element an MPEG-4 Audio bitstream extension mechanism that is either a Data Stream Element (ID DSE) or an extension_payload element.
- either the Data Stream Element (ID DSE) or the extension_payload element may be located at a predefined position in the MPEG-4 Audio bitstream and/or may have a specific instance tag signaling that a payload of the Data Stream Element (ID DSE) or the extension_payload element is the additional information.
- the extension_payload element may be contained inside a fill element (ID FIL).
- the additional information may further comprise a unique identifier, and optionally the unique identifier may be used to detect the different codec configuration.
- the bitstream of encoded audio data may comprise a first number of frames encoded using a first codec configuration and a second number of frames following the first number of frames and encoded using a second codec configuration, wherein the first frame of the second number of frames may be the immediate playout frame.
- an audio encoder for generating a bitstream of encoded audio data with immediate playout frames, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the audio encoder may comprise a core encoder configured to encode uncompressed audio sample values associated with the plurality of frames using a predefined codec configuration.
- the audio encoder may further comprise a buffer configured to store encoded audio sample values of a number of preceding frames of a current frame of the plurality of frames encoded using the predefined codec configuration.
- the audio encoder may comprise an embedder configured to write an immediate playout frame in the current frame of the plurality of frames, wherein the immediate playout frame may comprise encoded audio sample values associated with said current frame and additional information corresponding to the encoded audio sample values of the number of preceding frames of said current frame.
- the embedder may further be configured to include information on the predefined codec configuration in the additional information.
- the embedder may further be configured to include in the immediate playout frame the additional information.
- the generated bitstream of encoded audio data may be an MPEG-4 Audio bitstream.
- the embedder may further be configured to embed the additional information in the bitstream via an MPEG-4 Audio bitstream extension mechanism that is either a Data Stream Element (ID DSE) or an extension_payload element.
- ID DSE Data Stream Element
- extension_payload element an extension_payload element
- the embedder may further be configured to locate either the Data Stream Element (ID DSE) or the extension_payload element at a predefined position in the MPEG-4 Audio bitstream and/or to assign a specific instance tag signaling that a payload of the Data Stream Element (ID DSE) or the extension_payload element is the additional information.
- ID DSE Data Stream Element
- ID DSE extension_payload element
- the embedder may further be configured to embed the
- the embedder may further be configured to include a unique identifier into the additional information, and optionally the unique identifier may signal the predefined codec configuration.
- the audio encoder may further be configured to not time-differentially encode or entropy encode an earliest frame of the number of frames comprised in the additional information relative to any frame previous to the earliest frame and the audio encoder may further be configured to not time-differentially encode or entropy encode the immediate playout frame relative to any frame previous to the earliest frame of the number of frames preceding the immediate playout frame or relative to any frame previous to the immediate playout frame.
- a system comprising two or more audio encoders for generating a plurality of bitstreams of encoded audio data each having immediate playout frames, wherein each bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, and wherein each frame comprises associated encoded audio sample values.
- a predetermined sampling rate may be the same for each of the core encoders of the two or more audio encoders. Accordingly, resampling and additional delay handling at the decoder can be avoided.
- the system may further comprise a delay alignment unit for delay aligning the plurality of bitstreams. Accordingly, this allows for seamless switching at the decoder by compensating for different encoder delays.
- the method may comprise the step of encoding, by a core encoder, uncompressed audio sample values associated with the plurality of frames using a predefined codec configuration.
- the method may further comprise the step of storing, by a buffer, encoded audio sample values of a number of preceding frames of a current frame of the plurality of frames encoded using the predefined codec configuration. And the method may comprise the step of writing, by an embedder, an immediate playout frame in the current frame of the plurality of frames, wherein the immediate playout frame may comprise encoded audio sample values associated with said current frame and additional information corresponding to the encoded audio sample values of the number of preceding frames of said current frame.
- the additional information may further comprise information on the predefined codec configuration.
- the immediate playout frame may further comprise the additional information.
- the generated bitstream of encoded audio data may be an MPEG-4 Audio bitstream.
- the additional information may be embedded in the bitstream, by the embedder, via an MPEG-4 Audio bitstream extension mechanism that may be either a Data Stream Element (ID DSE) or an extension_payload element.
- ID DSE Data Stream Element
- extension_payload element may be located, by the embedder, at a predefined position in the MPEG-4 Audio bitstream and/or may be assigned a specific instance tag signaling that a payload of the Data Stream Element (ID DSE) or the extension_payload element is the additional information.
- the extension_payload element may be embedded, by the embedder, inside a fill element (ID FIL).
- the additional information may further comprise a unique identifier, and optionally the unique identifier may signal the predefined codec configuration.
- an earliest frame of the number of frames comprised in the additional information may not be time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame and, by the audio encoder, the immediate playout frame may not be time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame of the number of frames preceding the immediate playout frame or relative to any frame previous to the immediate playout frame.
- an apparatus for generating immediate playout frames in a bitstream of encoded audio data or for removing immediate playout frames from a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the apparatus may comprise a receiver configured to receive a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values. And the apparatus may comprise an embedder configured to write an immediate playout frame in a current frame of the plurality of frames, wherein the immediate playout frame may comprise encoded audio sample values associated with said current frame and additional information corresponding to encoded audio sample values of a number of preceding frames of said current frame. Configured as proposed, the apparatus allows to individually generate immediate playout frames in any already existing bitstreams of encoded audio data, i.e. prior to distribution if needed.
- the apparatus may further comprise a buffer configured to store encoded audio sample values of the number of preceding frames of the current frame of the plurality of frames.
- the embedder may further be configured to remove, from the immediate playout frame, the additional information corresponding to the encoded audio sample values of the number of preceding frames of said current frame.
- this allows to individually remove immediate playout frames from a bitstream of encoded audio data, for example, in a case where an audio encoder generates immediate playout frames only.
- a non-transitory digital storage medium having a computer program stored thereon to perform a method for decoding a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, when said computer program is run by computer or a processor.
- a non-transitory digital storage medium having a computer program stored thereon to perform a method of generating, by an audio encoder, a bitstream of encoded audio data with immediate playout frames, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, when said computer program is run by a computer or a processor.
- FIG. 1 illustrates an example of an immediate playout frame in an MPEG-4 Audio bitstream of encoded audio data.
- FIG. 2 illustrates an example of a method for decoding a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- FIG. 3 illustrates a further example of a method for decoding a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- FIG. 4 illustrates an example of an audio decoder for decoding a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- FIG. 5 illustrates an example of an audio encoder for generating a bitstream of encoded audio data with random access points (immediate playout frames, IPFs).
- FIG. 6 illustrates an example of a method for generating a bitstream of encoded audio data with random access points (immediate playout frames, IPFs).
- FIG. 7 illustrates an example of a system of audio encoders for generating a plurality of bitstreams of encoded audio data each having time-aligned random access points (immediate playout frames, IPFs).
- FIG. 8 illustrates an example of an apparatus for generating random access points (immediate playout frames, IPFs) in a bitstream of encoded audio data or for removing random access points (immediate playout frames, IPFs) from a bitstream of encoded audio data.
- FIG. 9 illustrates an example of a device having a processor for running a computer program stored on a non-transitory digital storage medium.
- the present invention relates to the creation (encoding), transportation (bitstream) and processing (decoding) of IPFs in MPEG-4 Audio, such as in the context of a data stream including audio standardized in accordance with another standard, such as the MPEG-H 3D audio standard.
- MPEG-4 Audio bitstreams may refer to bitstreams compatible with the standard set out in ISO/IEC 14496-3, Coding of audio-visual objects - Part 3: Audio, and all future editions, revisions and amendments thereto (“hereinafter MPEG-4 Audio").
- MPEG-4 Audio Coding of audio-visual objects - Part 3: Audio
- MPEG-4 Audio Coding of audio-visual objects - Part 3: Audio
- a bitstream of encoded audio data may contain a sequence of audio sample values, e.g.
- the bitstream of encoded audio data may further comprise a plurality of frames. Each frame may comprise associated encoded audio sample values. In other words, each payload packet may belong to a respective frame or AU.
- the above audio payload packet may be compatible with the present and future versions of the MPEG standard such as the MPEG-4 Audio standard.
- the bitstream of encoded audio data may be an MPEG-4 Audio bitstream (i.e. MPEG-4 Audio standard compliant bitstream).
- a frame of the bitstream of encoded audio data may be an immediate playout frame (random access point, special frame) comprising encoded audio sample values associated with a current frame and additional information.
- the additional information may comprise encoded audio sample values of a number of frames preceding the immediate playout frame, wherein the encoded audio sample values of the preceding frames may be encoded using the same codec configuration as the current frame.
- the number of preceding frames, corresponding to pre-roll frames may correspond to the number of frames needed by the decoder to build up the full signal so as to be in a position to output valid audio sample values associated with the current frame whenever an immediate playout frame is decoded.
- the full signal may, for example, be built up during start-up or restart of the decoder.
- the immediate playout frame may be, for example, the first frame upon start-up of the decoder.
- the additional information may be transported via an MPEG-4 Audio bitstream extension mechanism that may be either a Data Stream Element (ID DSE) or an extension_payload element.
- ID DSE Data Stream Element
- the extension_payload element may, for example, be contained at different places of the MPEG-4 Audio bitstream syntax, e.g. at different levels.
- the extension_payload element may be contained inside a fill element (ID FIL).
- the additional information may thus be transported via an MPEG-4 Audio bitstream extension mechanism, for example, based on one of the following options:
- a Data Stream Element (signaled via id syn ele equal to ID DSE) or a fill element (signaled via id syn ele equal to ID FIL), or equivalents thereof, as defined in ISO/IEC 14496-3 and/or future standards, may be used to carry extension payloads, which may be used to further extend the information which is transported in such a payload packet without breaking compatibility to legacy decoders.
- either a Data Stream Element (ID DSE) or an extension_payload element which may be contained inside a fill element (ID FIL) may be used in order to transport AUs and configuration information which are representing time- segments before AU n (i.e. additional information) within the same payload packet as AU n (i.e. current frame, immediate playout frame).
- ID DSE Data Stream Element
- ID FIL fill element
- an extension element (e.g. extension_payload element) may be inserted into an AU, where the element type is signaled in the bitstream for each element and AU.
- the extension element may be the first element in a payload packet, preceding the first audio element.
- Each audio pre-roll element defined below may be identified by a Universally Unique Identifier (UUID).
- UUID Universally Unique Identifier
- the UUID field may be used to signal by an audio encoder and detect by an audio decoder a switch of stream configurations. If the UUID field has changed relative to the previous frame or initial state (e.g. at first decoder startup) the stream configuration might have changed and the pre-roll payload must be evaluated to ensure correct decoding. If the UUID does not change relative to the previous frame, the decoder may skip the
- the decoder may compare the AudioSpecificConfig of the audio_preroll_element() with the current decoder configuration to detect a stream configuration switch.
- the field 'flags' is used as an 8-bit bitset that may be used to signal additional information to the decoder. This may be information on whether a crossfade shall be applied or the type of crossfade (e.g., linear, logarithmic). In the example below one bit is used to signal whether either the pre-roll payload or UUID are present or are not present in the bitstream.
- the stream configuration must be known to the decoder before processing any AU.
- the decoder configuration is conveyed in the AudioSpecificConfig-element. Decoder configuration and UUID are part of the pre-roll payload. Further, the pre-roll payload comprises a configuration dependent number of MPEG-4 Audio AUs (raw data block).
- An Immediate Playout Frame (IPF) in MPEG-4 may comprise exactly one
- Audio_preroll_element() payload as described below and one or several audio element streams (e.g. single_channel_element()) defined in ISO/IEC 14496-3.
- the audio elementary element streams are associated with the current timestamp.
- Pre-roll payload may be carried in one of the extension payload mechanisms of MPEG-4 Audio.
- Figure 1 shows an Immediate Playout Frame (AU n ) 1 comprising two pre-roll frames (AU n-i , AU n-2 ) 2, 3 as well as corresponding stream configuration 4 (AudioSpecificConfig) and Stream Identifier 5 (UUID).
- An Audio Pre-roll element may be defined based on the following:
- asc_size bs_asc_size; 8 uimbsf
- au_size bs_au_size; 8 uimbsf
- au_size + esc; 8 uimbsf raw_data_block(); au size * 8
- the audio pre-roll element (e.g., audio_preroll_element()) is byte aligned and can therefore be transmitted without further byte-alignment by both extension_payload element (e.g. inside a fill element) and Data Stream Element.
- the IPF relates to the pre-roll element as follows: IPFs include both the current AU, and the additional AUs (i.e. the number of preceding frames) which are needed for decoding correctly. The additional AUs are packaged as part of the pre-roll element, which in turn is packaged into the raw_data_block() (via ID DSE or ID FIF). Such a
- the audio_preroll_element() may be encapsulated in a Data Stream Element.
- ID DSE Data Stream Element
- the ID DSE may be located at a pre-defined position in the stream and/or may have a specific instance tag, signaling that the payload is an audio_preroll_element() .
- the audio_preroll_element() may be encapsulated in an
- extension_payload element inside a fill element.
- a convention may be used to identify the type of an
- extension_payload element such as that if the EXT DATA ELEMENT is conveyed at a certain pre-defined position in a payload packet, then the payload is an
- the extension_payload element may be located at a pre-defined position in the stream and/or may have a specific instance tag, signaling that the payload is an audio_preroll_element().
- EXT AUDIO PRE ROLL 101 Ob .
- the Data Stream Element (ID DSE) or the extension_payload element (e.g. inside the fill element (ID FIL)) carrying the audio_preroll_element() may be embedded into the bitstream before any audio element in the same payload packet.
- the decoding process may include the step S101 of receiving a bitstream of encoded audio data.
- the bitstream of encoded audio data may represent a sequence of audio sample values and may comprise a plurality of frames, wherein each frame may comprise associated encoded audio sample values.
- the method may further include the step S 102 of determining whether a frame of the bitstream of encoded audio data is an immediate playout frame.
- the immediate playout frame may comprise encoded audio sample values associated with a current frame and additional information.
- the additional information may comprise encoded audio sample values of a number of frames preceding the immediate playout frame, the encoded audio sample values of the preceding frames being encoded using the same codec configuration as the immediate playout frame.
- the number of preceding frames corresponding to pre-roll frames, may correspond to the number of frames needed by a decoder to build up the full signal so as to be in a position to output valid audio sample values associated with the current frame whenever an immediate playout frame is decoded.
- the full signal may, for example, be built up during start-up or restart of the decoder.
- the immediate playout frame may be, for example, the first frame upon start-up of the decoder.
- the method may include the step S 103 of initializing the decoder if it is determined that the frame is an immediate playout frame.
- the initializing may comprise decoding the encoded audio sample values comprised by the additional information before decoding the encoded audio sample values associated with the current frame.
- the audio decoder may be switched from a current codec configuration to a different codec configuration if it is determined that the frame is an immediate playout frame and if the audio sample values of the immediate playout frame have been encoded using the different codec configuration.
- the immediate playout frame may be decoded using the current codec configuration and the additional information may be discarded if it is determined that the frame is an immediate playout frame and if the audio sample values of the immediate playout frame have been encoded using the current codec configuration.
- the decoding process may utilize elements from IPF decoding in MPEG-D USAC. Referring again to the example of Figure 3, the decoding process may proceed in detail as follows.
- the decoding process may further include: If payload is not present, decode the respective frame and continue with the next frame
- this processing may allow switching of AudioObjectTypes (AOT) as defined in ISO/IEC 14496-3 in combination with continuously producing correct output samples and without introducing gaps of silence in the audio output.
- AOTs AudioObjectTypes
- the AOTs which are switched in between may include AOT 2 (AAC), AOT 5 (SBR), AOT 29 (PS) and other compatible ones.
- the audio decoder 100 may comprise a determiner 101.
- the determiner 101 may be configured to determine whether a frame of the bitstream of encoded audio data is an immediate playout frame comprising encoded audio sample values associated with a current frame and additional information.
- the additional information may comprise encoded audio sample values of a number of frames preceding the immediate playout frame.
- the encoded audio sample values of the preceding frames may be encoded using the same codec configuration as the current frame.
- the number of preceding frames may correspond to the number of frames needed by the decoder 100 to build up the full signal so as to be in a position to output valid audio sample values associated with the current frame whenever an immediate playout frame is decoded.
- the full signal may, for example, be built up during start-up or restart of the decoder 100.
- the immediate playout frame may be, for example, the first frame upon start-up of the decoder 100.
- the audio decoder 100 may comprise an initializer 102.
- the initializer 102 may be configured to initialize the decoder 100 if the determiner 101 determines that the frame is an immediate playout frame. Initializing the decoder 100 may comprise decoding the encoded audio sample values comprised by the additional information before decoding the encoded audio sample values associated with the current frame.
- the initializer 102 may further be configured to switch the audio decoder 100 from a current codec configuration to a different codec configuration if the determiner 101 determines that the frame is an immediate playout frame and if the audio sample values of the current frame have been encoded using the different codec configuration.
- the decoder 100 may be configured to decode the current frame using the current codec configuration and to discard the additional information if the determiner 101 determines that the frame is an immediate playout frame and if the audio sample values of the current frame have been encoded using the current codec configuration.
- an aspect of the present invention relates to an audio encoder for generating a bitstream of encoded audio data with immediate playout frames (random access points), wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the audio encoder 200 may comprise a core encoder 202 configured to encode uncompressed audio sample values associated with the plurality of frames using a predefined codec configuration. Using a predefined codec configuration may, for example, include using a predetermined sampling rate. In one example, the core encoder 202 may encode the uncompressed audio samples to be compliant with the decoding of the MPEG-4 Audio standard.
- the audio encoder 200 may further comprise a buffer 203 configured to store encoded audio sample values of a number of preceding frames of a current frame of the plurality of frames encoded using the predefined codec configuration (as indicated by the dashed lines).
- the relevant previous frames N-l, N-2, ... may always be buffered/stored.
- the stored relevant previous frames N-l, N-2, ... may then be taken and packaged into the current frame N.
- the audio encoder 200 may comprise an embedder 204 configured to write an immediate playout frame in the current frame of the plurality of frames, wherein the immediate playout frame may comprise encoded audio sample values associated with said current frame and additional information corresponding to the encoded audio sample values of the number of preceding frames of said current frame.
- the embedder 204 is defined as being part of the audio encoder 200, it is noted that alternatively, or additionally, the embedder 204 may also be implemented individually to write immediate playout frames in any current frame of a bitstream of encoded audio data or to convert immediate playout frames in a bitstream of encoded audio data into "normal" frames by removing the additional information from the immediate playout frames. In this, the embedder 204 may be part of the encoder chain but does not necessarily have to.
- the embedder 204 may further be configured to include information on the predefined codec configuration in the additional information.
- the additional information may provide information on the predefined codec configuration to a decoder.
- the embedder 204 may further be configured to include in the immediate playout frame the additional information.
- the additional information may be transported in the bitstream to the decoder.
- the generated bitstream of encoded audio data may be an MPEG-4 Audio bitstream.
- the embedder 204 may further be configured to embed the additional information in the bitstream (e.g. for transport) via an MPEG-4 Audio bitstream extension mechanism that may be either a Data Stream Element (ID DSE) or an extension_payload element.
- an MPEG-4 Audio bitstream extension mechanism may be either a Data Stream Element (ID DSE) or an extension_payload element.
- the embedder 204 may further be configured to locate either the Data Stream Element (ID DSE) or the extension_payload element at a predefined position in the MPEG-4 Audio bitstream and/or to assign a specific instance tag signaling that a payload of the Data Stream Element (ID DSE) or the extension_payload element is the additional information.
- the predefined position may correspond to a first position in the MPEG-4 Audio bitstream, i.e. always the first in the frames as it may carry the decoder configuration that may be needed to decode a frame n ( Figure 1, raw_data_block()[n]).
- the decoder may thus assume that if the first element in the frame is an ID DSE or an extension_payload element (may be contained inside an ID FIL element) that this element carries pre-roll data (preceding frames, pre-roll frames).
- the embedder 204 may further be configured to include a unique identifier into the additional information.
- the unique identifier may signal the predefined codec configuration.
- the predefined codec configuration may then be used by the decoder to decode a frame n as stated above. Based on the unique identifier, the decoder may be able to identify the additional information in the bitstream and parse the bitstream accordingly.
- the audio encoder 200 may further be configured to not time-differentially encode or entropy encode an earliest frame of the number of frames comprised in the additional information relative to any frame previous to the earliest frame and the audio encoder 200 may further be configured to not time-differentially encode or entropy encode the immediate playout frame relative to any frame previous to the earliest frame of the number of frames preceding the immediate playout frame or relative to any frame previous to the immediate playout frame.
- an aspect of the present invention relates to a method of generating, by an audio encoder, a bitstream of encoded audio data with immediate playout frames (random access points), wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the method may comprise the step S201 of encoding, by a core encoder, uncompressed audio sample values associated with the plurality of frames using a predefined codec configuration.
- a predefined codec configuration may, for example, include using a predetermined sampling rate.
- the method may further comprise the step S202 of storing, by a buffer, encoded audio sample values of a number of preceding frames of a current frame of the plurality of frames encoded using the predefined codec configuration.
- the method may comprise the step S203 of writing, by an embedder, an immediate playout frame in the current frame of the plurality of frames, wherein the immediate playout frame comprises encoded audio sample values associated with said current frame and additional information corresponding to the encoded audio sample values of the number of preceding frames of said current frame.
- the additional information may further comprise information on the predefined codec configuration.
- the predefined codec configuration may be used by a decoder in the decoding process as detailed above.
- the immediate playout frame may further comprise the additional information.
- the additional information may be transported in the bitstream.
- the generated bitstream of encoded audio data may be an MPEG-4 Audio bitstream.
- the additional information may be embedded in the bitstream (e.g. for transport), by the embedder, via an MPEG-4 Audio bitstream extension mechanism that is either a Data Stream Element (ID DSE) or an extension_payload element.
- ID DSE Data Stream Element
- extension_payload element an MPEG-4 Audio bitstream extension mechanism that is either a Data Stream Element (ID DSE) or an extension_payload element.
- extension_payload element may, for example, be contained at different places of the MPEG-4 Audio bitstream syntax.
- the extension_payload element may be embedded (e.g. for transport), by the embedder, inside a fill element (ID FIL).
- either the Data Stream Element (ID DSE) or the extension_payload element may be located, by the embedder, at a predefined position in the MPEG-4 Audio bitstream and/or may be assigned a specific instance tag signaling that a payload of the Data Stream Element (ID DSE) or the extension_payload element is the additional information.
- the predefined position may always be the first position in the frames as it may carry the decoder configuration that may be needed to decode the current frame.
- the additional information may further comprise a unique identifier.
- the unique identifier may signal the predefined codec configuration.
- an earliest frame of the number of frames comprised in the additional information may not be time-differentially encoded or entropy encoded, by the audio encoder, relative to any frame previous to the earliest frame and the immediate playout frame may not be time-differentially encoded or entropy encoded, by the audio encoder, relative to any frame previous to the earliest frame of the number of frames preceding the immediate playout frame or relative to any frame previous to the immediate playout frame.
- an aspect of the present invention relates to a system comprising two or more audio encoders for generating a plurality of bitstreams of encoded audio data each having immediate playout frames (time-aligned random access points), wherein each bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, and wherein each frame comprises associated encoded audio sample values.
- each bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, and wherein each frame comprises associated encoded audio sample values.
- Each encoding chain may output its own bitstream. Both bitstreams created that way may be stored, for example, on a web-server.
- a client may start playing back a stream A (e.g. at high bitrate). At some point in time, the client may decide to switch to a lower bitrate stream B, and therefore may request the same content, but in a different bitrate.
- the first segment of stream B arrives at a decoder, such a segment may always start with an IPF (this may be signaled e.g. via a manifest file according to MPEG-DASH), which enables the decoder to output correct audio from the very beginning.
- each of the audio encoders comprises a core encoder 202 (encoder core #1, encoder core #2), a buffer (not shown) and an embedder 204 (IPF insertion).
- a predetermined sampling rate may be the same for each of the core encoders 202. Otherwise, resampling and additional delay handling may be needed at the decoder side.
- the core encoders 202 may be configured to run at different frame rates (e.g. AAC-FC 1024; HE-AAC 2048). Further, core encoder configurations may require different number of pre-roll frames p. It may be required that both frame n-p and n are independently decodable, i.e. may not have to rely on information from previous frames (for HE-AAC they may contain the SBR header). After core encoding, the decoding times of IPFs may be aligned across the different streams.
- the system may further comprise a delay alignment unit 201 (delay# 1, delay#2) for delay aligning the plurality of bitstreams.
- delay alignment unit 201 delay# 1, delay#2
- the decoding times of immediate playout frames (IPFs) may have to be aligned.
- the delay alignment stage may delay the input PCM samples (uncompressed audio sample values, input audio sample values) to compensate for different encoder/decoder delays.
- an aspect of the present invention relates to an apparatus for generating immediate playout frames (random access points) in a bitstream of encoded audio data or for removing immediate playout frames (random access points) from a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the apparatus 300 may comprise a receiver 301 configured to receive a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values.
- the apparatus 300 may comprise an embedder 302 configured to write an immediate playout frame in a current frame of the plurality of frames, wherein the immediate playout frame comprises encoded audio sample values associated with said current frame and additional information corresponding to encoded audio sample values of a number of preceding frames of said current frame.
- the embedder 302 may operate in compliance with the principles described above in connection with Figures 5 to 7. This operation may be said to correspond to converting a“normal” (non-IPF) frame into an IPF.
- IPF insertion may take care of the actual copying and packaging. By splitting up core encoding and IPF insertion, as with the above apparatus, it may be possible to store the bitstreams without IPFs and only embed IPFs prior to distribution if needed. If configured for seamless switching, each IPF may have to contain the stream configuration ASC.
- the apparatus 300 may further comprise a buffer configured to store encoded audio sample values of the number of preceding frames of the current frame of the plurality of frames.
- the embedder 302 may further be configured to remove, from the immediate playout frame, the additional information corresponding to the encoded audio sample values of the number of preceding frames of said current frame. This may be said to correspond to converting an IPF into a "normal" frame. For example, an audio encoder may generate IPFs only which may then be removed later by the embedder depending on the constraints that a respective channel/service may have.
- aspects of the invention may include a non- transitory digital storage medium having a computer program stored thereon to perform methods described herein when said computer program is run by a computer or a processor.
- Figure 9 illustrates exemp lardy a device 400 having a processor 401 that may run said computer program.
- the device 400 may represent a respective computer.
- the methods and systems described herein may be implemented as software, firmware and/or hardware. Certain components may e.g. be implemented as software running on a digital signal processor or microprocessor. Other components may e.g. be implemented as hardware and/or as application specific integrated circuits.
- the signals encountered in the described methods and systems may be stored on media such as random access memory or optical storage media. They may be transferred via networks, such as radio networks, satellite networks, wireless networks or wireline networks, e.g. the Internet.
- networks such as radio networks, satellite networks, wireless networks or wireline networks, e.g. the Internet.
- Typical devices making use of the methods, apparatus and systems described herein are portable electronic devices or other consumer equipment which are used to store and/or render audio signals.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Priority Applications (17)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1020217008181A KR102852659B1 (ko) | 2018-08-21 | 2019-08-20 | 즉시 재생 프레임(ipf)의 생성, 전송 및 처리를 위한 방법, 장치 및 시스템 |
| JP2021509899A JP7455812B2 (ja) | 2018-08-21 | 2019-08-20 | 即時再生フレーム(ipf)の生成、輸送及び処理のための方法、装置及びシステム |
| CA3110137A CA3110137A1 (en) | 2018-08-21 | 2019-08-20 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (ipfs) |
| EP19765660.6A EP3841571B1 (en) | 2018-08-21 | 2019-08-20 | Methods, apparatuses and systems for generation and processing of immediate playout frames (ipfs) |
| EP23163137.5A EP4224473A1 (en) | 2018-08-21 | 2019-08-20 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (ipfs) |
| CN201980061575.5A CN112740325B (zh) | 2018-08-21 | 2019-08-20 | 即时播放帧(ipf)的生成、传输及处理的方法、设备及系统 |
| CN202410401067.XA CN118192926A (zh) | 2018-08-21 | 2019-08-20 | 即时播放帧(ipf)的生成、传输及处理的方法、设备及系统 |
| MX2021001970A MX2021001970A (es) | 2018-08-21 | 2019-08-20 | Métodos, aparatos y sistemas para generación, transporte y procesamiento de tramas de ejecución inmediata (ipfs). |
| AU2019323625A AU2019323625B2 (en) | 2018-08-21 | 2019-08-20 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (IPFs) |
| BR112021003104-9A BR112021003104A2 (pt) | 2018-08-21 | 2019-08-20 | métodos, aparelho e sistemas para geração, transporte e processamento de quadros de reprodução imediata (ipfs) |
| KR1020257028463A KR20250133478A (ko) | 2018-08-21 | 2019-08-20 | 즉시 재생 프레임(ipf)의 생성, 전송 및 처리를 위한 방법, 장치 및 시스템 |
| US17/270,036 US11972769B2 (en) | 2018-08-21 | 2019-08-20 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (IPFs) |
| CN202410401063.1A CN118192925A (zh) | 2018-08-21 | 2019-08-20 | 即时播放帧(ipf)的生成、传输及处理的方法、设备及系统 |
| JP2024038519A JP7831908B2 (ja) | 2018-08-21 | 2024-03-13 | 即時再生フレーム(ipf)の生成、輸送及び処理のための方法、装置及びシステム |
| US18/608,664 US12315524B2 (en) | 2018-08-21 | 2024-03-18 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (IPFs) |
| AU2024227722A AU2024227722A1 (en) | 2018-08-21 | 2024-10-29 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (IPFs) |
| US19/216,341 US20250336404A1 (en) | 2018-08-21 | 2025-05-22 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (ipfs) |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201862720680P | 2018-08-21 | 2018-08-21 | |
| US62/720,680 | 2018-08-21 |
Related Child Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/270,036 A-371-Of-International US11972769B2 (en) | 2018-08-21 | 2019-08-20 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (IPFs) |
| US18/608,664 Continuation US12315524B2 (en) | 2018-08-21 | 2024-03-18 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (IPFs) |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2020038938A1 true WO2020038938A1 (en) | 2020-02-27 |
Family
ID=67902488
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/EP2019/072258 Ceased WO2020038938A1 (en) | 2018-08-21 | 2019-08-20 | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (ipfs) |
Country Status (10)
| Country | Link |
|---|---|
| US (3) | US11972769B2 (https=) |
| EP (2) | EP3841571B1 (https=) |
| JP (2) | JP7455812B2 (https=) |
| KR (2) | KR20250133478A (https=) |
| CN (3) | CN112740325B (https=) |
| AU (2) | AU2019323625B2 (https=) |
| BR (1) | BR112021003104A2 (https=) |
| CA (1) | CA3110137A1 (https=) |
| MX (2) | MX2021001970A (https=) |
| WO (1) | WO2020038938A1 (https=) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114979771A (zh) * | 2022-07-26 | 2022-08-30 | 北京智象信息技术有限公司 | 一种多媒体播放器视频预滚水位控制的方法、装置及介质 |
| WO2023021137A1 (en) * | 2021-08-19 | 2023-02-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, method for providing an encoded representation of an audio information, computer program and encoded audio representation using immediate playout frames |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| BR112021003104A2 (pt) * | 2018-08-21 | 2021-05-11 | Dolby International Ab | métodos, aparelho e sistemas para geração, transporte e processamento de quadros de reprodução imediata (ipfs) |
| EP4014506B1 (en) * | 2019-08-15 | 2023-01-11 | Dolby International AB | Methods and devices for generation and processing of modified audio bitstreams |
| GB2614482A (en) * | 2020-09-25 | 2023-07-05 | Apple Inc | Seamless scalable decoding of channels, objects, and hoa audio content |
| CN115223579B (zh) * | 2021-04-20 | 2025-09-12 | 华为技术有限公司 | 一种编解码器协商与切换方法 |
| GB2636866A (en) * | 2023-12-28 | 2025-07-02 | Nokia Technologies Oy | An apparatus and method for immersive audio rendering |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180197556A1 (en) * | 2013-10-18 | 2018-07-12 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder |
| WO2018130577A1 (en) * | 2017-01-10 | 2018-07-19 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier |
Family Cites Families (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5963203A (en) * | 1997-07-03 | 1999-10-05 | Obvious Technology, Inc. | Interactive video icon with designated viewing position |
| PL1839297T3 (pl) * | 2005-01-11 | 2019-05-31 | Koninklijke Philips Nv | Skalowalne kodowanie/dekodowanie sygnałów audio |
| US8532984B2 (en) | 2006-07-31 | 2013-09-10 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
| US8190441B2 (en) | 2006-09-11 | 2012-05-29 | Apple Inc. | Playback of compressed media files without quantization gaps |
| CN101802907B (zh) | 2007-09-19 | 2013-11-13 | 爱立信电话股份有限公司 | 多信道音频的联合增强 |
| WO2011104463A1 (fr) | 2010-02-26 | 2011-09-01 | France Telecom | Compression de flux audio multicanal |
| EP3499503B1 (en) | 2010-04-13 | 2024-07-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Decoding sample-accurate representation of an audio signal |
| WO2014124377A2 (en) | 2013-02-11 | 2014-08-14 | Dolby Laboratories Licensing Corporation | Audio bitstreams with supplementary data and encoding and decoding of such bitstreams |
| WO2013163224A1 (en) | 2012-04-24 | 2013-10-31 | Vid Scale, Inc. | Method and apparatus for smooth stream switching in mpeg/3gpp-dash |
| US9830918B2 (en) | 2013-07-05 | 2017-11-28 | Dolby International Ab | Enhanced soundfield coding using parametric component generation |
| US9502045B2 (en) | 2014-01-30 | 2016-11-22 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
| KR20240162584A (ko) | 2014-03-21 | 2024-11-15 | 돌비 인터네셔널 에이비 | 고차 앰비소닉스(hoa) 신호를 압축하는 방법, 압축된 hoa 신호를 압축 해제하는 방법, hoa 신호를 압축하기 위한 장치, 및 압축된 hoa 신호를 압축 해제하기 위한 장치 |
| PL3522554T3 (pl) | 2014-05-28 | 2021-06-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Procesor danych i transport danych kontrolnych użytkownika do dekoderów audio i modułów renderowania |
| EP2996269A1 (en) | 2014-09-09 | 2016-03-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio splicing concept |
| MX363719B (es) | 2015-03-09 | 2019-03-29 | Fraunhofer Ges Forschung | Codificación de audio alineada por fragmentos. |
| TWI693594B (zh) * | 2015-03-13 | 2020-05-11 | 瑞典商杜比國際公司 | 解碼具有增強頻譜帶複製元資料在至少一填充元素中的音訊位元流 |
| EP3332557B1 (en) | 2015-08-07 | 2019-06-19 | Dolby Laboratories Licensing Corporation | Processing object-based audio signals |
| RU2651902C1 (ru) | 2017-04-26 | 2018-04-24 | Федеральное государственное бюджетное образовательное учреждение высшего образования "Кубанский государственный аграрный университет имени И.Т. Трубилина" | Способ иппотерапии больных детским церебральным параличом в условиях черноморского побережья |
| US20190361917A1 (en) * | 2018-05-25 | 2019-11-28 | Bao Tran | Smart device |
| BR112021003104A2 (pt) * | 2018-08-21 | 2021-05-11 | Dolby International Ab | métodos, aparelho e sistemas para geração, transporte e processamento de quadros de reprodução imediata (ipfs) |
-
2019
- 2019-08-20 BR BR112021003104-9A patent/BR112021003104A2/pt unknown
- 2019-08-20 EP EP19765660.6A patent/EP3841571B1/en active Active
- 2019-08-20 CA CA3110137A patent/CA3110137A1/en active Pending
- 2019-08-20 KR KR1020257028463A patent/KR20250133478A/ko active Pending
- 2019-08-20 CN CN201980061575.5A patent/CN112740325B/zh active Active
- 2019-08-20 KR KR1020217008181A patent/KR102852659B1/ko active Active
- 2019-08-20 CN CN202410401067.XA patent/CN118192926A/zh active Pending
- 2019-08-20 MX MX2021001970A patent/MX2021001970A/es unknown
- 2019-08-20 JP JP2021509899A patent/JP7455812B2/ja active Active
- 2019-08-20 EP EP23163137.5A patent/EP4224473A1/en active Pending
- 2019-08-20 WO PCT/EP2019/072258 patent/WO2020038938A1/en not_active Ceased
- 2019-08-20 US US17/270,036 patent/US11972769B2/en active Active
- 2019-08-20 AU AU2019323625A patent/AU2019323625B2/en active Active
- 2019-08-20 CN CN202410401063.1A patent/CN118192925A/zh active Pending
-
2021
- 2021-02-18 MX MX2024007726A patent/MX2024007726A/es unknown
-
2024
- 2024-03-13 JP JP2024038519A patent/JP7831908B2/ja active Active
- 2024-03-18 US US18/608,664 patent/US12315524B2/en active Active
- 2024-10-29 AU AU2024227722A patent/AU2024227722A1/en active Pending
-
2025
- 2025-05-22 US US19/216,341 patent/US20250336404A1/en active Pending
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180197556A1 (en) * | 2013-10-18 | 2018-07-12 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder |
| WO2018130577A1 (en) * | 2017-01-10 | 2018-07-19 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier |
Non-Patent Citations (1)
| Title |
|---|
| MAX NEUENDORF ET AL: "Contribution to MPEG-H 3D Audio Version 1", 106. MPEG MEETING; 28-10-2013 - 1-11-2013; GENEVA; (MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11),, no. m31360, 23 October 2013 (2013-10-23), XP030059813 * |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2023021137A1 (en) * | 2021-08-19 | 2023-02-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, method for providing an encoded representation of an audio information, computer program and encoded audio representation using immediate playout frames |
| US12573409B2 (en) | 2021-08-19 | 2026-03-10 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, method for providing an encoded representation of an audio information, computer program and encoded audio representation using immediate playout frames |
| CN114979771A (zh) * | 2022-07-26 | 2022-08-30 | 北京智象信息技术有限公司 | 一种多媒体播放器视频预滚水位控制的方法、装置及介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| AU2024227722A1 (en) | 2024-11-21 |
| CN112740325B (zh) | 2024-04-16 |
| JP2021535427A (ja) | 2021-12-16 |
| CA3110137A1 (en) | 2020-02-27 |
| JP2024083353A (ja) | 2024-06-21 |
| KR20250133478A (ko) | 2025-09-05 |
| JP7831908B2 (ja) | 2026-03-17 |
| KR20210043679A (ko) | 2021-04-21 |
| AU2019323625B2 (en) | 2024-08-08 |
| CN118192925A (zh) | 2024-06-14 |
| BR112021003104A2 (pt) | 2021-05-11 |
| EP4224473A1 (en) | 2023-08-09 |
| CN112740325A (zh) | 2021-04-30 |
| US11972769B2 (en) | 2024-04-30 |
| MX2021001970A (es) | 2021-05-31 |
| KR102852659B1 (ko) | 2025-08-29 |
| US20240347068A1 (en) | 2024-10-17 |
| JP7455812B2 (ja) | 2024-03-26 |
| AU2019323625A1 (en) | 2021-04-15 |
| US20210335376A1 (en) | 2021-10-28 |
| US12315524B2 (en) | 2025-05-27 |
| CN118192926A (zh) | 2024-06-14 |
| US20250336404A1 (en) | 2025-10-30 |
| EP3841571A1 (en) | 2021-06-30 |
| MX2024007726A (es) | 2024-07-01 |
| EP3841571B1 (en) | 2023-03-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12315524B2 (en) | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (IPFs) | |
| US11770582B2 (en) | Frame conversion for adaptive streaming alignment | |
| US12495170B2 (en) | Audio splicing concept | |
| US9883213B2 (en) | Embedding encoded audio into transport stream for perfect splicing | |
| RU2792944C2 (ru) | Способы, устройство и системы для генерирования, передачи и обработки кадров немедленного воспроизведения (ipf) | |
| HK40051518A (en) | Methods, apparatus and systems for generation, transportation and processing of immediate playout frames (ipfs) | |
| HK40051518B (zh) | 即时播放帧(ipf)的生成、传输及处理的方法、设备及系统 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19765660 Country of ref document: EP Kind code of ref document: A1 |
|
| DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
| ENP | Entry into the national phase |
Ref document number: 3110137 Country of ref document: CA Ref document number: 2021509899 Country of ref document: JP Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112021003104 Country of ref document: BR |
|
| ENP | Entry into the national phase |
Ref document number: 20217008181 Country of ref document: KR Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 2019765660 Country of ref document: EP Effective date: 20210322 |
|
| ENP | Entry into the national phase |
Ref document number: 2019323625 Country of ref document: AU Date of ref document: 20190820 Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 112021003104 Country of ref document: BR Kind code of ref document: A2 Effective date: 20210219 |
|
| WWD | Wipo information: divisional of initial pct application |
Ref document number: 1020257028463 Country of ref document: KR |