WO2011012072A1 - 转码方法、装置、设备以及系统 - Google Patents
转码方法、装置、设备以及系统 Download PDFInfo
- Publication number
- WO2011012072A1 WO2011012072A1 PCT/CN2010/075497 CN2010075497W WO2011012072A1 WO 2011012072 A1 WO2011012072 A1 WO 2011012072A1 CN 2010075497 W CN2010075497 W CN 2010075497W WO 2011012072 A1 WO2011012072 A1 WO 2011012072A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- codec
- delay
- code stream
- coded
- stream
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 106
- 239000010410 layer Substances 0.000 claims description 132
- 230000003111 delayed effect Effects 0.000 claims description 54
- 239000012792 core layer Substances 0.000 claims description 26
- 230000001934 delay Effects 0.000 claims description 20
- 238000004364 calculation method Methods 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 11
- 230000006870 function Effects 0.000 description 6
- 230000005540 biological transmission Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/173—Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
Definitions
- the embodiments of the present invention relate to a transcoding technology of a signal code stream, and in particular, to a transcoding method, device, device, and system.
- the embedded voice audio codec is composed of a core layer and several extension layers.
- the core layer is usually the existing traditional narrowband or wideband codec, and the extension layer can be wideband, ultra-wideband or even full-band extension. It can also be stereo, multi-channel expansion.
- the embedded voice audio codec can be compatible with existing traditional codecs, and at the same time can provide more extended functions as needed.
- transcoding To communicate between different embedded codecs, transcoding must be performed, that is, decoding and re-encoding.
- the existing transcoding method usually decodes the coded stream passed by the first encoder, and then uses the second encoder to be transcoded for encoding. As shown in FIG. 1, when transcoding, the first encoding After the core layer code stream and all received extension layer code streams are decoded by the encoder, the second encoder to be transcoded is used for re-encoding.
- the purpose of the embodiments of the present invention is to provide a method, device, device, and system for transcoding, so as to reduce the complexity of encoding and decoding during the transcoding process and improve signal quality.
- an embodiment of the present invention provides an encoding method, including:
- the first code stream includes at least one extended layer code stream in the input code stream obtained after the input signal is coded by the first codec
- the first decoded signal obtained by using the first codec to decode the remaining coded streams in the input code stream is adjusted to obtain an adjustment signal, and the remaining coded streams include the input signal using the first decoded signal.
- the remaining layer code streams including the core layer code stream except for the first coded code stream.
- the first codec is used to decode the remaining coded streams in the input code stream to obtain a first decoded signal, and the remaining coded streams include the input code stream obtained after the input signal is coded by the first codec In addition to the first code stream, the remaining layer code streams including the core layer code stream;
- Another embodiment of the present invention also provides a transcoding device, including:
- the delayed integer frame module is used to delay the first code stream in the input code stream by integer frames, where the first code stream includes the input code stream obtained after the input signal is coded by the first codec middle At least one extended layer code stream;
- the delay alignment module is configured to perform delay alignment adjustment on the first decoded signal obtained by using the first codec to decode the remaining coded streams in the input code stream to obtain an adjusted signal, and the remaining coded streams include In the input code stream obtained after the input signal is encoded by the first codec, the remaining layer code streams including the core layer code stream except for the first coded code stream.
- a transcoding device including:
- the receiving module is configured to receive the first coded code stream in the input code stream and decode the first coded code stream obtained by using the first codec to decode the coded code streams other than the first coded code stream in the input code stream.
- a decoded signal where the first coded code stream includes at least one extended layer code stream in the input code stream obtained after the input signal is coded by the first codec;
- a delay integer frame module configured to delay the first code stream by integer frames
- the delay alignment module is configured to perform delay alignment adjustment on the first decoded signal to obtain an adjustment signal, and the remaining coded stream includes the input signal obtained after the first codec is coded by dividing the input code stream The remaining layer code streams including the core layer code stream outside the first coded code stream.
- the output device is configured to output the adjustment signal and the first code stream delayed by the integer frame to the second codec.
- a transcoding system including:
- the first codec is configured to decode the remaining coded code streams to obtain a first decoded signal, where the remaining coded code streams include the input code stream obtained after the input signal is coded by the first codec divided by the first codec The code stream of the other layers including the core layer code stream;
- a transcoding device configured to delay the first encoded code stream by an integer number of frames, and perform a delay alignment adjustment on the first decoded signal to obtain an adjusted signal
- a second codec configured to encode the adjustment signal to obtain a second coded stream, and multiplex the second An encoded bitstream and the first encoded bitstream delayed by an integer frame.
- Another embodiment of the present invention also provides a mobile station of the above transcoding device.
- Another embodiment of the present invention also provides a network element of the above transcoding device. Therefore, by introducing the transcoding method, device, device, and system of the embodiment of the present invention, only part of the code stream needs to be re-encoded and decoded, which greatly reduces the transcoding complexity of the embedded codec. The stream is not re-encoded and decoded so that the signal quality in transcoding is effectively improved.
- FIG. 1 is a schematic diagram of prior art transcoding
- Figure 2 is a schematic diagram of an embodiment of a transcoding method of the present invention.
- Fig. 3 is a schematic diagram of another embodiment of a transcoding method according to the present invention
- Fig. 4 is a schematic diagram of another embodiment of a transcoding method according to the present invention
- FIG. 5 is a schematic diagram of another embodiment of a transcoding method according to the present invention.
- Fig. 6 is a schematic diagram of another embodiment of a transcoding method according to the present invention.
- Fig. 7 is a schematic diagram of an embodiment of a transcoding device of the present invention.
- Figure 8 is a schematic diagram of another embodiment of a transcoding device according to the present invention.
- Figure 9 is a schematic diagram of an embodiment of a transcoding device of the present invention.
- Figure 10 is a schematic diagram of an embodiment of a transcoding device of the present invention.
- Figure 11 is a schematic diagram of an embodiment of a transcoding system of the present invention.
- An embodiment of the present invention provides a method for transcoding between two embedded codecs. As shown in FIG. 2, the method includes the following steps:
- the encoding method and bandwidth of the at least one extension layer by the first codec are consistent with the encoding method and bandwidth of the at least one extension layer by the second codec.
- the remaining layer code streams including the core layer code stream except for the first coded code stream.
- the relationship between the delay of the second coded stream obtained by encoding the adjustment signal by the second codec and the delay of the first coded stream by an integer frame corresponds to the second codec
- the first codec and the second codec may be a whole codec that is physically synthesized together, or may be separate physical entities, that is, the first codec includes the first encoder and the The first decoder is respectively used for encoding and decoding of the code stream, and the same description is applicable to the following embodiments.
- the transcoding method of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment of the code stream that is no longer decoded and re-encoded and perform the delay integer frame operation on the re-encoded and decoded part of the code stream.
- the transcoding complexity of the embedded codec is greatly reduced, and the signal quality in the transcoding is effectively improved because part of the code stream is not re-encoded and decoded.
- Another embodiment of the present invention provides a method for transcoding between two embedded codecs. As shown in FIG. 3, the method includes the following steps:
- the information of the integer frames with a preset configuration delay may include the number of integer frames with a preset configuration delay or the length of the integer frames with a delay, and the delay integer frame information is set by means of module loading or system presets, where The data of the integer frame or the length of the delayed integer frame is determined according to the frame length of the signal frame, combined with the delay of the encoding and decoding of the first codec and the second codec, and can be embodied as the same as the aforementioned signal frame. Length, the first codec and the second codec encoding and decoding the delay-related function of the code stream.
- S202 Delay the first encoded code stream by a preset integer frame according to the information of the delayed integer frame.
- the first coded code stream is obtained by encoding at least one extension layer using a first codec, and the at least one extension layer uses the method and bandwidth of the first codec encoding and uses the first codec required for transcoding.
- the two codecs have one less extended layer coding method and the same bandwidth. Since the encoding methods of the at least one extension layer before and after transcoding are the same, unnecessary operations will be caused by re-decoding and re-encoding, which will seriously increase the complexity of transcoding. This implementation is aimed at the at least one extension layer.
- the first coded stream encoded by the first codec is delayed by the preset integer frame according to the information of the preset delayed integer frame. At this time, the bit information in the code stream is not processed. It takes the code stream as a whole to carry out the delay operation of the transmission frame length.
- the preset adjustment delay information may include the preset adjustment delay time or the adjustment delay time length.
- the adjustment delay information is set by means of module loading or system presets, wherein the preset adjustment delay time length is based on the signal
- the frame length of the frame is determined in combination with the delay of the first codec and the second codec encoding and decoding the code stream and the integer frame delay of the first code stream.
- this step can also be executed immediately after step S201.
- S204 Perform time delay alignment adjustment on the first decoded signal to obtain an adjusted signal.
- the first decoded signal is obtained by decoding the remaining coded code streams including the core layer coded code stream except the first coded code stream by the first codec, and the remaining coded code streams are obtained by comparing the at least one codec code stream to the at least one codec.
- the remaining layers outside the extension layer are coded by the first codec.
- the relationship between the delay of the second coded stream obtained by encoding the adjustment signal by the second codec and the delay of the first coded stream by an integer frame corresponds to the second codec
- the adjustment of the first and second time points in S202 and this step can ensure Multiplexing and outputting the time delay relationship between the code streams used for the second decoding, and the code stream after only the at least one extension layer is encoded by the second codec and the other layers except the at least one extension layer pass through the first The time delay relationship between the code streams encoded by the two codecs corresponds.
- the transcoding method of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment of the code stream that is no longer decoded and re-encoded and perform the delay integer frame operation on the re-encoded and decoded part of the code stream.
- the transcoding complexity of the embedded codec is greatly reduced, and the signal quality in the transcoding is effectively improved because part of the code stream is not re-encoded and decoded.
- the transcoding between most embedded codecs can also reduce the extra delay caused by transcoding, for example, it is possible to avoid any extra delay during transcoding.
- Example three A more specific embodiment of the present invention provides a method for transcoding between two embedded codecs, as shown in Fig. 4:
- the G.722 ultra-wideband extended codec is used as the first codec
- the G.711.1 ultra-wideband codec is used as the second codec
- at least one extended layer that does not need to be decoded and re-encoded is marked as Mlayer, which is the ultra-wideband coding layer shown in the figure, is directly multiplexed with the code stream encoded by the first encoder.
- Mlayer is the ultra-wideband coding layer shown in the figure
- Nlayer needs to pass through the first encoder. After encoding, it is decoded, and then encoded by the second encoder.
- the embodiment of the present invention needs to align the time point of the re-encoded Nlayer with the time point of the directly multiplexed Mlayer. Therefore, the Mlayer needs to be delayed in the transcoding process. Alignment operation and delay integer frame adjustment for Nlayer.
- the delay of the first encoder to Nlayer is ell, the delay to Mlayer is el2, the delay of the first decoder to Nlayer is dll, the delay to Mlayer is dl2, and the delay of the second encoder to Nlayer and Mlayer
- the time is e21, e22, the delay of the second decoder to Mayer and Mlayer is d21, d22, and the frame length of the signal frame is T-frame.
- L and D can be expressed as follows:
- the delayed integer frame and the length of the delay alignment adjustment, the delay of the first encoder to Nlayer and Mlayer, the delay of the first decoder to Nlayer, the delay of the second encoder to Nlayer, and the second decoding The delay of the device to Nlayer and Mlayer is related.
- the above delays can be zero, that is, there is no delay or other values.
- the decoding signal of Nlayer is delayed by 3.125ms
- the code stream of Mlayer is delayed by 2 integer frames.
- the first codec is the G.711.1 ultra-wideband extension codec
- the second codec is the G.722 ultra-wideband extension codec.
- Another embodiment of the present invention provides a method for transcoding between two embedded codecs. As shown in FIG. 5, the method includes the following steps:
- S301 Delay an integer frame for a first code stream in the input code stream, where the first code stream includes at least one extended layer code stream of the input code stream obtained after the input signal is coded by the first codec;
- S303 Perform time delay alignment adjustment on the first decoded signal to obtain an adjusted signal.
- the transcoding method of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment of the code stream that is no longer decoded and re-encoded and perform the delay integer frame operation on the re-encoded and decoded part of the code stream. It greatly reduces the transcoding complexity of the embedded codec and further improves the reliability. Number encoding and decoding effect, because part of the code stream has not been re-encoded and decoded, the signal quality during transcoding and the encoding and decoding process is effectively improved.
- the transcoding between most embedded codecs can also reduce the extra delay caused by transcoding, for example, it is possible to avoid any extra delay during transcoding. Example five
- Another embodiment of the present invention provides a method for transcoding between two embedded codecs, as shown in FIG. 6: Before the method is implemented, the method includes step S401.
- the first codec encodes an input signal to obtain an encoded bitstream.
- the first codec is an encoder used for the first encoding before transcoding
- the first codec is an embedded codec
- the encoded input signal includes an extension layer signal and a core layer signal
- the first encoder encodes The formation of the input signal corresponds to the code stream.
- the code stream formed by encoding at least one extension layer is called the first code stream.
- the encoding method and bandwidth of the encoder before and after the at least one extension layer transcoding is the same, that is, The encoding method and bandwidth of the at least one extension layer by the first codec are the same as the encoding method and bandwidth of the at least one extension layer by using the second codec.
- the code stream formed by encoding the remaining layers except at least one extension layer is called the remaining code stream, which includes the core layer code stream.
- S402 Delay the first encoded code stream by integer frames.
- the encoding methods of at least one extension layer before and after transcoding are the same. Re-decoding and re-encoding will cause unnecessary operations and severely increase the complexity of transcoding. However, if appropriate operations are not performed, it will The time delay caused by the multiplexing of the first code stream and the time delay relationship when the code stream formed by using the second encoder to encode at least one extension layer is multiplexed does not match, for the at least one extension layer.
- the first coded code stream encoded by the first codec is delayed by the preset integer frame according to the preset delayed integer frame information. At this time, the bit information in the code stream is not processed. The processing is to take the code stream as a whole to carry out the delay operation of the transmission frame length.
- the information of the integer frames with a preset configuration delay may include the number of integer frames with a preset configuration delay or the length of the integer frames with a delay.
- the delay integer frame information is loaded by the module or preset by the system. Setting, wherein the data of the integer frame or the length of the delayed integer frame is determined according to the frame length of the signal frame, combined with the delay of the encoding and decoding of the first codec and the second codec, which can be embodied as the same as the above
- the signal frame length, the first codec and the second codec encoding and decoding a function related to the delay of the code stream.
- the first coded stream is obtained by encoding at least one extension layer using a first codec, and the at least one extension layer uses the method and bandwidth of the first codec encoding and uses the first codec required for transcoding.
- the two codecs have one less extended layer coding method and the same bandwidth. Since the encoding methods of the at least one extension layer before and after transcoding are the same, unnecessary operations will be caused when re-decoding and re-encoding, which seriously increases the complexity of transcoding.
- the first coded stream encoded by the first codec is delayed by the preset integer frame according to the preset delayed integer frame information. At this time, the bit information in the code stream is not processed. It takes the code stream as a whole to carry out the delay operation of the transmission frame length.
- the method of presetting the information of the delayed integer frame is not limited to being executed after step S401. Due to the mutual correlation between the two steps, the method can be implemented at other times to ensure that the information of the delayed integer frame is matched to the first A code stream delays the execution of a preset integer frame step.
- the first codec decodes the remaining code streams. For the remaining layers except at least one extension layer, because the encoding method and encoding bandwidth of the two encodings before and after the transcoding process are different, it needs to be encoded and then decoded.
- the first codec decodes the remaining coded code streams to obtain a first decoded signal, and the first decoded signal is used in the transcoding method to be provided to the second codec for an encoding operation.
- S404 Perform time delay alignment adjustment on the first decoded signal to obtain an adjusted signal.
- the first decoded signal is decoded by the first codec except for the first coded stream
- the remaining coded code streams including the core layer coded code stream are obtained, and the remaining coded code streams are obtained by encoding the remaining layers except for the at least one extension layer using the first codec.
- the relationship between the time delay of the second coded stream and the time delay of the first coded stream after being delayed by an integer number of frames corresponds to the coding time delay of the at least one extended layer by the second codec and the codec
- the remaining coded streams need to be
- the first time point is aligned with the second time point of the at least one extension layer, or forms the correspondence between the delay relationships as described above.
- the delay relationship between the multiplexed output code streams for the second decoding can be ensured, and after only at least one extension layer is encoded by the second codec
- the code stream corresponds to the time delay relationship between code streams encoded by the second codec in the remaining layers except for the at least one extension layer.
- the multiplexed code stream is provided to the second codec for decoding operation to recover the original signal.
- the steps of adjusting the delay information required by the preset delay integer frame information and the delay alignment adjustment in the foregoing embodiments may also use other methods.
- the information of the integer frames of the delay delay and the adjustment delay information required for the delay alignment adjustment are determined in a preset manner, that is, it may have been completed during the completion of the system construction or the process of modularization. It is preset that, in the implementation process of transcoding, only the preset information is read to delay the first code stream by integer frames And the adjustment signal may be obtained by adjusting the delay alignment of the first decoded signal.
- the first codec and the first codec may be combined according to the frame length of the signal frame in the process of transcoding.
- the delay of the second codec encoding and decoding the code stream is implemented in a real-time or in advance manner of determining the information of the delayed integer frame and the adjustment delay information required for the delay alignment adjustment.
- the step of adjusting the delay information required by the preset delay integer frame information and the delay alignment adjustment in the foregoing embodiments may also be preset or in the process of transcoding According to the frame length of the signal frame, the information of one of them is determined according to the delay of encoding and decoding of the first codec and the second codec, and then the other is further configured according to the information of the determined one.
- Information, the integer frame and the adjustment delay satisfy the following relationship: the adjustment delay is based on the frame length of the signal frame, combined with the delay of the first codec and the second codec to encode and decode the code stream Related to the integer frame.
- a universal transcoding method is provided.
- the delay is determined according to different codec parameters
- the information of the integer frame and the adjustment delay information required for the delay alignment adjustment provide the operation implementation of the delay integer frame and the delay alignment adjustment.
- the first codec and the second codec can be not only two simple and independent codecs, but also relatively complex structures, such as in Partial Mixing applications. middle.
- Partial Mixing is an efficient mixing method in a conference system. It decodes and mixes the traditional narrowband or wideband code stream in the multi-channel coded signal, but only selects the extended layer code stream of a specific coded signal as the mixing method. The extended layer code stream after the audio. When the receiving end still needs to transcode, the traditional transcoding method will waste the advantages of Partial Mixing.
- the transcoding method provided by the embodiment of the present invention, only the narrowband or wideband code stream after mixing is used as ( The NM) layer performs delay alignment adjustment, and the extended layer code stream acts as a Mlayer for delay integer frame adjustment.
- This kind of transcoding method still does not need to decode the extension layer code stream, which greatly improves the efficiency of transcoding and coding and decoding.
- the transcoding method of the embodiment of the present invention only needs to re-encode and decode part of the code stream,
- the time delay alignment adjustment of the re-encoded code stream and the delayed integer frame operation of the re-encoded and decoded part of the code stream can greatly reduce the transcoding complexity of the embedded codec, because part of the code stream is not re-encoded.
- Decoding effectively improves the signal quality in transcoding. It provides a universal transcoding method for transcoding between different codecs. For most embedded codecs, transcoding can also reduce the extra delay caused by transcoding. For example, no extra delay can be generated during transcoding. The delay.
- transcoding between two embedded codecs includes:
- the delayed integer frame module 501 is configured to delay the first code stream in the input code stream by integer frames, where the first code stream includes the input code stream obtained after the input signal is coded by the first codec At least one extended layer code stream;
- the delay alignment module 502 is configured to perform delay alignment adjustment on the first decoded signal obtained by using the first codec to decode the remaining coded stream in the input code stream to obtain an adjustment signal, and the remaining coded stream includes the input In the input code stream obtained after the signal is coded by the first codec, the remaining layer code streams including the core layer code stream except for the first coded code stream.
- the transcoding device of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment by using the delay alignment module for the code streams that are no longer decoded and re-encoded, and use the integer frame adjustment module to re-encode and decode the code stream. Delayed integer frame operations on part of the code stream can greatly reduce the transcoding complexity of the embedded codec. Because part of the code stream is not re-encoded and decoded, the signal quality in the transcoding is effectively improved. The transcoding between most embedded codecs can also reduce the extra delay caused by the transcoding, for example, no extra delay can be generated during the transcoding. Example Seven
- transcoding between two embedded codecs includes:
- the delay integer frame module 501 is configured to delay the first code stream by integer frames.
- the first coded code stream is obtained by encoding at least one extension layer using a first codec, and the delayed integer frame module 501 delays the first coded code stream by a preset integer frame according to the information of the delayed integer frame Perform a delay integer frame operation on the first code stream.
- the delay alignment module 502 is configured to perform delay alignment adjustment on the first decoded signal to obtain an adjustment signal.
- the remaining coded code streams of the layer coded code stream are obtained by encoding the remaining layers except the at least one extension layer using the first codec, and the delay alignment module 502 aligns according to the delay
- the adjustment delay information required by the adjustment performs a delay alignment adjustment on the first decoded signal.
- the device also includes a first delay module 503, which is used to determine information about delayed integer frames.
- the first delay module 503 determines the integer frame delaying the first coded stream according to the frame length of the signal frame in combination with the delays of the encoding and decoding of the first codec and the second codec.
- the second delay module 504 is configured to determine an adjustment delay for performing a delay alignment adjustment on the decoded signal.
- the frame length of the signal frame combined with the delay of the first codec and the second codec to encode and decode the code stream, and the integer frame delay of the first code stream to determine the decoded signal
- the adjustment delay of the delay alignment adjustment is
- the above-mentioned first delay module and the second delay module can be embedded in the system as a preset parameter module when the system is set up.
- the signal is adjusted for delay alignment to obtain the adjusted signal and call directly.
- the above-mentioned first delay module and second delay module may also be implemented when the delay integer frame module 501 implements the delay of the integer frame to the first code stream and the delay alignment module 502 performs the first decoding process during the operation of the transcoding device. Called when the signal is adjusted for delay alignment.
- the above-mentioned first delay module and the second delay module may be physically separate entities or independent entities, that is, they may be physically independent from the delay integer frame module and the delay alignment module, as shown in the figure As shown in 8, the first delay module may also be set in the delay integer frame module and the second delay module may be set in the delay alignment module.
- the encoding method and bandwidth of the at least one extension layer by the first codec are consistent with the encoding method and bandwidth of the at least one extension layer by the second codec, and the delay of the second codec stream is equal to
- the relationship between the delays after the first coded stream is delayed by an integer number of frames corresponds to the relationship between the coding delays of encoding the at least one extension layer by the second codec and the coding delays of encoding the remaining layers.
- the transcoding device of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment by using the delay alignment module for the code streams that are no longer decoded and re-encoded, and use the integer frame adjustment module to re-encode and decode the code stream. Delayed integer frame operations on part of the code stream can greatly reduce the transcoding complexity of the embedded codec. Because part of the code stream is not re-encoded and decoded, the signal quality in the transcoding is effectively improved. The transcoding between most embedded codecs can also reduce the extra delay caused by the transcoding, for example, no extra delay can be generated during the transcoding.
- transcoding device As shown in FIG. 9, transcoding is performed between two embedded codecs.
- the seventh embodiment includes:
- the delayed integer frame module 501 is configured to delay the integer frame of the first coded stream and the delay alignment module 502 is configured to perform delay alignment adjustment on the first decoded signal to obtain an adjusted signal.
- the device also includes a parameter configuration module 505, configured to configure the integer frames that delay the first coded stream and/or the adjusted delay according to the first codec and the second codec,
- the integer frame and the adjustment delay satisfy the following relationship:
- the adjustment delay is related to the integer frame according to the frame length of the signal frame, combined with the first codec and the second codec to encode and decode the code stream. That is, by configuring the integer frame delay for the first code stream (adjustment delay), the adjusted delay (the integer delay for the first code stream delay) is obtained through the above relationship frame).
- the coding method and bandwidth of the at least one extension layer are consistent with the coding method and bandwidth of the at least one extension layer by the second codec, and the delay of the second code stream is the same as that of the first code stream.
- the relationship between the delays after the time-integer frame corresponds to the relationship between the encoding delay of encoding the at least one extension layer by the second codec and the encoding delay of encoding the remaining layers.
- the parameter configuration module may also be included in the transcoding device as well as the first delay module and the second delay module, and the parameter configuration module provides the first delay module and the second delay module.
- the parameter information required by the delay module, or the parameter configuration module determines the information of the integer frame delayed by the first delay module, and provides this parameter to the second delay module, and vice versa, the parameter configuration module
- the cooperative operation with the first delay module and the second delay module may be determined according to different implementation environments, and is not limited to this.
- the transcoding device of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment by using the delay alignment module for the code streams that are no longer decoded and re-encoded, and use the integer frame adjustment module to re-encode and decode the code stream. Delayed integer frame operations on part of the code stream can greatly reduce the transcoding complexity of the embedded codec. Because part of the code stream is not re-encoded and decoded, the signal quality in the transcoding is effectively improved. Provides a universal transcoding method for transcoding between different codecs. For most embedded codecs, transcoding can also reduce the extra delay caused by transcoding. For example, no extra delay can be generated during transcoding. The delay. Example 9
- An embodiment of the present invention provides a transcoding device. As shown in FIG. 10, the transcoding device implements a transcoding operation between two embedded codecs, and its module is used to receive the first codec. The output coded and decoded signals to be processed are transcoded on the received signal, and the processed signal is provided to the second codec for decoding operation.
- the transcoding device includes
- the receiving device 506 is configured to receive the first coded code stream in the input code stream and the first coded stream obtained by using the first codec to decode other coded code streams in the input code stream except for the first coded code stream.
- one A decoded signal where the first coded code stream includes at least one extended layer code stream in the input code stream obtained after the input signal is coded by the first codec;
- a delay integer frame module 501 configured to delay the first code stream by integer frames
- the delay alignment module 502 is configured to perform delay alignment adjustment on the first decoded signal to obtain an adjustment signal, and the remaining coded stream includes the input code stream obtained after the input signal is encoded by the first codec divided by The remaining layer code streams including the core layer code stream outside the first coded code stream.
- the output device 507 is configured to output the adjustment signal and the first coded stream delayed by an integer frame to the second codec.
- the delayed integer frame module and the delay alignment module are the same as the delayed integer frame module and the delay alignment module in each embodiment of the transcoding device.
- the encoding method and bandwidth of the at least one extension layer by the first codec is consistent with the encoding method and bandwidth of the at least one extension layer by the second codec, and the encoding method and bandwidth of the second codec are used.
- the relationship between the time delay of the second coded stream obtained by adjusting the signal and the time delay after the first coded stream is delayed by an integer number of frames corresponds to the time when the second codec encodes the at least one extension layer
- the relationship between the delay and the encoding delay of encoding the remaining layers is consistent with the encoding method and bandwidth of the at least one extension layer by the second codec, and the encoding method and bandwidth of the second codec are used.
- the device further includes other modules as described in the foregoing transcoding device embodiment, as shown in FIG. 10.
- the transcoding device of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment by using the delay alignment module for the code stream that is no longer decoded and re-encoded, and adjust the integer frame module to re-encode and decode the code stream. Delayed integer frame operations on part of the code stream can greatly reduce the transcoding complexity of the embedded codec. Because part of the code stream is not re-encoded and decoded, the signal quality in the transcoding is effectively improved. Provides a universal transcoding method for transcoding between different codecs. For most embedded codecs, transcoding can also reduce the extra delay caused by transcoding. For example, no extra delay can be generated during transcoding. The delay.
- Example ten An embodiment of the present invention provides a transcoding system. As shown in FIG. 11, the transcoding system includes a first codec 508, a transcoding device 50, and a second codec 509.
- the first codec implements the decoding function and is used to decode the remaining coded streams to obtain the first decoded signal.
- the remaining coded streams include the input signal obtained after the input signal is encoded by the first codec.
- the remaining layer code streams including the core layer code stream other than the first coded code stream; a transcoding device, configured to delay the first coded code stream by an integer number of frames, and perform time delay alignment on the first decoded signal.
- the adjustment signal is obtained by adjustment; a second codec, which implements an encoding function, is used to encode the adjustment signal to obtain a second coded code stream, and multiplex the second coded code stream and the first code after the delayed integer frame Code stream.
- the first codec may also encode the input signal to obtain a first coded code stream corresponding to at least one extension layer and the remaining coded code streams including the core layer coded stream.
- the encoding method and bandwidth of the at least one extension layer are consistent with the encoding method and bandwidth of the at least one extension layer by the second codec.
- the second codec may also decode the second coded stream and the multiplexed code stream of the first coded stream delayed by an integer frame.
- the first codec encodes the input signal to obtain a first code stream corresponding to at least one extension layer and the remaining code streams including the core layer code stream.
- the first codec decodes the remaining coded stream to obtain a first decoded signal, and the remaining coded stream is obtained by encoding the remaining layers except for the at least one extension layer using the first codec.
- the transcoding device receives the first coded stream, obtains delayed integer frame information, and delays the first coded stream by integer frames according to the delayed integer frame information, and the transcoding device receives the The first decoded signal obtains the adjustment delay information required for the delay alignment adjustment, the delay alignment adjustment is performed on the first decoded signal according to the adjustment delay information to obtain an adjustment signal, and the transcoding device delays an integer
- the first coded stream after the frame and the adjustment signal output as shown in each embodiment of the transcoding device or transcoding device, the delay of the second coded stream and the delay of the first coded stream
- the relationship between the delays after the integer frame corresponds to the relationship between the encoding delay of encoding the at least one extension layer by the second codec and the encoding delay of encoding the remaining layers
- the second codec encodes the adjustment Signal to obtain the second coded stream and decode the second coded stream and the The code stream after the multiplexing of the first code stream delayed by an integer frame.
- the transcoding system of the embodiment of the present invention only needs to re-encode and decode part of the code stream, and adjust the delay alignment by using the transcoding device for the code stream that is no longer decoded and re-encoded, and delay the re-encoded and decoded part of the code stream.
- Time-integer frame operation can greatly reduce the transcoding complexity of the embedded codec. Because part of the code stream is not re-encoded and decoded, the signal quality in the transcoding is effectively improved. It provides a universal transcoding method for transcoding between different codecs. For most embedded codecs, transcoding can also reduce the extra delay caused by transcoding. For example, no extra delay can be generated during transcoding. Delay.
- a mobile station in another embodiment, includes the transcoding device of the foregoing embodiment, and a network element is provided, which also includes the transcoding device of the foregoing embodiment.
- the transcoding device is in the mobile station and Network elements provide transcoding operations, and transcoding devices are not limited to the above-mentioned mobile stations and network elements. All communication equipment and systems that require transcoding operations can use the above transcoding devices and transcoding methods, and are not limited to this.
- the above embodiments of the present invention are applicable to speech and audio signals and even other data signals with embedded codec processing signal characteristics.
- Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented through computer programs. Relevant hardware is instructed to complete, the program can be stored in a computer readable storage medium, and when the program is executed, it can include the processes of the above-mentioned method embodiments.
- the storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM).
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2012521949A JP2013501246A (ja) | 2009-07-31 | 2010-07-28 | トランスコーディングの方法、装置、機器、およびシステム |
KR1020127003447A KR101348969B1 (ko) | 2009-07-31 | 2010-07-28 | 트랜스코딩 방법, 디바이스, 장치 및 시스템 |
EP10803905A EP2461322A4 (en) | 2009-07-31 | 2010-07-28 | TRANSCODING PROCESS, DEVICE, DEVICE AND SYSTEM |
US13/359,339 US8326608B2 (en) | 2009-07-31 | 2012-01-26 | Transcoding method, apparatus, device and system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009101095101A CN101989429B (zh) | 2009-07-31 | 2009-07-31 | 转码方法、装置、设备以及系统 |
CN200910109510.1 | 2009-07-31 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/359,339 Continuation US8326608B2 (en) | 2009-07-31 | 2012-01-26 | Transcoding method, apparatus, device and system |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2011012072A1 true WO2011012072A1 (zh) | 2011-02-03 |
Family
ID=43528775
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2010/075497 WO2011012072A1 (zh) | 2009-07-31 | 2010-07-28 | 转码方法、装置、设备以及系统 |
Country Status (6)
Country | Link |
---|---|
US (1) | US8326608B2 (zh) |
EP (1) | EP2461322A4 (zh) |
JP (1) | JP2013501246A (zh) |
KR (1) | KR101348969B1 (zh) |
CN (1) | CN101989429B (zh) |
WO (1) | WO2011012072A1 (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2395504B1 (en) * | 2009-02-13 | 2013-09-18 | Huawei Technologies Co., Ltd. | Stereo encoding method and apparatus |
RU2665281C2 (ru) | 2013-09-12 | 2018-08-28 | Долби Интернэшнл Аб | Временное согласование данных обработки на основе квадратурного зеркального фильтра |
CN111614976B (zh) * | 2014-09-12 | 2022-05-24 | 松下知识产权经营株式会社 | 发送装置、接收装置、发送方法以及接收方法 |
CN104244004B (zh) * | 2014-09-30 | 2017-10-10 | 华为技术有限公司 | 低功耗编码方法及装置 |
EP3284087B1 (en) | 2016-01-22 | 2019-03-06 | Fraunhofer Gesellschaft zur Förderung der Angewand | Apparatuses and methods for encoding or decoding an audio multi-channel signal using spectral-domain resampling |
CN109217979B (zh) * | 2017-06-30 | 2021-06-15 | 华为技术有限公司 | 一种通信方法、装置及存储介质 |
CN112188286B (zh) * | 2020-10-09 | 2022-06-10 | 上海网达软件股份有限公司 | 一种用于直播流的时间戳标定方法及系统 |
CN114760226B (zh) * | 2020-12-29 | 2023-07-18 | 海能达通信股份有限公司 | 一种提高往返时延估计精度的方法、装置和存储介质 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000341691A (ja) * | 1999-05-31 | 2000-12-08 | Toshiba Corp | トランスコーダ装置 |
US20040153316A1 (en) * | 2003-01-30 | 2004-08-05 | Hardwick John C. | Voice transcoder |
US20050015243A1 (en) * | 2003-07-15 | 2005-01-20 | Lee Eung Don | Apparatus and method for converting pitch delay using linear prediction in speech transcoding |
CN1585970A (zh) * | 2001-11-13 | 2005-02-23 | 日本电气株式会社 | 代码转换方法、设备、程序和记录介质 |
CN1784716A (zh) * | 2003-04-08 | 2006-06-07 | 日本电气株式会社 | 代码转换方法和设备 |
CN1918634A (zh) * | 2004-02-16 | 2007-02-21 | 皇家飞利浦电子股份有限公司 | 代码转换器以及用于其的代码转换方法 |
WO2008034723A1 (en) * | 2006-09-20 | 2008-03-27 | Thomson Licensing | Method and device for transcoding audio signals |
WO2009008947A1 (en) * | 2007-07-06 | 2009-01-15 | Mindspeed Technologies, Inc. | Speech transcoding in gsm networks |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3032242B2 (ja) | 1990-05-25 | 2000-04-10 | 日産自動車株式会社 | インストパッドとフロントピラーガーニッシュの合わせ構造 |
JPH10327074A (ja) * | 1997-05-22 | 1998-12-08 | Matsushita Electric Ind Co Ltd | 信号処理装置 |
DE10102155C2 (de) * | 2001-01-18 | 2003-01-09 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Erzeugen eines skalierbaren Datenstroms und Verfahren und Vorrichtung zum Decodieren eines skalierbaren Datenstroms |
US20030028386A1 (en) | 2001-04-02 | 2003-02-06 | Zinser Richard L. | Compressed domain universal transcoder |
JP2003280694A (ja) * | 2002-03-26 | 2003-10-02 | Nec Corp | 階層ロスレス符号化復号方法、階層ロスレス符号化方法、階層ロスレス復号方法及びその装置並びにプログラム |
KR100933159B1 (ko) | 2003-07-11 | 2009-12-21 | 삼성전자주식회사 | 이동통신시스템에서 음성 데이터 전송을 위한 동기화 방법 및 시스템 |
JP4958780B2 (ja) * | 2005-05-11 | 2012-06-20 | パナソニック株式会社 | 符号化装置、復号化装置及びこれらの方法 |
FR2888699A1 (fr) * | 2005-07-13 | 2007-01-19 | France Telecom | Dispositif de codage/decodage hierachique |
WO2007043642A1 (ja) * | 2005-10-14 | 2007-04-19 | Matsushita Electric Industrial Co., Ltd. | スケーラブル符号化装置、スケーラブル復号装置、およびこれらの方法 |
JPWO2007116809A1 (ja) * | 2006-03-31 | 2009-08-20 | パナソニック株式会社 | ステレオ音声符号化装置、ステレオ音声復号装置、およびこれらの方法 |
JP4708446B2 (ja) * | 2007-03-02 | 2011-06-22 | パナソニック株式会社 | 符号化装置、復号装置およびそれらの方法 |
-
2009
- 2009-07-31 CN CN2009101095101A patent/CN101989429B/zh active Active
-
2010
- 2010-07-28 JP JP2012521949A patent/JP2013501246A/ja active Pending
- 2010-07-28 WO PCT/CN2010/075497 patent/WO2011012072A1/zh active Application Filing
- 2010-07-28 EP EP10803905A patent/EP2461322A4/en not_active Withdrawn
- 2010-07-28 KR KR1020127003447A patent/KR101348969B1/ko active IP Right Grant
-
2012
- 2012-01-26 US US13/359,339 patent/US8326608B2/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000341691A (ja) * | 1999-05-31 | 2000-12-08 | Toshiba Corp | トランスコーダ装置 |
CN1585970A (zh) * | 2001-11-13 | 2005-02-23 | 日本电气株式会社 | 代码转换方法、设备、程序和记录介质 |
US20040153316A1 (en) * | 2003-01-30 | 2004-08-05 | Hardwick John C. | Voice transcoder |
CN1784716A (zh) * | 2003-04-08 | 2006-06-07 | 日本电气株式会社 | 代码转换方法和设备 |
US20050015243A1 (en) * | 2003-07-15 | 2005-01-20 | Lee Eung Don | Apparatus and method for converting pitch delay using linear prediction in speech transcoding |
CN1918634A (zh) * | 2004-02-16 | 2007-02-21 | 皇家飞利浦电子股份有限公司 | 代码转换器以及用于其的代码转换方法 |
WO2008034723A1 (en) * | 2006-09-20 | 2008-03-27 | Thomson Licensing | Method and device for transcoding audio signals |
WO2009008947A1 (en) * | 2007-07-06 | 2009-01-15 | Mindspeed Technologies, Inc. | Speech transcoding in gsm networks |
Also Published As
Publication number | Publication date |
---|---|
US8326608B2 (en) | 2012-12-04 |
JP2013501246A (ja) | 2013-01-10 |
KR101348969B1 (ko) | 2014-01-09 |
CN101989429B (zh) | 2012-02-01 |
EP2461322A1 (en) | 2012-06-06 |
US20120136669A1 (en) | 2012-05-31 |
CN101989429A (zh) | 2011-03-23 |
EP2461322A4 (en) | 2013-01-09 |
KR20120061828A (ko) | 2012-06-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2011012072A1 (zh) | 转码方法、装置、设备以及系统 | |
JP4160278B2 (ja) | 媒体ストリームのスケーラブル符号化方法、スケーラブルエンコーダおよびマルチメディア端末 | |
US10885921B2 (en) | Multi-stream audio coding | |
CN111108556B (zh) | 多流音频译码 | |
JP5363488B2 (ja) | マルチチャネル・オーディオのジョイント強化 | |
US8069035B2 (en) | Scalable encoding apparatus, scalable decoding apparatus, and methods of them | |
EP1590800B1 (en) | Continuous backup audio | |
JP5418930B2 (ja) | 音声復号化方法および音声復号化器 | |
US20080004883A1 (en) | Scalable audio coding | |
JP4969454B2 (ja) | スケーラブル符号化装置およびスケーラブル符号化方法 | |
CN101051465B (zh) | 用于解码被编码的有用数据的方法和解码装置 | |
JP2001202097A (ja) | 符号化二進オーディオ処理方法 | |
TW201306025A (zh) | 用以產生可由實施不同解碼協定之解碼器所解碼的統一位元流之音頻編碼方法及系統 | |
JPWO2005119950A1 (ja) | 音声データ受信装置および音声データ受信方法 | |
CN101366082B (zh) | 可变帧偏移编解码方法、编解码器及无线电通信设备 | |
CN102915736B (zh) | 混音处理方法和混音处理系统 | |
WO2011012029A1 (zh) | 多描述音频编解码的方法、装置及系统 | |
US10375131B2 (en) | Selectively transforming audio streams based on audio energy estimate | |
KR101256316B1 (ko) | 보코더들 간의 스위칭 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10803905 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012521949 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010803905 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20127003447 Country of ref document: KR Kind code of ref document: A |