WO2023097686A1 - 一种立体声音频信号处理方法及设备/存储介质/装置 - Google Patents

一种立体声音频信号处理方法及设备/存储介质/装置 Download PDF

Info

Publication number
WO2023097686A1
WO2023097686A1 PCT/CN2021/135514 CN2021135514W WO2023097686A1 WO 2023097686 A1 WO2023097686 A1 WO 2023097686A1 CN 2021135514 W CN2021135514 W CN 2021135514W WO 2023097686 A1 WO2023097686 A1 WO 2023097686A1
Authority
WO
WIPO (PCT)
Prior art keywords
threshold
thresh0
frame
current frame
initial
Prior art date
Application number
PCT/CN2021/135514
Other languages
English (en)
French (fr)
Other versions
WO2023097686A9 (zh
Inventor
高硕�
Original Assignee
北京小米移动软件有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京小米移动软件有限公司 filed Critical 北京小米移动软件有限公司
Priority to CN202180004514.2A priority Critical patent/CN114365509B/zh
Priority to PCT/CN2021/135514 priority patent/WO2023097686A1/zh
Publication of WO2023097686A1 publication Critical patent/WO2023097686A1/zh
Publication of WO2023097686A9 publication Critical patent/WO2023097686A9/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0264Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control

Definitions

  • the present disclosure relates to the field of communication technologies, and in particular to a stereo audio signal processing method and equipment/storage medium/device.
  • lossless coding can meet the demands of high-quality audio playback and lossless storage, it is widely used.
  • lossless encoding is performed on stereo audio signals, it is necessary to perform decorrelation processing on the stereo audio signals first, so as to improve the encoding compression rate.
  • the main method of de-correlation processing is: setting a threshold, and calculating the correlation coefficient of the left channel signal and the right channel signal of the current frame of the stereo audio signal, and determining the left channel signal of the current frame based on the relationship between the correlation coefficient and the threshold value.
  • the correlation between the channel signal and the right channel signal is determined, and based on the determined correlation, an optimal decorrelation processing method is adopted to perform decorrelation processing on the current frame.
  • the threshold corresponding to each frame of the stereo audio signal is fixed and cannot be updated adaptively, which will affect the accuracy of determining the correlation of different frames, and then the optimal threshold cannot be accurately selected for each frame.
  • the de-correlation processing method makes it impossible to improve the encoding compression rate.
  • the present disclosure proposes a stereo audio signal processing method and equipment/storage medium/device to solve the technical problem of low coding compression rate in the decorrelation processing method in the related art.
  • the stereo audio signal processing method proposed in an embodiment of the present disclosure is applied to a coding device, including:
  • Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal, where Thresh0 1 ⁇ (-1, 0), Thresh0 2 ⁇ (0, 1);
  • the offset value Delta Based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, the initial first threshold Thresh0 1 of the current frame, and the initial second threshold Thresh0 2 of the current frame, determine the stereo audio signal The first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the audio signal;
  • the stereo audio signal processing device proposed by the embodiment includes:
  • a determining module configured to determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of the current frame of the stereo audio signal, wherein Thresh0 1 ⁇ (-1, 0), Thresh0 2 ⁇ (0, 1);
  • a determining module configured to be based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, the initial first threshold Thresh0 1 of the current frame, and the initial second threshold Thresh0 2 of the current frame , determining a first threshold Thresh1 and a second threshold Thresh2 corresponding to the current frame of the stereo audio signal;
  • a processing module configured to perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • an embodiment provides a communication device, the device includes a processor and a memory, a computer program is stored in the memory, and the processor executes the computer program stored in the memory, so that the The device executes the method provided in the above embodiment of another aspect.
  • a communication device provided by an embodiment of another aspect of the present disclosure includes: a processor and an interface circuit;
  • the interface circuit is used to receive code instructions and transmit them to the processor
  • the processor is configured to run the code instructions to execute the method provided in another embodiment.
  • the computer-readable storage medium provided by another embodiment of the present disclosure is used to store instructions, and when the instructions are executed, the method provided by another embodiment is implemented.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, Among them, Thresh0 1 ⁇ (-1, 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta,
  • the initial first threshold Thresh0 1 of the current frame and the initial second threshold Thresh0 2 of the current frame determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; so that the follow-up can be based on the first threshold Thresh1 corresponding to the current frame and the second threshold Thresh2 to perform decorrelation processing on the current frame.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • Fig. 1a is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
  • FIG. 1b is a flow diagram of obtaining an encoded code stream based on a signal after decorrelation processing provided by an embodiment of the present disclosure
  • FIG. 2 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
  • FIG. 3 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
  • FIG. 4 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
  • FIG. 5 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
  • FIG. 6 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure
  • FIG. 7 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure.
  • FIG. 8 is a schematic structural diagram of a stereo audio signal processing device provided by an embodiment of the present disclosure.
  • Fig. 9 is a block diagram of a user equipment provided by an embodiment of the present disclosure.
  • Fig. 10 is a block diagram of a network side device provided by an embodiment of the present disclosure.
  • first, second, third, etc. may use the terms first, second, third, etc. to describe various information, the information should not be limited to these terms. These terms are only used to distinguish information of the same type from one another. For example, without departing from the scope of the embodiments of the present disclosure, first information may also be called second information, and similarly, second information may also be called first information.
  • first information may also be called second information
  • second information may also be called first information.
  • the words "if” and "if” as used herein may be interpreted as “at” or "when” or "in response to a determination.”
  • Fig. 1a is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in Fig. 1a, the method for processing a stereo audio signal may include the following steps:
  • Step 101 Determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of a current frame of a stereo audio signal.
  • the current frame may be any frame in the stereo audio signal except the first frame.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 may be preset, wherein, the initial first threshold Thresh0 1 ⁇ (-1, 0), the initial The second threshold Thresh0 2 ⁇ (0, 1).
  • the initial first threshold Thresh0 to 1 corresponding to each frame of the stereo audio signal is the same, and the initial second threshold Thresh0 corresponding to each frame of the stereo audio signal 2 are the same.
  • Step 102 Determine the offset value Delta.
  • the determined offset value Delta has a specific function of: using the offset value Delta to update the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame to The first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are obtained. Therefore, in an embodiment of the present disclosure, the offset value Delta may include an offset value Delta1 and an offset value Delta2, wherein the offset value Delta1 may be used to adjust the initial first threshold Thresh0 of the current frame 1 is updated, and the offset value Delta2 can be used to update the initial second threshold Thresh02 of the current frame.
  • the method for determining the offset value Delta1 may include: making Delta1 ⁇ (0,
  • ), the method for determining the offset value Delta2 may include: making Delta2 ⁇ (0,
  • the offset value Delta1 and the offset value Delta2 may be the same.
  • the offset value Delta1 and the offset value Delta2 may be different.
  • the offset values Delta1 and Delta2 may be 0.05. It can be understood that the above numerical value can be applied to any embodiment of the present disclosure, and the numerical value is only shown as an example, which is not limited by the present disclosure.
  • Step 103 based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, the initial first threshold Thresh0 1 of the current frame, and the initial second threshold Thresh0 2 of the current frame, determine the first threshold corresponding to the current frame of the stereo audio signal A threshold Thresh1 and a second threshold Thresh2.
  • the ways of determining the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal will also be different.
  • the detailed introduction of this part will be described in the following embodiments.
  • the above-mentioned de-correlation processing method of the previous frame may be determined based on the flag bits corresponding to the previous frame, wherein the flag bits of each frame are used to indicate the de-correlation of each frame related processing methods.
  • the decorrelation processing method of the previous frame in response to the flag position 0 of the previous frame, is determined to be: the first decorrelation processing method; in response to the flag position of the previous frame 1, it is determined that the decorrelation processing method of the previous frame is: the second decorrelation processing method; in response to the flag position 2 of the previous frame, it is determined that the decorrelation processing method of the previous frame is: no decorrelation processing is performed.
  • the first decorrelation processing manner, the second decorrelation processing manner, and no decorrelation processing will be described in subsequent embodiments.
  • Step 104 Perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • the first threshold Thresh1 corresponding to the current frame can be specifically used to determine that the current frame is a biased anti-phase signal or an uncorrelated signal
  • the second threshold Thresh2 can be specifically used to determine that the current frame is a biased signal. Positive phase signal or uncorrelated signal.
  • the method for decorrelating the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame may include:
  • Step 1 Determine the correlation of the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame, wherein the correlation includes partial anti-phase signals, partial positive phase signals and uncorrelated signals.
  • the current frame in response to the fact that the cross-correlation coefficient between the left channel signal and the right channel signal of the current frame is smaller than the first threshold Thresh1 corresponding to the current frame, it is determined that the current frame is a partial anti-phase signal, and the response Because the cross-correlation coefficient between the left channel signal and the right channel signal of the current frame is greater than the second threshold Thresh2 corresponding to the current frame, it is determined that the current frame is a partial positive phase signal, and in response to the mutual correlation between the left channel signal and the right channel signal of the current frame If the relation number is greater than or equal to the first threshold Thresh1 corresponding to the current frame and less than or equal to the second threshold Thresh2 corresponding to the current frame, it is determined that the current frame is an uncorrelated signal.
  • Step 2 Select an optimal decorrelation processing method based on the correlation of the current frame to perform decorrelation processing on the current frame to obtain a decorrelation processed signal.
  • FIG. 1b is a flow chart of obtaining a coded code stream based on the signal after decorrelation processing provided by the embodiment of the present disclosure. As shown in FIG. 1b , based on the de-correlation processing The method of obtaining the coded code stream of the signal can be:
  • the signal after decorrelation processing is divided into bands by integer lifting wavelet decomposition to obtain the sub-band signals, and the signal after decorrelation processing is calculated and quantized by LPC (Linear Prediction Coefficient, linear prediction coefficient) parameters to obtain quantized LPC parameters. Then use the linear predictor to predict each sub-band signal based on the quantized LPC parameters, generate a prediction residual signal, use the preprocessor to normalize the prediction residual signal, and generate a normalized output signal, LSB (Least Significant Bit , least significant bit) signal and signal sign bit.
  • LPC Linear Prediction Coefficient, linear prediction coefficient
  • entropy encoder uses the entropy encoder to perform entropy encoding on the normalized output signals corresponding to each sub-band signal to generate an encoded bit stream, and then perform code stream multiplexing on the encoded bit stream, LSB signal, signal symbol bit, quantized LPC parameters, and wavelet edge information Get the encoded code stream.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • Fig. 2 is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in Fig. 2 , the method for processing a stereo audio signal may include the following steps:
  • Step 201 Determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of a current frame of a stereo audio signal.
  • Step 202 Determine the offset value Delta.
  • Step 203 responding to the decorrelation processing method of the previous frame of the stereo audio signal: adopting the first decorrelation processing method to perform decorrelation processing, and determining the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame based on Formula 1.
  • formula one is:
  • Thresh1 and Thresh2 are respectively the first threshold and the second threshold of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold of the current frame and the second initial threshold of the current frame
  • Delta is the offset value
  • ) that is, the offset value in this embodiment is specifically the offset value Delta1 used to update the initial first threshold Thresh0 1 of the current frame in the above embodiment).
  • the first decorrelation processing manner may specifically be a manner for performing decorrelation processing on partial anti-phase signals.
  • the process of determining whether to use the first decorrelation processing method to perform decorrelation processing on the previous frame is mainly: first judge whether the previous frame is a partial anti-phase signal, and the current frame is When the signal is partial anti-phase, the first decorrelation processing method is used to perform decorrelation processing on the previous frame; otherwise, the first decorrelation processing method is not used to perform decorrelation processing on the previous frame.
  • the above-mentioned process of judging whether the previous frame is a partial anti-phase signal mainly includes: first calculating the first cross-correlation coefficient of the left channel signal and the right channel signal of the previous frame, When the first cross-correlation coefficient is smaller than the first threshold Thresh2 1 corresponding to the previous frame, it is judged that the previous frame is a partial anti-phase signal, and the first decorrelation processing needs to be performed on the signal.
  • the phenomenon of "judgment inaccuracy" may occur due to the inaccurate setting of the first threshold Thresh2 1 corresponding to the previous frame, so that the correlation of the signal after the first decorrelation process is higher than that before the first decorrelation process The correlation of the signal is stronger, causing the signal to fail to achieve the purpose of decorrelation.
  • the second cross-correlation coefficient is the cross-correlation coefficient of the signal after decorrelation processing obtained by performing the first decorrelation processing on the signal of the previous frame by using the first decorrelation processing manner.
  • the first cross-correlation coefficient when the first cross-correlation coefficient is smaller than the second cross-correlation coefficient, it means that "based on the first threshold Thresh2 1 corresponding to the previous frame, it is judged whether the previous frame is to perform the first decorrelation
  • the judging result of the processing is accurate", in other words, it shows that the first threshold Thresh2 1 corresponding to the previous frame is set accurately, and the partial anti-phase signal identified based on the first threshold Thresh2 1 can achieve decorrelation after the first decorrelation processing
  • the first threshold Thresh2 1 may still not reach the threshold critical point for decorrelation processing.
  • the first cross-correlation coefficient is still smaller than the second cross-correlation coefficient, that is, the decorrelation process can still achieve the purpose.
  • the decorrelation processing method of the previous frame is: adopting the first decorrelation processing method to perform decorrelation processing, it means that the previous frame is partial Inverted signal, and the first threshold Thresh2 1 of the previous frame still has room to increase, and since the first threshold Thresh2 1 corresponding to the previous frame is determined based on the initial first threshold Thresh0 1 , it can be concluded that, The initial first threshold Thresh0 1 also has room to increase.
  • the de-correlation processing makes the de-correlation processing result better.
  • first decorrelation processing manner may include first sum and difference downmix processing.
  • the first sum and difference downmix processing may include:
  • Mid(n) is the main channel signal of the previous frame
  • Sid(n) is the secondary channel signal of the previous frame
  • L(n) is the left channel signal of the previous frame
  • R(n) is the right channel signal of the previous frame. channel signal.
  • the method for determining the above-mentioned first cross-correlation coefficient may include:
  • ⁇ (LR) is the cross-correlation coefficient of the left channel signal and the right channel signal of the previous frame
  • L (n) is the nth sampling point of the left channel signal of the previous frame, is the average value of all samples of the left channel signal of the previous frame
  • R(n) is the nth sample point of the right channel signal of the previous frame, is the average value of all samples of the right channel signal of the previous frame
  • N is the total number of samples of the left channel signal or the right channel signal of the previous frame, that is, the frame length of the previous frame.
  • the method for determining the above-mentioned second correlation coefficient may include:
  • ⁇ (MS ) is the second cross-correlation coefficient or the third cross-correlation coefficient
  • Mid(n) is the nth sample point of the main channel signal in the signal after decorrelation processing, is the average value of all sample points of the main channel signal in the signal after decorrelation processing
  • Sid(n) is the nth sample point of the secondary channel signal in the signal after decorrelation processing
  • N is the total number of samples of the left channel signal or the right channel signal of the previous frame, that is, the frame length of the previous frame.
  • Step 204 Perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • step 204 For the relevant introduction about step 204, reference may be made to the description of the foregoing embodiments, and the embodiments of the present disclosure are not described in detail here.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • Fig. 3 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure, the method is executed by an encoding device, as shown in Fig. 3, the stereo audio signal processing method may include the following steps:
  • Step 301 Determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of a current frame of a stereo audio signal.
  • Step 302. Determine the offset value Delta.
  • Step 303 responding to the decorrelation processing method of the previous frame of the stereo audio signal is: adopt the second decorrelation processing method to perform decorrelation processing, and determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame based on Formula 2.
  • Formula 2 is:
  • Thresh1 and Thresh2 are respectively the first threshold and the second threshold of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold of the current frame and the second initial threshold of the current frame
  • Delta is the offset value
  • ) that is, the offset value in this embodiment is specifically the offset value Delta2 used to update the initial second threshold Thresh0 2 of the current frame in the above embodiment).
  • the second decorrelation processing manner may specifically be a manner for performing decorrelation processing on the partial positive-phase signal.
  • the process of determining whether to use the second decorrelation processing method to perform decorrelation processing on the previous frame is mainly: first judge whether the previous frame is a positive-phase signal, and the current frame is When the signal is partial to positive phase, the previous frame is de-correlated using the second de-correlation processing method; otherwise, the previous frame is not de-correlated using the second de-correlation processing method.
  • the above-mentioned process of judging whether the previous frame is a positive-phase signal mainly includes: first calculating the first cross-correlation coefficient between the left channel signal and the right channel signal of the previous frame, When the first cross-correlation coefficient is greater than the second threshold Thresh222 corresponding to the previous frame, it is judged that the previous frame is a partial positive phase signal, and a second decorrelation process needs to be performed on the signal.
  • the third cross-correlation coefficient is the cross-correlation coefficient of the signal after the decorrelation processing obtained by performing the second decorrelation processing on the signal of the previous frame by using the second decorrelation processing manner.
  • the first cross-correlation coefficient when the first cross-correlation coefficient is greater than the third cross-correlation coefficient, it indicates that "based on the second threshold Thresh2 corresponding to the previous frame, it is judged whether the previous frame needs to perform the second decorrelation process
  • the judgment result is accurate", in other words, it shows that the second threshold Thresh2 2 corresponding to the previous frame is set accurately, and the partial positive phase signal identified based on the second threshold Thresh2 2 can achieve decorrelation after the second decorrelation process.
  • the second threshold Thresh2 2 may still not reach the critical point of whether decorrelation processing is required, that is to say, there is still room for reduction in the second threshold Thresh2 2 , so that the reduced threshold recognizes
  • the first cross-correlation coefficient is still greater than the third cross-correlation coefficient, that is, the de-correlation processing can still achieve the purpose.
  • the decorrelation processing method of the previous frame is: adopt the second decorrelation processing method to perform decorrelation processing, it means that the previous frame is partial positive phase signal, and the second threshold Thresh2 2 of the previous frame still has room for reduction, and since the second threshold Thresh2 2 corresponding to the previous frame is determined based on the initial second threshold Thresh0 2 , it can be concluded that There is also room for reduction of the initial second threshold Thresh0 2 .
  • the de-correlation processing makes the de-correlation processing result better.
  • the above-mentioned second decorrelation processing manner may include the second sum and difference downmix processing.
  • the second sum and differential downmix processing may include:
  • Mid(n) is the main channel signal of the previous frame
  • Sid(n) is the secondary channel signal of the previous frame
  • L(n) is the left channel signal of the previous frame
  • R(n) is the right channel signal of the previous frame. channel signal.
  • the method for determining the above-mentioned second correlation coefficient may include:
  • formula nine is:
  • ⁇ (MS) is the second cross-correlation coefficient or the third cross-correlation coefficient
  • Mid(n) is the nth sampling point of the main channel signal in the signal after decorrelation processing, is the average value of all sample points of the main channel signal in the signal after decorrelation processing
  • Sid(n) is the nth sample point of the secondary channel signal in the signal after decorrelation processing
  • N is the total number of samples of the left channel signal or the right channel signal of the previous frame, that is, the frame length of the previous frame.
  • Step 304 Perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • step 304 For the relevant introduction about step 304, reference may be made to the description of the foregoing embodiments, and the embodiments of the present disclosure are not described in detail here.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • Fig. 4 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure, the method is executed by an encoding device, as shown in Fig. 4, the stereo audio signal processing method may include the following steps:
  • Step 401 Determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of a current frame of a stereo audio signal.
  • Step 402. Determine the offset value Delta.
  • Step 403 in response to the de-correlation processing method of the previous frame of the stereo audio signal: no de-correlation processing is performed, and the reason why the de-correlation processing is not performed at the same time is: the first correlation between the left channel signal and the right channel signal of the previous frame
  • the relation number is greater than or equal to the first threshold Thresh2 1 corresponding to the previous frame and less than or equal to the second threshold Thresh2 2 corresponding to the previous frame, and the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are determined based on Formula 3.
  • formula three is:
  • Thresh1 and Thresh2 are the first threshold and the second threshold of the current frame respectively, and Thresh0 1 and Thresh0 2 are the first initial threshold of the current frame and the second initial threshold of the current frame respectively.
  • the first threshold Thresh2 1 corresponding to the previous frame and less than or equal to the previous frame is greater than or equal to the first threshold Thresh2 1 corresponding to the previous frame and less than or equal to the previous frame
  • the corresponding second threshold Thresh2 2 indicates that the previous frame is an irrelevant signal.
  • the first initial threshold Thresh0 1 of the current frame and the second initial threshold Thresh0 2 of the current frame do not need to be updated.
  • Step 404 Perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • step 404 for the related introduction of step 404, reference may be made to the description of the above-mentioned embodiments, and the embodiments of the present disclosure are not repeated here.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • FIG. 5 is a schematic flowchart of a stereo audio signal processing method provided by an embodiment of the present disclosure, the method is executed by an encoding device. As shown in FIG. 5, the stereo audio signal processing method may include the following steps:
  • Step 501 Determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of a current frame of a stereo audio signal.
  • Step 502. Determine the offset value Delta.
  • Step 503 in response to the de-correlation processing method of the previous frame of the stereo audio signal: no de-correlation processing is performed, and the reason for not performing de-correlation processing at the same time is: the first correlation between the left channel signal and the right channel signal of the previous frame
  • the correlation coefficient is smaller than the first threshold Thresh2 1 corresponding to the previous frame
  • the first cross-correlation coefficient is greater than or equal to the second cross-correlation coefficient
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are determined based on Formula 4.
  • Formula 4 is:
  • Thresh1 and Thresh2 are respectively the first threshold and the second threshold of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold of the current frame and the second initial threshold of the current frame
  • Delta is the offset value
  • ) that is, the offset value in this embodiment is specifically the offset value Delta1 used to update the initial first threshold Thresh0 1 of the current frame in the above embodiment).
  • the second cross-correlation coefficient is the cross-correlation coefficient of the signal after decorrelation processing obtained by performing the first decorrelation processing on the previous frame signal by using the first decorrelation processing manner.
  • the principle of determining the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame by using the above formula 4 will be explained in detail: when the first correlation coefficient is greater than or equal to the second correlation coefficient, it means that the previous frame has not been removed. Correlation processing, that is, it shows that "the judgment result of the first decorrelation processing is inaccurate based on the first threshold value Thresh2 1 corresponding to the previous frame to judge that the previous frame is a partial anti-phase signal", in other words, it shows that the previous frame The value of the first threshold Thresh2 1 corresponding to the frame is inaccurate.
  • the signal identified based on the first threshold Thresh2 1 cannot achieve the purpose of decorrelation after the first decorrelation process.
  • the first threshold Thresh2 1 is greater than whether de-correlation is required.
  • the threshold critical point of correlation processing that is to say, the first threshold Thresh2 1 needs to be reduced, so that after the partial anti-phase signal identified by the reduced threshold undergoes the first decorrelation processing, the first cross-correlation coefficient is smaller than the second
  • the cross-correlation coefficient means that the de-correlation process can achieve the goal.
  • the first cross-correlation coefficient of the left channel signal and the right channel signal of the previous frame is smaller than the first threshold Thresh2 1 corresponding to the previous frame, and the first If a cross-correlation coefficient is greater than or equal to the second cross-correlation coefficient, it means that the first threshold Thresh2 1 is considered to be greater than the threshold critical point for whether decorrelation processing is required, and since the first threshold Thresh2 1 corresponding to the previous frame is based on the initial first threshold Thresh2 1 If a threshold Thresh0 1 is determined, it can be concluded that the initial first threshold Thresh0 1 may also be greater than the critical threshold for whether decorrelation processing is required.
  • the first cross-correlation coefficient of the left channel signal and the right channel signal of the previous frame is smaller than the first threshold Thresh2 1 corresponding to the previous frame, and the first cross-correlation The number is greater than or equal to the second cross-correlation coefficient" means "it is inaccurate to judge that the previous frame is a partial anti-phase signal based on the first threshold Thresh2 1 corresponding to the previous frame".
  • Step 504 Perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • step 504 reference may be made to the description of the foregoing embodiments, and the embodiments of the present disclosure are not described in detail here.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • FIG. 6 is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in FIG. 6 , the method for processing a stereo audio signal may include the following steps:
  • Step 601. Determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of a current frame of a stereo audio signal.
  • Step 602. Determine the offset value Delta.
  • Step 603 in response to the de-correlation processing method of the previous frame of the stereo audio signal: no de-correlation processing is performed, and the reason why the de-correlation processing is not performed at the same time is: the first correlation between the left channel signal and the right channel signal of the previous frame
  • the correlation coefficient is greater than the second threshold Thresh2 2 corresponding to the previous frame, and the first correlation coefficient is less than or equal to the third correlation coefficient, and the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are determined based on Formula 5.
  • Formula 5 is:
  • Thresh1 and Thresh2 are respectively the first threshold and the second threshold of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold of the current frame and the second initial threshold of the current frame
  • Delta is the offset value
  • ) that is, the offset value in this embodiment is specifically the offset value Delta2 used to update the initial second threshold Thresh0 2 of the current frame in the above embodiment).
  • the third cross-correlation coefficient is the cross-correlation coefficient of the signal after decorrelation processing obtained by performing the second decorrelation processing on the previous frame signal by using the second decorrelation processing manner.
  • the principle of determining the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame using the above formula 5 will be explained in detail: if the first correlation coefficient is less than or equal to the third correlation coefficient, it means that the previous frame has not Decorrelation processing, that is, it shows that "the judgment result of the second decorrelation processing is inaccurate based on the second threshold Thresh2 2 corresponding to the previous frame to judge that the previous frame is a partial positive phase signal", in other words, the previous The value of the second threshold Thresh2 2 corresponding to one frame is inaccurate, and the signal identified based on the second threshold Thresh2 2 cannot achieve the purpose of decorrelation after the second decorrelation process, and it is considered that the second threshold Thresh2 1 is less than whether it is necessary to perform
  • the threshold critical point of decorrelation processing that is to say, the second threshold Thresh2 needs to be increased, so that after the partial positive phase signal identified by the increased threshold undergoes second decorrelation processing,
  • the first cross-correlation coefficient of the left channel signal and the right channel signal of the previous frame is greater than the second threshold Thresh2 2 corresponding to the previous frame, and the first If a cross-correlation coefficient is less than or equal to the third cross-correlation coefficient, it means that the second threshold Thresh2 2 is considered to be less than the threshold critical point for decorrelation processing, and since the second threshold Thresh2 2 corresponding to the previous frame is based on the initial first If the second threshold Thresh0 2 is determined, it can be concluded that the initial second threshold Thresh0 2 may also be smaller than the threshold critical point for whether decorrelation processing is required.
  • the first cross-correlation coefficient of the left channel signal and the right channel signal of the previous frame is greater than the second threshold Thresh2 2 corresponding to the previous frame, and the first cross-correlation The number is less than or equal to the third cross-correlation coefficient" indicates that "the judgment result of judging that the previous frame is a positive phase signal based on the second threshold Thresh2 2 corresponding to the previous frame is inaccurate".
  • Step 604 Perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • step 604 reference may be made to the description of the foregoing embodiments, and the embodiments of the present disclosure are not described in detail here.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • FIG. 7 is a schematic flowchart of a method for processing a stereo audio signal provided by an embodiment of the present disclosure. The method is executed by an encoding device. As shown in FIG. 7 , the method for processing a stereo audio signal may include the following steps:
  • Step 701. Determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of a first frame of a stereo audio signal.
  • Step 702 Determine the first threshold Thresh3 1 and the second threshold Thresh3 2 corresponding to the first frame based on formula ten.
  • formula ten is:
  • Thresh3 1 and Thresh3 2 are respectively the first threshold of the first frame and the second threshold of the first frame, and Thresh0 1 and Thresh0 2 are respectively the first initial threshold of the first frame and the first threshold of the first frame The second initial threshold of .
  • Step 703 Determine the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal, where Thresh0 1 ⁇ (-1, 0), Thresh0 2 ⁇ (0, 1).
  • Step 704 determine the offset value Delta.
  • Step 705 based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, the initial first threshold Thresh0 1 of the current frame, and the initial second threshold Thresh0 2 of the current frame, determine the first threshold corresponding to the current frame of the stereo audio signal A threshold Thresh1 and a second threshold Thresh2.
  • Step 706 Perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • FIG. 8 is a schematic structural diagram of a stereo audio signal processing device provided by an embodiment of the present disclosure. As shown in FIG. 8, the device 800 may include:
  • a determining module 801 configured to determine an initial first threshold Thresh0 1 and an initial second threshold Thresh0 2 of the current frame of the stereo audio signal, wherein Thresh0 1 ⁇ (-1, 0), Thresh0 2 ⁇ (0, 1);
  • a determining module 803, configured to be based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, the initial first threshold Thresh0 1 of the current frame, and the initial second threshold Thresh0 of the current frame 2. Determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal;
  • the processing module 804 is configured to perform decorrelation processing on the current frame based on the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
  • the initial first threshold Thresh0 1 and the initial second threshold Thresh0 2 of the current frame of the stereo audio signal will be determined first, where Thresh0 1 ⁇ (-1 , 0), Thresh0 2 ⁇ (0, 1); after that, the offset value Delta will be determined; and, based on the decorrelation processing method of the previous frame of the stereo audio signal, the offset value Delta, and the initial first threshold of the current frame Thresh0 1 , the initial second threshold Thresh0 2 of the current frame, determine the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal; Frames are de-correlated.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame will be adaptively updated in real time based on the de-correlation processing method of the previous frame, so as to ensure the correlation of each frame The accuracy of the determination, and then the optimal decorrelation processing method can be accurately selected based on the correlation of each frame, which improves the encoding compression rate.
  • the offset value Delta the initial first threshold Thresh0 1 of the current frame
  • the The initial second threshold Thresh0 2 of the current frame determines the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame of the stereo audio signal, including:
  • the first de-correlation processing method is used to perform the de-correlation processing, and the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are determined based on formula 1, so
  • the first formula is:
  • Thresh1 and Thresh2 are respectively the first threshold value and the second threshold value of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold value of the current frame and the second initial threshold value of the current frame
  • Delta is the first threshold value of the current frame The above offset value, and Delta ⁇ (0,
  • the determination module 803 is further configured to:
  • the de-correlation processing is performed using the second de-correlation processing method, and the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are determined based on formula 2, so
  • the second formula is:
  • Thresh1 and Thresh2 are respectively the first threshold value and the second threshold value of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold value of the current frame and the second initial threshold value of the current frame
  • Delta is the first threshold value of the current frame The above offset value, and Delta ⁇ (0,
  • the determination module 803 is further configured to:
  • Thresh1 and Thresh2 are the first threshold and the second threshold of the current frame respectively
  • Thresh0 1 and Thresh0 2 are the first initial threshold of the current frame and the second initial threshold of the current frame respectively.
  • the determination module 803 is further configured to:
  • the reason why the de-correlation processing is not performed at the same time is: the first frame of the left channel signal and the right channel signal of the previous frame
  • the cross-correlation coefficient is less than the first threshold Thresh2 1 corresponding to the previous frame, and the first cross-correlation coefficient is greater than or equal to the second cross-correlation coefficient, wherein the second cross-correlation coefficient adopts the first decorrelation processing method
  • the cross-correlation coefficient of the signal after the decorrelation process obtained by performing the first decorrelation process on the previous frame signal is determined based on Formula 4.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are determined, and the formula 4 is:
  • Thresh1 and Thresh2 are respectively the first threshold value and the second threshold value of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold value of the current frame and the second initial threshold value of the current frame
  • Delta is the first threshold value of the current frame The above offset value, and Delta ⁇ (0,
  • the determination module 803 is further configured to:
  • the reason why the de-correlation processing is not performed at the same time is: the first frame of the left channel signal and the right channel signal of the previous frame
  • the cross-correlation coefficient is greater than the second threshold Thresh2 2 corresponding to the previous frame, and the first cross-correlation coefficient is less than or equal to the third cross-correlation coefficient, wherein the third cross-correlation coefficient adopts the second decorrelation processing method
  • the cross-correlation coefficient of the signal after the de-correlation processing obtained by performing the second de-correlation processing on the previous frame signal is determined based on formula 5.
  • the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame are determined, and the formula 5 is:
  • Thresh1 and Thresh2 are respectively the first threshold value and the second threshold value of the current frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold value of the current frame and the second initial threshold value of the current frame
  • Delta is the first threshold value of the current frame The above offset value, and Delta ⁇ (0,
  • the first decorrelation processing manner includes first sum difference downmix processing.
  • the first sum and difference downmix processing includes:
  • the left channel signal and the right channel signal of the previous frame are processed based on formula six to obtain the main channel signal and the secondary channel signal; the formula six is:
  • Mid(n) is the main channel signal of the previous frame
  • Sid(n) is the secondary channel signal of the previous frame
  • L(n) is the left channel signal of the previous frame
  • R(n) is the right channel signal of the previous frame. channel signal.
  • the second decorrelation processing manner includes second sum and difference downmix processing.
  • the second sum and difference downmix processing includes:
  • the left channel signal and the right channel signal of the previous frame are processed based on Formula 7 to obtain the main channel signal and the secondary channel signal;
  • the Formula 7 is:
  • Mid(n) is the main channel signal of the previous frame
  • Sid(n) is the secondary channel signal of the previous frame
  • L(n) is the left channel signal of the previous frame
  • R(n) is the right channel signal of the previous frame. channel signal.
  • the device is also used for:
  • ⁇ (LR) is the cross-correlation coefficient of the left channel signal and the right channel signal of the previous frame
  • L (n) is the nth sampling point of the left channel signal of the previous frame, is the average value of all samples of the left channel signal of the previous frame
  • R(n) is the nth sample point of the right channel signal of the previous frame, is the average value of all samples of the right channel signal of the previous frame
  • N is the total number of samples of the left channel signal or the right channel signal of the previous frame, that is, the frame length of the previous frame.
  • the signal after decorrelation processing includes a main channel signal and a secondary channel signal
  • Said device is also used for:
  • ⁇ (MS) is the second cross-correlation coefficient or the third cross-correlation coefficient
  • Mid(n) is the nth sampling point of the main channel signal in the signal after decorrelation processing, is the average value of all sample points of the main channel signal in the signal after decorrelation processing
  • Sid(n) is the nth sample point of the secondary channel signal in the signal after decorrelation processing
  • N is the total number of samples of the left channel signal or the right channel signal of the previous frame, that is, the frame length of the previous frame.
  • the device is also used for:
  • the first threshold Thresh3 1 and the second threshold Thresh3 2 corresponding to the first frame are determined based on formula ten, and the formula ten is:
  • Thresh3 1 and Thresh3 2 are respectively the first threshold of the first frame and the second threshold of the first frame
  • Thresh0 1 and Thresh0 2 are respectively the first initial threshold of the first frame and the first threshold of the first frame.
  • the second initial threshold for a frame.
  • Fig. 9 is a block diagram of a user equipment UE900 provided by an embodiment of the present disclosure.
  • the UE 900 may be a mobile phone, a computer, a digital broadcast terminal device, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
  • UE900 may include at least one of the following components: a processing component 902, a memory 904, a power supply component 906, a multimedia component 908, an audio component 910, an input/output (I/O) interface 912, a sensor component 913, and a communication component 916.
  • a processing component 902 a memory 904
  • a power supply component 906 a multimedia component 908, an audio component 910, an input/output (I/O) interface 912, a sensor component 913, and a communication component 916.
  • I/O input/output
  • the processing component 902 generally controls the overall operations of the UE 900, such as those associated with display, phone calls, data communications, camera operations, and recording operations.
  • the processing component 902 may include at least one processor 920 to execute instructions to complete all or part of the steps of the above-mentioned method.
  • processing component 902 can include at least one module to facilitate interaction between processing component 902 and other components.
  • processing component 902 may include a multimedia module to facilitate interaction between multimedia component 908 and processing component 902 .
  • the memory 904 is configured to store various types of data to support operations at the UE 900 . Examples of such data include instructions for any application or method operating on UE900, contact data, phonebook data, messages, pictures, videos, etc.
  • the memory 904 can be implemented by any type of volatile or non-volatile storage device or their combination, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read-only memory
  • EPROM erasable Programmable Read Only Memory
  • PROM Programmable Read Only Memory
  • ROM Read Only Memory
  • Magnetic Memory Flash Memory
  • Magnetic or Optical Disk Magnetic Disk
  • the power supply component 906 provides power to various components of the UE 900 .
  • Power component 906 may include a power management system, at least one power supply, and other components associated with generating, managing, and distributing power for UE 900 .
  • the multimedia component 908 includes a screen providing an output interface between the UE 900 and the user.
  • the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user.
  • the touch panel includes at least one touch sensor to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense a boundary of a touch or slide action, but also detect a wake-up time and pressure related to the touch or slide operation.
  • the multimedia component 908 includes a front camera and/or a rear camera. When the UE900 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capability.
  • the audio component 910 is configured to output and/or input audio signals.
  • the audio component 910 includes a microphone (MIC), which is configured to receive an external audio signal when the UE 900 is in an operation mode, such as a call mode, a recording mode, and a voice recognition mode. Received audio signals may be further stored in memory 904 or sent via communication component 916 .
  • the audio component 910 also includes a speaker for outputting audio signals.
  • the I/O interface 912 provides an interface between the processing component 902 and a peripheral interface module.
  • the peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include, but are not limited to: a home button, volume buttons, start button, and lock button.
  • the sensor component 913 includes at least one sensor for providing various aspects of state assessment for the UE 900 .
  • the sensor component 913 can detect the open/closed state of the device 900, the relative positioning of components, such as the display and the keypad of the UE900, the sensor component 913 can also detect the position change of the UE900 or a component of the UE900, and the user and Presence or absence of UE900 contact, UE900 orientation or acceleration/deceleration and temperature change of UE900.
  • the sensor assembly 913 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact.
  • the sensor assembly 913 may also include an optical sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
  • the sensor component 913 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor or a temperature sensor.
  • Communication component 916 is configured to facilitate wired or wireless communications between UE 900 and other devices.
  • UE900 can access wireless networks based on communication standards, such as WiFi, 2G or 3G, or a combination thereof.
  • the communication component 916 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel.
  • the communication component 916 also includes a near field communication (NFC) module to facilitate short-range communication.
  • NFC near field communication
  • the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, Infrared Data Association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
  • RFID Radio Frequency Identification
  • IrDA Infrared Data Association
  • UWB Ultra Wideband
  • Bluetooth Bluetooth
  • UE 900 may be powered by at least one Application Specific Integrated Circuit (ASIC), Digital Signal Processor (DSP), Digital Signal Processing Device (DSPD), Programmable Logic Device (PLD), Field Programmable Gate Array ( FPGA), controller, microcontroller, microprocessor or other electronic components for implementing the above method.
  • ASIC Application Specific Integrated Circuit
  • DSP Digital Signal Processor
  • DSPD Digital Signal Processing Device
  • PLD Programmable Logic Device
  • FPGA Field Programmable Gate Array
  • controller microcontroller, microprocessor or other electronic components for implementing the above method.
  • Fig. 10 is a block diagram of a network side device 1000 provided by an embodiment of the present disclosure.
  • the network side device 1000 may be provided as a network side device.
  • the network side device 1000 includes a processing component 1011, which further includes at least one processor, and a memory resource represented by a memory 1032 for storing instructions executable by the processing component 1022, such as an application program.
  • the application program stored in memory 1032 may include one or more modules each corresponding to a set of instructions.
  • the processing component 1010 is configured to execute instructions, so as to execute any of the aforementioned methods applied to the network side device, for example, the method shown in FIG. 1 .
  • the network side device 1000 may also include a power supply component 1026 configured to perform power management of the network side device 1000, a wired or wireless network interface 1050 configured to connect the network side device 1000 to the network, and an input and output (I/O ) interface 1058.
  • the network side device 1000 can operate based on the operating system stored in the memory 1032, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, Free BSDTM or similar.
  • the methods provided in the embodiments of the present disclosure are introduced from the perspectives of the network side device and the UE respectively.
  • the network side device and the UE may include a hardware structure and a software module, and implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module.
  • a certain function among the above-mentioned functions may be implemented in the form of a hardware structure, a software module, or a hardware structure plus a software module.
  • the methods provided in the embodiments of the present disclosure are introduced from the perspectives of the network side device and the UE respectively.
  • the network side device and the UE may include a hardware structure and a software module, and implement the above functions in the form of a hardware structure, a software module, or a hardware structure plus a software module.
  • a certain function among the above-mentioned functions may be implemented in the form of a hardware structure, a software module, or a hardware structure plus a software module.
  • the communication device may include a transceiver module and a processing module.
  • the transceiver module may include a sending module and/or a receiving module, the sending module is used to realize the sending function, the receiving module is used to realize the receiving function, and the sending and receiving module can realize the sending function and/or the receiving function.
  • the communication device may be a terminal device (such as the terminal device in the foregoing method embodiments), may also be a device in the terminal device, and may also be a device that can be matched and used with the terminal device.
  • the communication device may be a network device, or a device in the network device, or a device that can be matched with the network device.
  • the communication device may be a network device, or a terminal device (such as the terminal device in the foregoing method embodiments), or a chip, a chip system, or a processor that supports the network device to implement the above method, or it may be a terminal device that supports A chip, a chip system, or a processor for realizing the above method.
  • the device can be used to implement the methods described in the above method embodiments, and for details, refer to the descriptions in the above method embodiments.
  • a communications device may include one or more processors.
  • the processor may be a general purpose processor or a special purpose processor or the like.
  • it can be a baseband processor or a central processing unit.
  • the baseband processor can be used to process communication protocols and communication data
  • the central processor can be used to control communication devices (such as network side equipment, baseband chips, terminal equipment, terminal equipment chips, DU or CU, etc.)
  • a computer program that processes data for a computer program.
  • the communication device may further include one or more memories, on which computer programs may be stored, and the processor executes the computer programs, so that the communication device executes the methods described in the foregoing method embodiments.
  • data may also be stored in the memory.
  • the communication device and the memory can be set separately or integrated together.
  • the communication device may further include a transceiver and an antenna.
  • the transceiver may be referred to as a transceiver unit, a transceiver, or a transceiver circuit, etc., and is used to implement a transceiver function.
  • the transceiver may include a receiver and a transmitter, and the receiver may be called a receiver or a receiving circuit for realizing a receiving function; the transmitter may be called a transmitter or a sending circuit for realizing a sending function.
  • the communication device may further include one or more interface circuits.
  • the interface circuit is used to receive code instructions and transmit them to the processor.
  • the processor executes the code instructions to enable the communication device to execute the methods described in the foregoing method embodiments.
  • the communication device is a terminal device (such as the terminal device in the foregoing method embodiments): the processor is configured to execute any of the methods shown in FIG. 1-FIG. 4a.
  • the communication device is a network device: the transceiver is used to execute the method shown in any one of Fig. 5-Fig. 7 .
  • the processor may include a transceiver for implementing receiving and transmitting functions.
  • the transceiver may be a transceiver circuit, or an interface, or an interface circuit.
  • the transceiver circuits, interfaces or interface circuits for realizing the functions of receiving and sending can be separated or integrated together.
  • the above-mentioned transceiver circuit, interface or interface circuit may be used for reading and writing code/data, or the above-mentioned transceiver circuit, interface or interface circuit may be used for signal transmission or transfer.
  • the processor may store a computer program, and the computer program runs on the processor to enable the communication device to execute the methods described in the foregoing method embodiments.
  • a computer program may be embedded in a processor, in which case the processor may be implemented by hardware.
  • the communication device may include a circuit, and the circuit may implement the function of sending or receiving or communicating in the foregoing method embodiments.
  • the processors and transceivers described in this disclosure can be implemented on integrated circuits (integrated circuits, ICs), analog ICs, radio frequency integrated circuits (RFICs), mixed signal ICs, application specific integrated circuits (ASICs), printed circuit boards ( printed circuit board, PCB), electronic equipment, etc.
  • the processor and transceiver can also be fabricated using various IC process technologies such as complementary metal oxide semiconductor (CMOS), nMetal-oxide-semiconductor (NMOS), P-type Metal oxide semiconductor (positive channel metal oxide semiconductor, PMOS), bipolar junction transistor (bipolar junction transistor, BJT), bipolar CMOS (BiCMOS), silicon germanium (SiGe), gallium arsenide (Gas), etc.
  • CMOS complementary metal oxide semiconductor
  • NMOS nMetal-oxide-semiconductor
  • PMOS bipolar junction transistor
  • BJT bipolar CMOS
  • SiGe silicon germanium
  • Gas gallium arsenide
  • the communication device described in the above embodiments may be a network device or a terminal device (such as the terminal device in the foregoing method embodiments), but the scope of the communication device described in this disclosure is not limited thereto, and the structure of the communication device may not be limited limits.
  • a communication device may be a stand-alone device or may be part of a larger device.
  • the communication device may be:
  • a set of one or more ICs may also include storage components for storing data and computer programs;
  • ASIC such as modem (Modem);
  • the communications device may be a chip or system-on-a-chip
  • the chip includes a processor and an interface.
  • the number of processors may be one or more, and the number of interfaces may be more than one.
  • the chip also includes a memory, which is used to store necessary computer programs and data.
  • An embodiment of the present disclosure also provides a system for determining the duration of a side link, the system includes a communication device as a terminal device (such as the first terminal device in the method embodiment above) in the foregoing embodiments and a communication device as a network device, Alternatively, the system includes the communication device as the terminal device in the foregoing embodiments (such as the first terminal device in the foregoing method embodiment) and the communication device as a network device.
  • the present disclosure also provides a readable storage medium on which instructions are stored, and when the instructions are executed by a computer, the functions of any one of the above method embodiments are realized.
  • the present disclosure also provides a computer program product, which implements the functions of any one of the above method embodiments when executed by a computer.
  • all or part of them may be implemented by software, hardware, firmware or any combination thereof.
  • software When implemented using software, it may be implemented in whole or in part in the form of a computer program product.
  • the computer program product comprises one or more computer programs. When the computer program is loaded and executed on the computer, all or part of the processes or functions according to the embodiments of the present disclosure will be generated.
  • the computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable devices.
  • the computer program can be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer program can be downloaded from a website, computer, server or data center Transmission to another website site, computer, server or data center by wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.).
  • the computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media.
  • the available medium may be a magnetic medium (for example, a floppy disk, a hard disk, a magnetic tape), an optical medium (for example, a high-density digital video disc (digital video disc, DVD)), or a semiconductor medium (for example, a solid state disk (solid state disk, SSD)) etc.
  • a magnetic medium for example, a floppy disk, a hard disk, a magnetic tape
  • an optical medium for example, a high-density digital video disc (digital video disc, DVD)
  • a semiconductor medium for example, a solid state disk (solid state disk, SSD)
  • At least one in the present disclosure can also be described as one or more, and a plurality can be two, three, four or more, and the present disclosure is not limited.
  • the technical feature is distinguished by "first”, “second”, “third”, “A”, “B”, “C” and “D”, etc.
  • the technical features described in the “first”, “second”, “third”, “A”, “B”, “C” and “D” have no sequence or order of magnitude among the technical features described.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Algebra (AREA)
  • Quality & Reliability (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

本公开提出一种立体声音频信号处理方法及设备/存储介质/装置,属于通信技术领域。该方法包括:确定所述立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);确定偏移值Delta;基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;基于所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2对所述当前帧进行去相关处理。本公开的方法可以提高编码压缩率。

Description

一种立体声音频信号处理方法及设备/存储介质/装置 技术领域
本公开涉及通信技术领域,尤其涉及一种立体声音频信号处理方法及设备/存储介质/装置。
背景技术
由于无损编码可以满足高质量音频回放和无损存储的需求,因此得到广泛应用。通常在对立体声音频信号进行无损编码时,需要先对立体声音频信号进行去相关处理,以提高编码压缩率。
相关技术中,去相关处理的主要方式为:设置阈值,并计算立体声音频信号当前帧左声道信号和右声道信号的相关性系数,基于该相关性系数与阈值的大小关系确定当前帧左声道信号和右声道信号的相关性,并基于所确定出的相关性采取最优的去相关处理方式对当前帧进行去相关处理。
但是,相关技术中,立体声音频信号的每一帧对应的阈值是固定的,无法自适应更新,则会影响不同帧的相关性确定的准确性,进而针对每一帧无法准确选择出最优的去相关处理方式,导致无法提升编码压缩率。
发明内容
本公开提出的一种立体声音频信号处理方法及设备/存储介质/装置,以解决相关技术中的去相关处理方法编码压缩率较低的技术问题。
本公开一方面实施例提出的立体声音频信号处理方法,应用于编码设备,包括:
确定所述立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);
确定偏移值Delta;
基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;
基于所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2对所述当前帧进行去相关处理。
本公开又一方面实施例提出的立体声音频信号处理装置,包括:
确定模块,用于确定所述立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);
确定模块,用于确定偏移值Delta;
确定模块,用于基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;
处理模块,用于基于所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2对所述当前帧进行去相关处理。
本公开又一方面实施例提出的一种通信装置,所述装置包括处理器和存储器,所述存储器中存储有计算机程序,所述处理器执行所述存储器中存储的计算机程序,以使所述装置执行如上另一方面实施例提出的方法。
本公开又一方面实施例提出的通信装置,包括:处理器和接口电路;
所述接口电路,用于接收代码指令并传输至所述处理器;
所述处理器,用于运行所述代码指令以执行如另一方面实施例提出的方法。
本公开又一方面实施例提出的计算机可读存储介质,用于存储有指令,当所述指令被执行时,使如另一方面实施例提出的方法被实现。
综上所述,在本公开实施例提供的立体声音频信号处理方法及设备/存储介质/装置之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
附图说明
本公开上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:
图1a为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图;
图1b为本公开实施例所提供的一种基于去相关处理后的信号得到编码码流的流程框图;
图2为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图;
图3为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图;
图4为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图;
图5为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图;
图6为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图;
图7为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图;
图8为本公开实施例所提供的一种立体声音频信号处理装置的结构示意图;
图9是本公开一个实施例所提供的一种用户设备的框图;
图10为本公开一个实施例所提供的一种网络侧设备的框图。
具体实施方式
这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开实施例相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开实施例的一些方面相一致的装置和方法的例子。
在本公开实施例使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本公开实施例。在本公开实施例和所附权利要求书中所使用的单数形式的“一种”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。还应当理解,本文中使用的术语“和/或”是指并包含一个或多个相关联的列出项目的任何或所有可能组合。
应当理解,尽管在本公开实施例可能采用术语第一、第二、第三等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本公开实施例范围的情况下,第一信息也可以被称为第二信息,类似地,第二信息也可以被称为第一信息。取决于语境,如在此所使用的词语“如果”及“若”可以被解释成为“在……时”或“当……时”或“响应于确定”。
下面参考附图对本公开实施例所提供的立体声音频信号处理方法、装置、编码设备、解码设备及存储介质进行详细描述。
图1a为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图,该方法由编码设备执行,如图1a所示,该立体声音频信号处理方法可以包括以下步骤:
步骤101、确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
在本公开的一个实施例之中,该当前帧可以为立体声音频信号中除第一帧之外的任一帧。
以及,在本公开的一个实施例之中,上述初始第一阈值Thresh0 1和初始第二阈值Thresh0 2可以是预先设定的,其中,初始第一阈值Thresh0 1∈(-1,0),初始第二阈值Thresh0 2∈(0,1)。
进一步地,在本公开的一个实施例之中,该初始第一阈值Thresh0 1的绝对值和初始第二阈值Thresh0 2的绝对值可以相同。在本公开的另一个实施例之中,该初始第一阈值Thresh0 1的绝对值和初始第二阈值Thresh0 2的绝对值可以不同。示例的,在本公开的一个实施例之中,初始第一阈值Thresh0 1的绝对值和初始第二阈值Thresh0 2的绝对值可以均为0.47,即:初始第一阈值Thresh0 1=-0.47、初始第二阈值Thresh0 2=0.47。可以理解的是,上述数值可以应用到本公开任意的实施例中,并且该数值仅仅作为示例示出,本公开对此不作限定。
此外,需要说明的是,在本公开的一个实施例之中,立体声音频信号的每一帧对应的初始第一阈值Thresh0 1是相同的,立体声音频信号的每一帧对应的初始第二阈值Thresh0 2是相同的。
步骤102、确定偏移值Delta。
其中,在本公开的一个实施例之中,所确定出的偏移值Delta具体作用为:利用该偏移值Delta对当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2进行更新以得到当前帧对应的第一阈值Thresh1和第二阈值Thresh2。由此,在本公开的一个实施例之中,该偏移值Delta可以包括有偏移值Delta1和偏移值Delta2,其中,该偏移值Delta1可以用于对当前帧的初始第一阈值Thresh0 1进行更新,该偏移值Delta2可以用于对当前帧的初始第二阈值Thresh0 2进行更新。
以及,在本公开的一个实施例之中,确定偏移值Delta1的方法可以包括:使得Delta1∈(0,|Thresh0 1|),确定偏移值Delta2的方法可以包括:使得Delta2∈(0,|Thresh0 2|)。并且,在本公开的一个实施例之中,偏移值Delta1与偏移值Delta2可以相同。在本公开的另一个实施例之中,偏移值Delta1与偏移值Delta2可以不同。示例的,在本公开的一个实施例中,偏移值Delta1和Delta2可以为0.05。可以理解的是,上述数值可以应用到本公开任意的实施例中,并且该数值仅仅作为示例示出,本公开对此不作限定。
步骤103、基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2。
其中,在本公开的一个实施例之中,前一帧的去处理方式不同时,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2的方式也会有所不同。关于该部分的详细介绍具体会在后续实施例描述。
以及,在本公开的一个实施例之中,上述的前一帧的去相关处理方式可以基于前一帧对应的标志位来确定,其中,每一帧的标志位用于指示每一帧的去相关处理方式。示例的,在本公开的一个实施例之中,响应于前一帧的标志位置0,则确定前一帧的去相关处理方式为:第一去相关处理方式;响应于前一帧的标志位置1,则确定前一帧的去相关处理方式为:第二去相关处理方式;响应于前一帧的标志位置2,则确定前一帧的去相关处理方式为:未进行去相关处理。其中,关于第一去相关处理方式、第二去相关处理方式、未进行去相关处理的详细介绍会在后续实施例进行描述。
步骤104、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。
在本公开的一个实施例之中,该当前帧对应的第一阈值Thresh1具体可以用于确定当前帧为偏反相信号或不相关信号,该第二阈值Thresh2具体可以用于确定当前帧为偏正相信号或不相关信号。
以及,在本公开的一个实施例之中,基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理的方法可以包括:
步骤1、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2确定当前帧的相关性,其中,该相关性包括偏反相信号、偏正相信号以及不相关信号。
具体的,在本公开的一个实施例之中,响应于当前帧左声道信号和右声道信号的互相关系数小于当前帧对应的第一阈值Thresh1,确定当前帧为偏反相信号,响应于当前帧左声道信号和右声道信号的互相关系数大于当前帧对应的第二阈值Thresh2,确定当前帧为偏正相信号,响应于当前帧左声道信号和右声道信号的互相关系数大于等于当前帧对应的第一阈值Thresh1且小于等于当前帧对应的第二阈值Thresh2,确定当前帧为不相关信号。
步骤2、基于当前帧的相关性选择最优的去相关处理方式对当前帧进行去相关处理以得到去相关处理后的信号。
进一步地,在本公开的一个实施例之中,对当前帧进行去相关处理得到去相关处理后的信号后,可以基于去相关处理后的信号得到编码码流。其中,在本公开的一个实施例之中,图1b为本公开实施例所提供的一种基于去相关处理后的信号得到编码码流的流程框图,如图1b所示,基于去相关处理后的信号得到编码码流的方法可以为:
对去相关处理后的信号采用整型提升小波分解进行分带得到各子带信号,对去相关处理后的信号进行LPC(Linear Prediction Coefficient,线性预测系数)参数计算和量化以得到量化LPC参数,再利用线性预测器基于量化LPC参数对各子带信号进行预测,生成预测残差信号,利用预处理器对预测残差信号进行归一化处理,产生归一化输出信号、LSB(Least Significant Bit,最低有效位)信号以及信号符号位。利用熵编码器对各子带信号对应的归一化输出信号进行熵编码,生成编码位流,再对编码位流、LSB信号、信号符号位,量化LPC参数以及小波边信息进行码流复用得到编码码流。
综上所述,在本公开实施例提供的立体声音频信号处理方法之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
图2为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图,该方法由编码设备执行,如图2所示,该立体声音频信号处理方法可以包括以下步骤:
步骤201、确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
步骤202、确定偏移值Delta。
其中,关于步骤201-202的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
步骤203、响应于立体声音频信号前一帧的去相关处理方式为:采用第一去相关处理方式进行去相关处理,基于公式一确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2。
在本公开的一个实施例之中,公式一为:
Figure PCTCN2021135514-appb-000001
Thresh1和Thresh2分别为当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为当前帧的第一初始阈值、当前帧的第二初始阈值,Delta为偏移值,且Delta∈(0,|Thresh0 1|)(也即是,本实施例中偏移值具体为上述实施例中的用于对当前帧的初始第一阈值Thresh0 1进行更新的偏移值Delta1)。
以下,对采用上述公式一确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2的原理进行详细解释:
其中,在本公开的一个实施例之中,该第一去相关处理方式具体可以是用于对偏反相信号进行去相关处理的方式。以及,在本公开的一个实施例之中,确定是否利用第一去相关处理方式对前一帧进行去 相关处理的流程主要为:先判断前一帧是否为偏反相信号,当前一帧为偏反相信号时,则利用第一去相关处理方式对前一帧进行去相关处理,否则,不利用第一去相关处理方式对前一帧进行去相关处理。
进一步地,在本公开的一个实施例之中,上述判断前一帧是否为偏反相信号的流程主要为:先计算前一帧左声道信号和右声道信号的第一互相关系数,第一互相关系数小于前一帧对应的第一阈值Thresh2 1时,则判断该前一帧为偏反相信号,需要对信号进行第一去相关处理。
但是,需要说明的是,在本公开的一个实施例之中,仅基于前一帧对应的第一阈值Thresh2 1判断前一帧是否为偏反相信号进而判断是否需要进行第一去相关处理时,可能会由于前一帧对应的第一阈值Thresh2 1的设置不准而出现“判断不准确”的现象,使得经过第一去相关处理后的信号的相关性反而比第一去相关处理前的信号的相关性更强,导致信号没有达到去相关的目的。因此,在确定出第一互相关系数小于前一帧对应的第一阈值Thresh2 1的基础上,还需要进一步判断第一互相关系数是否小于第二互相关系数,其中,该第二互相关系数为采用第一去相关处理方式对前一帧信号进行第一去相关处理获得的去相关处理后的信号的互相关系数。
以及,在本公开的一个实施例之中,第一互相关系数小于第二互相关系数时,说明“基于前一帧对应的第一阈值Thresh2 1判断前一帧为是否要进行第一去相关处理的判断结果是准确的”,换言之,说明前一帧对应的第一阈值Thresh2 1设置准确,基于该第一阈值Thresh2 1识别出的偏反相信号经过第一去相关处理后能够达到去相关的目的,但是该第一阈值Thresh2 1有可能仍然没有达到是否需要进行去相关处理的阈值临界点,也就是说,该第一阈值Thresh2 1仍然存在增大的空间,使得增大后的阈值识别出的偏反相信号经过第一去相关处理后,第一互相关系数仍然小于第二互相关系数,也即去相关处理仍然能达到目的。
在此基础上,还需要说明的是,在本公开的一个实施例之中,若前一帧的去相关处理方式为:采用第一去相关处理方式进行去相关处理,说明前一帧为偏反相信号,并且前一帧的第一阈值Thresh2 1仍然存在增大的空间,并且由于该前一帧对应的第一阈值Thresh2 1是基于初始第一阈值Thresh0 1确定的,则可以得出,该初始第一阈值Thresh0 1也存在增大的空间。此时当前帧可以基于偏移值Delta对该初始第一阈值Thresh0 1进行更新得到当前帧对应的第一阈值Thresh1,即:使得Thresh1=Thresh0 1+Delta,通过第一阈值Thresh1对当前帧信号进行去相关处理,使得去相关处理结果更优。
进一步地,在本公开的一个实施例之中,前一帧的去相关处理方式为:采用第一去相关处理方式进行去相关处理时,说明该前一帧为偏反相信号。基于此,由于前一帧对应的第二阈值Thresh2 2并非用于判定前一帧是否为偏反相信号的,而是用于判断前一帧是否为不相关信号或偏正相信号的,因此,无需对初始第二阈值Thresh0 2进行更新,直接将初始第二阈值Thresh0 2确定为当前帧对应的第二阈值Thresh2即可,即:使得Thresh2=Thresh0 2
此外,需要说明的是,上述的第一去相关处理方式可以包括第一和差下混处理。
具体的,在本公开的一个实施例之中,该第一和差下混处理可以包括:
基于公式六对前一帧左声道信号和右声道信号进行处理以得到主声道信号和次声道信号;公式六为:
Figure PCTCN2021135514-appb-000002
其中,Mid(n)为前一帧主声道信号,Sid(n)为前一帧次声道信号,L(n)为前一帧左声道信号,R(n)为前一帧右声道信号。
以及,在本公开的一个实施例之中,上述的第一互相关系数的确定方法可以包括:
基于公式八确定前一帧左声道信号和右声道信号的第一互相关系数;公式八为:
Figure PCTCN2021135514-appb-000003
η (LR)为前一帧左声道信号和右声道信号的互相关系数,L(n)为前一帧左声道信号第n个样点,
Figure PCTCN2021135514-appb-000004
为前一帧左声道信号所有样点的平均值,R(n)为前一帧右声道信号第n个样点,
Figure PCTCN2021135514-appb-000005
为前一帧右声道信号所有样点的平均值,N为前一帧左声道信号或者右声道信号样点总数,即为前一帧帧长。
以及,在本公开的一个实施例之中,上述的第二互相关系数的确定方法可以包括:
基于公式九确定第二互相关系数;公式九为:
Figure PCTCN2021135514-appb-000006
η (MS)为第二互相关系数或第三互相关系数,Mid(n)为去相关处理后的信号中主声道信号第n个样点,
Figure PCTCN2021135514-appb-000007
为去相关处理后的信号中主声道信号所有样点的平均值,Sid(n)为去相关处理后的信号中次声道信号第n个样点,
Figure PCTCN2021135514-appb-000008
为去相关处理后的信号中次声道信号所有样点的平均值,N为前一帧左声道信号或者右声道信号样点总数,即为前一帧帧长。
步骤204、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。
其中,关于步骤204的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
综上所述,在本公开实施例提供的立体声音频信号处理方法之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
图3为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图,该方法由编码设备执行,如图3所示,该立体声音频信号处理方法可以包括以下步骤:
步骤301、确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
步骤302、确定偏移值Delta。
其中,关于步骤301-302的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
步骤303、响应于立体声音频信号前一帧的去相关处理方式为:采用第二去相关处理方式进行去相关处理,基于公式二确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2。
在本公开的一个实施例之中,公式二为:
Figure PCTCN2021135514-appb-000009
Thresh1和Thresh2分别为当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为当前帧的第一初始阈值、当前帧的第二初始阈值,Delta为偏移值,且Delta∈(0,|Thresh0 2|)(也即是,本实施例中偏移值具体为上述实施例中的用于对当前帧的初始第二阈值Thresh0 2进行更新的偏移值Delta2)。
以下,对采用上述公式二确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2的原理进行详细解释:
其中,在本公开的一个实施例之中,该第二去相关处理方式具体可以是用于对偏正相信号进行去相关处理的方式。以及,在本公开的一个实施例之中,确定是否利用第二去相关处理方式对前一帧进行去相关处理的流程主要为:先判断前一帧是否为偏正相信号,当前一帧为偏正相信号时,则利用第二去相关处理方式对前一帧进行去相关处理,否则,不利用第二去相关处理方式对前一帧进行去相关处理。
进一步地,在本公开的一个实施例之中,上述判断前一帧是否为偏正相信号的流程主要为:先计算前一帧左声道信号和右声道信号的第一互相关系数,第一互相关系数大于前一帧对应的第二阈值Thresh2 2时,则判断该前一帧为偏正相信号,需要对信号进行第二去相关处理。
但是,需要说明的是,在本公开的一个实施例之中,仅基于前一帧对应的第二阈值Thresh2 2判断前一帧是否为偏正相信号进而判断是否需要进行第二去相关处理时,会由于前一帧对应的第二阈值Thresh2 2的设置不准而出现“判断不准确”的现象,使得经过第二去相关处理后的信号的相关性反而比第二去相关处理前的信号的相关性更强,导致信号没有达到去相关的目的。因此,在确定出第一互相关系数大于前一帧对应的第二阈值Thresh2 2的基础上,还需要进一步判断第一互相关系数是否大于第三互相关系数,其中,该第三互相关系数为采用第二去相关处理方式对前一帧信号进行第二去相关处理获得的去相关处理后的信号的互相关系数。
以及,在本公开的一个实施例之中,第一互相关系数大于第三互相关系数时,说明“基于前一帧对应的第二阈值Thresh2 2判断前一帧是否要进行第二去相关处理的判断结果是准确的”,换言之,说明前一帧对应的第二阈值Thresh2 2设置准确,基于该第二阈值Thresh2 2识别出的偏正相信号经过第二去相关处理后能够达到去相关的目的,但是该第二阈值Thresh2 2有可能仍然没有达到是否需要进行去相关处理的阈值临界点,也就是说,该第二阈值Thresh2 2仍然存在减小的空间,使得减小后的阈值识别出的偏正相信号经过第二去相关处理后,第一互相关系数仍然大于第三互相关系数,也即去相关处理仍然能达到目的。
在此基础上,还需要说明的是,在本公开的一个实施例之中,若前一帧的去相关处理方式为:采用第二去相关处理方式进行去相关处理,说明前一帧为偏正相信号,并且前一帧的第二阈值Thresh2 2仍然存在减小的空间,并且由于该前一帧对应的第二阈值Thresh2 2是基于初始第二阈值Thresh0 2确定的,则可以得出,该初始第二阈值Thresh0 2也存在减小的空间。此时当前帧可以基于偏移值Delta对该初始第二阈值Thresh0 2进行更新得到当前帧对应的第二阈值Thresh2,即:使得Thresh2=Thresh0 2-Delta,通过第二阈值Thresh2对当前帧信号进行去相关处理,使得去相关处理结果更优。
进一步地,在本公开的一个实施例之中,前一帧的去相关处理方式为:采用第二去相关处理方式进行去相关处理时,说明该前一帧为偏正相信号。基于此,由于前一帧对应的第一阈值Thresh2 1并非用于判定前一帧是否为偏正相信号的,而是用于判断前一帧是否为不相关信号或偏反相信号的,因此,无需对初始第一阈值Thresh0 1进行更新,直接将初始第一阈值Thresh0 1确定为当前帧对应的第一阈值Thresh1即可,即:使得Thresh1=Thresh0 1
此外,需要说明的是,上述的第二去相关处理方式可以包括第二和差下混处理。
具体的,在本公开的一个实施例之中,该第二和差下混处理可以包括:
基于公式七对前一帧左声道信号和右声道信号进行处理以得到主声道信号和次声道信号;公式七为:
Figure PCTCN2021135514-appb-000010
其中,Mid(n)为前一帧主声道信号,Sid(n)为前一帧次声道信号,L(n)为前一帧左声道信号,R(n)为前一帧右声道信号。
以及,关于上述的第一互相关系数的确定方法可以参考上述实施例描述,本公开实施例在此不做赘述。
以及,在本公开的一个实施例之中,上述的第二互相关系数的确定方法可以包括:
基于公式九确定第三互相关系数;公式九为:
Figure PCTCN2021135514-appb-000011
η (MS)为第二互相关系数或第三互相关系数,Mid(n)为去相关处理后的信号中主声道信号第n个样点,
Figure PCTCN2021135514-appb-000012
为去相关处理后的信号中主声道信号所有样点的平均值,Sid(n)为去相关处理后的信号中次声道信号第n个样点,
Figure PCTCN2021135514-appb-000013
为去相关处理后的信号中次声道信号所有样点的平均值,N为前一帧左声道信号或者右声道信号样点总数,即为前一帧帧长。
步骤304、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。
其中,关于步骤304的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
综上所述,在本公开实施例提供的立体声音频信号处理方法之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
图4为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图,该方法由编码设备执行,如图4所示,该立体声音频信号处理方法可以包括以下步骤:
步骤401、确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
步骤402、确定偏移值Delta。
其中,关于步骤401-402的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
步骤403、响应于立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:前一帧左声道信号和右声道信号的第一互相关系数大于等于前一帧对应的第一阈值Thresh2 1且小于等于前一帧对应的第二阈值Thresh2 2,基于公式三确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2。
在本公开的一个实施例之中,公式三为:
Figure PCTCN2021135514-appb-000014
Thresh1和Thresh2分别为当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值。
其中,在本公开的一个实施例之中,响应于前一帧左声道信号和右声道信号的第一互相关系数大于等于前一帧对应的第一阈值Thresh2 1且小于等于前一帧对应的第二阈值Thresh2 2,则说明前一帧为不相关信号,此时可以无需对当前帧的第一初始阈值Thresh0 1和当前帧的第二初始阈值Thresh0 2进行更新。
步骤404、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。
其中,关于步骤404的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
综上所述,在本公开实施例提供的立体声音频信号处理方法之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
图5为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图,该方法由编码设备执行,如图5所示,该立体声音频信号处理方法可以包括以下步骤:
步骤501、确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
步骤502、确定偏移值Delta。
其中,关于步骤501-502的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
步骤503、响应于立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:前一帧左声道信号和右声道信号的第一互相关系数小于前一帧对应的第一阈值Thresh2 1,且第一互相关系数大于等于第二互相关系数,基于公式四确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2。
其中,在本公开的一个实施例之中,公式四为:
Figure PCTCN2021135514-appb-000015
Thresh1和Thresh2分别为当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为当前帧的第一初始阈值、当前帧的第二初始阈值,Delta为偏移值,且Delta∈(0,|Thresh0 1|)(也即是,本实施例中偏移值具体为上述实施例中的用于对当前帧的初始第一阈值Thresh0 1进行更新的偏移值Delta1)。
以及,在本公开的一个实施例之中,第二互相关系数为采用第一去相关处理方式对前一帧信号进行第一去相关处理获得的去相关处理后的信号的互相关系数。
以下,对采用上述公式四确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2的原理进行详细解释:第一互相关系数大于等于第二互相关系数时,则说明该前一帧没有进行去相关处理,也即是,说明“基于前一帧对应的第一阈值Thresh2 1判断前一帧是偏反相信号从而进行第一去相关处理的判断结果是不准确的”,换言之,说明前一帧对应的第一阈值Thresh2 1取值不准确,基于该第一阈值Thresh2 1识别出的信号经过第一去相关处理后不能达到去相关的目的,认为该第一阈值Thresh2 1大于是否需要进行去相关处理的阈值临界点,也就是说,该第一阈值Thresh2 1需要减小,使得减小后的阈值识别出的偏反相信号经过第一去相关处理后,第一互相关系数小于第二互相关系数,也即使得去相关处理能达到目 的。
其中,在本公开的一个实施例之中,基于前述描述可知,若前一帧左声道信号和右声道信号的第一互相关系数小于前一帧对应的第一阈值Thresh2 1,且第一互相关系数大于等于第二互相关系数,则说明认为该第一阈值Thresh2 1大于是否需要进行去相关处理的阈值临界点,并且由于该前一帧对应的第一阈值Thresh2 1是基于初始第一阈值Thresh0 1确定的,则可以得出,该初始第一阈值Thresh0 1可能也大于是否需要进行去相关处理的阈值临界点。此时可以基于偏移值Delta对该初始第一阈值Thresh0 1进行更新得到当前帧对应的第一阈值Thresh1,即:使得Thresh1=Thresh0 1-Delta,通过第一阈值Thresh1对当前帧信号进行去相关处理,使得去相关处理结果更优。
进一步地,在本公开的一个实施例之中,由于“前一帧左声道信号和右声道信号的第一互相关系数小于前一帧对应的第一阈值Thresh2 1,且第一互相关系数大于等于第二互相关系数”说明的是“基于前一帧对应的第一阈值Thresh2 1判断前一帧为偏反相信号的判断结果是不准确的”。基于此,由于前一帧对应的第二阈值Thresh2 2并非用于判定前一帧是否为偏反相信号的,而是用于判断前一帧是否为不相关信号或偏正相信号的,因此,无需对初始第二阈值Thresh0 2进行更新,直接将初始第二阈值Thresh0 2确定为当前帧对应的第二阈值Thresh2即可,即:使得Thresh2=Thresh0 2
此外,关于上述的第一去相关处理方式、第一互相关系数、第二互相关系数的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述
步骤504、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。
其中,关于步骤504的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
综上所述,在本公开实施例提供的立体声音频信号处理方法之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
图6为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图,该方法由编码设备执行,如图6所示,该立体声音频信号处理方法可以包括以下步骤:
步骤601、确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
步骤602、确定偏移值Delta。
其中,关于步骤601-602的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
步骤603、响应于立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:前一帧左声道信号和右声道信号的第一互相关系数大于前一帧对应的第二阈值Thresh2 2,且第一互相关系数小于等于第三互相关系数,基于公式五确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2。
其中,在本公开的一个实施例之中,公式五为:
Figure PCTCN2021135514-appb-000016
Thresh1和Thresh2分别为当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为当前帧的第一初始阈值、当前帧的第二初始阈值,Delta为偏移值,且Delta∈(0,|Thresh0 2|)(也即是,本实施例中偏移值具体为上述实施例中的用于对当前帧的初始第二阈值Thresh0 2进行更新的偏移值Delta2)。
以及,在本公开的一个实施例之中,第三互相关系数为采用第二去相关处理方式对前一帧信号进行 第二去相关处理获得的去相关处理后的信号的互相关系数。
以下,对采用上述公式五确定当前帧对应的第一阈值Thresh1和第二阈值Thresh2的原理进行详细解释:若第一互相关系数小于等于第三互相关系数时,则说明该前一帧没有进行去相关处理,也即是,说明“基于前一帧对应的第二阈值Thresh2 2判断前一帧是偏正相信号从而进行第二去相关处理的判断结果是不准确的”,换言之,说明前一帧对应的第二阈值Thresh2 2取值不准确,基于该第二阈值Thresh2 2识别出的信号经过第二去相关处理后不能达到去相关的目的,认为该第二阈值Thresh2 1小于是否需要进行去相关处理的阈值临界点,也就是说,该第二阈值Thresh2 2需要增大,使得增大后的阈值识别出的偏正相信号经过第二去相关处理后,第一互相关系数大于第三互相关系数,也即使得去相关处理能达到目的。
其中,在本公开的一个实施例之中,基于前述描述可知,若前一帧左声道信号和右声道信号的第一互相关系数大于前一帧对应的第二阈值Thresh2 2,且第一互相关系数小于等于第三互相关系数,则说明认为该第二阈值Thresh2 2小于是否需要进行去相关处理的阈值临界点,并且由于该前一帧对应的第二阈值Thresh2 2是基于初始第二阈值Thresh0 2确定的,则可以得出,该初始第二阈值Thresh0 2可能也小于是否需要进行去相关处理的阈值临界点。此时可以基于偏移值Delta对该初始第二阈值Thresh0 2进行更新得到当前帧对应的第二阈值Thresh2,即:使得Thresh2=Thresh0 2+Delta,通过第二阈值Thresh2对当前帧信号进行去相关处理,使得去相关处理结果更优。
进一步地,在本公开的一个实施例之中,由于“前一帧左声道信号和右声道信号的第一互相关系数大于前一帧对应的第二阈值Thresh2 2,且第一互相关系数小于等于第三互相关系数”说明的是“基于前一帧对应的第二阈值Thresh2 2判断前一帧为偏正相信号的判断结果是不准确的”。基于此,由于前一帧对应的第一阈值Thresh2 1并非用于判定前一帧是否为偏正相信号的,而是用于判断前一帧是否为不相关信号或偏反相信号的,因此,无需对初始第一阈值Thresh2 1进行更新,直接将初始第一阈值Thresh2 1确定为当前帧对应的第一阈值Thresh1即可,即:使得Thresh1=Thresh0 1
此外,关于上述的第二去相关处理方式、第一互相关系数、第三互相关系数的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述
步骤604、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。
其中,关于步骤604的相关介绍可以参考上述实施例描述,本公开实施例在此不做赘述。
综上所述,在本公开实施例提供的立体声音频信号处理方法之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
图7为本公开实施例所提供的一种立体声音频信号处理方法的流程示意图,该方法由编码设备执行,如图7所示,该立体声音频信号处理方法可以包括以下步骤:
步骤701、确定立体声音频信号第一帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
步骤702、基于公式十确定第一帧对应的第一阈值Thresh3 1和第二阈值Thresh3 2
其中,在本公开的一个实施例之中,公式十为:
Figure PCTCN2021135514-appb-000017
Thresh3 1和Thresh3 2分别为所述第一帧的第一阈值、所述第一帧的第二阈值,Thresh0 1和Thresh0 2分别为所述第一帧的第一初始阈值、所述第一帧的第二初始阈值。
步骤703、确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中 Thresh0 1∈(-1,0),Thresh0 2∈(0,1)。
步骤704、确定偏移值Delta。
步骤705、基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2。
步骤706、基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。
综上所述,在本公开实施例提供的立体声音频信号处理方法之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
图8为本公开实施例所提供的一种立体声音频信号处理装置的结构示意图,如图8所示,装置800可以包括:
确定模块801,用于确定所述立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);
确定模块802,用于确定偏移值Delta;
确定模块803,用于基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;
处理模块804,用于基于所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2对所述当前帧进行去相关处理。
综上所述,在本公开实施例提供的立体声音频信号处理装置之中,会先确定立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);之后,会确定偏移值Delta;以及,会基于立体声音频信号前一帧的去相关处理方式、偏移值Delta、当前帧的初始第一阈值Thresh0 1、当前帧的初始第二阈值Thresh0 2,确定立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;以便后续可以基于当前帧对应的第一阈值Thresh1和第二阈值Thresh2对当前帧进行去相关处理。由此可知,本公开实施例中,会基于前一帧的去相关处理方式来对当前帧对应的第一阈值Thresh1和第二阈值Thresh2进行实时自适应更新,从而可以确保每一帧的相关性确定的准确性,进而可以基于每一帧的相关性准确选择出最优的去相关处理方式,提升了编码压缩率。
可选的,在本公开的一个实施例之中,基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2,包括:
响应于所述立体声音频信号前一帧的去相关处理方式为:采用第一去相关处理方式进行去相关处理,基于公式一确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式一为:
Figure PCTCN2021135514-appb-000018
其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为 所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 1|)。
可选的,在本公开的一个实施例之中,所述确定模块803,还用于:
响应于所述立体声音频信号前一帧的去相关处理方式为:采用第二去相关处理方式进行去相关处理,基于公式二确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式二为:
Figure PCTCN2021135514-appb-000019
其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 2|)。
可选的,在本公开的一个实施例之中,所述确定模块803,还用于:
响应于所述立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:所述前一帧左声道信号和右声道信号的第一互相关系数大于等于所述前一帧对应的第一阈值Thresh2 1且小于等于所述前一帧对应的第二阈值Thresh2 2,基于公式三确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式三为:
Figure PCTCN2021135514-appb-000020
其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值。
可选的,在本公开的一个实施例之中,所述确定模块803,还用于:
响应于所述立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:所述前一帧左声道信号和右声道信号的第一互相关系数小于所述前一帧对应的第一阈值Thresh2 1,且所述第一互相关系数大于等于第二互相关系数,其中,所述第二互相关系数为采用第一去相关处理方式对前一帧信号进行第一去相关处理获得的去相关处理后的信号的互相关系数,基于公式四确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式四为:
Figure PCTCN2021135514-appb-000021
其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 1|)。
可选的,在本公开的一个实施例之中,所述确定模块803,还用于:
响应于所述立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:所述前一帧左声道信号和右声道信号的第一互相关系数大于所述前一帧对应的第二阈值Thresh2 2,且所述第一互相关系数小于等于第三互相关系数,其中,所述第三互相关系数为采用第二去相关处理方式对前一帧信号进行第二去相关处理获得的去相关处理后的信号的互相关系数,基于公式五确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式五为:
Figure PCTCN2021135514-appb-000022
其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为 所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 2|)。
可选的,在本公开的一个实施例之中,所述第一去相关处理方式包括第一和差下混处理。
可选的,在本公开的一个实施例之中,所述第一和差下混处理包括:
基于公式六对所述前一帧左声道信号和右声道信号进行处理以得到主声道信号和次声道信号;所述公式六为:
Figure PCTCN2021135514-appb-000023
其中,Mid(n)为前一帧主声道信号,Sid(n)为前一帧次声道信号,L(n)为前一帧左声道信号,R(n)为前一帧右声道信号。
可选的,在本公开的一个实施例之中,所述第二去相关处理方式包括第二和差下混处理。
可选的,在本公开的一个实施例之中,所述第二和差下混处理包括:
基于公式七对所述前一帧左声道信号和右声道信号进行处理以得到主声道信号和次声道信号;所述公式七为:
Figure PCTCN2021135514-appb-000024
其中,Mid(n)为前一帧主声道信号,Sid(n)为前一帧次声道信号,L(n)为前一帧左声道信号,R(n)为前一帧右声道信号。
可选的,在本公开的一个实施例之中,所述装置还用于:
基于公式八确定所述前一帧左声道信号和右声道信号的第一互相关系数;所述公式八为:
Figure PCTCN2021135514-appb-000025
η (LR)为前一帧左声道信号和右声道信号的互相关系数,L(n)为前一帧左声道信号第n个样点,
Figure PCTCN2021135514-appb-000026
为前一帧左声道信号所有样点的平均值,R(n)为前一帧右声道信号第n个样点,
Figure PCTCN2021135514-appb-000027
为前一帧右声道信号所有样点的平均值,N为前一帧左声道信号或者右声道信号样点总数,即为前一帧帧长。
可选的,在本公开的一个实施例之中,所述去相关处理后的信号包括主声道信号和次声道信号;
所述装置,还用于:
基于公式九确定第二互相关系数和第三互相关系数;所述公式九为:
Figure PCTCN2021135514-appb-000028
η (MS)为第二互相关系数或第三互相关系数,Mid(n)为去相关处理后的信号中主声道信号第n个样点,
Figure PCTCN2021135514-appb-000029
为去相关处理后的信号中主声道信号所有样点的平均值,Sid(n)为去相关处理后的信号中次声道信号第n个样点,
Figure PCTCN2021135514-appb-000030
为去相关处理后的信号中次声道信号所有样点的平均值,N为前一帧左声道信号或者右声道信号样点总数,即为前一帧帧长。
可选的,在本公开的一个实施例之中,所述装置还用于:
确定所述立体声音频信号第一帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
基于公式十确定所述第一帧对应的第一阈值Thresh3 1和第二阈值Thresh3 2,所述公式十为:
Figure PCTCN2021135514-appb-000031
其中,Thresh3 1和Thresh3 2分别为所述第一帧的第一阈值、所述第一帧的第二阈值,Thresh0 1和Thresh0 2分别为所述第一帧的第一初始阈值、所述第一帧的第二初始阈值。
图9是本公开一个实施例所提供的一种用户设备UE900的框图。例如,UE900可以是移动电话,计算机,数字广播终端设备,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。
参照图9,UE900可以包括以下至少一个组件:处理组件902,存储器904,电源组件906,多媒体组件908,音频组件910,输入/输出(I/O)的接口912,传感器组件913,以及通信组件916。
处理组件902通常控制UE900的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理组件902可以包括至少一个处理器920来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件902可以包括至少一个模块,便于处理组件902和其他组件之间的交互。例如,处理组件902可以包括多媒体模块,以方便多媒体组件908和处理组件902之间的交互。
存储器904被配置为存储各种类型的数据以支持在UE900的操作。这些数据的示例包括用于在UE900上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器904可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。
电源组件906为UE900的各种组件提供电力。电源组件906可以包括电源管理系统,至少一个电源,及其他与为UE900生成、管理和分配电力相关联的组件。
多媒体组件908包括在所述UE900和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括至少一个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与所述触摸或滑动操作相关的唤醒时间和压力。在一些实施例中,多媒体组件908包括一个前置摄像头和/或后置摄像头。当UE900处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。
音频组件910被配置为输出和/或输入音频信号。例如,音频组件910包括一个麦克风(MIC),当UE900处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。 所接收的音频信号可以被进一步存储在存储器904或经由通信组件916发送。在一些实施例中,音频组件910还包括一个扬声器,用于输出音频信号。
I/O接口912为处理组件902和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。
传感器组件913包括至少一个传感器,用于为UE900提供各个方面的状态评估。例如,传感器组件913可以检测到设备900的打开/关闭状态,组件的相对定位,例如所述组件为UE900的显示器和小键盘,传感器组件913还可以检测UE900或UE900一个组件的位置改变,用户与UE900接触的存在或不存在,UE900方位或加速/减速和UE900的温度变化。传感器组件913可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件913还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件913还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。
通信组件916被配置为便于UE900和其他设备之间有线或无线方式的通信。UE900可以接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件916经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,所述通信组件916还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。
在示例性实施例中,UE900可以被至少一个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。
图10是本公开实施例所提供的一种网络侧设备1000的框图。例如,网络侧设备1000可以被提供为一网络侧设备。参照图10,网络侧设备1000包括处理组件1011,其进一步包括至少一个处理器,以及由存储器1032所代表的存储器资源,用于存储可由处理组件1022的执行的指令,例如应用程序。存储器1032中存储的应用程序可以包括一个或一个以上的每一个对应于一组指令的模块。此外,处理组件1010被配置为执行指令,以执行上述方法前述应用在所述网络侧设备的任意方法,例如,如图1所示方法。
网络侧设备1000还可以包括一个电源组件1026被配置为执行网络侧设备1000的电源管理,一个有线或无线网络接口1050被配置为将网络侧设备1000连接到网络,和一个输入输出(I/O)接口1058。网络侧设备1000可以操作基于存储在存储器1032的操作系统,例如Windows Server TM,Mac OS XTM,Unix TM,Linux TM,Free BSDTM或类似。
上述本公开提供的实施例中,分别从网络侧设备、UE的角度对本公开实施例提供的方法进行了介绍。为了实现上述本公开实施例提供的方法中的各功能,网络侧设备和UE可以包括硬件结构、软件模块,以硬件结构、软件模块、或硬件结构加软件模块的形式来实现上述各功能。上述各功能中的某个功能可以以硬件结构、软件模块、或者硬件结构加软件模块的方式来执行。
上述本公开提供的实施例中,分别从网络侧设备、UE的角度对本公开实施例提供的方法进行了介绍。为了实现上述本公开实施例提供的方法中的各功能,网络侧设备和UE可以包括硬件结构、软件模块,以硬件结构、软件模块、或硬件结构加软件模块的形式来实现上述各功能。上述各功能中的某个功能可以以硬件结构、软件模块、或者硬件结构加软件模块的方式来执行。
本公开实施例提供的一种通信装置。通信装置可包括收发模块和处理模块。收发模块可包括发送模块和/或接收模块,发送模块用于实现发送功能,接收模块用于实现接收功能,收发模块可以实现发送功能和/或接收功能。
通信装置可以是终端设备(如前述方法实施例中的终端设备),也可以是终端设备中的装置,还可以是能够与终端设备匹配使用的装置。或者,通信装置可以是网络设备,也可以是网络设备中的装置,还可以是能够与网络设备匹配使用的装置。
本公开实施例提供的另一种通信装置。通信装置可以是网络设备,也可以是终端设备(如前述方法实施例中的终端设备),也可以是支持网络设备实现上述方法的芯片、芯片系统、或处理器等,还可以 是支持终端设备实现上述方法的芯片、芯片系统、或处理器等。该装置可用于实现上述方法实施例中描述的方法,具体可以参见上述方法实施例中的说明。
通信装置可以包括一个或多个处理器。处理器可以是通用处理器或者专用处理器等。例如可以是基带处理器或中央处理器。基带处理器可以用于对通信协议以及通信数据进行处理,中央处理器可以用于对通信装置(如,网络侧设备、基带芯片,终端设备、终端设备芯片,DU或CU等)进行控制,执行计算机程序,处理计算机程序的数据。
可选的,通信装置中还可以包括一个或多个存储器,其上可以存有计算机程序,处理器执行所述计算机程序,以使得通信装置执行上述方法实施例中描述的方法。可选的,所述存储器中还可以存储有数据。通信装置和存储器可以单独设置,也可以集成在一起。
可选的,通信装置还可以包括收发器、天线。收发器可以称为收发单元、收发机、或收发电路等,用于实现收发功能。收发器可以包括接收器和发送器,接收器可以称为接收机或接收电路等,用于实现接收功能;发送器可以称为发送机或发送电路等,用于实现发送功能。
可选的,通信装置中还可以包括一个或多个接口电路。接口电路用于接收代码指令并传输至处理器。处理器运行所述代码指令以使通信装置执行上述方法实施例中描述的方法。
通信装置为终端设备(如前述方法实施例中的终端设备):处理器用于执行图1-图4a任一所示的方法。
通信装置为网络设备:收发器用于执行图5-图7任一所示的方法。
在一种实现方式中,处理器中可以包括用于实现接收和发送功能的收发器。例如该收发器可以是收发电路,或者是接口,或者是接口电路。用于实现接收和发送功能的收发电路、接口或接口电路可以是分开的,也可以集成在一起。上述收发电路、接口或接口电路可以用于代码/数据的读写,或者,上述收发电路、接口或接口电路可以用于信号的传输或传递。
在一种实现方式中,处理器可以存有计算机程序,计算机程序在处理器上运行,可使得通信装置执行上述方法实施例中描述的方法。计算机程序可能固化在处理器中,该种情况下,处理器可能由硬件实现。
在一种实现方式中,通信装置可以包括电路,所述电路可以实现前述方法实施例中发送或接收或者通信的功能。本公开中描述的处理器和收发器可实现在集成电路(integrated circuit,IC)、模拟IC、射频集成电路RFIC、混合信号IC、专用集成电路(application specific integrated circuit,ASIC)、印刷电路板(printed circuit board,PCB)、电子设备等上。该处理器和收发器也可以用各种IC工艺技术来制造,例如互补金属氧化物半导体(complementary metal oxide semiconductor,CMOS)、N型金属氧化物半导体(nMetal-oxide-semiconductor,NMOS)、P型金属氧化物半导体(positive channel metal oxide semiconductor,PMOS)、双极结型晶体管(bipolar junction transistor,BJT)、双极CMOS(BiCMOS)、硅锗(SiGe)、砷化镓(Gas)等。
以上实施例描述中的通信装置可以是网络设备或者终端设备(如前述方法实施例中的终端设备),但本公开中描述的通信装置的范围并不限于此,而且通信装置的结构可以不受的限制。通信装置可以是独立的设备或者可以是较大设备的一部分。例如所述通信装置可以是:
(1)独立的集成电路IC,或芯片,或,芯片系统或子系统;
(2)具有一个或多个IC的集合,可选的,该IC集合也可以包括用于存储数据,计算机程序的存储部件;
(3)ASIC,例如调制解调器(Modem);
(4)可嵌入在其他设备内的模块;
(5)接收机、终端设备、智能终端设备、蜂窝电话、无线设备、手持机、移动单元、车载设备、网络设备、云设备、人工智能设备等等;
(6)其他等等。
对于通信装置可以是芯片或芯片系统的情况,芯片包括处理器和接口。其中,处理器的数量可以是一个或多个,接口的数量可以是多个。
可选的,芯片还包括存储器,存储器用于存储必要的计算机程序和数据。
本领域技术人员还可以了解到本公开实施例列出的各种说明性逻辑块(illustrative logical block)和步骤(step)可以通过电子硬件、电脑软件,或两者的结合进行实现。这样的功能是通过硬件还是软件来实现取决于特定的应用和整个系统的设计要求。本领域技术人员可以对于每种特定的应用,可以使用各种方法实现所述的功能,但这种实现不应被理解为超出本公开实施例保护的范围。
本公开实施例还提供一种确定侧链路时长的系统,该系统包括前述实施例中作为终端设备(如前述方法实施例中的第一终端设备)的通信装置和作为网络设备的通信装置,或者,该系统包括前述实施例中作为终端设备(如前述方法实施例中的第一终端设备)的通信装置和作为网络设备的通信装置。
本公开还提供一种可读存储介质,其上存储有指令,该指令被计算机执行时实现上述任一方法实施例的功能。
本公开还提供一种计算机程序产品,该计算机程序产品被计算机执行时实现上述任一方法实施例的功能。
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机程序。在计算机上加载和执行所述计算机程序时,全部或部分地产生按照本公开实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机程序可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机程序可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质(例如,软盘、硬盘、磁带)、光介质(例如,高密度数字视频光盘(digital video disc,DVD))、或者半导体介质(例如,固态硬盘(solid state disk,SSD))等。
本领域普通技术人员可以理解:本公开中涉及的第一、第二等各种数字编号仅为描述方便进行的区分,并不用来限制本公开实施例的范围,也表示先后顺序。
本公开中的至少一个还可以描述为一个或多个,多个可以是两个、三个、四个或者更多个,本公开不做限制。在本公开实施例中,对于一种技术特征,通过“第一”、“第二”、“第三”、“A”、“B”、“C”和“D”等区分该种技术特征中的技术特征,该“第一”、“第二”、“第三”、“A”、“B”、“C”和“D”描述的技术特征间无先后顺序或者大小顺序。
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本发明的其它实施方案。本公开旨在涵盖本发明的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本发明的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。

Claims (17)

  1. 一种立体声音频信号处理方法,其特征在于,应用于编码设备,包括:
    确定所述立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2,其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);
    确定偏移值Delta;
    基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;
    基于所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2对所述当前帧进行去相关处理。
  2. 如权利要求1所述的方法,其特征在于,基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2,包括:
    响应于所述立体声音频信号前一帧的去相关处理方式为:采用第一去相关处理方式进行去相关处理,基于公式一确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式一为:
    Figure PCTCN2021135514-appb-100001
    其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 1|)。
  3. 如权利要求1所述的方法,其特征在于,基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2,包括:
    响应于所述立体声音频信号前一帧的去相关处理方式为:采用第二去相关处理方式进行去相关处理,基于公式二确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式二为:
    Figure PCTCN2021135514-appb-100002
    其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 2|)。
  4. 如权利要求1所述的方法,其特征在于,基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2,包括:
    响应于所述立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:所述前一帧左声道信号和右声道信号的第一互相关系数大于等于所述前一帧对应的第一阈值Thresh2 1且小于等于所述前一帧对应的第二阈值Thresh2 2,基于公式三确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式三为:
    Figure PCTCN2021135514-appb-100003
    其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为 所述当前帧的第一初始阈值、所述当前帧的第二初始阈值。
  5. 如权利要求1所述的方法,其特征在于,基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2,包括:
    响应于所述立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:所述前一帧左声道信号和右声道信号的第一互相关系数小于所述前一帧对应的第一阈值Thresh2 1,且所述第一互相关系数大于等于第二互相关系数,其中,所述第二互相关系数为采用第一去相关处理方式对前一帧信号进行第一去相关处理获得的去相关处理后的信号的互相关系数,基于公式四确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式四为:
    Figure PCTCN2021135514-appb-100004
    其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 1|)。
  6. 如权利要求1所述的方法,其特征在于,基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2,包括:
    响应于所述立体声音频信号前一帧的去相关处理方式为:未进行去相关处理,同时未进行去相关处理的缘由为:所述前一帧左声道信号和右声道信号的第一互相关系数大于所述前一帧对应的第二阈值Thresh2 2,且所述第一互相关系数小于等于第三互相关系数,其中,所述第三互相关系数为采用第二去相关处理方式对前一帧信号进行第二去相关处理获得的去相关处理后的信号的互相关系数,基于公式五确定所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2,所述公式五为:
    Figure PCTCN2021135514-appb-100005
    其中,Thresh1和Thresh2分别为所述当前帧的第一阈值、第二阈值,Thresh0 1和Thresh0 2分别为所述当前帧的第一初始阈值、所述当前帧的第二初始阈值,Delta为所述偏移值,且Delta∈(0,|Thresh0 2|)。
  7. 如权利要求2或5所述的方法,其特征在于,所述第一去相关处理方式包括第一和差下混处理。
  8. 如权利要求7所述的方法,其特征在于,所述第一和差下混处理包括:
    基于公式六对所述前一帧左声道信号和右声道信号进行处理以得到主声道信号和次声道信号;所述公式六为:
    Figure PCTCN2021135514-appb-100006
    其中,Mid(n)为前一帧主声道信号,Sid(n)为前一帧次声道信号,L(n)为前一帧左声道信号,R(n)为前一帧右声道信号。
  9. 如权利要求3或6所述的方法,其特征在于,所述第二去相关处理方式包括第二和差下混处理。
  10. 如权利要求9所述的方法,其特征在于,所述第二和差下混处理包括:
    基于公式七对所述前一帧左声道信号和右声道信号进行处理以得到主声道信号和次声道信号;所述公式七为:
    Figure PCTCN2021135514-appb-100007
    其中,Mid(n)为前一帧主声道信号,Sid(n)为前一帧次声道信号,L(n)为前一帧左声道信号,R(n)为前一帧右声道信号。
  11. 如权利要求4-6任一所述的方法,其特征在于,所述第一互相关系数的确定方法,包括:
    基于公式八确定所述前一帧左声道信号和右声道信号的第一互相关系数;所述公式八为:
    Figure PCTCN2021135514-appb-100008
    η (LR)为前一帧左声道信号和右声道信号的互相关系数,L(n)为前一帧左声道信号第n个样点,
    Figure PCTCN2021135514-appb-100009
    为前一帧左声道信号所有样点的平均值,R(n)为前一帧右声道信号第n个样点,
    Figure PCTCN2021135514-appb-100010
    为前一帧右声道信号所有样点的平均值,N为前一帧左声道信号或者右声道信号样点总数,即为前一帧帧长。
  12. 如权利要求5或6任一所述的方法,其特征在于,所述去相关处理后的信号包括主声道信号和次声道信号;
    计算所述去相关处理后的信号的第二互相关系数和第三互相关系数,包括:
    基于公式九确定第二互相关系数和第三互相关系数;所述公式九为:
    Figure PCTCN2021135514-appb-100011
    η (MS)为第二互相关系数或第三互相关系数,Mid(n)为去相关处理后的信号中主声道信号第n个样点,
    Figure PCTCN2021135514-appb-100012
    为去相关处理后的信号中主声道信号所有样点的平均值,Sid(n)为去相关处理后的信号中次声道信号第n个样点,
    Figure PCTCN2021135514-appb-100013
    为去相关处理后的信号中次声道信号所有样点的平均值,N为前一帧左声道信号或者右声道信号样点总数,即为前一帧帧长。
  13. 如权利要求1所述的方法,其特征在于,所述方法还包括:
    确定所述立体声音频信号第一帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2
    基于公式十确定所述第一帧对应的第一阈值Thresh3 1和第二阈值Thresh3 2,所述公式十为:
    Figure PCTCN2021135514-appb-100014
    其中,Thresh3 1和Thresh3 2分别为所述第一帧的第一阈值、所述第一帧的第二阈值,Thresh0 1和Thresh0 2分别为所述第一帧的第一初始阈值、所述第一帧的第二初始阈值。
  14. 一种立体声音频信号处理装置,其特征在于,包括:
    确定模块,用于确定所述立体声音频信号当前帧的初始第一阈值Thresh0 1和初始第二阈值Thresh0 2, 其中Thresh0 1∈(-1,0),Thresh0 2∈(0,1);
    确定模块,用于确定偏移值Delta;
    确定模块,用于基于所述立体声音频信号前一帧的去相关处理方式、所述偏移值Delta、所述当前帧的初始第一阈值Thresh0 1、所述当前帧的初始第二阈值Thresh0 2,确定所述立体声音频信号当前帧对应的第一阈值Thresh1和第二阈值Thresh2;
    处理模块,用于基于所述当前帧对应的第一阈值Thresh1和第二阈值Thresh2对所述当前帧进行去相关处理。
  15. 一种通信装置,其特征在于,所述装置包括处理器和存储器,所述存储器中存储有计算机程序,所述处理器执行所述存储器中存储的计算机程序,以使所述装置执行如权利要求1至13中任一项所述的方法。
  16. 一种通信装置,其特征在于,包括:处理器和接口电路;
    所述接口电路,用于接收代码指令并传输至所述处理器;
    所述处理器,用于运行所述代码指令以执行如权利要求1至13中任一项所述的方法。
  17. 一种计算机可读存储介质,用于存储有指令,当所述指令被执行时,使如权利要求1至13中任一项所述的方法被实现。
PCT/CN2021/135514 2021-12-03 2021-12-03 一种立体声音频信号处理方法及设备/存储介质/装置 WO2023097686A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202180004514.2A CN114365509B (zh) 2021-12-03 2021-12-03 一种立体声音频信号处理方法及设备/存储介质/装置
PCT/CN2021/135514 WO2023097686A1 (zh) 2021-12-03 2021-12-03 一种立体声音频信号处理方法及设备/存储介质/装置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/135514 WO2023097686A1 (zh) 2021-12-03 2021-12-03 一种立体声音频信号处理方法及设备/存储介质/装置

Publications (2)

Publication Number Publication Date
WO2023097686A1 true WO2023097686A1 (zh) 2023-06-08
WO2023097686A9 WO2023097686A9 (zh) 2024-06-13

Family

ID=81105445

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/135514 WO2023097686A1 (zh) 2021-12-03 2021-12-03 一种立体声音频信号处理方法及设备/存储介质/装置

Country Status (2)

Country Link
CN (1) CN114365509B (zh)
WO (1) WO2023097686A1 (zh)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104364842A (zh) * 2012-04-18 2015-02-18 诺基亚公司 立体声音频信号编码器
CN109215668A (zh) * 2017-06-30 2019-01-15 华为技术有限公司 一种声道间相位差参数的编码方法及装置
CN109389984A (zh) * 2017-08-10 2019-02-26 华为技术有限公司 时域立体声编解码方法和相关产品
CN111128210A (zh) * 2018-10-30 2020-05-08 哈曼贝克自动系统股份有限公司 具有声学回声消除的音频信号处理
US20200357417A1 (en) * 2017-09-25 2020-11-12 Panasonic Intellectual Property Corporation Of America Encoder and encoding method

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK2736271T3 (da) * 2012-11-27 2019-09-16 Oticon As Fremgangsmåde til styring af en opdateringsalgoritme for et adaptivt feedbackestimeringssystem og en de-korrelationsenhed
US9143862B2 (en) * 2012-12-17 2015-09-22 Microsoft Corporation Correlation based filter adaptation
TWI618050B (zh) * 2013-02-14 2018-03-11 杜比實驗室特許公司 用於音訊處理系統中之訊號去相關的方法及設備
CN104217726A (zh) * 2014-09-01 2014-12-17 东莞中山大学研究院 一种无损音频压缩编码方法及其解码方法
CN107731238B (zh) * 2016-08-10 2021-07-16 华为技术有限公司 多声道信号的编码方法和编码器
CN110556118B (zh) * 2018-05-31 2022-05-10 华为技术有限公司 立体声信号的编码方法和装置
CN112053669B (zh) * 2020-08-27 2023-10-27 海信视像科技股份有限公司 一种人声消除方法、装置、设备及介质

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104364842A (zh) * 2012-04-18 2015-02-18 诺基亚公司 立体声音频信号编码器
CN109215668A (zh) * 2017-06-30 2019-01-15 华为技术有限公司 一种声道间相位差参数的编码方法及装置
CN109389984A (zh) * 2017-08-10 2019-02-26 华为技术有限公司 时域立体声编解码方法和相关产品
US20200357417A1 (en) * 2017-09-25 2020-11-12 Panasonic Intellectual Property Corporation Of America Encoder and encoding method
CN111128210A (zh) * 2018-10-30 2020-05-08 哈曼贝克自动系统股份有限公司 具有声学回声消除的音频信号处理

Also Published As

Publication number Publication date
CN114365509A (zh) 2022-04-15
CN114365509B (zh) 2024-03-01

Similar Documents

Publication Publication Date Title
CN113597779B (zh) 信息指示方法、装置、用户设备、基站及存储介质
WO2022257127A1 (zh) 时域窗口确定方法、装置、用户设备、基站及存储介质
WO2023097686A1 (zh) 一种立体声音频信号处理方法及设备/存储介质/装置
CN116368460A (zh) 音频处理方法、装置
WO2023097686A9 (zh) 一种立体声音频信号处理方法及设备/存储介质/装置
WO2023092505A1 (zh) 一种立体声音频信号处理方法、装置、编码设备、解码设备及存储介质
WO2023028849A1 (zh) 参考信号测量方法、装置、用户设备、网络侧设备及存储介质
WO2023130283A1 (zh) 一种映射方式确定方法/装置/设备及存储介质
WO2023193148A1 (zh) 音频回放方法/装置/设备及存储介质
WO2023092602A1 (zh) 一种预编码方法、装置、用户设备、ris阵列、基站及存储介质
WO2023108435A1 (zh) 一种预编码方法及设备/存储介质/装置
US20110136481A1 (en) Apparatus and method for automatically changing telephony mode in portable terminal
WO2023184260A1 (zh) 一种信号传输方法/装置/设备及存储介质
WO2023077472A1 (zh) 一种信息更新方法、装置、用户设备、基站及存储介质
WO2023070407A1 (zh) 一种预编码方法、装置、用户设备、可重构智能表面ris阵列及存储介质
WO2023193276A1 (zh) 一种上报方法/装置/设备及存储介质
WO2023155053A1 (zh) 一种辅助通信设备的发送方法及设备/存储介质/装置
WO2023050153A1 (zh) 一种上报方法、装置、用户设备、网路侧设备及存储介质
WO2023240653A1 (zh) 音频信号格式确定方法、装置
WO2023065254A1 (zh) 一种信号编解码方法、装置、编码设备、解码设备及存储介质
WO2023122987A1 (zh) 一种重复传输方法及设备/存储介质/装置
WO2023130397A1 (zh) 一种预编码方法/装置/设备及存储介质
WO2023155200A1 (zh) 一种测量放松方法及设备、存储介质、装置
WO2023206177A1 (zh) Ai波束模型确定方法、装置
WO2023082194A1 (zh) 一种波束处理方法、装置、用户设备、ris阵列、基站及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21966112

Country of ref document: EP

Kind code of ref document: A1