JP6410777B2

JP6410777B2 - Audio signal switching method and device

Info

Publication number: JP6410777B2
Application number: JP2016201593A
Authority: JP
Inventors: ▲ゼ▼新 ▲劉▼; 磊苗; 晨胡; 文海 ▲ウ▼; ▲ユエ▼ 郎; 清 ▲張▼
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2010-04-28
Filing date: 2016-10-13
Publication date: 2018-10-24
Anticipated expiration: 2031-04-28
Also published as: EP2485029B1; CN101964189B; JP6027081B2; KR20120074303A; ES2718947T3; EP3249648B1; BR112012013306B8; AU2011247719A1; BR112012013306A2; ES2635212T3; JP5667202B2; EP2485029A1; KR101377547B1; CN101964189A; JP2015045888A; EP2485029A4; WO2011134415A1; JP2017033015A; AU2011247719B2; EP3249648A1

Description

本出願は、その全体が参照により本明細書に組み込まれる、2010年4月28日に出願された「METHOD AND APPARATUS FOR SWITCHING SPEECH OR AUDIO SIGNALS」と題する中国特許出願第201010163406.3号の優先権を主張する。 This application claims priority of Chinese Patent Application No. 201010163406.3 entitled “METHOD AND APPARATUS FOR SWITCHING SPEECH OR AUDIO SIGNALS” filed on April 28, 2010, which is incorporated herein by reference in its entirety. To do.

本発明は、通信技術に関し、特に音声信号を切り替えるための方法および装置に関する。 The present invention relates to communication technology, and more particularly to a method and apparatus for switching audio signals.

現在、ネットワーク上における音声信号の送信の処理中に、ネットワーク状態が異なりうることから、ネットワークは様々なビットレートでエンコーダからネットワークに送信される音声信号のビットストリームをインターセプトすることがあり、その結果デコーダは、インターセプトされたビットストリームから様々な帯域幅を有する音声信号を復号することがある。 Currently, during the process of transmitting audio signals over the network, the network conditions may be different, so the network may intercept the bit stream of the audio signal transmitted from the encoder to the network at various bit rates, and as a result The decoder may decode audio signals having different bandwidths from the intercepted bitstream.

従来技術では、ネットワーク上で送信される音声信号は様々な帯域幅を有するため、狭周波数帯域音声信号と広周波数帯域音声信号との間の双方向の切り替えが音声信号の送信処理中に生じることがある。本発明の実施形態では、狭周波数帯域信号は、アップサンプリングおよびローパスフィルタ処理により低周波数帯域要素のみを有する広周波数帯域信号に切り替えられる。広周波数帯域音声信号は、低周波数帯域信号要素および高周波数帯域信号要素の両方を含む。 In the prior art, since an audio signal transmitted over a network has various bandwidths, bidirectional switching between a narrow frequency band audio signal and a wide frequency band audio signal occurs during the audio signal transmission process. There is. In an embodiment of the present invention, a narrow frequency band signal is switched to a wide frequency band signal having only low frequency band elements by upsampling and low pass filter processing. The wide frequency band audio signal includes both low frequency band signal elements and high frequency band signal elements.

本発明の実施中に、本発明者は、従来技術における少なくとも以下の問題を発見している。高周波数帯域信号情報は、広周波数帯域音声信号で入手できるが、狭周波数帯域音声信号では存在しないため、様々な帯域幅を有する音声信号が切り替えられたとき、音声信号においてエネルギーの急変が生じて聞き心地が悪くなり、結果的にユーザが受信する音声信号の品質が低下することがある。 During the practice of the present invention, the inventor has discovered at least the following problems in the prior art. High frequency band signal information can be obtained as a wide frequency band audio signal, but it does not exist in a narrow frequency band audio signal. Therefore, when an audio signal having various bandwidths is switched, a sudden change in energy occurs in the audio signal. The listening comfort may deteriorate, and as a result, the quality of the audio signal received by the user may deteriorate.

本発明の実施形態は、様々な帯域幅の間で音声信号を円滑に切り替え、それによりユーザが受信する音声信号の品質を改善する、音声信号を切り替えるための方法および装置を提示する。 Embodiments of the present invention present a method and apparatus for switching audio signals that smoothly switches audio signals between various bandwidths, thereby improving the quality of the audio signals received by the user.

音声信号を切り替えるための方法は以下を含む。
音声信号の切り替えが生じたとき、現在の音声信号のフレームの第1高周波数帯域信号および以前のM(Mは1以上)個の音声信号のフレームの第2高周波数帯域信号をウェイト付けして、処理済み第1高周波数帯域信号を取得するステップ、および
処理済み第1高周波数帯域信号および現在の音声信号のフレームの第1低周波数帯域信号を広周波数帯域信号に合成するステップ。 Methods for switching audio signals include:
When audio signal switching occurs, the first high frequency band signal of the current audio signal frame and the second high frequency band signal of the previous M (M is 1 or more) audio signal frames are weighted. Obtaining a processed first high frequency band signal; and synthesizing the processed first high frequency band signal and the first low frequency band signal of the frame of the current audio signal into a wide frequency band signal.

音声信号を切り替えるための装置は以下を含む。
音声信号の切り替えが生じたとき、現在の音声信号のフレームの第1高周波数帯域信号および以前のM(Mは1以上)個の音声信号のフレームの第2高周波数帯域信号をウェイト付けして、処理済み第1高周波数帯域信号を取得するように構成された処理モジュール、および
処理済み第1高周波数帯域信号および現在の音声信号のフレームの第1低周波数帯域信号を広周波数帯域信号に合成するように構成された第1合成モジュール。 An apparatus for switching audio signals includes:
When audio signal switching occurs, the first high frequency band signal of the current audio signal frame and the second high frequency band signal of the previous M (M is 1 or more) audio signal frames are weighted. A processing module configured to acquire a processed first high frequency band signal, and a first low frequency band signal of the frame of the processed first high frequency band signal and the current audio signal into a wide frequency band signal. A first synthesis module configured to:

本発明の実施形態における音声信号を切り替えるための方法および装置を使用することによって、現在の音声信号のフレームの第1高周波数帯域信号は、以前のM個の音声信号のフレームの第2高周波数帯域信号に従って処理され、その結果、以前のM個の音声信号のフレームの第2高周波数帯域信号を処理済み第1高周波数帯域信号に円滑に切り替えることができ、処理済み第1高周波数帯域信号および第1低周波数帯域信号は広周波数帯域信号に合成される。このようにして、様々な帯域幅を有する音声信号間の切り替えの処理中に、こうした音声信号を円滑に切り替えることができ、それによりエネルギーの急変が音声信号の主観的音声品質に及ぼす悪影響を減らし、ユーザが受信する音声信号の品質を改善することができる。 By using the method and apparatus for switching audio signals in an embodiment of the present invention, the first high frequency band signal of the frame of the current audio signal is the second high frequency of the frame of the previous M audio signals. Processed according to the band signal, so that the second high frequency band signal of the frame of the previous M audio signals can be smoothly switched to the processed first high frequency band signal, the processed first high frequency band signal The first low frequency band signal is combined with the wide frequency band signal. In this way, during the process of switching between audio signals with different bandwidths, such audio signals can be switched smoothly, thereby reducing the negative effects of sudden changes in energy on the subjective audio quality of the audio signal. The quality of the audio signal received by the user can be improved.

本発明の技術的解決策を明確化するため、本発明の実施形態を示す添付の図面について以下に概説する。見かけ上、添付の図面は例にすぎず、当業者は創造的努力なしにかかる添付の図面から他の図面を導出できよう。 BRIEF DESCRIPTION OF THE DRAWINGS To clarify the technical solutions of the present invention, the accompanying drawings illustrating the embodiments of the present invention will be outlined below. Apparently, the accompanying drawings are merely examples, and those skilled in the art will be able to derive other drawings from such accompanying drawings without creative efforts.

音声信号を切り替えるための方法の第1実施形態の流れ図である。2 is a flowchart of a first embodiment of a method for switching an audio signal. 音声信号を切り替えるための方法の第2実施形態の流れ図である。6 is a flowchart of a second embodiment of a method for switching an audio signal. 図2に示すステップ201の実施形態の流れ図である。3 is a flowchart of the embodiment of step 201 shown in FIG. 図3に示すステップ302の実施形態の流れ図である。4 is a flowchart of the embodiment of step 302 shown in FIG. 図3に示すステップ302の別の実施形態の第2流れ図である。FIG. 4 is a second flowchart of another embodiment of step 302 shown in FIG. 図2に示すステップ202の実施形態の流れ図である。3 is a flowchart of the embodiment of step 202 shown in FIG. 図2に示すステップ201の別の実施形態の第2流れ図である。3 is a second flowchart of another embodiment of step 201 shown in FIG. 図2に示すステップ201の別の実施形態の第3流れ図である。4 is a third flowchart of another embodiment of step 201 shown in FIG. 音声信号を切り替えるための装置の第1実施形態の構造を示す図である。1 is a diagram illustrating a structure of a first embodiment of an apparatus for switching an audio signal. FIG. 音声信号を切り替えるための装置の第2実施形態の構造を示す図である。FIG. 6 is a diagram illustrating a structure of a second embodiment of an apparatus for switching audio signals. 音声信号を切り替えるための装置の第2実施形態における処理モジュールの構造を示す第1概略図である。FIG. 6 is a first schematic diagram showing the structure of a processing module in a second embodiment of an apparatus for switching audio signals. 音声信号を切り替えるための装置の第2実施形態における第1モジュールの構造を示す概略図である。FIG. 5 is a schematic diagram showing a structure of a first module in a second embodiment of the device for switching an audio signal. 音声信号を切り替えるための装置の第2実施形態における処理モジュールの構造を示す第2概略図である。FIG. 10 is a second schematic diagram showing the structure of a processing module in the second embodiment of the apparatus for switching audio signals. 音声信号を切り替えるための装置の第2実施形態における処理モジュールの構造を示す第3概略図である。FIG. 10 is a third schematic diagram showing the structure of a processing module in the second embodiment of the apparatus for switching audio signals.

本発明の目的、技術的解決策および利点を理解しやすくするため、以下では、実施形態および添付の図面を参照して本発明について詳述する。見かけ上、実施形態は例にすぎず、本発明はかかる実施形態に限定されない。当業者は著しい創造的努力を払うことなく本明細書で与えられる実施形態から他の実施形態を導出することができ、かかる実施形態はすべて本明細書の範囲内に入る。 In order to make the objects, technical solutions and advantages of the present invention easier to understand, the present invention will be described in detail below with reference to embodiments and the accompanying drawings. Apparently, the embodiments are merely examples, and the present invention is not limited to such embodiments. One skilled in the art can derive other embodiments from the embodiments given herein without making significant creative efforts, all such embodiments falling within the scope of the present specification.

図1は、音声信号を切り替えるための方法の第1実施形態の流れ図である。図1に示すように、音声信号を切り替えるための方法を使用することにより、音声の切り替えが生じたときに、切り替えフレームより後の各フレームが以下のステップに従い処理される。 FIG. 1 is a flowchart of a first embodiment of a method for switching audio signals. As shown in FIG. 1, by using a method for switching audio signals, each frame after the switching frame is processed according to the following steps when the audio switching occurs.

ステップ101:音声の切り替えが生じたとき、現在の音声信号のフレームの第1高周波数帯域信号および以前のM(Mは1以上)個の音声信号のフレームの第2高周波数帯域信号をウェイト付けして、処理済み第1高周波数帯域信号を取得する。 Step 101: When audio switching occurs, weight the first high frequency band signal of the current audio signal frame and the second high frequency band signal of the previous M (M is 1 or more) audio signal frames Then, the processed first high frequency band signal is acquired.

ステップ102:処理済み第1高周波数帯域信号および現在の音声信号のフレームの第1低周波数帯域信号を広周波数帯域信号に合成する。 Step 102: Synthesize the processed first high frequency band signal and the first low frequency band signal of the frame of the current audio signal into a wide frequency band signal.

この実施形態では、以前のM個の音声信号のフレームは、現在のフレームより前のM個の音声信号のフレームを指す。切り替え前のL個の音声信号のフレームは、音声の切り替えが生じたときの切り替えフレームより前のL個の音声信号のフレームを指す。現在の音声フレームは広周波数帯域信号であるが、以前の音声フレームは狭周波数帯域信号である場合、または現在の音声フレームは狭周波数帯域信号であるが、以前の音声フレームは広周波数帯域信号である場合、音声信号は切り替えされ、現在の音声フレームは切り替えフレームである。 In this embodiment, the previous M audio signal frames refer to the M audio signal frames prior to the current frame. The frame of L audio signals before switching indicates the frame of L audio signals before the switching frame when the audio is switched. The current voice frame is a wide frequency band signal, but the previous voice frame is a narrow frequency band signal, or the current voice frame is a narrow frequency band signal, but the previous voice frame is a wide frequency band signal. In some cases, the audio signal is switched and the current audio frame is a switching frame.

この実施形態における音声信号を切り替えるための方法を使用することによって、現在の音声信号のフレームの第1高周波数帯域信号が、以前のM個の音声信号のフレームの第2高周波数帯域信号に従って処理され、その結果、以前のM個の音声信号のフレームの第2高周波数帯域信号を処理済み第1高周波数帯域信号に円滑に切り替えることができる。このようにして、様々な帯域幅を有する音声信号間の切り替えの処理中に、こうした音声信号の高周波数帯域信号を円滑に切り替えることができる。最後に、処理済み第1高周波数帯域信号および第1低周波数帯域信号は広周波数帯域信号に合成され、広周波数帯域信号はユーザ端末に送信され、その結果、ユーザは高品質の音声信号を享受することができる。この実施形態における音声信号を切り替えるための方法を使用することによって、様々な帯域幅を有する音声信号を円滑に切り替えることができ、それによりエネルギーの急変が音声信号の主観的音声品質に及ぼす影響を減らし、ユーザが受信する音声信号の品質を改善することができる。 By using the method for switching audio signals in this embodiment, the first high frequency band signal of the frame of the current audio signal is processed according to the second high frequency band signal of the frame of the previous M audio signals. As a result, the second high frequency band signal of the frame of the previous M audio signals can be smoothly switched to the processed first high frequency band signal. In this way, during the process of switching between audio signals having various bandwidths, the high frequency band signals of such audio signals can be switched smoothly. Finally, the processed first high frequency band signal and first low frequency band signal are combined into a wide frequency band signal, and the wide frequency band signal is transmitted to the user terminal, so that the user enjoys a high quality audio signal. can do. By using the method for switching audio signals in this embodiment, it is possible to smoothly switch audio signals having various bandwidths, and thereby influence the sudden change of energy on the subjective audio quality of the audio signal. The quality of the audio signal received by the user can be improved.

図2は、音声信号を切り替えるための方法の第2実施形態の流れ図である。図2に示すように、本方法は以下のステップを含む。 FIG. 2 is a flowchart of a second embodiment of a method for switching audio signals. As shown in FIG. 2, the method includes the following steps.

ステップ200:音声信号の切り替えが生じていないとき、現在の音声信号のフレームの第1高周波数帯域信号および第1低周波数帯域信号を広周波数帯域信号に合成する。 Step 200: When no audio signal switching occurs, the first high frequency band signal and the first low frequency band signal of the frame of the current audio signal are combined into a wide frequency band signal.

具体的には、この実施形態における第1周波数帯域音声信号は、広周波数帯域音声信号または狭周波数帯域音声信号でありうる。第1周波数帯域音声信号が音声信号の送信中に切り替えられていないとき、以下の2つのケースに従ってオペレーションを実行できる。1.第1周波数帯域音声信号が広周波数帯域音声信号である場合、広周波数帯域音声信号の低周波数帯域信号および高周波数帯域信号が広周波数帯域信号に合成される。2.第1周波数帯域音声信号が狭周波数帯域音声信号である場合、狭周波数帯域音声信号の低周波数帯域信号および高周波数帯域信号が広周波数帯域信号に合成される。この場合、信号は広周波数帯域信号であるが、高周波数帯域はヌルである。 Specifically, the first frequency band audio signal in this embodiment may be a wide frequency band audio signal or a narrow frequency band audio signal. When the first frequency band audio signal is not switched during transmission of the audio signal, the operation can be performed according to the following two cases. 1. When the first frequency band audio signal is a wide frequency band audio signal, the low frequency band signal and the high frequency band signal of the wide frequency band audio signal are combined with the wide frequency band signal. 2. When the first frequency band audio signal is a narrow frequency band audio signal, the low frequency band signal and the high frequency band signal of the narrow frequency band audio signal are combined into a wide frequency band signal. In this case, the signal is a wide frequency band signal, but the high frequency band is null.

ステップ201:音声信号が切り替えられたとき、現在の音声信号のフレームの第1高周波数帯域信号および以前のM個の音声信号のフレームの第2高周波数帯域信号をウェイト付けして、処理済み第1高周波数帯域信号を取得する。Mは1以上である。 Step 201: When the audio signal is switched, the first high frequency band signal of the current audio signal frame and the second high frequency band signal of the previous M audio signal frames are weighted and processed first. 1 Acquire a high frequency band signal. M is 1 or more.

具体的には、様々な帯域幅を有する音声信号間で切り替えが生じたとき、現在の音声信号のフレームの第1高周波数帯域信号が、以前のM個の音声信号のフレームの第2高周波数帯域信号に従って処理され、その結果、以前のM個の音声信号のフレームの第2高周波数帯域信号を処理済み第1高周波数帯域信号に円滑に切り替えることができる。例えば、広周波数帯域音声信号が狭周波数帯域音声信号に切り替えられたとき、狭周波数帯域音声信号に対応する高周波数帯域信号情報がヌルであるため、狭周波数帯域音声信号に対応する高周波数帯域信号の要素を復元して、広周波数帯域音声信号を狭周波数帯域音声信号に円滑に切り替えられるようにする必要がある。一方、狭周波数帯域音声信号が広周波数帯域音声信号に切り替えられたとき、広周波数帯域音声信号の高周波数帯域信号はヌルでないため、切り替え後の連続する複数のフレームの広周波数帯域音声信号の高周波数帯域信号のエネルギーを弱めて、狭周波数帯域音声信号を広周波数帯域音声信号に円滑に切り替えることを可能にし、その結果、広周波数帯域音声信号の高周波数帯域信号が真の高周波数帯域信号に徐々に切り替えられるようにしなければならない。ステップ201で現在の音声信号のフレームを処理することによって、様々な帯域幅を有する音声信号の高周波数帯域信号を円滑に切り替えることができ、広周波数帯域音声信号と狭周波数帯域音声信号との間の切り替えの処理中におけるエネルギーの急変に起因するユーザの聞き心地の悪さが回避され、ユーザは高品質の音声信号を受信することができる。処理済み第1高周波数帯域信号を取得する処理を単純化するため、第1高周波数帯域信号および以前のM個の音声信号のフレームの第2高周波数帯域信号は直接的にウェイト付けできる。ウェイト付け結果は、処理済み第1高周波数帯域信号である。 Specifically, when a switch occurs between audio signals having various bandwidths, the first high frequency band signal of the current audio signal frame is changed to the second high frequency of the previous M audio signal frames. As a result, the second high frequency band signal of the previous M audio signal frames can be smoothly switched to the processed first high frequency band signal. For example, when the wide frequency band audio signal is switched to the narrow frequency band audio signal, the high frequency band signal information corresponding to the narrow frequency band audio signal is null, so the high frequency band signal corresponding to the narrow frequency band audio signal is It is necessary to restore the above-mentioned elements so that the wide frequency band audio signal can be smoothly switched to the narrow frequency band audio signal. On the other hand, when the narrow frequency band audio signal is switched to the wide frequency band audio signal, the high frequency band signal of the wide frequency band audio signal is not null. The energy of the frequency band signal is weakened, and the narrow frequency band audio signal can be smoothly switched to the wide frequency band audio signal. As a result, the high frequency band signal of the wide frequency band audio signal becomes a true high frequency band signal. It must be possible to switch gradually. By processing the frame of the current audio signal in step 201, it is possible to smoothly switch the high frequency band signal of the audio signal having various bandwidths between the wide frequency band audio signal and the narrow frequency band audio signal. The user's uncomfortable listening caused by a sudden change in energy during the switching process is avoided, and the user can receive a high-quality audio signal. To simplify the process of obtaining the processed first high frequency band signal, the first high frequency band signal and the second high frequency band signal of the frame of the previous M audio signals can be weighted directly. The weighting result is the processed first high frequency band signal.

ステップ202:処理済み第1高周波数帯域信号および現在の音声信号のフレームの第1低周波数帯域信号を広周波数帯域信号に合成する。 Step 202: Synthesize the processed first high frequency band signal and the first low frequency band signal of the frame of the current audio signal into a wide frequency band signal.

具体的には、ステップ201で現在の音声信号のフレームが処理された後、以前のM個の音声信号のフレームの第2高周波数帯域信号を、現在のフレームの処理済み第1高周波数帯域信号に円滑に切り替えることができ、次いでステップ202で、処理済み第1高周波数帯域信号および現在の音声信号のフレームの第1低周波数帯域信号が広周波数帯域信号に合成されて、その結果、ユーザが受信する音声信号は常に広周波数帯域音声信号となる。このようにして、様々な帯域幅を有する音声信号は円滑に切り替えられ、これはユーザが受信する音声信号の品質改善に寄与する。 Specifically, after the current audio signal frame is processed in step 201, the second high frequency band signal of the previous M audio signal frames is changed to the processed first high frequency band signal of the current frame. Then, in step 202, the processed first high frequency band signal and the first low frequency band signal of the frame of the current audio signal are combined into a wide frequency band signal so that the user can The received audio signal is always a wide frequency band audio signal. In this way, audio signals with various bandwidths are switched smoothly, which contributes to improving the quality of the audio signal received by the user.

この実施形態における音声信号を切り替えるための方法を使用することによって、現在の音声信号のフレームの第1高周波数帯域信号が、以前のM個の音声信号のフレームの第2高周波数帯域信号に従って処理され、その結果、以前のM個の音声信号のフレームの第2高周波数帯域信号を処理済み第1高周波数帯域信号に円滑に切り替えることができる。このようにして、様々な帯域幅を有する音声信号間の切り替えの処理中に、こうした音声信号の高周波数帯域信号を円滑に切り替えることができる。最後に、処理済み第1高周波数帯域信号および第1低周波数帯域信号は広周波数帯域信号に合成され、広周波数帯域信号はユーザ端末に送信され、その結果、ユーザは高品質の音声信号を享受することができる。この実施形態における音声信号を切り替えるための方法を使用することによって、様々な帯域幅を有する音声信号を円滑に切り替えることができ、それによりエネルギーの急変が音声信号の主観的音声品質に及ぼす影響を減らし、ユーザが受信する音声信号の品質を改善することができる。加えて、様々な帯域幅を有する音声信号が切り替えられていないとき、現在の音声信号のフレームの第1高周波数帯域信号および第1低周波数帯域信号が広周波数帯域信号に合成され、その結果、ユーザは高品質の音声信号を取得することができる。 By using the method for switching audio signals in this embodiment, the first high frequency band signal of the frame of the current audio signal is processed according to the second high frequency band signal of the frame of the previous M audio signals. As a result, the second high frequency band signal of the frame of the previous M audio signals can be smoothly switched to the processed first high frequency band signal. In this way, during the process of switching between audio signals having various bandwidths, the high frequency band signals of such audio signals can be switched smoothly. Finally, the processed first high frequency band signal and first low frequency band signal are combined into a wide frequency band signal, and the wide frequency band signal is transmitted to the user terminal, so that the user enjoys a high quality audio signal. can do. By using the method for switching audio signals in this embodiment, it is possible to smoothly switch audio signals having various bandwidths, and thereby influence the sudden change of energy on the subjective audio quality of the audio signal. The quality of the audio signal received by the user can be improved. In addition, when audio signals having various bandwidths are not switched, the first high frequency band signal and the first low frequency band signal of the frame of the current audio signal are combined into a wide frequency band signal, The user can obtain a high quality audio signal.

上述の技術的解決策によれば、随意に、図3に示すように、広周波数帯域音声信号から狭周波数帯域音声信号への切り替えが生じたとき、ステップ201は以下のステップを含む。 According to the above technical solution, optionally, as shown in FIG. 3, when switching from a wide frequency band audio signal to a narrow frequency band audio signal occurs, step 201 includes the following steps.

ステップ301:第1高周波数帯域信号に対応する微細構造情報およびエンベロープ情報を予測する。 Step 301: Predict fine structure information and envelope information corresponding to the first high frequency band signal.

具体的には、音声信号を微細構造情報およびエンベロープ情報に分割することができ、その結果、微細構造情報およびエンベロープ情報に従って音声信号を復元することができる。広周波数帯域音声信号から狭周波数帯域音声信号への切り替えの処理において、低周波数帯域信号のみが狭周波数帯域音声信号で入手でき、高周波数帯域信号はヌルであるため、広周波数帯域音声信号を狭周波数帯域音声信号に円滑に切り替えることを可能にするには、現在の狭周波数帯域音声信号が必要とする高周波数帯域信号を復元することで、音声信号間の円滑な切り替えを実施できるようにする必要がある。ステップ301において、狭周波数帯域音声信号の第1高周波数帯域信号に対応する予測される微細構造情報およびエンベロープ情報が予測される。 Specifically, the audio signal can be divided into fine structure information and envelope information, and as a result, the audio signal can be restored according to the fine structure information and envelope information. In the process of switching from a wide frequency band audio signal to a narrow frequency band audio signal, only the low frequency band signal is available as the narrow frequency band audio signal, and the high frequency band signal is null, so the wide frequency band audio signal is narrowed. To enable smooth switching to a frequency band audio signal, the high frequency band signal required by the current narrow frequency band audio signal can be restored to enable smooth switching between the audio signals. There is a need. In step 301, predicted fine structure information and envelope information corresponding to the first high frequency band signal of the narrow frequency band audio signal are predicted.

現在の音声信号のフレームに対応する微細構造情報およびエンベロープ情報をより正確に予測するために、ステップ301で現在の音声信号のフレームの第1低周波数帯域信号を分類することができ、次いで第1高周波数帯域信号に対応する予測される微細構造情報およびエンベロープ情報が、第1低周波数帯域信号の信号タイプに従って予測される。例えば、現在のフレームの狭周波数帯域音声信号は、調波信号、または非調波信号、または過渡信号でありうる。この場合、狭周波数帯域音声信号のタイプに対応する微細構造情報およびエンベロープ情報を取得することができ、その結果、高周波数帯域信号に対応する微細構造情報およびエンベロープ情報をより正確に予測することができる。この実施形態における音声信号を切り替えるための方法は、狭周波数帯域音声信号の信号タイプを限定しない。 In order to more accurately predict the fine structure information and envelope information corresponding to the frame of the current audio signal, the first low frequency band signal of the frame of the current audio signal can be classified in step 301, and then the first Predicted fine structure information and envelope information corresponding to the high frequency band signal are predicted according to the signal type of the first low frequency band signal. For example, the narrow-band audio signal of the current frame can be a harmonic signal, a non-harmonic signal, or a transient signal. In this case, the fine structure information and envelope information corresponding to the type of the narrow frequency band audio signal can be acquired, and as a result, the fine structure information and envelope information corresponding to the high frequency band signal can be predicted more accurately. it can. The method for switching audio signals in this embodiment does not limit the signal type of the narrow frequency band audio signal.

ステップ302:予測済みエンベロープ情報および以前のM個の音声信号のフレームの第2高周波数帯域信号に対応する以前のM個のフレームのエンベロープ情報をウェイト付けして、第1高周波数帯域信号に対応する第1エンベロープ情報を取得する。 Step 302: Weighting the envelope information of the previous M frames corresponding to the predicted envelope information and the second high frequency band signal of the frame of the previous M audio signals, to support the first high frequency band signal Get first envelope information.

具体的には、ステップ301で現在のフレームの第1高周波数帯域信号に対応する予測される微細構造情報およびエンベロープ情報が予測された後、第1高周波数帯域信号に対応する第1エンベロープ情報を、予測済みエンベロープ情報および以前のM個の音声信号のフレームの第2高周波数帯域信号に対応する以前のM個のフレームのエンベロープ情報に従い生成できる。 Specifically, after the predicted fine structure information and envelope information corresponding to the first high frequency band signal of the current frame are predicted in step 301, the first envelope information corresponding to the first high frequency band signal is obtained. , According to the predicted envelope information and the envelope information of the previous M frames corresponding to the second high frequency band signal of the frame of the previous M speech signals.

具体的には、ステップ302における第1高周波数帯域信号に対応する第1エンベロープ情報を生成するプロセスは、以下の2つの方式を使用して実施できる。 Specifically, the process of generating the first envelope information corresponding to the first high frequency band signal in step 302 can be performed using the following two methods.

1.図4に示すように、ステップ302による第1エンベロープ情報を取得する実施形態は以下のステップを含むことができる。 1. As shown in FIG. 4, the embodiment for obtaining the first envelope information in step 302 may include the following steps.

ステップ401:第1低周波数帯域信号と以前のN個の音声信号のフレームの低周波数帯域信号との間の相関係数を、第1低周波数帯域信号および以前のN個の音声信号のフレームの低周波数帯域信号に従って計算する(Nは1以上)。 Step 401: calculating a correlation coefficient between the first low frequency band signal and the low frequency band signal of the previous N audio signal frames, and calculating the correlation coefficient between the first low frequency band signal and the previous N audio signal frames; Calculate according to the low frequency band signal (N is 1 or more).

具体的には、現在の音声信号のフレームの第1低周波数帯域信号を以前のN個の音声信号のフレームの低周波数帯域信号と比較して、現在の音声信号のフレームの第1低周波数帯域信号と以前のN個の音声信号のフレームの低周波数帯域信号との間の相関係数を取得する。例えば、現在の音声信号のフレームの第1低周波数帯域信号と以前のN個の音声信号のフレームの低周波数帯域信号との間の相関は、エネルギー規模または情報タイプの点で、現在の音声信号のフレームの第1低周波数帯域信号の周波数帯域と以前のN個の音声信号のフレームの低周波数帯域信号の同じ周波数帯域との差を判断することによって割り出すことができ、その結果、所望の相関係数を計算することができる。以前のN個の音声信号のフレームは狭周波数帯域音声信号、広周波数帯域音声信号、または狭周波数帯域音声信号および広周波数帯域音声信号の混成信号でありうる。 Specifically, the first low frequency band signal of the current audio signal frame is compared with the low frequency band signal of the previous N audio signal frames, and the first low frequency band of the current audio signal frame is compared. A correlation coefficient between the signal and the low frequency band signal of the frame of the previous N speech signals is obtained. For example, the correlation between the first low frequency band signal of the frame of the current audio signal and the low frequency band signal of the frame of the previous N audio signals may be expressed in terms of energy scale or information type. Can be determined by determining the difference between the frequency band of the first low frequency band signal of the frame of the frame and the same frequency band of the low frequency band signal of the frame of the previous N speech signals, and thus the desired phase The number of relationships can be calculated. The previous frame of N audio signals may be a narrow frequency band audio signal, a wide frequency band audio signal, or a hybrid signal of a narrow frequency band audio signal and a wide frequency band audio signal.

ステップ402:相関係数が所与の第1しきい値範囲内にあるか否かを判断する。 Step 402: Determine whether the correlation coefficient is within a given first threshold range.

具体的には、ステップ401で相関係数が計算された後に、相関係数が所与のしきい値範囲内にあるか否かが判断される。相関係数を計算する目的は、現在の音声信号のフレームが、以前のN個の音声信号のフレームから徐々に切り替えられているか、それとも以前のN個の音声信号のフレームから突然切り替えられているかを判断することである。すなわち、目的は、それらの特性が同じであるか否かを判断し、次いで、現在の音声信号の高周波数帯域信号を予測する処理における以前のフレームの高周波数帯域信号のウェイトを決定することである。例えば、現在の音声信号のフレームの第1低周波数帯域信号が、以前の音声信号のフレームの低周波数帯域信号と同じエネルギーを有し、これらの信号タイプが同じである場合、それは、以前の音声信号のフレームは現在の音声信号のフレームとの相関が高いことを示している。したがって、現在の音声信号のフレームに対応する第1エンベロープ情報を正確に復元するにあたり、以前の音声信号のフレームに対応する高周波数帯域のエンベロープ情報または過渡エンベロープ情報はより大きなウェイトを占める。一方、エネルギーおよび信号タイプの点で、現在の音声信号のフレームの第1低周波数帯域信号と以前の音声信号のフレームの低周波数帯域信号との間に大きな差がある場合、それは、以前の音声信号と現在の音声信号のフレームとの相関が低いことを示している。したがって、現在の音声信号のフレームに対応する第1エンベロープ情報を正確に復元するにあたり、以前の音声信号のフレームに対応する高周波数帯域のエンベロープ情報または過渡エンベロープ情報は、より小さなウェイトを占める。 Specifically, after the correlation coefficient is calculated in step 401, it is determined whether or not the correlation coefficient is within a given threshold range. The purpose of calculating the correlation coefficient is whether the frame of the current speech signal is gradually switched from the previous N speech signal frames or suddenly switched from the previous N speech signal frames Is to judge. That is, the purpose is to determine whether their characteristics are the same and then to determine the weight of the high frequency band signal of the previous frame in the process of predicting the high frequency band signal of the current speech signal. is there. For example, if the first low frequency band signal of the frame of the current audio signal has the same energy as the low frequency band signal of the frame of the previous audio signal and these signal types are the same, The signal frame has a high correlation with the current audio signal frame. Therefore, in accurately restoring the first envelope information corresponding to the frame of the current audio signal, the envelope information or transient envelope information in the high frequency band corresponding to the previous frame of the audio signal occupies a greater weight. On the other hand, if there is a large difference in energy and signal type between the first low frequency band signal of the frame of the current voice signal and the low frequency band signal of the frame of the previous voice signal, It shows that the correlation between the signal and the frame of the current audio signal is low. Therefore, in accurately restoring the first envelope information corresponding to the frame of the current audio signal, the envelope information or transient envelope information of the high frequency band corresponding to the previous frame of the audio signal occupies a smaller weight.

ステップ403:相関係数が所与の第1しきい値範囲内にない場合、指定の第1ウェイト1および指定の第1ウェイト2に従いウェイト付けして、第1エンベロープ情報を計算する。第1ウェイト1は、以前の音声信号のフレームの高周波数帯域信号に対応する以前のフレームのエンベロープ情報のウェイト値を指し、第1ウェイト2は、エンベロープ情報のウェイト値を指す。 Step 403: If the correlation coefficient is not within the given first threshold range, the first envelope information is calculated by weighting according to the designated first weight 1 and the designated first weight 2. The first weight 1 indicates the weight value of the envelope information of the previous frame corresponding to the high frequency band signal of the frame of the previous audio signal, and the first weight 2 indicates the weight value of the envelope information.

具体的には、ステップ402において、相関係数が所与の第1しきい値範囲内にないと判断された場合、それは、現在の音声信号のフレームと以前のN個の音声信号のフレームとの相関がわずかであることを示している。したがって、以前のM個のフレームの第1周波数帯域音声信号に対応する以前のM個のフレームのエンベロープ情報または過渡エンベロープ情報、または以前の音声信号のフレームに対応する高周波数帯域のエンベロープ情報は、第1エンベロープ情報にわずかな影響を及ぼす。現在の音声信号のフレームに対応する第1エンベロープ情報が復元されるとき、以前のM個のフレームの第1周波数帯域音声信号に対応する以前のM個のフレームのエンベロープ情報または過渡エンベロープ情報、または以前の音声信号のフレームに対応する高周波数帯域のエンベロープ情報は、より小さなウェイトを占める。したがって、現在のフレームの第1エンベロープ情報は、指定の第1ウェイト1および第1ウェイト2に従って計算できる。第1ウェイト1は、以前の音声信号のフレームの高周波数帯域信号に対応するエンベロープ情報のウェイト値を指す。以前の音声信号のフレームは、広周波数帯域音声信号または処理済み狭周波数帯域音声信号でありうる。第1切り替えの場合、以前の音声信号のフレームは広周波数帯域音声信号であり、一方で第1ウェイト2は予測済みエンベロープ情報のウェイト値を指す。予測済みエンベロープ情報と第1ウェイト2との積は、以前のフレームのエンベロープ情報と第1ウェイト1との積に加えられ、ウェイト付け和は、現在のフレームの第1エンベロープ情報である。また、その後に送信される音声信号は、この方法およびウェイトに従って処理される。音声信号に対応する第1エンベロープ情報は、音声信号が再び切り替えられるまで復元される。 Specifically, if it is determined in step 402 that the correlation coefficient is not within a given first threshold range, it is determined that the current audio signal frame and the previous N audio signal frames It is shown that there is a slight correlation. Therefore, the envelope information or transient envelope information of the previous M frames corresponding to the first frequency band audio signal of the previous M frames, or the envelope information of the high frequency band corresponding to the frames of the previous audio signal, Slightly affects the first envelope information. When the first envelope information corresponding to the frame of the current audio signal is restored, the envelope information or transient envelope information of the previous M frames corresponding to the first frequency band audio signal of the previous M frames, or The envelope information in the high frequency band corresponding to the previous audio signal frame occupies a smaller weight. Therefore, the first envelope information of the current frame can be calculated according to the designated first weight 1 and first weight 2. The first weight 1 indicates the weight value of the envelope information corresponding to the high frequency band signal of the previous audio signal frame. The previous audio signal frame may be a wide frequency band audio signal or a processed narrow frequency band audio signal. In the case of the first switching, the frame of the previous audio signal is a wide frequency band audio signal, while the first weight 2 indicates the weight value of the predicted envelope information. The product of the predicted envelope information and the first weight 2 is added to the product of the envelope information of the previous frame and the first weight 1, and the weighted sum is the first envelope information of the current frame. The audio signal transmitted thereafter is processed according to this method and weight. The first envelope information corresponding to the audio signal is restored until the audio signal is switched again.

ステップ404:相関係数が所与の第1しきい値範囲内にある場合、指定の第2ウェイト1および指定の第2ウェイト2に従いウェイト付けして、過渡エンベロープ情報を計算する。第2ウェイト1は、切り替え前のエンベロープ情報のウェイト値を指し、第2ウェイト2は、以前のM(Mは1以上)個のフレームのエンベロープ情報のウェイト値を指す。 Step 404: If the correlation coefficient is within the given first threshold range, the transient envelope information is calculated by weighting according to the designated second weight 1 and the designated second weight 2. The second weight 1 indicates the weight value of the envelope information before switching, and the second weight 2 indicates the weight value of the envelope information of the previous M frames (M is 1 or more).

具体的には、ステップ402で相関係数が所与のしきい値範囲内にあると判断された場合、現在の音声信号のフレームは、以前の連続したN個の音声信号のフレームと似た特性を有し、現在の音声信号のフレームに対応する第1エンベロープ情報は、以前の連続したN個の音声信号のフレームのエンベロープ情報によりかなり影響される。以前のM個のフレームのエンベロープに対する信頼性の観点から、現在の音声信号のフレームに対応する過渡エンベロープ情報を、以前のM個のフレームのエンベロープ情報および切り替え前のエンベロープ情報に従って計算する必要がある。現在の音声信号のフレームの第1エンベロープ情報が復元されるとき、以前のM個のフレームのエンベロープ情報および切り替え前の以前のL個のフレームのエンベロープ情報はより大きなウェイトを占めることになる。次いで、第1エンベロープ情報が過渡エンベロープ情報に従って計算される。第2ウェイト1は、切り替え前のエンベロープ情報のウェイト値を指し、第2ウェイト2は、以前のM個のフレームのエンベロープ情報のウェイト値を指す。この場合、切り替え前のエンベロープ情報と第2ウェイト1との積が、以前のM個のフレームのエンベロープ情報と第2ウェイト2との積に加えられ、ウェイト付け値は過渡エンベロープ情報となる。 Specifically, if it is determined in step 402 that the correlation coefficient is within a given threshold range, the frame of the current audio signal is similar to the previous frame of N audio signals. The first envelope information having the characteristics and corresponding to the frame of the current audio signal is considerably influenced by the envelope information of the frame of the previous consecutive N audio signals. From the viewpoint of reliability with respect to the envelope of the previous M frames, it is necessary to calculate the transient envelope information corresponding to the frame of the current audio signal according to the envelope information of the previous M frames and the envelope information before switching. . When the first envelope information of the frame of the current audio signal is restored, the envelope information of the previous M frames and the envelope information of the previous L frames before switching occupy a larger weight. Then, first envelope information is calculated according to the transient envelope information. The second weight 1 indicates the weight value of the envelope information before switching, and the second weight 2 indicates the weight value of the envelope information of the previous M frames. In this case, the product of the envelope information before switching and the second weight 1 is added to the previous product of the envelope information of M frames and the second weight 2, and the weighted value becomes transient envelope information.

ステップ405:第1ウェイトステップにより第2ウェイト1を縮小し、第1ウェイトステップにより第2ウェイト2を拡大する。 Step 405: The second weight 1 is reduced by the first weight step, and the second weight 2 is enlarged by the first weight step.

具体的には、音声信号が送信されると、切り替え前の広周波数帯域音声信号がその後の狭周波数帯域音声信号に及ぼす影響は、徐々に減少する。第1エンベロープ情報をより正確に計算するため、第2ウェイト1および第2ウェイト2に対して適応的調整を実行する必要がある。切り替え前のL個のフレームの広周波数帯域音声信号がその後の音声信号に及ぼす影響は徐々に減少するため、第2ウェイト1の値は徐々に小さくなり、一方で第2ウェイト2の値が徐々に大きくなることで、切り替え前のエンベロープ情報が第1エンベロープ情報に及ぼす影響が弱まる。ステップ405では、第2ウェイト1および第2ウェイト2を以下の式により修正できる。新第2ウェイト1=旧第2ウェイト1-第1ウェイトステップ;新第2ウェイト2=旧第2ウェイト2+第1ウェイトステップ、ここで第1ウェイトステップは指定の値である。 Specifically, when an audio signal is transmitted, the influence of the wide frequency band audio signal before switching on the subsequent narrow frequency band audio signal gradually decreases. In order to calculate the first envelope information more accurately, it is necessary to perform adaptive adjustment on the second weight 1 and the second weight 2. Since the influence of the wide frequency band audio signal of L frames before switching on the subsequent audio signal gradually decreases, the value of the second weight 1 gradually decreases, while the value of the second weight 2 gradually increases By increasing the value, the influence of the envelope information before switching on the first envelope information is weakened. In step 405, the second weight 1 and the second weight 2 can be corrected by the following equations. New second weight 1 = old second weight 1-first weight step; new second weight 2 = old second weight 2 + first weight step, where the first weight step is a specified value.

ステップ406:指定の第3ウェイト1が第1ウェイト1より大きいか否かを判断する。 Step 406: It is determined whether or not the designated third weight 1 is greater than the first weight 1.

具体的には、第3ウェイト1は過渡エンベロープ情報のウェイト値を指す。過渡エンベロープ情報が現在のフレームの第1エンベロープ情報に及ぼす影響は、第3ウェイト1を第2ウェイト1と比較することによって判断できる。過渡エンベロープ情報は以前のM個のフレームのエンベロープ情報および切り替え前のエンベロープ情報に従って計算される。したがって、第3ウェイト1は実際に、第1エンベロープ情報が切り替え前のエンベロープ情報から受ける影響の度合いを表す。 Specifically, the third weight 1 indicates the weight value of the transient envelope information. The effect of the transient envelope information on the first envelope information of the current frame can be determined by comparing the third weight 1 with the second weight 1. The transient envelope information is calculated according to the envelope information of the previous M frames and the envelope information before switching. Therefore, the third weight 1 actually represents the degree of influence that the first envelope information receives from the envelope information before switching.

ステップ407:第3ウェイト1が第1ウェイト1より大きくない場合、指定の第1ウェイト1および第1ウェイト2に従いウェイト付けして、第1エンベロープ情報を計算する。 Step 407: If the third weight 1 is not larger than the first weight 1, weighting is performed according to the designated first weight 1 and first weight 2, and the first envelope information is calculated.

具体的には、ステップ406で第3ウェイト1が第1ウェイト1以下であると判断された場合、それは、現在の音声信号のフレームが切り替え前のL個の音声信号のフレームから少し遠いこと、および第1エンベロープ情報が主に以前のM個のフレームのエンベロープ情報により影響されていることを示している。したがって、現在のフレームの第1エンベロープ情報は、指定の第1ウェイト1および第1ウェイト2に従って計算できる。 Specifically, if it is determined in step 406 that the third weight 1 is less than or equal to the first weight 1, it means that the current audio signal frame is slightly far from the L audio signal frames before switching, And the first envelope information is mainly influenced by the envelope information of the previous M frames. Therefore, the first envelope information of the current frame can be calculated according to the designated first weight 1 and first weight 2.

ステップ408:第3ウェイト1が第1ウェイト1より大きい場合、指定の第3ウェイト1および第3ウェイト2に従いウェイト付けして、第1エンベロープ情報を計算する。第3ウェイト1は、過渡エンベロープ情報のウェイト値を指し、第3ウェイト2は、予測済みエンベロープ情報のウェイト値を指す。 Step 408: When the third weight 1 is larger than the first weight 1, weighting is performed according to the designated third weight 1 and third weight 2, and the first envelope information is calculated. The third weight 1 indicates the weight value of the transient envelope information, and the third weight 2 indicates the weight value of the predicted envelope information.

具体的には、ステップ406で第3ウェイト1が第1ウェイト1より大きいと判断された場合、それは、現在の音声信号のフレームが切り替え前のL個の音声信号のフレームにより近いこと、および第1エンベロープ情報が切り替え前のエンベロープ情報によって大きく影響されていることを示している。したがって、現在のフレームの第1エンベロープ情報を、過渡エンベロープ情報に従って計算する必要がある。第3ウェイト1は、過渡エンベロープ情報のウェイト値を指し、第3ウェイト2は、予測済みエンベロープ情報のウェイト値を指す。この場合、過渡エンベロープ情報と第3ウェイト1との積が、予測済みエンベロープ情報と第3ウェイト2との積に加えられ、ウェイト付け値は第1エンベロープ情報となる。 Specifically, if it is determined in step 406 that the third weight 1 is greater than the first weight 1, it means that the current audio signal frame is closer to the L audio signal frames before switching, and 1 Envelope information is greatly affected by the envelope information before switching. Therefore, it is necessary to calculate the first envelope information of the current frame according to the transient envelope information. The third weight 1 indicates the weight value of the transient envelope information, and the third weight 2 indicates the weight value of the predicted envelope information. In this case, the product of the transient envelope information and the third weight 1 is added to the product of the predicted envelope information and the third weight 2, and the weighted value becomes the first envelope information.

ステップ409:第2ウェイトステップにより第3ウェイト1を縮小し、第3ウェイト1が0に等しくなるまで第2ウェイトステップにより第3ウェイト2を拡大する。 Step 409: The third weight 1 is reduced by the second weight step, and the third weight 2 is enlarged by the second weight step until the third weight 1 becomes equal to 0.

具体的には、ステップ409で第3ウェイト1および第3ウェイト2を修正する目的は、ステップ405で第2ウェイト1および第2ウェイト2を修正する目的と同じである。すなわち、その目的は、切り替え前のL個の音声信号のフレームがその後に送信される音声信号に及ぼす影響が徐々に減少するときに、第3ウェイト1および第3ウェイト2に対して適応的調整を実行して、第1エンベロープ情報をより正確に計算することである。切り替え前のL個の音声信号のフレームがその後の音声信号に及ぼす影響は徐々に減少するため、第3ウェイト1の値は徐々に小さくなり、一方で第3ウェイト2の値が徐々に大きくなることで、切り替え前のエンベロープ情報が第1エンベロープ情報に及ぼす影響は弱まる。ステップ409では、第3ウェイト1および第3ウェイト2を以下の式により修正できる。新第3ウェイト1=旧第3ウェイト1-第2ウェイトステップ;新第3ウェイト2=旧第3ウェイト2+第2ウェイトステップ、ここで第2ウェイトステップは指定の値である。 Specifically, the purpose of correcting the third weight 1 and the third weight 2 in step 409 is the same as the purpose of correcting the second weight 1 and the second weight 2 in step 405. That is, the purpose is to adaptively adjust the third weight 1 and the third weight 2 when the influence of the L audio signal frames before switching gradually decreases on the audio signal transmitted thereafter. To calculate the first envelope information more accurately. Since the influence of the L audio signal frames before switching on the subsequent audio signal gradually decreases, the value of the third weight 1 gradually decreases, while the value of the third weight 2 gradually increases As a result, the influence of the envelope information before switching on the first envelope information is weakened. In step 409, the third weight 1 and the third weight 2 can be corrected by the following equations. New third weight 1 = old third weight 1-second weight step; new third weight 2 = old third weight 2 + second weight step, where the second weight step is a designated value.

第1ウェイト1と第1ウェイト2との和は1に等しく、第2ウェイト1と第2ウェイト2との和は1に等しく、第3ウェイト1と第3ウェイト2との和は1に等しく、第3ウェイト1の初期値は第1ウェイト1の初期値より大きく、第1ウェイト1および第1ウェイト2は定数である。具体的には、この実施形態におけるウェイト1およびウェイト2は実際に、現在のフレームの第1エンベロープ情報における、切り替え前のエンベロープ情報および以前のM個のフレームのエンベロープ情報のパーセンテージを表す。現在の音声信号のフレームが切り替え前のL個の音声信号のフレームに近く、両者の相関が高い場合、切り替え前のエンベロープ情報のパーセンテージは高く、一方で以前のM個のフレームのエンベロープ情報のパーセンテージは低い。現在の音声信号のフレームが切り替え前のL個の音声信号のフレームから少し遠い場合、それは、音声信号がネットワーク上で安定的に送信されていることを示しており、あるいは、現在の音声信号のフレームが切り替え前のL個の音声信号のフレームとわずかに相関する場合、それは、現在の音声信号のフレームの特性が既に変更されていることを示している。したがって、現在の音声信号のフレームが、切り替え前のL個の音声信号のフレームによりわずかに影響されている場合、切り替え前のエンベロープ情報のパーセンテージは低い。 The sum of the first weight 1 and the first weight 2 is equal to 1, the sum of the second weight 1 and the second weight 2 is equal to 1, and the sum of the third weight 1 and the third weight 2 is equal to 1. The initial value of the third weight 1 is larger than the initial value of the first weight 1, and the first weight 1 and the first weight 2 are constants. Specifically, weight 1 and weight 2 in this embodiment actually represent the percentage of envelope information before switching and envelope information of previous M frames in the first envelope information of the current frame. If the frame of the current audio signal is close to the L audio signal frames before switching and the correlation between them is high, the percentage of envelope information before switching is high, while the percentage of envelope information of the previous M frames Is low. If the frame of the current audio signal is a bit far from the L audio signal frames before switching, it indicates that the audio signal is being transmitted stably over the network, or If the frame is slightly correlated with the frame of the L audio signals before switching, it indicates that the characteristics of the frame of the current audio signal have already been changed. Thus, if the current audio signal frame is slightly affected by the L audio signal frames before switching, the percentage of envelope information before switching is low.

さらに、ステップ404をステップ405の後に実行することもできる。すなわち、最初に第2ウェイト1および第2ウェイト2を修正することができ、次に過渡エンベロープ情報が第2ウェイト1および第2ウェイト2に従って計算される。同様に、ステップ408をステップ409の後に実行することもできる。すなわち、最初に第3ウェイト1および第3ウェイト2を修正することができ、次に第1エンベロープ情報が第3ウェイト1および第3ウェイト2に従って計算される。 Further, step 404 can be executed after step 405. That is, the second weight 1 and the second weight 2 can be corrected first, and then the transient envelope information is calculated according to the second weight 1 and the second weight 2. Similarly, step 408 can be performed after step 409. That is, the third weight 1 and the third weight 2 can be corrected first, and then the first envelope information is calculated according to the third weight 1 and the third weight 2.

2.図5に示すように、ステップ302による第1エンベロープ情報を取得する別の実施形態は、以下のステップをさらに含むことができる。 2. As shown in FIG. 5, another embodiment of obtaining the first envelope information according to step 302 may further include the following steps.

ステップ501:第1低周波数帯域信号と以前の音声信号のフレームの低周波数帯域信号との間の相関を、現在の音声信号のフレームの第1低周波数帯域信号および以前の音声信号のフレームの低周波数帯域信号に従って計算する。 Step 501: The correlation between the first low frequency band signal and the low frequency band signal of the frame of the previous audio signal is changed to the low frequency band signal of the first audio signal frame and the frame of the previous audio signal. Calculate according to the frequency band signal.

具体的には、より正確な第1エンベロープ情報を取得するために、現在の音声信号のフレームの第1低周波数帯域信号の周波数帯域と以前の音声信号のフレームの低周波数帯域信号の同じ周波数帯域との間の関係が計算される。この実施形態において、「corr」を使用して相関係数を示すことがある。この相関係数は、現在の音声信号のフレームの第1低周波数帯域信号と以前の音声信号のフレームの低周波数帯域信号との間のエネルギー関係により取得される。エネルギー差が小さい場合は「corr」が大きく、逆の場合は「corr」が小さい。具体的なプロセスについては、ステップ401における以前のN個の音声信号のフレームの係数に関する計算を参照されたい。 Specifically, in order to obtain more accurate first envelope information, the frequency band of the first low frequency band signal of the current audio signal frame and the same frequency band of the low frequency band signal of the previous audio signal frame The relationship between is calculated. In this embodiment, “corr” may be used to indicate the correlation coefficient. This correlation coefficient is obtained from the energy relationship between the first low frequency band signal of the frame of the current audio signal and the low frequency band signal of the frame of the previous audio signal. “Corr” is large when the energy difference is small, and “corr” is small in the opposite case. For the specific process, refer to the calculation regarding the coefficient of the frame of the previous N audio signals in step 401.

ステップ502:相関係数が所与の第2しきい値範囲内にあるか否かを判断する。 Step 502: Determine whether the correlation coefficient is within a given second threshold range.

具体的には、ステップ501で「corr」の値が計算された後、計算された「corr」値が所与の第2しきい値内にあるか否かが判断される。例えば、第2しきい値範囲はこの実施形態でc1〜c2によって表されうる。 Specifically, after the value of “corr” is calculated in step 501, it is determined whether the calculated “corr” value is within a given second threshold. For example, the second threshold range may be represented by c1-c2 in this embodiment.

ステップ503:相関係数が所与の第2しきい値範囲内にない場合、指定の第1ウェイト1および第1ウェイト2に従いウェイト付けして、第1エンベロープ情報を計算する。第1ウェイト1は、以前の音声信号のフレームの高周波数帯域信号に対応する以前のフレームのエンベロープ情報のウェイト値を指し、第1ウェイト2は、予測済みエンベロープ情報のウェイト値を指す。第1ウェイト1および第2ウェイト2は定数である。 Step 503: If the correlation coefficient is not within the given second threshold range, the first envelope information is calculated by weighting according to the designated first weight 1 and first weight 2. The first weight 1 indicates the weight value of the envelope information of the previous frame corresponding to the high frequency band signal of the frame of the previous audio signal, and the first weight 2 indicates the weight value of the predicted envelope information. The first weight 1 and the second weight 2 are constants.

具体的には、「corr」値がc1より小さいか、c2より大きいと判断されたとき、現在の音声信号のフレームに対応する第1エンベロープ情報が、切り替え前の以前の音声信号のフレームのエンベロープ情報によってわずかに影響されていると判断される。したがって、現在のフレームの第1エンベロープ情報は指定の第1ウェイト1および第1ウェイト2に従って計算される。予測済みエンベロープ情報と第1ウェイト2との積が、以前のフレームのエンベロープ情報と第1ウェイト1との積に加えられ、ウェイト付け和は、現在のフレームの第1エンベロープ情報である。また、その後に送信される狭帯域音声信号は、この方法およびウェイトに従って処理される。狭帯域音声信号に対応する第1エンベロープ情報は、様々な帯域幅を有する音声信号が再び切り替えられるまで復元される。例えば、この実施形態における第1ウェイト1はa1によって表すことができ、第1ウェイト2はb1によって表すことができ、以前のフレームのエンベロープ情報はpre_fenvによって表すことができ、予測済みエンベロープ情報はfenvによって表すことができ、第1エンベロープ情報はcur_fenvによって表すことができる。この場合、ステップ503は以下の式によって表すことができる。cur_fenv = pre_fenv×a1 + fenv×b1 Specifically, when it is determined that the “corr” value is smaller than c1 or larger than c2, the first envelope information corresponding to the frame of the current audio signal is the envelope of the frame of the previous audio signal before switching. Judged to be slightly affected by information. Therefore, the first envelope information of the current frame is calculated according to the designated first weight 1 and first weight 2. The product of the predicted envelope information and the first weight 2 is added to the product of the envelope information of the previous frame and the first weight 1, and the weighted sum is the first envelope information of the current frame. The narrowband audio signal transmitted thereafter is processed according to this method and weight. The first envelope information corresponding to the narrowband audio signal is restored until the audio signal having various bandwidths is switched again. For example, the first weight 1 in this embodiment can be represented by a1, the first weight 2 can be represented by b1, the envelope information of the previous frame can be represented by pre_fenv, and the predicted envelope information is fenv. The first envelope information can be represented by cur_fenv. In this case, step 503 can be expressed by the following equation. cur_fenv = pre_fenv × a1 + fenv × b1

ステップ504:相関係数が第2しきい値範囲内にある場合、指定の第2ウェイト1が第1ウェイト1より大きいか否かを判断する。第2ウェイト1は、切り替え前の以前の音声信号のフレームの高周波数帯域信号に対応する切り替え前のエンベロープ情報のウェイト値を指す。 Step 504: If the correlation coefficient is within the second threshold range, it is determined whether or not the designated second weight 1 is greater than the first weight 1. The second weight 1 indicates the weight value of the envelope information before switching corresponding to the high frequency band signal of the frame of the previous audio signal before switching.

具体的には、c1<corr<c2の場合、切り替え前のエンベロープ情報および以前のフレームのエンベロープ情報が現在のフレームの第1エンベロープ情報に及ぼす影響の度合いは、第2ウェイト1を第1ウェイト1と比較することによって取得できる。 Specifically, in the case of c1 <corr <c2, the degree of influence of the envelope information before switching and the envelope information of the previous frame on the first envelope information of the current frame is determined by changing the second weight 1 to the first weight 1. Can be obtained by comparing with.

ステップ505:第2ウェイト1が第1ウェイト1より大きくない場合、指定の第1ウェイト1および第1ウェイト2に従いウェイト付けして、第1エンベロープ情報を計算する。 Step 505: When the second weight 1 is not larger than the first weight 1, weighting is performed according to the designated first weight 1 and first weight 2, and the first envelope information is calculated.

具体的には、ステップ504で第2ウェイト1が第1ウェイト1より小さいと判断された場合、それは、現在の音声信号のフレームが切り替え前の以前の音声信号のフレームから少し遠いこと、および第1エンベロープ情報が切り替え前の以前のフレームのエンベロープ情報によりわずかに影響されていることを示している。したがって、現在のフレームの第1エンベロープ情報は、指定の第1ウェイト1および第1ウェイト2に従って計算できる。この場合、ステップ505は以下の式によって表すことができる。cur_fenv = pre_fenv×a1 + fenv×b1 Specifically, if it is determined in step 504 that the second weight 1 is smaller than the first weight 1, it means that the current audio signal frame is a little far from the previous audio signal frame before switching, and 1 indicates that the envelope information is slightly affected by the envelope information of the previous frame before switching. Therefore, the first envelope information of the current frame can be calculated according to the designated first weight 1 and first weight 2. In this case, step 505 can be expressed by the following equation. cur_fenv = pre_fenv × a1 + fenv × b1

ステップ506:第2ウェイト1が第1ウェイト1より大きい場合、第2ウェイト1および指定の第2ウェイト2に従いウェイト付けして、第1エンベロープ情報を計算する。第2ウェイト2は、予測済みエンベロープ情報のウェイト値を指す。例えば、第2ウェイト1はa2によって表すことができ、第2ウェイト2はb2によって表すことができる。 Step 506: When the second weight 1 is larger than the first weight 1, weighting is performed according to the second weight 1 and the designated second weight 2, and the first envelope information is calculated. The second weight 2 indicates the weight value of the predicted envelope information. For example, the second weight 1 can be represented by a2, and the second weight 2 can be represented by b2.

具体的には、ステップ504で第2ウェイト1が第1ウェイト1より大きいと判断された場合、それは、現在の音声信号のフレームが切り替え前の以前のフレームの第1周波数帯域音声信号により近いこと、および第1エンベロープ情報が切り替え前の以前の音声信号のフレームに対応する切り替え前のエンベロープ情報によって大きく影響されていることを示している。したがって、現在のフレームの第1エンベロープ情報は指定の第2ウェイト1および第2ウェイト2に従って計算される。この場合、予測済みエンベロープ情報と第2ウェイト2との積が、切り替え前のエンベロープ情報と第2ウェイト1との積に加えられ、ウェイト付け和は、現在のフレームの第1エンベロープ情報となる。切り替え前のエンベロープ情報は、con_fenvによって表すことができる。この場合、ステップ506は以下の式によって表すことができる。cur_fenv = con_fenv×a2 + fenv×b2 Specifically, if it is determined in step 504 that the second weight 1 is greater than the first weight 1, it means that the frame of the current audio signal is closer to the first frequency band audio signal of the previous frame before switching. , And that the first envelope information is greatly influenced by the envelope information before switching corresponding to the frame of the previous audio signal before switching. Accordingly, the first envelope information of the current frame is calculated according to the designated second weight 1 and second weight 2. In this case, the product of the predicted envelope information and the second weight 2 is added to the product of the envelope information before switching and the second weight 1, and the weighted sum becomes the first envelope information of the current frame. The envelope information before switching can be represented by con_fenv. In this case, step 506 can be represented by the following equation: cur_fenv = con_fenv × a2 + fenv × b2

ステップ507:第2ウェイトステップにより第2ウェイト1を縮小し、第2ウェイトステップにより第2ウェイト2を拡大する。 Step 507: The second weight 1 is reduced by the second weight step, and the second weight 2 is enlarged by the second weight step.

具体的には、音声信号が送信されると、切り替え前の音声信号がその後の音声信号のフレームに及ぼす影響は、徐々に減少する。第1エンベロープ情報をより正確に計算するため、第2ウェイト1および第2ウェイト2に対して適応的調整を実行する必要がある。切り替え前の音声信号がその後の音声信号のフレームに及ぼす影響は徐々に減少し、一方で現在の音声信号のフレームに近い以前の音声信号のフレームが及ぼす影響は徐々に大きくなる。したがって、第2ウェイト1の値は徐々に小さくなり、一方で第2ウェイト2の値は徐々に大きくなる。このようにして、切り替え前のエンベロープ情報が第1エンベロープ情報に及ぼす影響は弱まり、一方で予測済みエンベロープ情報が第1エンベロープ情報に及ぼす影響は強まる。ステップ507では、第2ウェイト1および第2ウェイト2を以下の式により修正できる。新第2ウェイト1=旧第2ウェイト1-第1ウェイトステップ;新第2ウェイト2=旧第2ウェイト2+第1ウェイトステップ、ここで第1ウェイトステップは指定の値である。 Specifically, when an audio signal is transmitted, the influence of the audio signal before switching on the frame of the subsequent audio signal gradually decreases. In order to calculate the first envelope information more accurately, it is necessary to perform adaptive adjustment on the second weight 1 and the second weight 2. The influence of the audio signal before switching on the frame of the subsequent audio signal gradually decreases, while the influence of the frame of the previous audio signal close to the frame of the current audio signal gradually increases. Therefore, the value of the second weight 1 is gradually decreased, while the value of the second weight 2 is gradually increased. In this way, the influence of the envelope information before switching on the first envelope information is weakened, while the influence of the predicted envelope information on the first envelope information is strengthened. In step 507, the second weight 1 and the second weight 2 can be corrected by the following equations. New second weight 1 = old second weight 1-first weight step; new second weight 2 = old second weight 2 + first weight step, where the first weight step is a specified value.

第1ウェイト1と第1ウェイト2との和は1に等しく、第2ウェイト1と第2ウェイト2との和は1に等しく、第2ウェイト1の初期値は第1ウェイト1の初期値より大きい。 The sum of the first weight 1 and the first weight 2 is equal to 1, the sum of the second weight 1 and the second weight 2 is equal to 1, and the initial value of the second weight 1 is greater than the initial value of the first weight 1. large.

ステップ303:第1エンベロープ情報および予測済み微細構造情報に従い、処理済み第1高周波数帯域信号を生成する。 Step 303: Generate a processed first high frequency band signal according to the first envelope information and the predicted fine structure information.

具体的には、ステップ302で現在のフレームの第1エンベロープ情報が取得された後、処理済み第1高周波数帯域信号を第1エンベロープ情報および予測済み微細構造情報に従って生成することができ、その結果、第2高周波数帯域信号を処理済み第1高周波数帯域信号に円滑に切り替えることができる。 Specifically, after the first envelope information of the current frame is acquired in step 302, the processed first high frequency band signal can be generated according to the first envelope information and the predicted fine structure information, and as a result The second high frequency band signal can be smoothly switched to the processed first high frequency band signal.

この実施形態における音声信号を切り替えるための方法を使用することによって、音声信号を広周波数帯域音声信号から狭周波数帯域音声信号に切り替える処理において、現在のフレームの処理済み第1高周波数帯域信号が、予測済み微細構造情報および第1エンベロープ情報に従って取得される。このようにして、切り替え前の広周波数帯域音声信号の第2高周波数帯域信号を、狭周波数帯域音声信号に対応する処理済み第1高周波数帯域信号に円滑に切り替えることができ、その結果、ユーザが受信する音声信号の品質を改善することができる。 In the process of switching the audio signal from the wide frequency band audio signal to the narrow frequency band audio signal by using the method for switching the audio signal in this embodiment, the processed first high frequency band signal of the current frame is: Obtained according to the predicted microstructure information and the first envelope information. In this way, the second high frequency band signal of the wide frequency band audio signal before switching can be smoothly switched to the processed first high frequency band signal corresponding to the narrow frequency band audio signal. Can improve the quality of the audio signal received.

前述の技術的解決策に基づき、図6に示すステップ202は以下のステップを含む。 Based on the foregoing technical solution, step 202 shown in FIG. 6 includes the following steps.

ステップ601:現在の音声信号のフレームおよび切り替え前の以前の音声信号のフレームに従い、処理済み第1高周波数帯域信号が減衰する必要があるか否かを判断する。 Step 601: It is determined whether the processed first high frequency band signal needs to be attenuated according to the frame of the current audio signal and the frame of the previous audio signal before switching.

具体的には、狭帯域音声信号の第1高周波数帯域信号はヌルである。広周波数帯域音声信号を狭周波数帯域音声信号に切り替える処理において、復元済み狭周波数帯域音声信号に対応する処理済み第1高周波数帯域信号による悪影響を防ぐため、処理済み第1高周波数帯域信号のエネルギーは、狭周波数帯域音声信号から拡大した広周波数帯域信号のフレーム数が所与のフレーム数に達した後に減衰係数が所与のしきい値に達するまで、フレーム単位で減衰させられる。現在の音声信号のフレームと切り替え前のフレームの音声信号との間隔は、現在の音声信号のフレームおよび切り替え前のフレームの音声信号に従って取得できる。例えば、狭周波数帯域音声信号のフレーム数は、カウンタを使用して記録でき、この場合のフレーム数は0以上の所定の値でありうる。 Specifically, the first high frequency band signal of the narrowband audio signal is null. In the process of switching a wide frequency band audio signal to a narrow frequency band audio signal, the energy of the processed first high frequency band signal to prevent the negative effects of the processed first high frequency band signal corresponding to the restored narrow frequency band audio signal Is attenuated frame by frame until the attenuation coefficient reaches a given threshold after the number of wide frequency band signals expanded from the narrow frequency band audio signal reaches a given number of frames. The interval between the frame of the current audio signal and the audio signal of the frame before switching can be acquired according to the frame of the current audio signal and the audio signal of the frame before switching. For example, the number of frames of the narrow frequency band audio signal can be recorded using a counter, and the number of frames in this case can be a predetermined value of 0 or more.

ステップ602:処理済み第1高周波数帯域信号が減衰する必要がない場合、処理済み第1高周波数帯域信号および第1低周波数帯域信号を広周波数帯域信号に合成する。 Step 602: If the processed first high frequency band signal does not need to be attenuated, combine the processed first high frequency band signal and the first low frequency band signal into a wide frequency band signal.

具体的には、ステップ601で処理済み第1高周波数帯域信号が減衰する必要がないと判断された場合、処理済み第1高周波数帯域信号および第1低周波数帯域信号は広周波数帯域信号に直接合成される。 Specifically, if it is determined in step 601 that the processed first high frequency band signal does not need to be attenuated, the processed first high frequency band signal and the first low frequency band signal are directly converted to the wide frequency band signal. Synthesized.

ステップ603:処理済み第1高周波数帯域信号が減衰する必要がある場合、処理済み第1高周波数帯域信号に対応する減衰係数がしきい値より大きいか否かを判断する。 Step 603: If the processed first high frequency band signal needs to be attenuated, it is determined whether or not the attenuation coefficient corresponding to the processed first high frequency band signal is greater than a threshold value.

具体的には、減衰係数の初期値が1であり、しきい値が0以上1未満である。ステップ601で処理済み第1高周波数帯域信号が減衰する必要があると判断された場合、ステップ603において、処理済み第1高周波数帯域信号に対応する減衰係数が所与のしきい値より大きいか否かが判断される。 Specifically, the initial value of the attenuation coefficient is 1, and the threshold value is 0 or more and less than 1. If it is determined in step 601 that the processed first high frequency band signal needs to be attenuated, in step 603, is the attenuation coefficient corresponding to the processed first high frequency band signal greater than a given threshold? It is determined whether or not.

ステップ604:減衰係数が所与のしきい値より大きくない場合、処理済み第1高周波数帯域信号としきい値とを掛け、積および第1低周波数帯域信号を広周波数帯域信号に合成する。 Step 604: If the attenuation factor is not greater than the given threshold, multiply the processed first high frequency band signal by the threshold to synthesize the product and the first low frequency band signal into a wide frequency band signal.

具体的には、ステップ603で減衰係数が所与のしきい値より大きくないと判断された場合、それは、処理済み第1高周波数帯域信号のエネルギーが既にある程度減衰していること、および処理済み第1高周波数帯域信号が悪影響をもたらさないかもしれないことを示している。この場合、この減衰率が維持されうる。次いで、処理済み第1高周波数帯域信号にしきい値を掛け、次に、積および第1低周波数帯域信号が広周波数帯域信号に合成される。 Specifically, if step 603 determines that the attenuation factor is not greater than a given threshold, it means that the energy of the processed first high frequency band signal has already been attenuated to some extent and has been processed It shows that the first high frequency band signal may not have an adverse effect. In this case, this attenuation rate can be maintained. The processed first high frequency band signal is then multiplied by a threshold, and then the product and the first low frequency band signal are combined into a wide frequency band signal.

ステップ605:減衰係数が所与のしきい値より大きい場合、処理済み第1高周波数帯域信号と減衰係数とを掛け、積および第1低周波数帯域信号を広周波数帯域信号に合成する。 Step 605: If the attenuation factor is greater than a given threshold, multiply the processed first high frequency band signal by the attenuation factor to synthesize the product and the first low frequency band signal into a wide frequency band signal.

具体的には、ステップ603で減衰係数が所与のしきい値より大きい場合、それは、処理済み第1高周波数帯域信号が減衰係数において聞き心地の悪さをもたらす可能性があり、所与のしきい値に達するまでさらに減衰する必要があることを示している。次いで、処理済み第1高周波数帯域信号に減衰係数を掛け、次に、積および第1低周波数帯域信号が広周波数帯域信号に合成される。 Specifically, if the attenuation factor is greater than a given threshold in step 603, it is possible that the processed first high frequency band signal may result in uncomfortable listening at the attenuation factor. This indicates that further attenuation is required until the threshold is reached. The processed first high frequency band signal is then multiplied by an attenuation factor, and then the product and the first low frequency band signal are combined into a wide frequency band signal.

ステップ606:減衰係数を修正して減衰係数を縮小する。 Step 606: Modify the attenuation coefficient to reduce the attenuation coefficient.

具体的には、音声信号が送信されると、切り替え前の音声信号がその後の狭帯域音声信号に及ぼす影響は徐々に小さくなり、減衰係数も徐々に小さくなる。 Specifically, when an audio signal is transmitted, the influence of the audio signal before switching on the subsequent narrowband audio signal is gradually reduced, and the attenuation coefficient is also gradually reduced.

随意に、前述の技術的解決策に基づき、狭周波数帯域音声信号から広周波数帯域音声信号への切り替えが生じたとき、ステップ201により処理済み第1高周波数帯域信号を取得する実施形態は、図7に示すように以下のステップを含む。 Optionally, based on the above technical solution, the embodiment of obtaining the processed first high frequency band signal by step 201 when switching from the narrow frequency band audio signal to the wide frequency band audio signal is illustrated in FIG. The following steps are included as shown in FIG.

ステップ701:指定の第4ウェイト1および第4ウェイト2に従いウェイト付けして、処理済み第1高周波数帯域信号を計算する。第4ウェイト1は、第2高周波数帯域信号のウェイト値を指し、第4ウェイト2は、現在の音声信号のフレームの第1高周波数帯域信号のウェイト値を指す。 Step 701: Weighting is performed according to designated fourth weight 1 and fourth weight 2, and a processed first high frequency band signal is calculated. The fourth weight 1 indicates the weight value of the second high frequency band signal, and the fourth weight 2 indicates the weight value of the first high frequency band signal of the frame of the current audio signal.

具体的には、狭周波数帯域音声信号を広周波数帯域音声信号に切り替える処理において、広周波数帯域音声信号の高周波数帯域信号はヌルではなく、狭周波数帯域音声信号に対応する高周波数帯域信号がヌルであるため、広周波数帯域音声信号の高周波数帯域信号のエネルギーを減衰させることで、狭周波数帯域音声信号を広周波数帯域音声信号に円滑に切り替えることができるようにする必要がある。第2高周波数帯域信号と第4ウェイト1との積が、第1高周波数帯域信号と第4ウェイト2との積に加えられ、ウェイト付け値は処理済み第1高周波数帯域信号となる。 Specifically, in the process of switching the narrow frequency band audio signal to the wide frequency band audio signal, the high frequency band signal of the wide frequency band audio signal is not null, and the high frequency band signal corresponding to the narrow frequency band audio signal is null. Therefore, it is necessary to make it possible to smoothly switch the narrow frequency band audio signal to the wide frequency band audio signal by attenuating the energy of the high frequency band signal of the wide frequency band audio signal. The product of the second high frequency band signal and the fourth weight 1 is added to the product of the first high frequency band signal and the fourth weight 2, and the weighted value becomes the processed first high frequency band signal.

ステップ702:第3ウェイトステップにより第4ウェイト1を縮小し、第4ウェイト1が0に等しくなるまで第3ウェイトステップにより第4ウェイト2を拡大する。第4ウェイト1と第4ウェイト2との和は1に等しくなる。 Step 702: The fourth weight 1 is reduced by the third weight step, and the fourth weight 2 is enlarged by the third weight step until the fourth weight 1 becomes equal to 0. The sum of the fourth weight 1 and the fourth weight 2 is equal to 1.

具体的には、音声信号が送信されると、切り替え前の狭周波数帯域音声信号がその後の広周波数帯域音声信号に及ぼす影響は、徐々に小さくなる。したがって、第4ウェイト1は徐々に小さくなり、一方で第4ウェイト1が0に等しくなって第4ウェイト2が1に等しくなるまで、第4ウェイト2は徐々に大きくなる。すなわち、送信される音声信号は常に広周波数帯域音声信号となる。 Specifically, when an audio signal is transmitted, the influence of the narrow frequency band audio signal before switching on the subsequent wide frequency band audio signal is gradually reduced. Accordingly, the fourth weight 1 gradually decreases, while the fourth weight 2 gradually increases until the fourth weight 1 is equal to 0 and the fourth weight 2 is equal to 1. That is, the transmitted audio signal is always a wide frequency band audio signal.

同様に、図8に示すように、ステップ201による処理済み第1高周波数帯域信号を取得する別の実施形態は、以下のステップをさらに含むことができる。 Similarly, as shown in FIG. 8, another embodiment for obtaining the processed first high frequency band signal according to step 201 may further include the following steps.

ステップ801:指定の第5ウェイト1および第5ウェイト2に従いウェイト付けして、処理済み第1高周波数帯域信号を計算する。第5ウェイト1は指定の固定パラメータのウェイト値であり、第5ウェイト2は現在の音声信号のフレームの第1高周波数帯域信号のウェイト値である。 Step 801: Weighting is performed according to designated fifth weight 1 and fifth weight 2, and a processed first high frequency band signal is calculated. The fifth weight 1 is the weight value of the designated fixed parameter, and the fifth weight 2 is the weight value of the first high frequency band signal of the current audio signal frame.

具体的には、狭周波数帯域音声信号の第1高周波数帯域信号はヌルであるため、固定パラメータは狭周波数帯域音声信号の高周波数帯域信号を交換するように設定でき、ここで固定パラメータは0以上で、第1高周波数帯域信号のエネルギーより小さい定数である。固定パラメータと第5ウェイト1との積が、第1高周波数帯域信号と第5ウェイト2との積に加えられ、ウェイト付け値は、処理済み第1高周波数帯域信号となる。 Specifically, since the first high frequency band signal of the narrow frequency band audio signal is null, the fixed parameter can be set to exchange the high frequency band signal of the narrow frequency band audio signal, where the fixed parameter is 0 Thus, the constant is smaller than the energy of the first high frequency band signal. The product of the fixed parameter and the fifth weight 1 is added to the product of the first high frequency band signal and the fifth weight 2, and the weighted value becomes the processed first high frequency band signal.

ステップ802:第4ウェイトステップにより第5ウェイト1を縮小し、第5ウェイト1が0に等しくなるまで第4ウェイトステップにより第5ウェイト2を拡大する。第5ウェイト1と第5ウェイト2との和は1に等しくなる。 Step 802: The fifth weight 1 is reduced by the fourth weight step, and the fifth weight 2 is enlarged by the fourth weight step until the fifth weight 1 becomes equal to 0. The sum of the fifth weight 1 and the fifth weight 2 is equal to 1.

具体的には、音声信号が送信されると、切り替え前の狭周波数帯域音声信号がその後の広周波数帯域音声信号に及ぼす影響は、徐々に小さくなる。したがって、第5ウェイト1は徐々に小さくなり、一方で第5ウェイト1が0に等しくなって第5ウェイト2が1に等しくなるまで、第5ウェイト2は徐々に大きくなる。すなわち、送信される音声信号は常に真の広周波数帯域音声信号となる。 Specifically, when an audio signal is transmitted, the influence of the narrow frequency band audio signal before switching on the subsequent wide frequency band audio signal is gradually reduced. Accordingly, the fifth weight 1 gradually decreases, while the fifth weight 2 gradually increases until the fifth weight 1 is equal to 0 and the fifth weight 2 is equal to 1. That is, the transmitted audio signal is always a true wide frequency band audio signal.

この実施形態における音声信号を切り替えるための方法を使用することによって、音声信号を狭周波数帯域音声信号から広周波数帯域音声信号に切り替える処理において、広周波数帯域音声信号の高周波数帯域信号を減衰させて、処理済み高周波数帯域信号を取得する。このようにして、切り替え前の狭周波数帯域音声信号に対応する高周波数帯域信号を、広周波数帯域音声信号に対応する処理済み高周波数帯域信号に円滑に切り替えることができ、ひいてはユーザが受信する音声信号の品質改善に寄与することができる。 By using the method for switching audio signals in this embodiment, in the process of switching the audio signal from the narrow frequency band audio signal to the wide frequency band audio signal, the high frequency band signal of the wide frequency band audio signal is attenuated. Obtain a processed high frequency band signal. In this way, the high frequency band signal corresponding to the narrow frequency band audio signal before switching can be smoothly switched to the processed high frequency band signal corresponding to the wide frequency band audio signal, and thus the audio received by the user. This can contribute to signal quality improvement.

この実施形態において、エンベロープ情報を、高周波数帯域信号を表すことができる他のパラメータ、例えば線形予測符号化(LPC)パラメータまたは振幅パラメータによって交換することもできる。 In this embodiment, the envelope information can also be exchanged by other parameters that can represent high frequency band signals, such as linear predictive coding (LPC) parameters or amplitude parameters.

当業者は、本発明の実施形態による方法のステップの全部または一部を、関連ハードウェアに命令するプログラムによって実施できることを理解しうる。プログラムはコンピュータ可読記憶媒体に記憶できる。プログラムが作動するとき、本発明の実施形態による方法のステップが実行される。記憶媒体は読み取り専用メモリ(ROM)、ランダムアクセスメモリ(RAM)、磁気ディスクまたはコンパクトディスク読み取り専用メモリ(CD-ROM)でありうる。 One skilled in the art can appreciate that all or part of the steps of the method according to embodiments of the present invention can be implemented by a program that instructs the associated hardware. The program can be stored in a computer readable storage medium. When the program runs, the steps of the method according to embodiments of the invention are performed. The storage medium may be read only memory (ROM), random access memory (RAM), magnetic disk or compact disk read only memory (CD-ROM).

図9は、音声信号を切り替えるための装置の第1実施形態の構造を示している。図9に示すように、音声信号を切り替えるための装置は、処理モジュール91および第1合成モジュール92を含む。 FIG. 9 shows the structure of the first embodiment of an apparatus for switching audio signals. As shown in FIG. 9, the apparatus for switching audio signals includes a processing module 91 and a first synthesis module 92.

処理モジュール91は、音声の切り替えが生じたとき、現在の音声信号のフレームの第1高周波数帯域信号および以前のM個の音声信号のフレームの第2高周波数帯域信号をウェイト付けして、処理済み第1高周波数帯域信号を取得するように適合される。Mは1以上である。 The processing module 91 weights and processes the first high frequency band signal of the current audio signal frame and the second high frequency band signal of the previous M audio signal frames when the audio switch occurs. Adapted to obtain a finished first high frequency band signal. M is 1 or more.

第1合成モジュール92は、処理済み第1高周波数帯域信号および現在の音声信号のフレームの第1低周波数帯域信号を広周波数帯域信号に合成するように適合される。 The first synthesis module 92 is adapted to synthesize the processed first high frequency band signal and the first low frequency band signal of the frame of the current audio signal into a wide frequency band signal.

この実施形態における音声信号を切り替えるための装置において、処理モジュールが、現在の音声信号のフレームの第1高周波数帯域信号を、以前のM個の音声信号のフレームの第2高周波数帯域信号に従って処理し、その結果、第2高周波数帯域信号を処理済み第1高周波数帯域信号に円滑に切り替えることができる。このようにして、様々な帯域幅を有する音声信号間の切り替えの処理中に、こうした音声信号の高周波数帯域信号を円滑に切り替えることができる。最後に、第1合成モジュールは、処理済み第1高周波数帯域信号および第1低周波数帯域信号を広周波数帯域信号に合成し、広周波数帯域信号はユーザ端末に送信され、その結果、ユーザは高品質の音声信号を享受することができる。この実施形態における音声信号を切り替えるための方法を使用することによって、様々な帯域幅を有する音声信号を円滑に切り替えることができ、それによりエネルギーの急変が音声信号の主観的音声品質に及ぼす影響を減らし、ユーザが受信する音声信号の品質を改善することができる。 In the apparatus for switching audio signals in this embodiment, a processing module processes a first high frequency band signal of a frame of a current audio signal according to a second high frequency band signal of a frame of previous M audio signals. As a result, the second high frequency band signal can be smoothly switched to the processed first high frequency band signal. In this way, during the process of switching between audio signals having various bandwidths, the high frequency band signals of such audio signals can be switched smoothly. Finally, the first combining module combines the processed first high frequency band signal and the first low frequency band signal into a wide frequency band signal, and the wide frequency band signal is transmitted to the user terminal, so that the user can Quality audio signals can be enjoyed. By using the method for switching audio signals in this embodiment, it is possible to smoothly switch audio signals having various bandwidths, and thereby influence the sudden change of energy on the subjective audio quality of the audio signal. The quality of the audio signal received by the user can be improved.

図10は、音声信号を切り替えるための装置の第2実施形態の構造を示している。図10に示すように、この実施形態における音声信号を切り替えるための装置は、第1実施形態に基づき、第2合成モジュール103をさらに含む。 FIG. 10 shows the structure of a second embodiment of the device for switching audio signals. As shown in FIG. 10, the apparatus for switching audio signals in this embodiment further includes a second synthesis module 103 based on the first embodiment.

第2合成モジュール103は、音声信号の切り替えが生じていないときに第1高周波数帯域信号および第1低周波数帯域信号を広周波数帯域信号に合成するように適合される。 The second synthesis module 103 is adapted to synthesize the first high frequency band signal and the first low frequency band signal into a wide frequency band signal when no switching of the audio signal occurs.

この実施形態における音声信号を切り替えるための装置において、第2合成モジュールは、様々な帯域幅を有する音声信号間の切り替えが生じたとき、現在のフレームの第1周波数帯域音声信号の第1低周波数帯域信号および第1高周波数帯域信号を広周波数帯域信号に合成するように設定される。このようにして、ユーザが受信する音声信号の品質が改善される。 In the apparatus for switching audio signals in this embodiment, the second synthesis module is configured to switch the first low frequency of the first frequency band audio signal of the current frame when switching between audio signals having various bandwidths occurs. The band signal and the first high frequency band signal are set to be combined with the wide frequency band signal. In this way, the quality of the audio signal received by the user is improved.

上述の技術的解決策によれば、随意に、図10および図11に示すように、広周波数帯域音声信号から狭周波数帯域音声信号への切り替えが生じたとき、処理モジュール101は以下のモジュールを含む。
第1高周波数帯域信号に対応する微細構造情報およびエンベロープ情報を予測するように適合された予測モジュール1011、
予測済みエンベロープ情報および以前のM個の音声信号のフレームの第2高周波数帯域信号に対応する以前のM個のフレームのエンベロープ情報をウェイト付けして、第1高周波数帯域信号に対応する第1エンベロープ情報を取得するように適合された第1生成モジュール1012、および
第1エンベロープ情報および予測済み微細構造情報に従い、処理済み第1高周波数帯域信号を生成するように適合された第2生成モジュール1013。 According to the above technical solutions, optionally, as shown in FIGS. 10 and 11, when switching from a wide frequency band audio signal to a narrow frequency band audio signal occurs, the processing module 101 Including.
A prediction module 1011 adapted to predict fine structure information and envelope information corresponding to the first high frequency band signal;
The first corresponding to the first high frequency band signal by weighting the envelope information of the previous M frames corresponding to the predicted high envelope signal and the second high frequency band signal of the frame of the previous M audio signals. A first generation module 1012 adapted to obtain envelope information, and a second generation module 1013 adapted to generate a processed first high frequency band signal according to the first envelope information and the predicted microstructure information .

さらに、この実施形態における音声信号を切り替えるための装置は、現在の音声信号のフレームの第1低周波数帯域信号を分類するように適合された分類モジュール1010を含むことができる。予測モジュール1011は、現在の音声信号のフレームの第1低周波数帯域信号に対応する微細構造情報およびエンベロープ情報を予測するようにさらに適合される。 Furthermore, the apparatus for switching audio signals in this embodiment can include a classification module 1010 adapted to classify the first low frequency band signal of the frame of the current audio signal. The prediction module 1011 is further adapted to predict fine structure information and envelope information corresponding to the first low frequency band signal of the frame of the current speech signal.

この実施形態における音声信号を切り替えるための装置において、予測モジュールは、第1高周波数帯域信号に対応する微細構造情報およびエンベロープ情報を予測し、その結果、処理済み第1高周波数帯域信号は第1生成モジュールおよび第2生成モジュールによって正確に生成できる。このようにして、第1高周波数帯域信号を処理済み第1高周波数帯域信号に円滑に切り替えることができ、それによりユーザが受信する音声信号の品質を改善することができる。さらに、分類モジュールは現在の音声信号のフレームの第1低周波数帯域信号を分類し、予測モジュールは信号タイプに従い予測済み微細構造情報および予測済みエンベロープ情報を取得する。このようにして、予測済み微細構造情報および予測済みエンベロープ情報がより正確となることで、ユーザが受信する音声信号の品質を改善する。 In the apparatus for switching audio signals in this embodiment, the prediction module predicts the fine structure information and envelope information corresponding to the first high frequency band signal, and as a result, the processed first high frequency band signal is the first It can be generated accurately by the generation module and the second generation module. In this way, the first high frequency band signal can be smoothly switched to the processed first high frequency band signal, thereby improving the quality of the audio signal received by the user. Further, the classification module classifies the first low frequency band signal of the frame of the current speech signal, and the prediction module obtains predicted fine structure information and predicted envelope information according to the signal type. In this way, the predicted fine structure information and the predicted envelope information become more accurate, thereby improving the quality of the audio signal received by the user.

前述の技術的解決策に基づき、随意に、図10および図12に示すように、第1合成モジュール102は以下のモジュールを含む。
現在の音声信号のフレームおよび切り替え前の以前の音声信号のフレームに従い、処理済み第1高周波数帯域信号が減衰する必要があるか否かを判断するように適合された第1判断モジュール1021、
処理済み第1高周波数帯域信号が減衰する必要がないと第1判断モジュール1021が判断したときに、処理済み第1高周波数帯域信号および第1低周波数帯域信号を広周波数帯域信号に合成するように適合された第3合成モジュール1022、
処理済み第1高周波数帯域信号が減衰する必要があると第1判断モジュール1021が判断したときに、処理済み第1高周波数帯域信号に対応する減衰係数が所与のしきい値より大きいか否かを判断するように適合された第2判断モジュール1023、
減衰係数が所与のしきい値より大きくないと第2判断モジュール1023が判断した場合、処理済み第1高周波数帯域信号としきい値とを掛け、積および第1低周波数帯域信号を広周波数帯域信号に合成するように適合された第4合成モジュール1024、
減衰係数が所与のしきい値より大きいと第2判断モジュール1023が判断した場合、処理済み第1高周波数帯域信号と減衰係数とを掛け、積および第1低周波数帯域信号を広周波数帯域信号に合成するように適合された第5合成モジュール1025、および
減衰係数を修正して減衰係数を縮小するように適合された第1修正モジュール1026。 Based on the foregoing technical solutions, and optionally, as shown in FIGS. 10 and 12, the first synthesis module 102 includes the following modules:
A first determination module 1021 adapted to determine whether the processed first high frequency band signal needs to be attenuated according to the frame of the current audio signal and the frame of the previous audio signal before switching,
Synthesize the processed first high frequency band signal and the first low frequency band signal into a wide frequency band signal when the first determination module 1021 determines that the processed first high frequency band signal does not need to be attenuated 3rd synthesis module 1022, adapted to
Whether the attenuation factor corresponding to the processed first high frequency band signal is greater than a given threshold when the first determination module 1021 determines that the processed first high frequency band signal needs to be attenuated A second decision module 1023, adapted to determine whether
If the second decision module 1023 determines that the attenuation factor is not greater than a given threshold, the processed first high frequency band signal is multiplied by the threshold to multiply the product and the first low frequency band signal to the wide frequency band A fourth synthesis module 1024, adapted to synthesize to the signal
If the second decision module 1023 determines that the attenuation factor is greater than a given threshold, the processed first high frequency band signal is multiplied by the attenuation factor to multiply the product and first low frequency band signal to the wide frequency band signal. A fifth synthesis module 1025 adapted to synthesize, and a first modification module 1026 adapted to modify the attenuation coefficient to reduce the attenuation coefficient.

減衰係数の初期値は1であり、しきい値は0以上1未満である。 The initial value of the attenuation coefficient is 1, and the threshold value is 0 or more and less than 1.

音声信号を切り替えるための装置を使用することによって、処理済み第1高周波数帯域信号が減衰し、その結果、現在の音声信号のフレームを処理することによって得られる広周波数帯域信号がより正確となり、ひいてはユーザが受信する音声信号の品質を改善する。 By using a device for switching audio signals, the processed first high frequency band signal is attenuated, resulting in a more accurate wide frequency band signal obtained by processing the frame of the current audio signal, As a result, the quality of the audio signal received by the user is improved.

上述の技術的解決策によれば、随意に、図10および図13aに示すように、狭周波数帯域音声信号から広周波数帯域音声信号への切り替えが生じたとき、この実施形態における処理モジュール101は以下のモジュールを含む。
指定の第4ウェイト1および第4ウェイト2に従いウェイト付けして処理済み第1高周波数帯域信号を計算するように適合され、第4ウェイト1は第2高周波数帯域信号のウェイト値を指し、第4ウェイト2は第1高周波数帯域信号のウェイト値を指す、第1計算モジュール1011a、および
第3ウェイトステップにより第4ウェイト1を縮小し、第4ウェイト1が0に等しくなるまで第3ウェイトステップにより第4ウェイト2を拡大するように適合され、第4ウェイト1と第4ウェイト2との和は1に等しくなる、第2修正モジュール1012a。 According to the above technical solution, optionally, when switching from a narrow frequency band audio signal to a wide frequency band audio signal occurs, as shown in FIGS. 10 and 13a, the processing module 101 in this embodiment Includes the following modules:
Adapted to calculate the processed first high frequency band signal weighted according to the specified fourth weight 1 and fourth weight 2, where the fourth weight 1 refers to the weight value of the second high frequency band signal, 4 weight 2 indicates the weight value of the first high frequency band signal, the 4th weight 1 is reduced by the 1st calculation module 1011a and the 3rd weight step, and the 3rd weight step until the 4th weight 1 becomes equal to 0 The second correction module 1012a is adapted to enlarge the fourth weight 2 by the sum, and the sum of the fourth weight 1 and the fourth weight 2 is equal to 1.

同様に、図10および図13bに示すように、狭周波数帯域音声信号から広周波数帯域音声信号への切り替えが生じたとき、この実施形態における処理モジュール101は以下のモジュールをさらに含むことができる。
指定の第5ウェイト1および第5ウェイト2に従いウェイト付けして処理済み第1高周波数帯域信号を計算するように適合され、第5ウェイト1は指定の固定パラメータのウェイト値を指し、第5ウェイト2は第1高周波数帯域信号のウェイト値を指す、第2計算モジュール1011b、および
第4ウェイトステップにより第5ウェイト1を縮小し、第5ウェイト1が0に等しくなるまで第4ウェイトステップにより第5ウェイト2を拡大するように適合され、第5ウェイト1と第5ウェイト2との和は1に等しくなり、固定パラメータは0以上であり、第1高周波数帯域信号のエネルギー値より小さい定数である、第3修正モジュール1012b。 Similarly, as shown in FIGS. 10 and 13b, when switching from a narrow frequency band audio signal to a wide frequency band audio signal occurs, the processing module 101 in this embodiment may further include the following modules.
Adapted to calculate the processed first high frequency band signal weighted according to the specified fifth weight 1 and fifth weight 2, where the fifth weight 1 refers to the weight value of the specified fixed parameter and the fifth weight 2 indicates the weight value of the first high frequency band signal, the second calculation module 1011b, and the fifth weight step 1 is reduced by the fourth weight step, and the fifth weight step 1 is continued until the fifth weight 1 becomes equal to 0. It is adapted to expand 5 weight 2, the sum of 5th weight 1 and 5th weight 2 is equal to 1, the fixed parameter is 0 or more, and it is a constant smaller than the energy value of the 1st high frequency band signal There is a third modification module 1012b.

この実施形態における音声信号を切り替えるための装置を使用することによって、音声信号を狭周波数帯域音声信号から広周波数帯域音声信号に切り替える処理において、広周波数帯域音声信号の高周波数帯域信号を減衰させて、処理済み高周波数帯域信号を取得する。このようにして、切り替え前の狭周波数帯域音声信号に対応する高周波数帯域信号を、広周波数帯域音声信号に対応する処理済み高周波数帯域信号に円滑に切り替えることができ、ひいてはユーザが受信する音声信号の品質改善に寄与することができる。 By using the apparatus for switching the audio signal in this embodiment, in the process of switching the audio signal from the narrow frequency band audio signal to the wide frequency band audio signal, the high frequency band signal of the wide frequency band audio signal is attenuated. Obtain a processed high frequency band signal. In this way, the high frequency band signal corresponding to the narrow frequency band audio signal before switching can be smoothly switched to the processed high frequency band signal corresponding to the wide frequency band audio signal, and thus the audio received by the user. This can contribute to signal quality improvement.

上記の実施形態は本発明の技術的解決策を説明するために提示されているにすぎず、本発明を限定する意図はないことに留意されたい。上記の実施形態を参照して本発明について詳述してきたが、当業者は本発明の趣旨および範囲から離れることなく本発明に様々な修正および変更を加えることができることは明らかである。本発明は、以下の特許請求の範囲またはその等価物に定める保護の範囲内にある限り、修正および変更を網羅するものである。 It should be noted that the above embodiments have been presented only to illustrate the technical solutions of the present invention and are not intended to limit the present invention. While the invention has been described in detail with reference to the above embodiments, it will be apparent to those skilled in the art that various modifications and changes can be made to the invention without departing from the spirit and scope of the invention. The present invention is intended to cover modifications and variations as long as they are within the scope of protection as defined in the following claims or their equivalents.

91 処理モジュール
92 第1合成モジュール
103 第2合成モジュール
101 処理モジュール
102 第1合成モジュール
1010 分類モジュール
1011 予測モジュール
1012 第1生成モジュール
1013 第2生成モジュール
1021 第1判断モジュール
1022 第3合成モジュール
1023 第2判断モジュール
1024 第4判断モジュール
1025 第5合成モジュール
1026 第1修正モジュール
1011a 第1計算モジュール
1012a 第2修正モジュール
1011b 第2計算モジュール
1012b 第3修正モジュール 91 Processing module
92 1st synthesis module
103 2nd synthesis module
101 processing module
102 1st synthesis module
1010 Classification module
1011 prediction module
1012 First generation module
1013 Second generation module
1021 First decision module
1022 3rd synthesis module
1023 Second decision module
1024 Fourth judgment module
1025 5th synthesis module
1026 First modification module
1011a First calculation module
1012a Second modification module
1011b Second calculation module
1012b Third modification module

Claims

A method for switching audio signals,
When the switching from the wide frequency band audio signal to the narrow frequency band audio signal occurs, the first high frequency band signal of the frame of the audio signal immediately after the switching of the audio signal and the second of the frame of the previous audio signal Weighting the high frequency band signal to obtain a processed first high frequency band signal, wherein the second high frequency band signal of the previous audio signal frame is included in envelope information of the previous frame; Corresponding steps,
Synthesizing a first low frequency band signal of a frame of an audio signal immediately after the switching between the processed first high frequency band signal and the audio signal into a wide frequency band signal, and outputting the wide frequency band signal Comprising the steps of:

Weighting the first high frequency band signal of the frame of the audio signal immediately after the switching of the audio signal and the second high frequency band signal of the frame of the previous audio signal to weight the processed first high The step of obtaining a frequency band signal comprises:
Predicting fine structure information and envelope information corresponding to the first high frequency band signal of the frame of the audio signal immediately after the switching of the audio signal occurs ;
Weighting the predicted envelope information and the envelope information of the previous frame corresponding to the second high frequency band signal of the frame of the previous audio signal to weight the first envelope corresponding to the first high frequency band signal Obtaining information,
And generating the processed first high frequency band signal according to the first envelope information and the predicted microstructure information.

Predicting the fine structure information and envelope information corresponding to the first high frequency band signal of a frame of the audio signal immediately after the switching of the audio signal occurs ,
Classifying the first low frequency band signal of a frame of the audio signal immediately after the switching of the audio signal occurs ;
And predicting the fine structure information and envelope information according to a signal type of the first low frequency band signal.

Weighting the predicted envelope information and the envelope information of the previous frame corresponding to the second high frequency band signal of the frame of the previous audio signal, and weighting the first information corresponding to the first high frequency band signal The step of obtaining envelope information comprises:
Correlation coefficient between the first low frequency band signal of the frame immediately after the switching of the audio signal and the low frequency band signal of the frame of the previous audio signal, the switching of the audio signal has occurred Calculating according to the same frequency band of the first low frequency band signal of the immediately following frame and the low frequency band signal of the frame of the previous speech signal;
Determining whether the correlation coefficient is within a given second threshold range;
When the correlation coefficient is not within the second threshold range, the predicted envelope information and the previous frame envelope information are weighted according to the designated first weight 1 and designated first weight 2. Calculating the first envelope information, wherein the first weight 1 refers to the weight value of the envelope information of the previous frame, the first weight 2 refers to the weight value of the predicted envelope information, The first weight 1 and the first weight 2 are constants, and the sum of the first weight 1 and the first weight 2 is equal to 1, a step;
When the correlation coefficient is within the second threshold range, the previous frame envelope information and the predicted envelope information are weighted according to the comparison result between the second weight 2 and the designated first weight 2. In the step of calculating the first envelope information, the second weight 2 indicates a weight value of the predicted envelope information, and an initial value of the second weight 2 is a value of the designated first weight 2 A step not greater than said value;
4. The method according to claim 2, further comprising a step of enlarging the second weight 2 by a second weight step, wherein the second weight step has a specified value.

According to the comparison result between the second weight 2 and the designated first weight 2, the step of calculating the first envelope information by weighting the envelope information of the previous frame and the predicted envelope information,
When the second weight 2 is larger than the first weight 2, the first frame 1 is weighted with the envelope information of the previous frame and the predicted envelope information according to the specified first weight 1 and the first weight 2. Calculating envelope information;
If the second weight 2 is not larger than the first weight 2 , the first envelope is weighted according to the second weight 1 and the second weight 2, and the previous envelope information and the predicted envelope information are weighted. Calculating the information, wherein the second weight 1 indicates the weight value of the envelope information of the previous frame, and the sum of the second weight 1 and the second weight 2 is 1. The method of claim 4, which is equal.

The step of synthesizing the first low frequency band signal of the frame of the audio signal immediately after the switching of the processed first high frequency band signal and the audio signal into the wide frequency band signal;
Determining whether the processed first high frequency band signal needs to be attenuated according to a frame of the audio signal immediately after the switching of the audio signal and a frame of the previous audio signal before the switching. When,
If attenuation is not required, combining the processed first high frequency band signal and the first low frequency band signal into the wide frequency band signal;
If attenuation is required, determining whether an attenuation coefficient corresponding to the first high frequency band signal is greater than a given threshold;
If the attenuation factor is not greater than the given threshold, the processed first high frequency band signal is multiplied by the threshold, and the processed first high frequency band signal and the threshold Combining the product and the first low frequency band signal into the wide frequency band signal;
When the attenuation factor is greater than the given threshold, the processed first high frequency band signal is multiplied by the attenuation factor, the product of the processed first high frequency band signal and the attenuation factor, and the Synthesizing a first low frequency band signal into the wide frequency band signal;
Modifying the attenuation coefficient to reduce the attenuation coefficient;
6. The method according to claim 1, wherein an initial value of the attenuation coefficient is 1, and the threshold value is 0 or more and less than 1.

7. The method according to claim 1, further comprising a step of synthesizing the first high frequency band signal and the first low frequency band signal into the wide frequency band signal when switching of the audio signal has not occurred. The method according to item.

An apparatus for switching audio signals,
When the switching from the wide frequency band audio signal to the narrow frequency band audio signal occurs, the first high frequency band signal of the frame of the audio signal immediately after the switching of the audio signal and the second of the frame of the previous audio signal A processing module adapted to weight a high frequency band signal to obtain a processed first high frequency band signal, wherein the second high frequency band signal of the frame of the previous audio signal is the previous A processing module corresponding to the envelope information of the frame,
Combining the processed first high frequency band signal and the first low frequency band signal of the frame of the audio signal immediately after the switching of the audio signal into a wide frequency band signal, and outputting the wide frequency band signal And a first synthesis module adapted to the above.

A prediction module adapted to predict fine structure information and envelope information corresponding to the first high frequency band signal of a frame of the audio signal immediately after the switching of the audio signal occurs, the processing module;
Weighting the predicted envelope information and the envelope information of the previous frame corresponding to the second high frequency band signal of the frame of the previous audio signal to weight the first envelope corresponding to the first high frequency band signal A first generation module adapted to obtain information;
9. The apparatus of claim 8, comprising: a second generation module adapted to generate the processed first high frequency band signal according to the first envelope information and the predicted microstructure information.

A classification module adapted to classify the first low frequency band signal of a frame of the audio signal immediately after the switching of the audio signal occurs ;
The prediction module is further adapted to predict the microstructure information and the envelope information according to a signal type of the first low frequency band signal;
The apparatus according to claim 9.

The first synthesis module is
It is determined whether or not the processed first high frequency band signal needs to be attenuated according to the frame of the audio signal immediately after the switching of the audio signal and the frame of the previous audio signal before the switching. A first determination module adapted to
When the first determination module determines that the processed first high frequency band signal does not need to be attenuated, the processed first high frequency band signal and the first low frequency band signal are converted to the wide frequency band signal. A third synthesis module adapted to synthesize,
When the first determination module determines that the processed first high frequency band signal needs to be attenuated, an attenuation coefficient corresponding to the processed first high frequency band signal is greater than a given threshold value A second decision module adapted to determine whether or not,
If the second determination module determines that the attenuation coefficient is not greater than the given threshold, the processed first high frequency band signal is multiplied by the threshold to produce a product and the first low frequency A fourth synthesis module adapted to synthesize a band signal into the wide frequency band signal;
If the second determination module determines that the attenuation coefficient is greater than the given threshold, the processed first high frequency band signal is multiplied by the attenuation coefficient to yield a product and the first low frequency band signal A fifth synthesis module adapted to synthesize the signal into the wide frequency band signal;
A first modification module adapted to modify the damping factor to reduce the damping factor;
The apparatus according to any one of claims 8 to 10, wherein an initial value of the attenuation coefficient is 1, and the threshold value is 0 or more and less than 1.

Further comprising a second synthesis module adapted to synthesize the first high frequency band signal and the first low frequency band signal into the wide frequency band signal when switching of the audio signal has not occurred. Item 12. The device according to any one of Items 8 to 11.

A computer-readable storage medium having a recorded program, wherein the program causes a computer to execute the method according to any one of claims 1 to 7.