JP3325277B2

JP3325277B2 - Audio coding device

Info

Publication number: JP3325277B2
Application number: JP17144591A
Authority: JP
Inventors: 山元昭児
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1991-07-11
Filing date: 1991-07-11
Publication date: 2002-09-17
Anticipated expiration: 2017-09-17
Also published as: JPH0519789A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、音声符号化装置に関
し、特に、無音部分の符号化に着目した音声符号化装置
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding apparatus and, more particularly, to a speech coding apparatus which focuses on coding of a silent part.

【０００２】[0002]

【従来の技術】従来、音声信号を符号化する場合、高効
率で符号化するために、無音部分の処理について特別な
処理を行なってきた。2. Description of the Related Art Conventionally, when encoding an audio signal, a special process has been performed for processing of a silent portion in order to encode with high efficiency.

【０００３】例えば、音声のない無音区間を符号化する
に当って、単位時間当りの符号化ビット数の少ない低ビ
ットレートで符号化する、いわゆる、アダプティブな方
式はその一つである。これは、例えば以下のようなもの
である。即ち、図５のタイミングチャートに示すよう
に、時間ｔの経過と共に順次入力される音声符号のｎフ
レームからｎ＋７フレームまでを低メモリアドレスから
高メモリアドレスに向かって順次符号化するとする。ｎ
＋３フレームからｎ＋５フレームまでの間は無音区間で
ある。このため、これらの区間は低ビットレートで符号
化することにより、通常の音声部分の符号化に対して、
圧縮率を１／ｋにしている。[0003] For example, a so-called adaptive system in which a silent section without sound is encoded at a low bit rate with a small number of encoding bits per unit time is one of them. This is, for example, as follows. That is, as shown in the timing chart of FIG. 5, it is assumed that the speech codes sequentially inputted from the nth frame to the n + 7th frame as the time t elapses are sequentially encoded from the low memory address to the high memory address. n
Between the +3 frame and the n + 5 frame is a silent section. For this reason, these sections are encoded at a low bit rate, so that the
The compression ratio is set to 1 / k.

【０００４】一方、音声のない無音区間を全く符号化し
ないという、ランレングス的な符号化も知られている。
これは、以下のようなものである。即ち、図６のタイム
チャートに示すように、時間ｔの経過と共に入力される
音声波形Ｗのなかで、同図（ａ）に示すように、ある閾
値ＴＨ以上のｎ＋１区間、ｎ＋３区間のみを符号化の対
象とし、閾値ＴＨ以下のｎ区間、ｎ＋２区間は全く符号
化の対象としない。そして、同図（ｂ）に示すように、
音声のあるｎ＋１区間、ｎ＋３区間のみをつなげて符号
化するという方式である。On the other hand, there is also known a run-length encoding in which a silent section having no sound is not encoded at all.
This is as follows. That is, as shown in the time chart of FIG. 6, only the (n + 1) and (n + 3) sections that are equal to or greater than a certain threshold TH in the audio waveform W input as time t elapses, as shown in FIG. N and n + 2 sections that are equal to or less than the threshold value TH are not subjected to encoding at all. Then, as shown in FIG.
This is a method in which only n + 1 and n + 3 sections with speech are connected and encoded.

【０００５】従来の音声符号化装置は以上のいずれかの
方法により、符号化効率を高めてきた。[0005] The conventional speech coding apparatus has improved the coding efficiency by any of the above methods.

【０００６】[0006]

【発明が解決しようとする課題】無音区間を音声区間に
比べて低ビットレートで符号化する第１の方式の場合、
音声区間と無音区間の単位時間当りのメモリ量が異な
る。そのため、復号化時にスキップ等の特殊再生を行な
うと、符号化されたデータを音声か否かで単位時間当り
のアドレススキップ量が異なってくる。そのため、スキ
ップ時のアドレッシングの計算を行なう場合、メモリ内
容を一旦読み出す必要がある。これにより、回路規模が
大きくなり、演算時間も増大するという問題点がある。In the case of the first method for encoding a silent section at a lower bit rate than a speech section,
The memory amount per unit time differs between the voice section and the silent section. Therefore, if special reproduction such as skipping is performed during decoding, the amount of address skipping per unit time differs depending on whether encoded data is audio. Therefore, when calculating the addressing at the time of skipping, it is necessary to read out the memory contents once. As a result, there is a problem that the circuit scale increases and the operation time also increases.

【０００７】一方、無音区間を全く符号化しないという
第２の方式の場合、無音区間のデータが全く欠落してい
る。このため、復号化した音声を再生すると、無音区間
が欠落した再生音となる。このため、録音時の音声の忠
実な再生ができず、不自然な再生音になってしまうとい
う問題がある。On the other hand, in the case of the second method in which a silent section is not encoded at all, data of a silent section is completely missing. For this reason, when the decoded sound is reproduced, the reproduced sound has a silent section missing. For this reason, there is a problem that the faithful reproduction of the sound at the time of recording cannot be performed, resulting in an unnatural reproduction sound.

【０００８】以上のように、従来の音声符号化では、音
声の復号化時に処理が複雑になったり、再生音が不自然
になったりするという問題がある。As described above, the conventional speech coding has a problem that the processing becomes complicated when the speech is decoded and the reproduced sound becomes unnatural.

【０００９】本発明は、上記に鑑みてなされたもので、
その目的は、音声再生時のアドレッシングを容易にする
と共に自然な音声再生を可能とすることにある。[0009] The present invention has been made in view of the above,
An object of the present invention is to facilitate addressing at the time of sound reproduction and to enable natural sound reproduction.

【００１０】[0010]

【課題を解決するための手段】本発明の第１の音声符号
化装置は、入力音声信号の無音部分を検出して無音検出
信号を出力する無音検出部と、前記無音検出信号出力中
のクロック数を計数してカウンタ値を出力する無音カウ
ンタと、前記無音部分が検出されずに前記カウンタ値が
出力されないときは前記入力音声信号の有音部分を符号
化音声信号とすると共に、前記無音部分が検出されて、
前記カウンタ値が予め定めた基準値を超えたときに無音
符号を発生する音声符号化部と、前記無音部分の検出時
に前記無音カウンタからの前記カウンタ値に応じた信号
が与えられ、前記有音部分の入力時にはアドレスを歩進
するアドレス発生部と、前記音声符号化部から出力され
た前記符号化音声信号を格納すると共に前記無音符号が
与えられるメモリと、を備え、前記アドレス発生部は、
前記カウンタ値が前記基準値を超えないときは前記アド
レスを歩進させ、前記カウンタ値が前記基準値を超えた
ときは前記メモリのアドレスの歩進を停止させ、前記カ
ウンタ値が前記基準値を超えたときから次の有音部分ま
での無音部分をその期間に拘らず１フレーム長と同じ長
さの１つのフレームに格納するようにアドレスを発生し
て前記メモリに出力するように構成されている、ことを
特徴として構成される。According to a first aspect of the present invention, there is provided a speech encoding apparatus for detecting a silence portion of an input speech signal and outputting a silence detection signal, and a clock for outputting the silence detection signal. A silence counter that counts a number and outputs a counter value, and when the silence portion is not detected and the counter value is not output, a sound portion of the input audio signal is used as an encoded audio signal, and the silence portion is output. Is detected,
A speech encoding unit that generates a silence code when the counter value exceeds a predetermined reference value, and a signal corresponding to the counter value from the silence counter when the silence portion is detected is provided, An address generation unit that increments an address at the time of inputting a part, and a memory that stores the encoded audio signal output from the audio encoding unit and is provided with the silence code, the address generation unit includes:
When the counter value does not exceed the reference value, the address is incremented.When the counter value exceeds the reference value, the increment of the address of the memory is stopped. An address is generated so as to store a silence portion from the time of exceeding to the next sound portion in one frame having the same length as one frame regardless of the period, and the address is outputted to the memory. It is characterized by having.

【００１１】さらに、本発明の第２の音声符号化装置は
次のように構成される。Further, the second speech coding apparatus of the present invention is configured as follows.

【００１２】即ち、入力音声信号の無音部分を検出して
無音検出信号を出力する無音検出部と、前記無音検出信
号出力中のクロック数を計数してカウンタ値を出力する
無音カウンタと、前記無音部分が検出されずに前記カウ
ンタ値が出力されないときは前記入力音声信号の有音部
分を符号化音声信号とすると共に、前記無音部分が検出
されて、前記カウンタ値が予め定めた基準値を超えたと
きに無音符号を発生する音声符号化部と、前記無音部分
の検出時に前記無音カウンタからの前記カウンタ値に応
じた信号が与えられ、前記有音部分の入力時にはアドレ
スを歩進するアドレス発生部と、前記音声符号化部から
出力された前記符号化音声信号を格納すると共に前記無
音符号が与えられるメモリと、を備え、前記アドレス発
生部は、前記カウンタ値が前記基準値を超えないときは
前記アドレスを歩進させ、前記カウンタ値が前記基準値
を超えたときは、前記メモリのアドレスの歩進を停止さ
せるだけでなく、前記アドレスを前記無音検出信号の出
力開始時点のアドレスまで戻し、前記無音検出信号が出
力されてから次の有音部分までの無音部分をその期間に
拘らず１フレーム長と同じ長さの１つのフレームに格納
するようにアドレスを発生して前記メモリに出力するよ
うに構成されている、ことを特徴として構成される。A silence detecting section for detecting a silence portion of the input audio signal and outputting a silence detection signal; a silence counter for counting the number of clocks in outputting the silence detection signal and outputting a counter value; When the counter value is not output because no part is detected, the sound part of the input audio signal is used as the encoded audio signal, and the silent part is detected, and the counter value exceeds a predetermined reference value. A speech encoding unit that generates a silence code when a signal is generated, and a signal corresponding to the counter value from the silence counter is provided when the silence portion is detected, and an address generation step that increments an address when the speech portion is input. And a memory for storing the encoded audio signal output from the audio encoding unit and to which the silence code is given. When the counter value does not exceed the reference value, the address is incremented. When the counter value exceeds the reference value, not only the increment of the address of the memory is stopped, but also the address is changed to the silence. Return to the address at the start of the output of the detection signal, and store the silent part from the output of the silent detection signal to the next sound part in one frame of the same length as one frame regardless of the period. And outputs the address to the memory.

【００１３】[0013]

【作用】入力音声信号の無音部分が無音検出手段で検出
される。無音部分が所定の長さよりも長いと判断された
ときには、符号化手段からの無音符号がメモリに格納さ
れる。ただし、このときからアドレス発生手段はアドレ
スの歩進を停止する。この後、入力音声手段の音声部分
が入力すると、アドレスの歩進が再開する。そして、そ
の音声部分が符号化音声信号としてメモリに格納され
る。The silent part of the input audio signal is detected by the silent detection means. When it is determined that the silent part is longer than the predetermined length, the silent code from the encoding means is stored in the memory. However, from this time, the address generation means stops the address increment. Thereafter, when the voice portion of the input voice means is input, the address increment is resumed. Then, the audio part is stored in the memory as an encoded audio signal.

【００１４】[0014]

【実施例】以下、図面を参照しながら本発明の実施例を
説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１５】図１は、本発明の一実施例に係る音声符号
化装置のブロック図である。図１に示すように、無音検
出部１は、音声信号Ｓ１から無音部を検出する。この検
出フレーム数は、無音カウンタ２によってカウントされ
る。音声符号化部４は、音声信号Ｓ１の中の符号化対象
となる音声を符号化する。つまり、音声符号化部４は、
音声部はそのまま符号化し、無音部は圧縮して無音カウ
ンタ２からのカウント値と共にデータ出力する。音声符
号化部４からのデータは、符号蓄積用メモリ５に記録さ
れる。この記録に使われるメモリアドレスＡｍは、アド
レス発生器３が無音カウンタ２からのカウント信号に基
づいて発生する。FIG. 1 is a block diagram of a speech coding apparatus according to one embodiment of the present invention. As shown in FIG. 1, the silence detecting section 1 detects a silence section from the audio signal S1. The number of detected frames is counted by the silence counter 2. The audio encoding unit 4 encodes the audio to be encoded in the audio signal S1. That is, the audio encoding unit 4
The audio part is encoded as it is, the silent part is compressed, and the data is output together with the count value from the silent counter 2. Data from the audio encoding unit 4 is recorded in the code storage memory 5. The memory address Am used for this recording is generated by the address generator 3 based on a count signal from the silence counter 2.

【００１６】図２は、図１の無音検出部１の一例を示す
ブロック図である。図２に示すように、検波器１１は入
力された音声信号Ｓ１を検波する。この場合の検波方式
としては、絶対値検波や２乗検波等の方式がとられる。
検波器１１で検波された信号は、ローパスフィルタ１２
で平滑された後、比較器１３で固定閾値θ１と比較され
る。そして、固定閾値θ１よりも小さい信号の区間につ
いては、無音検出信号Ｓ２が出力される。FIG. 2 is a block diagram showing an example of the silence detecting section 1 of FIG. As shown in FIG. 2, the detector 11 detects the input audio signal S1. As a detection method in this case, a method such as an absolute value detection or a square detection is used.
The signal detected by the detector 11 is applied to a low-pass filter 12.
, And is compared by the comparator 13 with the fixed threshold value θ1. Then, for a section of the signal smaller than the fixed threshold value θ1, the silence detection signal S2 is output.

【００１７】図３は、図１の無音カウンタ２の一例を示
すブロック図である。図３に示すように、無音カウンタ
２はクロックＣＬＫ１によりカウント動作し、無音検出
信号Ｓ２により同期リセットされる。カウンタ値θｎ
が、無音カウンタ２から出力される。FIG. 3 is a block diagram showing an example of the silence counter 2 of FIG. As shown in FIG. 3, the silence counter 2 performs a counting operation by the clock CLK1, and is synchronously reset by the silence detection signal S2. Counter value θn
Is output from the silence counter 2.

【００１８】図４は、図１のアドレス発生器３の一例を
示すブロック図である。図４に示すように、比較器３１
は、無音カウンタ２からのカウンタ値θｎと固定閾値θ
２を比較し、その結果を出力する。論理ゲート３６は、
この比較結果とクロックＣＬＫ２とのナンド論理をと
り、アドレスレジスタ３２にストローブ信号を与える。
インクリメント部３３は、アドレスレジスタ３２の出力
データを入力し、そのデータに＋１したデータをメモリ
アドレスＡｍとして出力する。減算器３４は、アドレス
レジスタ３２の出力データを入力し、そのデータからデ
ータＤを減算した値をメモリアドレスＡｍとして出力す
る。切替器３５は、比較器３１からの出力に基づいて、
インクリメント部３３の出力と減算器３４の出力とのう
ちのいずれかのデータを切り替え出力し、アドレスレジ
スタ３２に与える。FIG. 4 is a block diagram showing an example of the address generator 3 of FIG. As shown in FIG.
Is the counter value θn from the silence counter 2 and the fixed threshold value θ
2 and outputs the result. The logic gate 36
The NAND logic of the comparison result and the clock CLK2 is calculated, and a strobe signal is supplied to the address register 32.
The increment unit 33 receives the output data of the address register 32, and outputs data obtained by adding +1 to the data as the memory address Am. The subtractor 34 receives the output data of the address register 32, and outputs a value obtained by subtracting the data D from the data as a memory address Am. The switch 35 is based on the output from the comparator 31.
Any one of the output of the increment unit 33 and the output of the subtractor 34 is switched and output, and is provided to the address register 32.

【００１９】比較器３１の出力は、符号蓄積用メモリ５
に対するライト信号ＷＲとしても用いられる。The output of the comparator 31 is supplied to the code storage memory 5.
Is also used as a write signal WR for

【００２０】以上述べたような構成において、次にその
動作を図７のタイミングチャートにしたがって説明す
る。ちなみに、図７（ａ）は入力される音声波形Ｗ、
（ｂ）は無音検出部１による無音検出信号、（ｃ）は無
音カウンタ２のカウンタ値、（ｄ）はアドレス発生器３
における比較器３１の出力であるライト信号ＷＲ、
（ｅ）はアドレス発生器３のアドレスレジスタ３２のカ
ウント値、（ｆ）はアドレスレジスタ３２のストローブ
信号をそれぞれ示すものである。Next, the operation of the above-described configuration will be described with reference to the timing chart of FIG. By the way, FIG. 7A shows the input speech waveform W,
(B) is a silence detection signal by the silence detection unit 1, (c) is a counter value of the silence counter 2, and (d) is an address generator 3.
WR, which is the output of the comparator 31 at
(E) shows the count value of the address register 32 of the address generator 3, and (f) shows the strobe signal of the address register 32.

【００２１】先ず、音声信号Ｓ１が入力され、音声波形
Ｗが固定閾値θ１を上回っているとする。この場合は、
無音検出部１が無音を検出しない。このため、音声符号
化部４は、音声信号Ｓ１を符号化し、符号蓄積用メモリ
５に符号化データとして書き込む。この時、アドレスレ
ジスタ３２には、切替器３５を通じて、インクリメント
部３３の出力データが与えられる。このため、クロック
ＣＬＫ２に基づいてアドレスは順次インクリメントさ
れ、符号蓄積用メモリ５には順次符号化データが書き込
まれる。このとき、無音カウンタ２のカウント値は
“０”のままで歩進しない。First, it is assumed that the audio signal S1 is inputted and the audio waveform W exceeds the fixed threshold value θ1. in this case,
The silence detector 1 does not detect silence. Therefore, the audio encoding unit 4 encodes the audio signal S1 and writes the encoded audio signal S1 in the code storage memory 5 as encoded data. At this time, the output data of the increment unit 33 is supplied to the address register 32 through the switch 35. Therefore, the address is sequentially incremented based on the clock CLK2, and the encoded data is sequentially written in the code storage memory 5. At this time, the count value of the silence counter 2 remains "0" and does not advance.

【００２２】一方、音声信号Ｓ１が入力され、音声波形
Ｗが固定閾値θ１以下とする。この場合、無音検出部１
からは、図７（ｂ）に示すように無音検出信号Ｓ２が出
力される。これにより、無音カウンタ２は、同図（ｃ）
に示すように、クロックＣＬＫ１のカウントを開始し、
カウンタ値θｎを出力する。このカウンタ値θｎが固定
閾値θ２に達すると、符号化圧縮すべきということが判
定される。この判定は、比較器３１で行われる。その結
果、同図７（ｄ）に示すように、比較器３１から、ライ
ト信号ＷＲと切替信号ＣＨが送出され、切替器３５が切
り替わる。切替器３５が切り替わると、図７（ｅ）に示
すように、アドレスレジスタ３２には、減算器３４によ
って、現在のアドレスｋよりデータＤだけ減算したアド
レス（ｋ−Ｄ）がセットされる。一方、図７（ｆ）に示
すように、論理ゲート３６に与えられていたクロックＣ
ＬＫ２がマスクされる。このため、アドレスレジスタ３
２はアドレス（ｋ−Ｄ）をキープすることになる。ちな
みに、ここでの減算データＤは、比較器３１に設定され
る固定閾値θ２に相当するデータに設定される。一方、
アドレス発生器３から、カウンタ値θｎが固定閾値θ２
を越えたことを示すライト信号ＷＲが出力されたとす
る。このときには、音声符号化部４は、音声信号Ｓ１の
符号化データの代わりに、無音マークを符号蓄積用メモ
リ５に書き込む。無音区間が続く限りアドレスレジスタ
３２はアドレス（ｋ−Ｄ）を保持する。無音カウンタ２
は無音区間のカウントを継続する。次に、音声信号Ｓ１
の音声波形Ｗが固定閾値θ１を越えると、無音検出部１
からの無音検出信号Ｓ２がなくなる。その結果、無音カ
ウンタ２はその時点のカウント値ｎのままでカウント動
作を停止し、次に同期リセットされてカウンタ値ｎを
“０”に戻す。これに先立って、音声符号化部４は、無
音カウンタ２のカウント値ｎを符号蓄積用メモリ５に記
録する。その結果、符号蓄積用メモリ５は、図９に示す
ように、（ａ）の状態までは音声符号化部４の符号化デ
ータを記録するが、無音カウンタ２のカウント値θｎが
固定閾値θ２に達すると無音と判定し、この時点で
（ｂ）に示すように、アドレスをデータＤだけ戻したメ
モリアドレスに無音マークを記録し、その後はアドレス
を進めずに、次に音声区間が始まった時点で無音カウン
タ２のカウンタ値θｎであるｎを記録する。On the other hand, the audio signal S1 is input, and the audio waveform W is set to a fixed threshold value θ1 or less. In this case, the silence detector 1
Outputs a silence detection signal S2 as shown in FIG. 7 (b). Thereby, the silence counter 2 becomes as shown in FIG.
As shown in the figure, the counting of the clock CLK1 is started,
The counter value θn is output. When the counter value θn reaches the fixed threshold value θ2, it is determined that encoding compression should be performed. This determination is performed by the comparator 31. As a result, as shown in FIG. 7D, the comparator 31 sends the write signal WR and the switching signal CH, and the switching unit 35 switches. When the switch 35 is switched, as shown in FIG. 7E, an address (kD) obtained by subtracting the data D from the current address k by the subtractor 34 is set in the address register 32. On the other hand, as shown in FIG.
LK2 is masked. Therefore, the address register 3
2 will keep the address (kD). Incidentally, the subtraction data D here is set to data corresponding to the fixed threshold value θ2 set in the comparator 31. on the other hand,
From the address generator 3, the counter value θn is set to a fixed threshold value θ2
It is assumed that a write signal WR indicating that the signal has exceeded the limit is output. At this time, the voice coding unit 4 writes a silent mark into the code storage memory 5 instead of the coded data of the voice signal S1. As long as the silent section continues, the address register 32 holds the address (kD). Silence counter 2
Keeps counting the silent section. Next, the audio signal S1
Is greater than the fixed threshold θ1, the silence detector 1
Is lost. As a result, the silence counter 2 stops the counting operation with the count value n at that time, and is reset synchronously to return the counter value n to “0”. Prior to this, the speech encoding unit 4 records the count value n of the silence counter 2 in the code storage memory 5. As a result, as shown in FIG. 9, the code storage memory 5 records the coded data of the voice coding unit 4 until the state shown in FIG. At this point, it is determined that there is no sound. At this point, as shown in (b), a silent mark is recorded at the memory address whose address has been returned by the data D, and thereafter the address is not advanced and the next time the voice section starts Record n, which is the counter value θn of the silence counter 2.

【００２３】さて、音声区間に戻り、カウンタ値θｎが
“０”に戻ることに伴い、カウンタ値θｎは固定閾値θ
２より小さくなる。このため、ライト信号ＷＲがなくな
り、切替器３５も減算器３４からインクリメント部３３
側に切り替わる。その結果、アドレスレジスタ３２は、
再び、クロックＣＬＫ２に同期してアドレス（ｋ−Ｄ）
から（ｋ−Ｄ＋１）とインクリメント動作を始める。こ
れにより符号蓄積用メモリ５には、音声符号化部４で符
号化された音声信号Ｓ１が順次書き込まれる。Now, returning to the voice section, when the counter value θn returns to “0”, the counter value θn becomes a fixed threshold value θ.
It becomes smaller than 2. For this reason, the write signal WR disappears, and the switch 35 is also switched from the subtractor 34 to the incrementer 33.
Switch to the side. As a result, the address register 32
Again, the address (kD) is synchronized with the clock CLK2.
From (k-D + 1) to start the increment operation. Thereby, the audio signal S1 encoded by the audio encoding unit 4 is sequentially written into the code storage memory 5.

【００２４】以上のようにして符号蓄積用メモリ５に書
き込まれた音声符号化データは、図１０に示すように、
無音区間で無音マークと無音区間フレーム数が無音デー
タフォーマットで記録され、音声区間では符号化音声デ
ータが音声データフォーマットで記録される。The audio coded data written in the code storage memory 5 as described above is, as shown in FIG.
In a silent section, a silent mark and the number of silent section frames are recorded in a silent data format, and in a voice section, encoded audio data is recorded in an audio data format.

【００２５】さて、以上のようなフォーマットで記録さ
れた符号化音声データを再生するには、音声データフォ
ーマット区間では符号蓄積用メモリ５のアドレスを進め
ながら符号化音声データをそのまま再生すればよい。一
方、無音データフォーマット区間では、無音マークによ
って無音フレームであることを検出し、その区間では符
号蓄積用メモリ５のアドレスを停止して、無音データを
再生し続ければよい。同時に、無音区間フレーム数をカ
ウントし、カウント値が一杯になった時点で符号蓄積用
メモリ５のアドレスを止めずに進めればよい。なお、無
音区間をスキップさせたければ符号蓄積用メモリ５のア
ドレスを止めずに進めればよい。なお、図１０に示すよ
うに、無音データフォーマットと音声データフォーマッ
トでデータサイズを同じにしておけば無音部と音声部で
アドレス制御を切り替える必要はない。In order to reproduce the encoded audio data recorded in the above format, the encoded audio data may be reproduced as it is while the address of the code storage memory 5 is advanced in the audio data format section. On the other hand, in the silence data format section, the fact that the frame is a silence frame is detected by the silence mark, and in that section, the address of the code storage memory 5 is stopped, and the silence data may be continuously reproduced. At the same time, the number of silent section frames is counted, and when the count value is full, the address of the code storage memory 5 may be advanced without stopping. If a silent section is to be skipped, it is sufficient to proceed without stopping the address of the code storage memory 5. As shown in FIG. 10, if the data size is the same between the silent data format and the audio data format, there is no need to switch the address control between the silent portion and the audio portion.

【００２６】なお、無音データフォーマットと音声デー
タフォーマットでデータサイズを同じにした場合、アド
レッシングが容易になり、高速サーチ、早送り、巻き戻
しなどの特殊再生を、データの内容を読み出す必要なく
実現することができる。When the data size is the same in the silence data format and the audio data format, addressing becomes easy, and special reproduction such as high-speed search, fast forward, and rewind can be realized without having to read data contents. Can be.

【００２７】以上のような構成によれば、連続無音区間
の長さに拘らずに無音区間は１フレームに圧縮されるの
で、無音区間が長ければ長い程データの圧縮率が高くな
る。今、音声区間と無音区間の比が１：ｕであるとす
る。図５のような圧縮方式をとって音声部に対して無音
部を１／ｋに圧縮した場合と比較してみる。図５のよう
な方式で実現できる圧縮率α１が、 α１＝（１＋ｕ）／｛１＋ｕ（１／ｋ）｝＝（１＋ｕ）・ｋ／（ｋ＋ｕ）となり、本実施例の方式では圧縮率α２が α２＝（１＋ｕ）／２となる。ｕ＞ｋであるとすると、 α１＜α２となり、本実施例の場合、無音区間が長ければ長い程、
データの圧縮率が高くなる。また、無音区間をスキップ
する場合も無音区間の展開をしなくて済むという利点が
ある。According to the above configuration, the silent section is compressed into one frame regardless of the length of the continuous silent section, so that the longer the silent section, the higher the data compression rate. Now, it is assumed that the ratio between the voice section and the silent section is 1: u. A comparison will be made with a case where the silent part is compressed to 1 / k with respect to the audio part by using the compression method as shown in FIG. The compression rate α1 that can be realized by the method shown in FIG. 5 is as follows: α1 = (1 + u) / {1 + u (1 / k)} = (1 + u) · k / (k + u), and the compression rate α2 is α2 = (1 + u) / 2. If u> k, then α1 <α2, and in this embodiment, the longer the silent section, the longer the
The data compression ratio increases. In addition, there is an advantage that a silent section does not need to be expanded even when a silent section is skipped.

【００２８】なお、図８は、アドレス発生器３の他の構
成例を示す。同図８の構成と図４の構成の違いは、減算
器３４と切替器３５を省略した点にあり、構成が簡単に
なっている。無音区間を判別したときに、既に無音カウ
ンタ２は固定閾値θ２までカウントを進めている。しか
し、この例では、アドレスレジスタ３２のアドレスを元
に戻さずに単にストップさせるだけに留めている。その
結果、符号蓄積用メモリ５には、固定閾値θ２に対応す
るフレームだけ無音データが記録されてしまい、圧縮率
の点でも不利となる。しかし、回路構成が簡単でコスト
が安くなるという利点を有する。FIG. 8 shows another example of the configuration of the address generator 3. The difference between the configuration in FIG. 8 and the configuration in FIG. 4 is that the subtractor 34 and the switch 35 are omitted, and the configuration is simplified. When the silent section is determined, the silent counter 2 has already counted up to the fixed threshold value θ2. However, in this example, the address of the address register 32 is merely restored without being restored. As a result, silence data is recorded in the code storage memory 5 only for the frame corresponding to the fixed threshold value θ2, which is disadvantageous in terms of the compression ratio. However, there is an advantage that the circuit configuration is simple and the cost is low.

【００２９】なお、上記実施例では、それぞれの構成要
素を個別の回路構成として示した。しかし、ソフトウェ
アにより制御されるマイクロコンピュータやディジタル
シグナルプロセッサを用いても同様の機能を実現するこ
とが可能である。また、カウンタ等もディジタル的なカ
ウントに代えて、積分器等のアナログ的な回路でも実現
可能である。また、上記実施例では、アドレスレジスタ
３２にアドレス値を与えるために、減算器３４とインク
リメント部３３の２つの回路の出力値を切替器３５で切
り替える構成を例示した。しかし、インクリメント部３
３と減算器３４を共通の演算器で構成し、切替器３５に
対する切り替え信号により演算値を切り替える構成とし
てもよい。In the above embodiment, each component is shown as an individual circuit configuration. However, the same function can be realized by using a microcomputer or a digital signal processor controlled by software. Also, the counter and the like can be realized by an analog circuit such as an integrator instead of the digital count. Further, in the above-described embodiment, the configuration in which the output values of the two circuits of the subtractor 34 and the increment unit 33 are switched by the switch 35 in order to give the address value to the address register 32 has been exemplified. However, the increment unit 3
3 and the subtractor 34 may be configured by a common computing unit, and the computation value may be switched by a switching signal to the switching unit 35.

【００３０】なお、音声符号化装置４としては、ＰＣＭ
（パルスコードモジュレーション）、ＡＤＭ（アダプテ
ィブデルタモジュレーション）、ＡＤＰＣＭ（アダプテ
ィブヂッファレンシャルパルスコードモジュレーショ
ン）、ＡＰＣ−ＡＢ（アダプティブプレヂクティブコー
ディング−アダプティブビットアロケーション）等の各
種の符号化方式が適用可能である。また、音声符号化部
４に入力される音声信号Ｓ１は、アナログ信号でも、デ
ィジタル信号でもよい。It should be noted that the speech encoding device 4 is a PCM
Various coding methods such as (pulse code modulation), ADM (adaptive delta modulation), ADPCM (adaptive differential pulse code modulation), and APC-AB (adaptive adaptive coding-adaptive bit allocation) are applicable. . The audio signal S1 input to the audio encoding unit 4 may be an analog signal or a digital signal.

【００３１】また、符号蓄積用メモリ５としては、半導
体メモリはもちろん、磁気ディスクや磁気テープでもよ
く、データを記録できる媒体ならあらゆるものが適用可
能である。また、リアルタイムな記録を要求されなけれ
ば、もう少し低速のメモリが適用可能である。The code storage memory 5 may be not only a semiconductor memory but also a magnetic disk or a magnetic tape, and any medium that can record data can be applied. If real-time recording is not required, a slightly slower memory is applicable.

【００３２】さらに、無音カウンタ２は、音声信号Ｓ１
の大きさが固定閾値θ１を越えた時点でリセットされる
が、固定閾値θ１を越えた時間が一定の時間を越えた場
合場合に無音カウンタ２をリセットするような構成とし
てもよい。また、無音カウンタ２は、アップカウンタに
限らず、ダウンカウンタとしてもよい。Further, the silence counter 2 outputs the audio signal S1
Is reset when the value exceeds the fixed threshold θ1, but the silence counter 2 may be reset when the time exceeding the fixed threshold θ1 exceeds a certain time. The silence counter 2 is not limited to the up counter, but may be a down counter.

【００３３】[0033]

【発明の効果】以上述べたように、本発明によれば、音
声信号の無音区間をその長さに拘わらず有音区間の１フ
レームと同じ長さの１フレームに無音区間フレーム数デ
ータと共に圧縮するようにしたので、無音区間が長く続
く場合の圧縮率を高めることが可能であり、さらにこの
ように有音区間と無音区間におけるフレームサイズを同
じとしたので、即ち、メモリにおいて、符号化音声信号
を格納する１フレームと、無音符号とこのカウンタ値を
格納する１フレームとを同じデータサイズとしておき、
入力音声信号における無音部分をその長さに拘わらず１
フレームにアドレッシングするようにしたので、２つの
区間におけるアドレス制御を切り替える必要はなく、ア
ドレシングが容易となり、格納、再生を高速で行うこと
ができ、装置全体としての回路構成を簡単で低コストの
ものとして得ることができる。また再生を行なう場合の
データの取り扱いが容易で回路構成が簡単になり、更に
無音カウンターで一定時間以上無音区間が続かないと無
音と判定しないようにしたので再生音声の品質を高める
ことができる。As described above, according to the present invention, a silent section of an audio signal is compressed into one frame of the same length as one frame of a sound section, together with data of the number of silent section frames, regardless of its length. Therefore, it is possible to increase the compression ratio in the case where the silent section continues for a long time. Further, since the frame size in the sound section and the frame section in the silent section are the same, that is, in the memory, One frame for storing the signal and one frame for storing the silence code and this counter value have the same data size,
The silent part in the input audio signal is 1 regardless of its length.
Since addressing is performed in frames, there is no need to switch address control in two sections, addressing becomes easy, storage and reproduction can be performed at high speed, and the circuit configuration of the entire apparatus is simple and low-cost. Can be obtained as In addition, data can be easily handled during reproduction and the circuit configuration can be simplified. Further, since a silent section is not determined unless a silent section continues for a certain period of time by a silent counter, the quality of reproduced sound can be improved.

[Brief description of the drawings]

【図１】本発明の一実施例の音声符号化装置のブロック
図。FIG. 1 is a block diagram of a speech encoding device according to an embodiment of the present invention.

【図２】図１の無音検出部の一例を示すブロック図。FIG. 2 is a block diagram showing an example of a silence detecting section in FIG. 1;

【図３】図１の無音カウンタの一例を示すブロック図。FIG. 3 is a block diagram showing an example of a silence counter of FIG. 1;

【図４】図１のアドレス発生器３の一例を示すブロック
図。FIG. 4 is a block diagram showing an example of an address generator 3 of FIG. 1;

【図５】従来の音声の符号化方式の一例を示すタイミン
グチャート。FIG. 5 is a timing chart showing an example of a conventional audio encoding method.

【図６】従来の音声の符号化方式の他の例を示すタイミ
ングチャート。FIG. 6 is a timing chart showing another example of a conventional audio encoding method.

【図７】図１〜図４の回路の動作を説明するためのタイ
ミングチャート。FIG. 7 is a timing chart for explaining the operation of the circuits of FIGS. 1 to 4;

【図８】本発明の他の実施例の音声符号化装置の部分ブ
ロック図。FIG. 8 is a partial block diagram of a speech encoding device according to another embodiment of the present invention.

【図９】符号蓄積用メモリでの無音区間の記録状態の説
明図。FIG. 9 is an explanatory diagram of a recording state of a silent section in a code storage memory.

【図１０】符号蓄積用メモリへの記録フォーマットの説
明図。FIG. 10 is an explanatory diagram of a recording format in a code storage memory.

[Explanation of symbols]

１無音検出部２無音カウンタ３アドレス発生器４音声符号化部５符号蓄積用メモリ１１検波器１２ローパスフィルタ１３比較器３１比較器３２アドレスレジスタ３３インクリメント部３４減算器３５切替器３６論理ゲート DESCRIPTION OF SYMBOLS 1 Silence detection part 2 Silence counter 3 Address generator 4 Voice encoding part 5 Code storage memory 11 Detector 12 Low-pass filter 13 Comparator 31 Comparator 32 Address register 33 Increment part 34 Subtractor 35 Switch 36 Logic gate

Claims

(57) [Claims]

1. A silence detection unit that detects a silence portion of an input audio signal and outputs a silence detection signal, a silence counter that counts the number of clocks in outputting the silence detection signal and outputs a counter value, and a silence counter. When the counter value is not output because no part is detected, the sound part of the input audio signal is used as the encoded audio signal, and the silent part is detected, and the counter value exceeds a predetermined reference value. A speech encoding unit that generates a silence code when a signal corresponding to the counter value is given from the silence counter when the silence part is detected, and an address generation step that increments an address when the speech part is input. And a memory for storing the encoded audio signal output from the audio encoding unit and to which the silence code is given. When the value does not exceed the reference value, the address is incremented, and when the counter value exceeds the reference value, the increment of the address of the memory is stopped, and the counter value exceeds the reference value. It is configured to generate an address so as to store a silent portion from a time to a next sound portion in one frame having the same length as one frame length regardless of the period, and output the address to the memory. A speech coding apparatus characterized by the above-mentioned.

2. A silence detecting section for detecting a silence portion of an input audio signal and outputting a silence detection signal, a silence counter for counting the number of clocks in outputting the silence detection signal and outputting a counter value; When the counter value is not output because no part is detected, the sound part of the input audio signal is used as the encoded audio signal, and the silent part is detected, and the counter value exceeds a predetermined reference value. A speech encoding unit that generates a silence code when a signal corresponding to the counter value is given from the silence counter when the silence part is detected, and an address generation step that increments an address when the speech part is input. And a memory for storing the encoded audio signal output from the audio encoding unit and to which the silence code is given. When the value does not exceed the reference value, the address is incremented. When the counter value exceeds the reference value, not only the increment of the address of the memory is stopped but also the silence detection of the address is performed. Return to the address at the start of signal output, and store the silence part from the output of the silence detection signal to the next sound part in one frame of the same length as one frame regardless of the period. A speech encoding device configured to generate an address and output the address to the memory.