JP2001184100A

JP2001184100A - Speaking speed converting device

Info

Publication number: JP2001184100A
Application number: JP36768299A
Authority: JP
Inventors: Kotaro Machidera; 侯大郎待寺; Chikako Ohara; 千賀子大原; Yoichi Katsuki; 陽一勝木
Original assignee: Anritsu Corp
Current assignee: Anritsu Corp
Priority date: 1999-12-24
Filing date: 1999-12-24
Publication date: 2001-07-06

Abstract

PROBLEM TO BE SOLVED: To reproduce an inputted voice signal which has arbitrary signal time length (source sound recording time) in a desirable reproduction time. SOLUTION: This device is equipped with a voice analysis part 10 which digitally analyzes a continuous voice signal, a sound-recording buffer 11 which stores the digital signal analyzed by the voice analysis part, a source sound recording time detecting means 16 which detects the source sound recording time of the continuous voice signal, a reference speaking speed multiple calculating means 18 which calculates a reference speaking speed multification M represented as the ratio of the detected source sound recording time R0 and a desirable reproduction time R1, and a signal composition part 12 which reproduces a new digital signal with a speaking speed multification Y corresponding to the previously calculated reference speaking speed multification M by compositing the digital signals outputted from the sound-recording buffer 11 responding to reproduction instructions.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は音声の話速を変更す
る話速変換装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech speed conversion device for changing the speech speed of voice.

【０００２】[0002]

【従来の技術】例えば外国語を学習する方法として、実
際にその外国語を耳で聞いて学習する学習法が効果的で
ある。この場合、同じ会話を繰り返し聴くことが重要で
ある。したがって、外国語を聴く能力を向上させるため
に、簡便な方法として、例えば、数分〜十数分の会話又
はナレーションを録音したテープを再生して学習する。2. Description of the Related Art For example, as a method of learning a foreign language, a learning method of actually learning by listening to the foreign language by ear is effective. In this case, it is important to listen to the same conversation repeatedly. Therefore, in order to improve the ability to listen to a foreign language, as a simple method, for example, learning is performed by playing a tape on which a conversation or a narration of several minutes to several tens of minutes has been recorded.

【０００３】この場合、語学学習専用のテープレコーダ
においては、音声の再生速度を一定の範囲で可変できる
ように構成されている。[0003] In this case, a tape recorder dedicated to language learning is configured so that the sound reproduction speed can be varied within a certain range.

【０００４】また、語学学習以外にも、演説やナレーシ
ョンやニュース原稿を一旦テープレコーダに録音して、
後で再生する場合においても、再生速度を一定の範囲で
可変できることが望ましい。[0004] In addition to language learning, speeches, narrations, and news manuscripts are temporarily recorded on a tape recorder,
Even in the case of reproducing later, it is desirable that the reproducing speed can be varied within a certain range.

【０００５】しかし、ただ単に音声の再生速度を変更さ
せたのみでは、再生される音声の周波数が変化してしま
い、音声が不自然に聞こえる。[0005] However, simply changing the reproduction speed of the sound changes the frequency of the reproduced sound, and the sound sounds unnatural.

【０００６】このような不都合を解消するために、話す
速度、すなわち話速を変化させたとしても、音声の周波
数は変化せず、ただ話し方がゆっくりになったり、早口
になるのみで自然に聞こえる話速変換手法が提唱されて
いる。[0006] Even if the speaking speed, that is, the speaking speed is changed in order to solve such inconvenience, the frequency of the voice does not change, and the sound can be heard naturally simply by slowing down the speech or making the speaker speak faster. A speech speed conversion method has been proposed.

【０００７】次に、この話速変換手法を図１１及び図１
２を用いて説明する。図１１は、例えば「 It's diffi
cult for me to finish… 」と話した場合の音声信号
１の波形図である。図１２はこの音声信号１の拡大図で
ある。周知のように、音声には子音と母音とがあり、音
声信号１にもそれに対応した子音と母音とがある。図示
するように子音は１個の無声音２で構成され、母音は複
数の有声音３で構成されている。また、音声信号１には
音声の途切れたときの無音４が存在する。Next, FIG. 11 and FIG.
2 will be described. FIG. 11 shows, for example, “It's diffi
cult for me to finish ... ". FIG. FIG. 12 is an enlarged view of the audio signal 1. As is well known, voice has consonants and vowels, and voice signal 1 also has corresponding consonants and vowels. As shown, the consonant is composed of one unvoiced sound 2 and the vowel is composed of a plurality of voiced sounds 3. The audio signal 1 includes silence 4 when the audio is interrupted.

【０００８】ここで、無声音２又は有声音３の継続期間
を有音期間５と称し、無音４の継続期間を無音期間６と
称する。Here, the duration of the unvoiced sound 2 or the voiced sound 3 is referred to as a voiced period 5, and the duration of the silence 4 is referred to as a silent period 6.

【０００９】子音を構成する無声音２は比較的高い周波
数成分を有し、母音を構成する複数の有声音３はほぼ同
一波形を有する。したがって、話速を速くするために
は、母音を構成する複数の有声音３のうちの１個又は複
数の有声音３を間引いて、間引いた有声音３の前後の有
声音３どうし、又は無声音２と有声音３、又は有声音３
と無音４とを接続する。よって、母音の継続時間を短縮
でき、結果として音声信号１の全体の時間を短くでき、
音声の周波数や音質を変更することなく話速を速くでき
る。また、無声期間６の時間を短縮することによって話
速を速くすることもできる。The unvoiced sound 2 forming a consonant has a relatively high frequency component, and the plurality of voiced sounds 3 forming a vowel have substantially the same waveform. Therefore, in order to increase the speech speed, one or a plurality of voiced sounds 3 among the plurality of voiced sounds 3 constituting the vowel are thinned out, and the voiced sounds 3 before and after the thinned voiced sound 3 or unvoiced sounds 3 are mixed. 2 and voiced sound 3 or voiced sound 3
And silence 4 are connected. Therefore, the duration of the vowel can be shortened, and as a result, the entire time of the audio signal 1 can be shortened,
Speaking speed can be increased without changing the frequency or sound quality of the voice. In addition, by shortening the time of the silent period 6, the speech speed can be increased.

【００１０】逆に、話速を遅くする場合は、母音を構成
する複数の有声音３に対して同一の有声音３を挿入して
母音の継続時間を長くすればよい。また、無声期間６の
時間を伸張することによって話速を遅くすることもでき
る。Conversely, when the speech speed is reduced, the same voiced sound 3 may be inserted into a plurality of voiced sounds 3 constituting a vowel to extend the duration of the vowel. Further, the speech speed can be reduced by extending the time of the silent period 6.

【００１１】有音期間５を短縮・伸張することによっ
て、話速変換を自動的に行うためには、音声信号１に含
まれる無声音２と有声音３と無音４とを区分けする必要
がある。この区分手法として、母音は複数の有声音３が
継続する性質を利用して、音声信号１に対して自己相関
関数を算出することにより、無声音２と有声音３との区
分け、及び各有声音３の継続時間（ピッチ）が検出す
る。In order to automatically perform the speech speed conversion by shortening / extending the voiced period 5, it is necessary to distinguish the unvoiced sound 2, the voiced sound 3 and the voiceless sound 4 included in the voice signal 1. As a classification method, a vowel uses the property that a plurality of voiced sounds 3 continue, and calculates an autocorrelation function for the audio signal 1, thereby classifying the unvoiced sound 2 and the voiced sound 3 and each voiced sound. The duration (pitch) of 3 is detected.

【００１２】そして、母音を構成する複数の有声音３の
うち何個の有声音３を間引くか、又は何個の有声音３を
挿入するかで、音声信号１の話速が定まる。The speech speed of the voice signal 1 is determined by the number of voiced sounds 3 to be thinned out or the number of voiced sounds 3 to be inserted among a plurality of voiced sounds 3 constituting a vowel.

【００１３】したがって、このような話速変換機能が組
込まれた音声再生装置を用いることにより、語学学習者
は、違和感なく、速い速度又は遅い速度で会話やナレー
ションを聴くことができる。[0013] Therefore, by using a voice reproducing apparatus incorporating such a speech speed conversion function, a language learner can listen to conversation or narration at a high speed or a low speed without a sense of incongruity.

【００１４】また、必要に応じて、違和感なく、演説や
ナレーションやニュース原稿の話速を変更できる。Further, if necessary, the speech speed of a speech, a narration, or a news manuscript can be changed without a sense of discomfort.

【００１５】[0015]

【発明が解決しようとする課題】しかしながら、上述し
た話速変換機能が組込まれた音声再生装置においても、
まだ改良すべき次のような課題があった。However, even in a sound reproducing apparatus incorporating the above-mentioned speech speed conversion function,
There were the following issues that still need to be improved.

【００１６】すなわち、講演会や放送局においては、予
め与えられた時間に合うように、演説やナレーションや
ニュース原稿が録音されている場合が多い。しかし、与
えられた時間が急に変更になる場合がしばしばある。That is, in lectures and broadcast stations, speeches, narrations, and news manuscripts are often recorded in time with a predetermined time. However, given times often change suddenly.

【００１７】しかし、上述した話速変換機能が組込まれ
た音声再生装置においては、再生される音声の話速を通
常話速に対して何％上昇させるか、又は、通常話速に対
して何％低下させるかを指示できたが、再生時間そのも
のを指定できなかった。そのために、適当に、話速倍率
を指定して、再生させていた。その結果、再生時間が与
えられた時間内に正確に収まらない問題が生じる。However, in a voice reproducing apparatus incorporating the above-mentioned voice speed conversion function, what percentage of the voice speed of the voice to be reproduced is raised relative to the normal voice speed, Although it was possible to specify whether to reduce the playback time, the playback time itself could not be specified. For this purpose, the playback speed has been appropriately specified and reproduced. As a result, there arises a problem that the reproduction time does not exactly fit within the given time.

【００１８】特に、放送局における放送時間は秒単位ま
で厳格に定められている。演説やナレーションは、たと
え圧縮・伸張しない状態においても、与えられた時間に
秒単位で収まることはない。したがって、録音時間が不
確かであるのに、適当な話速倍率を設定して、再生され
た演説やナレーションを放送時間内に秒単位まで厳格に
収めることは至難の業である。In particular, the broadcasting time in a broadcasting station is strictly set to the second. Speech and narration, even in the uncompressed and uncompressed state, do not fit in seconds at a given time. Therefore, it is extremely difficult to set an appropriate speech rate magnification and strictly store reproduced speeches and narrations to the order of seconds within the broadcast time even though the recording time is uncertain.

【００１９】本発明はこのような事情に鑑みてなされた
ものであり、希望再生時間を与えるのみで任意時間長を
有した音声信号を、違和感なく与えられた希望再生時間
で正確に再生でき、使い勝手を大幅に向上できる話速変
換装置を提供することを目的とする。The present invention has been made in view of such circumstances, and it is possible to accurately reproduce an audio signal having an arbitrary time length at a given desired reproduction time without giving a sense of incongruity only by giving a desired reproduction time. It is an object of the present invention to provide a speech speed conversion device capable of greatly improving usability.

【００２０】[0020]

【課題を解決するための手段】上記課題を解消するため
に、本発明の話速変換装置においては、連続した音声信
号をデジタル的に解析する音声解析部と、この音声解析
部で解析されたデジタル信号を記憶する録音バッファ
と、連続した音声信号の原音録音時間を検出する原音録
音時間検出手段と、この検出された原音録音時間と希望
再生時間との比で示される基準話速倍率を算出する基準
話速倍率算出手段と、再生指示に応動して、録音バッフ
ァから出力されるデジタル信号を合成して算出された基
準話速倍率に対応した話速倍率で新たなデジタル信号を
再生する信号合成部とを備えている。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, in a speech speed conversion device according to the present invention, a voice analyzing section for digitally analyzing a continuous voice signal, and a voice analyzing section for analyzing the continuous voice signal. A recording buffer for storing a digital signal, an original sound recording time detecting means for detecting an original sound recording time of a continuous audio signal, and a reference speech speed magnification indicated by a ratio of the detected original sound recording time to a desired reproduction time. And a signal for reproducing a new digital signal at a speech rate corresponding to the reference speech rate calculated by synthesizing the digital signal output from the recording buffer in response to the playback instruction. A synthesizing unit.

【００２１】また、発明の話速変換装置においては、連
続した音声信号をデジタル的に解析する音声解析部と、
この音声解析部で解析されたデジタル信号を記憶する録
音バッファと、連続した音声信号の原音録音時間を検出
する原音録音時間検出手段と、この検出された原音録音
時間と希望再生時間との比で示される基準話速倍率を算
出する基準話速倍率算出手段と、再生指示に応動して、
録音バッファから出力されるデジタル信号を合成して指
定された話速倍率で新たなデジタル信号を再生する信号
合成部と、算出された基準話速倍率に対応した話速倍率
を算出して前記信号合成部へ送出する話速算出部とを備
えている。Further, in the speech speed conversion device of the present invention, a voice analysis unit for digitally analyzing a continuous voice signal,
A recording buffer for storing the digital signal analyzed by the audio analyzing unit, an original sound recording time detecting means for detecting an original sound recording time of a continuous audio signal, and a ratio of the detected original sound recording time to a desired reproduction time. In response to a reproduction instruction, a reference speech speed magnification calculating means for calculating the indicated reference speech speed magnification,
A signal synthesizing unit for synthesizing a digital signal output from the recording buffer to reproduce a new digital signal at a specified speech speed magnification, and calculating a speech speed ratio corresponding to the calculated reference speech speed ratio; A speech speed calculation unit for sending to the synthesis unit.

【００２２】このように構成された話速変換装置におい
ては、入力された連続する音声信号は音声解析部で、例
えば有音期間、無音期間、無声音、有声音、無音等に区
分（解析）される。この音声解析部でデジタル的に解析
されたデジタル信号（デジタルの音声信号）は録音バッ
ファに記憶保持される。In the thus constructed speech speed conversion device, the input continuous speech signal is classified (analyzed) by a speech analysis unit into, for example, a sound period, a silence period, an unvoiced sound, a voiced sound, a silence, and the like. You. A digital signal (digital audio signal) digitally analyzed by the audio analysis unit is stored and held in a recording buffer.

【００２３】また、入力された連続する音声信号の信号
時間長で示される原音録音時間が検出され、例えば操作
入力された希望再生時間との比で示される基準話速倍率
が算出される。そして、この基準話速倍率から話速倍率
が算出される。ここで、話速倍率とは、速度変換を実施
していない状態の音声の話速を１（基準）とした場合の
話速の倍率である。Also, the original sound recording time indicated by the signal time length of the input continuous audio signal is detected, and a reference speech speed magnification indicated by, for example, the ratio with the desired input reproduction time is calculated. Then, the speech speed magnification is calculated from the reference speech speed magnification. Here, the speech speed magnification is a magnification of the speech speed when the speech speed of the voice in a state where the speed conversion is not performed is set to 1 (reference).

【００２４】外部から再生指示が出されると、録音バッ
ファに記憶保持されている解析されたデジタル信号（デ
ジタルの音声信号）は信号合成部にて話速算出部で指定
され話速倍率を有する新たなデジタル信号（デジタルの
音声信号）として再生される。When a reproduction instruction is issued from the outside, the analyzed digital signal (digital audio signal) stored and held in the recording buffer is converted into a new signal having a speech speed magnification designated by the speech speed calculator in the signal synthesizer. Reproduced as a digital signal (digital audio signal).

【００２５】したがって、操作者は、たとえ、入力され
た音声信号の信号時間長（原音録音時間）が不明であっ
たとしても、希望再生時間のみを指定するのみで、再生
された音声信号が希望再生時間に正確に収まる。Therefore, even if the signal time length (original sound recording time) of the input audio signal is unknown, the operator can specify only the desired reproduction time, and the reproduced audio signal can be changed. Fits exactly in playback time.

【００２６】また、別の発明の話速変換装置において
は、上述した発明の話速変換装置に対して、さらに、連
続する音声信号の累積有音期間を算出する累積有音期間
算出手段と、連続する音声信号の累積無音期間を算出す
る累積無音期間算出手段とを備えている。そして、話速
算出部は、算出された累積有音期間と累積無音期間とか
ら、有音期間の話速倍率を変更することによって基準話
速倍率が得られる有音目標話速倍率を算出して、デジタ
ル信号の有音期間に有音目標話速倍率の話速倍率を信号
合成部へ送出し、デジタルの音声信号の無音期間に期間
変更なしを信号合成部へ送出する。According to another aspect of the present invention, there is provided a speech speed conversion device, wherein the speech speed conversion device according to the above-described invention further includes a cumulative sound period calculating means for calculating a cumulative sound period of a continuous audio signal. And a cumulative silent period calculating means for calculating a cumulative silent period of the continuous audio signal. Then, the speech speed calculation unit calculates a sound target speech speed ratio from which the reference speech speed ratio can be obtained by changing the speech speed ratio in the sound period from the calculated accumulated speech period and the accumulated silence period. Then, during the sound period of the digital signal, the speech speed magnification of the sound target speech speed ratio is sent to the signal synthesizing unit, and during the silence period of the digital audio signal, no change in the period is sent to the signal synthesizing unit.

【００２７】このように構成された話速変換装置におい
ては、連続した音声信号の話速倍率を変更する手段とし
て、連続した音声信号における無音期間はそのまま変更
せずに、有音期間の時間を例えば、有声音を間引いた
り、同一有声音を付加することによって変更している。[0027] In the speech speed conversion device thus constructed, as means for changing the speech speed magnification of the continuous voice signal, the silent period of the continuous voice signal is not changed but the time of the voice period is changed. For example, the voiced sound is changed by thinning out or adding the same voiced sound.

【００２８】さらに、別の発明の話速変換装置において
は、上述した発明の話速変換装置に対して、さらに、連
続した音声信号の累積有音期間を算出する累積有音期間
算出手段と、連続した音声信号の累積無音期間を算出す
る累積無音期間算出手段とを備えている。そして、話速
算出部は、算出された累積有音期間と累積無音期間とか
ら、無音期間を変更することによって基準話速倍率が得
られる無音目標期間を算出して、デジタル信号の有音期
間に無音目標期間を信号合成部へ送出し、デジタル信号
の有音期間に１の話速倍率を信号合成部へ送出する。Further, in the speech speed conversion device of another invention, the speech speed conversion device of the invention described above further comprises a cumulative sound period calculating means for calculating the cumulative sound period of the continuous audio signal, And a cumulative silence period calculating means for calculating a cumulative silence period of the continuous audio signal. Then, the speech speed calculation unit calculates a silence target period in which the reference speech speed magnification is obtained by changing the silence period from the calculated accumulated speech period and the accumulated silence period, and calculates the speech period of the digital signal. Then, a silent target period is sent to the signal synthesizing unit, and a speech speed magnification of 1 is sent to the signal synthesizing unit during the sound period of the digital signal.

【００２９】このように構成された話速変換装置におい
ては、入力された音声信号の話速倍率を変更する手段と
して、入力された音声信号における有音期間はそのまま
変更せずに、無音期間の継続時間を圧縮又は伸張してい
る。In the speech speed conversion device having the above-described structure, as a means for changing the speech speed magnification of the input voice signal, the voice period of the input voice signal is not changed without changing the voice period. Compressing or expanding duration.

【００３０】さらに、別の発明の話速変換装置において
は、上述した発明の話速変換装置の話速算出部は、連続
した音声信号の開始時刻から規定時間経過するまでの期
間内に、時間経過に伴って予め設定された初期話速倍率
から基準話速倍率近傍の目標話速倍率まで変化させ、か
つ規定時間経過後に、目標話速倍率を維持する話速倍率
を順次算出して信号合成部へ送出する。Further, in the speech speed conversion device according to another invention, the speech speed calculation unit of the speech speed conversion device according to the invention described above is configured such that the speech speed calculation unit sets the time within a period from the start time of the continuous audio signal until a predetermined time elapses. As the time elapses, the signal speed is changed from a preset initial speed factor to a target speed factor in the vicinity of the reference speed factor, and after a lapse of a predetermined time, the speed factor for maintaining the target speed factor is sequentially calculated to synthesize a signal. To the department.

【００３１】このように構成された話速変換装置におい
ては、再生された演説やナレーションの冒頭部分のみ通
常話速に近い話速倍率で、規定時間経過後に基準話速倍
率近傍の目標話速倍率で再生される。よって、再生され
た演説やナレーションの冒頭部分を聞き逃すことはな
い。In the thus constructed speech speed conversion device, only the beginning portion of the reproduced speech or narration has a speech speed magnification close to the normal speech speed, and after a lapse of a predetermined time, a target speech speed magnification near the reference speech speed magnification. Will be played back. Thus, the beginning of the replayed speech or narration will not be missed.

【００３２】[0032]

【発明の実施の形態】以下、本発明の各実施形態を図面
を用いて説明する。（第１実施形態）図１は本発明の第１実施形態に係る話
速変換装置の概略構成を示すブロック図である。入力端
子７に対して図１１に示した音声信号１と同一構成の一
連の連続した音声信号ａが入力される。したがって、こ
の音声信号ａは、図１２に示すように、子音に対応する
無声音２と、母音に対応する有声音３と、無音４とで構
成されている。そして、図１２に示すように、無声音２
又は有声音３からなる有音期間５の継続期間をＴ₁と
し、無音４からなる無音期間６の継続期間をＴ₀とす
る。Embodiments of the present invention will be described below with reference to the drawings. (First Embodiment) FIG. 1 is a block diagram showing a schematic configuration of a speech speed conversion device according to a first embodiment of the present invention. A series of continuous audio signals a having the same configuration as the audio signal 1 shown in FIG. Therefore, as shown in FIG. 12, the audio signal a is composed of an unvoiced sound 2 corresponding to a consonant, a voiced sound 3 corresponding to a vowel, and a silent sound 4. Then, as shown in FIG.
Or the duration of the sound period 5 consisting voiced 3 and T _1, the duration of the silent period 6 made silent 4 and T _0.

【００３３】入力端子７から入力されたアナログの音声
信号ａは、Ａ／Ｄ変換器８でデジタルの音声信号に変換
された後、音声信号メモリ９に蓄積される。音声解析部
１０は、この音声信号メモリ９に書込まれた一連のデジ
タルの音声信号ａ₁を無声音２と、有声音３と、無音４
とに区分けする。具体的には、音声信号ａ₁の信号レベ
ルを調べて、有音期間５と無音期間６とを区分けする。
その後、各有音期間５の信号に対して自己相関解析を実
施して、この有音期間５を無声音２と有声音３とに区分
けする。音声解析部１０で、無声音２と有声音３と無音
４とに区分けされた音声信号ａ₂は一旦録音バッファ１
１へ書込まれて記憶保持される。The analog audio signal a input from the input terminal 7 is converted into a digital audio signal by the A / D converter 8 and then stored in the audio signal memory 9. The voice analysis unit 10 converts the series of digital voice signals a ₁ written in the voice signal memory 9 into the unvoiced sound 2, the voiced sound 3, and the
And is divided into Specifically, the signal level of the audio signal a ₁ is checked, and the sound period 5 and the silent period 6 are classified.
After that, the autocorrelation analysis is performed on the signal of each voiced period 5 to divide the voiced period 5 into the unvoiced sound 2 and the voiced sound 3. The audio signal a ₂ divided into the unvoiced sound 2, the voiced sound 3 and the silent sound 4 by the voice analysis unit 10 is temporarily stored in the recording buffer 1.
1 is written and stored.

【００３４】信号合成部１２は、外部から再生開始指令
が入力されると、この録音バッファ１１に書込まれてい
る音声解析されたデジタルの音声信号ａ₂を取込んで、
この取込んだ音声信号ａ₂における有音期間５における
各母音を構成する複数の有声音３のうち、話速算出部１
９にで指定された話速倍率Ｙに対応した数だけ間引くか
又は追加する。また、この取込んだ音声信号ａ₂におけ
る無音期間６の継続期間を話速算出部１９にて指定され
た話速倍率Ｙに応じて短縮又は伸張する。そして、信号
合成部１２は、入力された音声信号ａ₂における無声音
２と、間引き又は追加後の有声音３と、短縮又は伸張さ
れた無音４とを接続して新たな音声信号ａ₃を合成して
出力する。When a reproduction start command is input from the outside, the signal synthesizing section 12 takes in the digitally analyzed digital audio signal a ₂ written in the recording buffer 11, and
Of the plurality of voiced sounds 3 constituting each vowel in the voiced period 5 of the captured audio signal a _2, the speech speed calculation unit 1
9 is thinned out or added by the number corresponding to the speech speed magnification Y specified in 9. In addition, the duration of the silence period 6 in the captured audio signal a ₂ is shortened or extended according to the speech speed magnification Y specified by the speech speed calculation unit 19. Then, the signal synthesis unit 12 connects the unvoiced sound ₂ in the input audio signal a _2, the voiced sound 3 after thinning or addition, and the shortened or expanded silence 4 to synthesize a new audio signal a ₃ . And output.

【００３５】信号合成部１２から出力された新たな音声
信号ａ₃は出力バッファ１３に一旦格納した後、Ｄ／Ａ
変換１４でアナログの音声信号ａ₄に変換されて、出力
端子１５から出力される。The new audio signal a ₃ output from the signal synthesizing unit 12 is temporarily stored in the output buffer 13 and then stored in the D / A
The signal is converted into an analog audio signal a ₄ by the converter 14 and output from the output terminal 15.

【００３６】したがって、出力端子１５から出力された
新たなアナログの音声信号ａ₄は、入力端子７に入力さ
れたアナログの音声信号ａに対して、指定された話速倍
率Ｙに対する分だけ短縮又は伸張され、その分、再生さ
れた演説又はナレーションの再生時間が短縮又は伸張さ
れる。Therefore, the new analog audio signal a ₄ output from the output terminal 15 is shortened or reduced by the amount corresponding to the specified speech speed magnification Y with respect to the analog audio signal a input to the input terminal 7. The length of the expanded speech or narration is shortened or lengthened accordingly.

【００３７】Ａ／Ｄ変換器８から出力されたデジタルの
音声信号ａ₁は音声信号メモリ９へ書込まれると共に、
録音時間検出部１６へ入力される。録音時間検出部１６
は、図２に示すように、入力された音声信号ａ（ａ₁）
の信号時間長で示される原音録音時間Ｒ₀を検出して、
次の基準話速倍率算出部１８へ送出する。The digital audio signal a ₁ output from the A / D converter 8 is written into the audio signal memory 9 and
It is input to the recording time detection unit 16. Recording time detector 16
Is the input audio signal a (a ₁ ), as shown in FIG.
The original sound recording time R ₀ indicated by the signal time length of
It is sent to the next reference speech speed magnification calculator 18.

【００３８】希望再生時間入力部１７は操作者が操作入
力した、図２に示す、出力端子１５から出力されるアナ
ログの音声信号ａ₄の信号時間長である再生時間Ｒ_Iを次
の基準話速倍率算出部１８へ送出する。The desired reproduction time input unit 17 the operator has operated the input, shown in FIG. 2, the following criteria story playback time R _I is the signal duration of the audio signal a ₄ analog output from the output terminal 15 It is sent to the speed magnification calculator 18.

【００３９】基準話速倍率算出部１８は、入力された音
声信号ａ（ａ₁）の原音録音時間Ｒ₀を出力される音声信
号ａ₄の希望再生時間Ｒ_Iで除算した基準話速倍率Ｍを算
出して次の話速算出部１９へ送出する。The reference speech speed magnification calculator 18 calculates a reference speech speed ratio M by dividing the original sound recording time R ₀ of the input audio signal a (a ₁ ) by the desired reproduction time R _I of the output audio signal a _4. Is calculated and sent to the next speech speed calculation unit 19.

【００４０】Ｍ＝Ｒ₀／Ｒ_I この第１実施形態の話速変換装置の話速算出部１９は、
入力された基準話速倍率Ｍをそのまま話速倍率Ｙとし
て、信号合成部１２へ送出する。M = R ₀ / R _I The speech speed calculator 19 of the speech speed converter of the first embodiment is
The input reference speech speed magnification M is directly transmitted to the signal synthesizing unit 12 as the speech speed magnification Y.

【００４１】Ｙ＝Ｍ前述したように、信号合成部１２は、取込んだデジタル
の音声信号ａ₂を圧縮・伸張して話速倍率Ｙを有するデ
ジタルの音声信号ａ₃として出力する。Y = M As described above, the signal synthesizing unit 12 compresses and expands the captured digital audio signal a ₂ and outputs it as a digital audio signal a ₃ having a speech speed magnification Y.

【００４２】このように構成された第１実施形態の話速
変換装置においては、入力端子７から入力された音声信
号ａの信号時間長を示す原音録音時間Ｒ₀が自動的に測
定される。そして、操作者が希望再生時間Ｒ₁を操作入
力すると、基準話速倍率Ｍが自動的に算出されて、信号
合成部１２へ話速倍率Ｙとして印加される。In the speech speed conversion device of the first embodiment thus configured, the original sound recording time R ₀ indicating the signal time length of the audio signal a inputted from the input terminal 7 is automatically measured. Then, when the operator inputs the desired reproduction time R ₁ , the reference speech speed magnification M is automatically calculated and applied to the signal synthesizing unit 12 as the speech speed magnification Y.

【００４３】したがって、操作者としては、入力された
音声信号ａの原音録音時間Ｒ₀に係わらず、希望再生時
間Ｒ₁を操作入力のみで、高い精度の希望再生時間Ｒ₁を
有する音声信号ａ₄を再生できる。[0043] Thus, as the operator, regardless of the original sound recording time R ₀ of the input speech signal a, only the operation input the desired playback time R _1, audio signal a having a desired playback time R ₁ of high precision ₄ can be played.

【００４４】よって、放送局のように放送時間を秒単位
で制御する環境下でこの話速変換装置を使用する場合
に、高い精度の希望再生時間Ｒ₁が確保されるので、こ
の話速変換装置の使い勝手を大幅に向上できる。[0044] Therefore, when using this speech speed converting device in an environment controlled in seconds airtime as broadcasters, since high accuracy desired playback time R ₁ is ensured, the speech speed conversion The usability of the device can be greatly improved.

【００４５】（第２実施形態）図３は本発明の第２実施
形態に係わる話速変換装置の概略構成を示すブロック図
である。図１に示す第１実施形態の話速変換装置と同一
部分には同一符号を付して、重複する部分の詳細説明を
省略する。(Second Embodiment) FIG. 3 is a block diagram showing a schematic configuration of a speech speed conversion device according to a second embodiment of the present invention. The same parts as those of the speech speed conversion device of the first embodiment shown in FIG. 1 are denoted by the same reference numerals, and detailed description of the overlapping parts will be omitted.

【００４６】この第２実施形態の話速変換装置において
は、音声解析部１０から出力された、有音期間５、無音
期間６、無声音２、有声音３、無音４に区分（解析）さ
れたデジタルの音声信号ａ₂は順次録音バッファ１１に
書込まれると共に、累積有音期間算出部２０及び累積無
音期間算出部２１へ入力される。In the speech speed conversion device of the second embodiment, the speech output from the speech analyzer 10 is divided (analyzed) into a voiced period 5, a silent period 6, an unvoiced sound 2, a voiced sound 3, and a silence 4. The digital audio signal a ₂ is sequentially written into the recording buffer 11 and is input to the cumulative sound period calculating unit 20 and the cumulative silent period calculating unit 21.

【００４７】累積有音期間算出部２０は、図２に示すよ
うに、入力された１原音録音時間Ｒ ₀分の音声信号ａ
₂（ａ）に含まれる全ての有音期間Ｔ₁を累積した累積有
音期間Ｔ_S1を算出して話速算出部１９ａへ送出する。同
様に、累積無音期間算出部２１は、図２に示すように、
入力された１原音録音時間Ｒ₀分の音声信号ａ₂（ａ）に
含まれる全ての無音期間Ｔ₀を累積した累積無音期間Ｔ
_S0を算出して話速算出部１９ａへ送出する。The cumulative sound period calculation unit 20 is configured as shown in FIG.
Thus, the input one original sound recording time R ₀Minute audio signal a
_TwoAll sound periods T included in (a)₁Has accumulated
Sound period T_S1Is calculated and sent to the speech speed calculation unit 19a. same
As shown in FIG. 2, the cumulative silence period calculation unit 21
Input original sound recording time R₀Minute audio signal a_Two(A)
All included silent periods T₀Cumulative silence period T
_S0Is calculated and sent to the speech speed calculation unit 19a.

【００４８】さらに、再生指示が入力され、信号合成部
１２が録音バッファ１１に記憶されたデジタルの音声信
号ａ₂の読出しを開始すると、録音バッファ１１から音
声信号ａ₂が信号合成部１２へ入力されると共に、話速
算出部１９ａへ入力される。したがって、話速算出部１
９ａには、デジタルの音声信号ａ₂が入力開始前に、基
準話速倍率Ｍ、累積有音期間Ｔ_S1、累積無音期間Ｔ_S0
が入力されている。Further, when a reproduction instruction is input and the signal synthesizing unit 12 starts reading out the digital audio signal a ₂ stored in the recording buffer 11, the audio signal a ₂ is input from the recording buffer 11 to the signal synthesizing unit 12. At the same time, it is input to the speech speed calculation unit 19a. Therefore, the speech speed calculation unit 1
The 9a, the audio signal a ₂ is input before the start of the digital, the reference speech rate magnification M, the cumulative voiced period T _S1, accumulated silence period T _S0
Is entered.

【００４９】そして、この話速算出部１９ａは図４に示
す流れ図に従って、話速倍率Ｙを算出して信号合成部１
２へ送出する処理を実施する。The speech speed calculator 19a calculates the speech speed magnification Y in accordance with the flowchart shown in FIG.
2 is executed.

【００５０】前述したように、基準話速倍率Ｍ、累積有
音期間Ｔ_S1、累積無音期間Ｔ_S0 を取込む（Ｐ１）。次
に、有音目標話速倍率Ｎ₁を算出する（Ｐ２）。この有
音目標話速倍率Ｎ₁は、各無音期間Ｔ_S0はそのままで、
各有音期間Ｔ_S1を圧縮・伸張して希望再生時Ｒ₁を得る
ために、各有音期間Ｔ_S1に作用させるため話速倍率Ｙで
ある。この有音目標話速倍率Ｎ₁は下式から導かれる。As described above, the reference speech speed magnification M, the accumulated sound period T _S1 , and the accumulated silence period T _S0 are taken (P1). Then, to calculate the sound target speech speed ratio N ₁ (P2). This voiced target speech speed magnification N ₁ is the same as each silent period T _S0 ,
The speech speed magnification Y is applied to each sound period T _S1 in order to compress / expand each sound period T _S1 to obtain the desired reproduction time R ₁ . The voiced target speech speed ratio N ₁ is derived from the following formula.

【００５１】（Ｔ_S1／Ｎ₁）＋Ｔ_S0＝（Ｔ_S1＋Ｔ_S0）／ＭＮ₁＝（Ｍ・Ｔ_S1）／（Ｔ_S1＋Ｔ_S0―Ｍ・Ｔ_S0）そして、録音バッファ１１から出力されたデジタルの音
声信号ａ₂が入力開始されると（Ｐ３）、例えば、０．
０１秒等の微小時間Δｔの経過を待って（Ｐ４）、この
デジタルの音声信号ａ₂が終了していないことを確認し
（Ｐ５）、現在時点におけるデジタルの音声信号ａ₂の
信号状態が有音期間５（Ｔ₁）の場合は（Ｐ６）、先に
求めた有音目標話速倍率Ｎ₁を話速倍率Ｙとして信号合
成部１２へ送出する（Ｐ７）。Ｙ＝Ｎ₁現在時点におけ
るデジタルの音声信号ａ₂の信号状態が無音期間６
（Ｔ₀）の場合は（Ｐ６）、無音期間６（Ｔ₀）変更なし
指示となる、話速倍率１を話速倍率Ｙとして信号合成部
１２へ送出する（Ｐ８）。Ｙ＝１そして、Ｐ５にて、デジタルの音声信号ａ₂が終了する
と、この話速倍率算出処理を終了する。(T _S1 / N ₁ ) + T _S0 = (T _S1 + T _S0 ) / M N ₁ = (M · T _S1 ) / (T _S1 + T _S0 -M · T _S0 ) and the digital audio signal a ₂ is initiated input (P3), for example, 0.
After waiting for the minute time Δt of 01 seconds, etc. (P4), to confirm that the audio signal a ₂ of the digital has not been completed (P5), the digital signal state of the audio signal a ₂ of the present time Yes In the case of the sound period 5 (T ₁ ) (P 6), the sound target speech speed magnification N ₁ obtained above is transmitted to the signal synthesizing unit 12 as the speech speed magnification Y (P 7). Y = N ₁ The signal state of the digital audio signal a _{2 at} the current time is a silent period 6
In the case of (T ₀ ), (P 6), the speech speed magnification 1 is transmitted to the signal synthesizing unit 12 as the speech speed magnification Y, which is an instruction for no change in the silence period 6 (T ₀ ) (P 8). Y = 1 Then, at P5, the audio signal a ₂ digital is completed, ends the speech speed ratio calculation process.

【００５２】このように、構成された第２実施形態の話
速変換装置においては、先に説明した第１実施形態の話
速変換装置と同様に、希望再生時間Ｒ₁を指定すると、
この希望再生時間Ｒ₁を有する音声信号ａ₄を再生するこ
ことができる。In the speech speed converter of the second embodiment configured as described above, similarly to the speech speed converter of the first embodiment described above, when the desired reproduction time R ₁ is designated,
An audio signal a ₄ having the desired playback time R ₁ can child play.

【００５３】さらに、この第２実施形態の話速変換装置
においては、話速算出部１９ａから、信号合成部１２へ
入力される話速倍率Ｙは、無音期間６（Ｔ₀）で１に設
定され、すなわち無音期間６（Ｔ₀）変更なしに設定さ
れ、有音期間５（Ｔ₁）で有音目標話速倍率Ｎ₁に設定さ
れる。したがって、たとえ入力された音声信号ａが大幅
に短縮又は伸張されたとしても、言葉と言葉との間の無
音期間６は変化されずに確保されるので、より自然に聞
こえる。Further, in the speech speed conversion device of the second embodiment, the speech speed magnification Y input from the speech speed calculation unit 19a to the signal synthesis unit 12 is set to 1 during the silent period 6 (T ₀ ). That is, the soundless period 6 (T ₀ ) is set without change, and the sound target speech speed magnification N ₁ is set in the sound period 5 (T ₁ ). Therefore, even if the input audio signal a is greatly shortened or expanded, the silence period 6 between words is secured without being changed, so that it sounds more natural.

【００５４】（第３実施形態）図５は本発明の第３実施
形態の話速変換装置に組込まれた話速算出部１９ａの話
速倍率Ｙの算出処理を示す流れ図である。なお、この第
３実施形態の話速変換装置のブロック構成図は、図３に
示した第２実施形態の話速変換装置のブロック構成図と
同じであるので説明を省略する。異なるところは、話速
算出部１９ａの話速倍率Ｙの算出処理内容のみである。(Third Embodiment) FIG. 5 is a flowchart showing a process of calculating a speech speed magnification Y of a speech speed calculating unit 19a incorporated in a speech speed conversion device according to a third embodiment of the present invention. The block diagram of the speech speed converter of the third embodiment is the same as the block diagram of the speech speed converter of the second embodiment shown in FIG. The only difference is the content of the processing for calculating the speech speed magnification Y by the speech speed calculation unit 19a.

【００５５】そして、第３実施形態の話速算出部１９ａ
は図５に示す流れ図に従って、話速倍率Ｙを算出して信
号合成部１２へ送出する処理を実施する。Then, the speech speed calculator 19a of the third embodiment
Performs a process of calculating the speech speed magnification Y and sending it to the signal synthesizing unit 12 according to the flowchart shown in FIG.

【００５６】図４と同様に、基準話速倍率Ｍ、累積有音
期間Ｔ_S1、累積無音期間Ｔ_S0 を取込んで（Ｑ１）、無
音目標期間を得るための無音目標話速倍率Ｎ₀を算出す
る（Ｑ２）。この無音目標期間を得るための無音目標話
速倍率Ｎ₀は、各有音期間Ｔ_S ₁はそのままで、各無音期
間Ｔ_S0を圧縮・伸張して希望再生時Ｒ₁を得るために、
各無音期間Ｔ_S0に作用させるため話速倍率Ｙである。こ
の無音目標話速倍率Ｎ₀は下式から導かれる。As in FIG. 4, the reference speech speed magnification M, the accumulated speech period T _S1 , and the accumulated silence period T _S0 are taken in (Q1), and the target silence target speech speed N ₀ for obtaining the target silence period is calculated. It is calculated (Q2). Silence target speech speed magnification N ₀ for obtaining the silence target period, each voice period T _S ₁ is intact, in order to obtain a desired playback R ₁ and compression and expansion of each silent period T _S0,
This is the speech speed magnification Y to act on each silent period T _S0 . This silent target speech speed magnification N ₀ is derived from the following equation.

【００５７】Ｔ_S1＋（Ｔ_S0／Ｎ₀）＝（Ｔ_S1＋Ｔ_S0）／ＭＮ₀＝（Ｍ・Ｔ_S0）／（Ｔ_S1＋Ｔ_S0―Ｍ・Ｔ_S1）そして、録音バッファ１１から出力されたデジタルの音
声信号ａ₂が入力開始されると（Ｑ３）、例えば、０．
０１秒等の微小時間Δｔの経過を待って（Ｑ４）、この
デジタルの音声信号ａ₂が終了していないことを確認し
（Ｑ５）、現在時点におけるデジタルの音声信号ａ₂の
信号状態が無音期間６（Ｔ₀）の場合は（Ｑ６）、先に
求めた無音目標話速倍率Ｎ₀を話速倍率Ｙとして信号合
成部１２へ送出する（Ｑ７）。Ｙ＝Ｎ₀ 現在時点におけるデジタルの音声信号ａ₂の信号状態が
有音期間５（Ｔ₁）の場合は（Ｑ６）、話速倍率１を話
速倍率Ｙとして信号合成部１２へ送出する（Ｑ８）。Ｙ＝１そして、Ｑ５にて、デジタルの音声信号ａ₂が終了する
と、この話速倍率算出処理を終了する。T _S1 + (T _S0 / N ₀ ) = (T _S1 + T _S0 ) / M N ₀ = (M · T _S0 ) / (T _S1 + T _S0 −M · T _S1 ) and output from the recording buffer 11. It has been the digital audio signal a ₂ is initiated input (Q3), for example, 0.
After waiting for the minute time Δt of 01 seconds, etc. (Q4), and confirm that the audio signal a ₂ of the digital has not been completed (Q5), the digital signal state of the audio signal a ₂ of the present time silence In the case of the period 6 (T ₀ ) (Q 6), the silence target speech speed magnification N ₀ obtained above is transmitted to the signal synthesizing unit 12 as the speech speed magnification Y (Q 7). Y = N ₀ digital signal state of the audio signal a ₂ of the present time is in the case of voiced period _{5 (T 1) (Q6)} , and sends to the signal combining unit 12 speech speed ratio 1 as speech speed ratio Y ( Q8). Y = 1 Then, at Q5, when the audio signal a ₂ digital is completed, ends the speech speed ratio calculation process.

【００５８】このように構成された第３話速変換装置に
おいては、先に説明した第１実施形態の話速変換装置と
同様に、希望再生時間Ｒ₁を指定すると、この希望再生
時間Ｒ₁を有する音声信号ａ₄を再生するこことができ
る。In the third speech speed converter constructed as described above, when the desired playback time R ₁ is designated, similarly to the speech speed converter of the first embodiment described above, the desired playback time R _{1 is set.} can child playing audio signal a ₄ with.

【００５９】さらに、この第３実施形態の話速変換装置
においては、話速算出部１９ａから信号合成部１２へ入
力される話速倍率Ｙは、有音期間５（Ｔ₁）で１に設定
され、無音期間６（Ｔ₀）で無音目標話速倍率Ｎ₀に設定
される。Further, in the speech speed conversion device according to the third embodiment, the speech speed magnification Y input from the speech speed calculation unit 19a to the signal synthesis unit 12 is set to 1 in the sound period 5 (T ₁ ). Then, in the silence period 6 (T ₀ ), the silence target speech speed magnification N ₀ is set.

【００６０】したがって、例えば、無音期間６（Ｔ₀）
が多い、間延びした演説やナレーションを希望再生時間
Ｒ₁に短縮する場合に、この手法を採用することによっ
て、引き締まった聞き易い再生音声とすることができ
る。Therefore, for example, the silent period 6 (T ₀ )
There are many, in the case of shortening the speech and narration that was slow to the desired playback time R _1, by adopting this approach, it is possible to be that tight to hear easy playback voice.

【００６１】（第４実施形態）図６は本発明の第４実施
形態の話速変換装置に組込まれた話速算出部１９ａから
信号合成部１２へ印加される話速倍率Ｙの時間特性を示
す図である。なお、この第４実施形態の話速変換装置の
ブロック構成図は、図３に示した第２実施形態の話速変
換装置のブロック構成図と同じであるので説明を省略す
る。異なるところは、話速算出部１９ａの話速倍率Ｙの
算出処理内容のみである。(Fourth Embodiment) FIG. 6 shows the time characteristic of the speech speed magnification Y applied from the speech speed calculator 19a incorporated in the speech speed converter of the fourth embodiment of the present invention to the signal synthesizer 12. FIG. The block diagram of the speech speed converter of the fourth embodiment is the same as the block diagram of the speech speed converter of the second embodiment shown in FIG. The only difference is the content of the processing for calculating the speech speed magnification Y by the speech speed calculation unit 19a.

【００６２】この話速算出部１９ａは、図６に示すよう
に、この話速算出部１９ａに入力されたデジタルの音声
信号ａ₃の開始時刻（ｔ＝０）から規定時間Ｔ_B経過する
までの期間内に、時間経過に伴って予め設定された初期
話速倍率Ｂ（＝１）から基準話速倍率Ｍ近傍の目標話速
倍率Ｎまで変化させ、かつ規定時間Ｔ経過後に、目標話
速倍率Ｎを維持する話速倍率Ｙを順次算出して信号合成
部１２へ送出する。[0062] The speech speed calculation unit 19a, as shown in FIG. 6, until the specified time T _B has elapsed from the start time of the digital audio signal a ₃ which is input to the speech rate calculating section 19a (t = 0) Is changed from an initial speech speed magnification B (= 1) set in advance with time to a target speech speed N near the reference speech speed M, and after a lapse of a specified time T, the target speech speed is increased. The speech speed magnification Y that maintains the magnification N is sequentially calculated and sent to the signal synthesis unit 12.

【００６３】具体的には、この話速算出部１９ａは、図
７に示す流れ図に従って、話速倍率Ｙを算出して信号合
成部１２へ送出する処理を実施する。More specifically, the speech speed calculation unit 19a performs a process of calculating the speech speed magnification Y and sending it to the signal synthesis unit 12 in accordance with the flowchart shown in FIG.

【００６４】先ず、基準話速倍率Ｍ、規定時間Ｔ_B、原
音録音時間Ｒ₀を取込んで（Ｓ１）、目標話速倍率Ｎを
算出する（Ｓ２）。具体的には、図６に示すように、話
速倍率Ｙが基準話速倍率Ｍを下回る面積Ｓ₁と、話速倍
率Ｙが基準話速倍率Ｍを上回る面積Ｓ₂とが等しくなる
ように目標話速倍率Ｎを算出する。First, the reference speech speed magnification M, the specified time T _B , and the original sound recording time R ₀ are taken in (S1), and the target speech speed magnification N is calculated (S2). Specifically, as shown in FIG. 6, the area S ₁ where the voice speed magnification Y is lower than the reference voice speed M is equal to the area S ₂ where the voice speed Y exceeds the reference voice speed M. The target speech speed magnification N is calculated.

【００６５】Ｓ₁＝（Ｍ―１）Ｔ_B／２Ｓ₂＝（Ｎ―Ｍ）（Ｒ₀―Ｔ_B）Ｎ＝［（Ｍ―１）Ｔ_B／２（Ｒ_0―Ｔ_B）］＋Ｍそして、録音バッファ１１から出力されたデジタルの音
声信号ａ₂が入力開始されると、Ｓ３にて、経過時間ｔ
を初期化する（ｔ＝０）。例えば、０．０１秒等の微小
時間Δｔの経過を待って（Ｓ４）、Ｓ５にて経過時間ｔ
を更新する（ｔ＝ｔ＋Δｔ）。そして、更新後の経過時
間ｔが規定時間Ｔ_B未満の場合（Ｓ６）、下式に示す話
速倍率Ｙの算出を行う（Ｓ７）。Ｙ＝［（Ｎ―１）／Ｔ_B］ｔ＋１算出した話速倍率Ｙを信号合成部１２へ送出する（Ｓ
８）。そして、Ｓ４へ戻り、次の微小時間Δｔの経過を
待つ。S ₁ = (M−1) T _B / 2 S ₂ = (N−M) (R ₀ −T _B ) N = [(M−1) T _{B /} 2 (R ₀ −T _B )] + M When the input of the digital audio signal a ₂ output from the recording buffer 11 is started, the elapsed time t
Is initialized (t = 0). For example, after elapse of a minute time Δt such as 0.01 second (S4), the elapsed time t is determined in S5.
Is updated (t = t + Δt). When the elapsed time t after the update is less than the predetermined time T _B (S6), calculates the speech speed ratio Y shown in the following equation (S7). Y = [(N−1) / T _B ] t + 1 The calculated speech speed magnification Y is sent to the signal synthesis unit 12 (S
8). Then, the process returns to S4 and waits for the elapse of the next minute time Δt.

【００６６】Ｓ６にて、更新後の経過時間ｔが規定時間
Ｔ_Bに達すると、目標話速倍率Ｙ＝Ｎを信号合成部９へ
送出する。At S 6, when the elapsed time t after the update reaches the specified time T _B , the target speech speed magnification Y = N is sent to the signal synthesizing section 9.

【００６７】このように構成された第４実施形態の話速
変換装置においては、図６に示すように、時刻（経過時
間）ｔ＝０で録音バッファ１１から解析済みのデジタル
の音声信号ａ₂が信号合成部１２及び話速算出部１９ａ
へ入力開始されると、出力端子１５から出力される音声
信号ａ₄の話速は通常話速（Ｙ＝１）である。そして、
経過時間ｔが増加すると、話速倍率Ｙも増加する。In the speech speed converter according to the fourth embodiment thus constructed, as shown in FIG. 6, a digital audio signal a ₂ analyzed from the recording buffer 11 at time (elapsed time) t = 0. Is the signal synthesis unit 12 and the speech speed calculation unit 19a
If the input starts to speech speed of the speech signal a ₄ output from the output terminal 15 is typically speech speed (Y = 1). And
As the elapsed time t increases, the speech speed magnification Y also increases.

【００６８】そして、経過時間ｔが規定時間Ｔ_Bに達す
ると、話速倍率Ｙが基準話速倍率Ｍを若干上回る目標話
速倍率Ｎに達する（Ｙ＝Ｎ）。規定時間Ｔ_Bを経過した
後は、話速倍率Ｙは目標話速倍率Ｎを維持する。[0068] When the elapsed time t reaches the predetermined time T _B, it reaches the target speech speed ratio N over speech speed ratio Y slightly reference speech speed magnification M is (Y = N). After a lapse of specified time T _B is speaking rate ratio Y maintains the target speech speed magnification N.

【００６９】したがって、この第４実施形態の話速変換
装置を採用することによって、先に説明した第１実施形
態の話速変換装置と同様に、希望再生時間Ｒ₁を指定す
ると、この希望再生時間Ｒ₁を有する音声信号ａ₄を再生
するこことができる。さらに、演説やナレーションの冒
頭部分のみ通常に近い話速でその後は目標話速Ｎの話速
なる。よって、演説やナレーションの冒頭部分を聞き逃
すことはない。Therefore, by adopting the speech speed conversion device of the fourth embodiment, when the desired playback time R ₁ is designated, as in the speech speed conversion device of the first embodiment described above, the desired playback time is designated. can child playing audio signal a ₄ having a time R _1. Further, only the beginning of the speech or the narration has a speech speed close to normal, and thereafter the speech speed becomes the target speech speed N. Therefore, you will not miss the beginning of your speech or narration.

【００７０】（第５実施形態）図８は本発明の第５実施
形態の話速変換装置に組込まれた話速算出部１９ａから
信号合成部１２へ印加される話速倍率Ｙの時間特性を示
す図である。なお、この第５実施形態の話速変換装置の
ブロック構成図は、図３に示した第２実施形態の話速変
換装置のブロック構成図と同じであるので説明を省略す
る。異なるところは、話速算出部１９ａの話速倍率Ｙの
算出処理内容のみである。(Fifth Embodiment) FIG. 8 shows the time characteristic of the speech speed magnification Y applied from the speech speed calculator 19a incorporated in the speech speed converter of the fifth embodiment of the present invention to the signal synthesizer 12. FIG. The block diagram of the speech speed conversion device of the fifth embodiment is the same as the block diagram of the speech speed conversion device of the second embodiment shown in FIG. The only difference is the content of the processing for calculating the speech speed magnification Y by the speech speed calculation unit 19a.

【００７１】この話速算出部１９ａは、図８に示すよう
に、この話速算出部１９ａに入力されたデジタルの音声
信号ａ₃の有音期間５（Ｔ₁）の開始時刻（ｔ_S＝０）か
らの経過期間ｔ_Sが規定時間Ｔ_B経過するまでの期間内
に、時間経過に伴って予め設定された初期話速倍率Ｂ
（＝１）から基準話速倍率Ｍ近傍の目標話速倍率Ｎまで
変化させ、かつ規定時間Ｔ_B経過後に、目標話速倍率Ｎ
を維持し、さらに、デジタルの音声信号ａ₃が経過期間
ｔ_S＝ｔ_Eで無音期間６（Ｔ₀）に変化すると、１とな
り、次の有音期間５（Ｔ₁）まで１を維持する話速倍率
Ｙを順次算出して信号合成部１２へ送出する。但し、無
音期間６の継続期間ｔ_Qがしきい値時間Ｔ_SHより短い場
合は、話速倍率Ｙは１に戻らない。As shown in FIG. 8, the speech speed calculating unit 19a starts the sounding period 5 (T ₁ ) of the digital voice signal a ₃ input to the speech speed calculating unit 19a (t _S = T _S ). in age period t to _S defines time T _B has elapsed from 0), is set in advance with time the initial speech speed ratio B
(= 1) is changed to the target speech speed magnification N of the reference speech rate ratio M vicinity of and after the predetermined time T _B has elapsed, the target speech speed magnification N
Further, when the digital audio signal a ₃ changes to the silent period 6 (T ₀ ) during the elapsed time t _S = t _E, it becomes 1, and maintains ₁ until the next voiced period 5 (T ₁ ). The speech speed magnification Y is sequentially calculated and sent to the signal synthesizing unit 12. However, when the duration t _Q of the silent period 6 is shorter than the threshold time T _SH , the speech speed magnification Y does not return to 1.

【００７２】具体的には、この話速算出部１９ａは、図
１０に示す流れ図に従って、話速倍率Ｙを算出して信号
合成部１２へ送出する処理を実施する。More specifically, the speech speed calculation unit 19a performs a process of calculating the speech speed magnification Y and sending it to the signal synthesis unit 12 in accordance with the flowchart shown in FIG.

【００７３】先ず、基準話速倍率Ｍ、規定時間Ｔ_B、話
し始め話速倍率Ｂを取込んで（A１）、目標話速倍率Ｎ
を算出する（Ａ２）。具体的には、図６に示すように、
話速倍率Ｙが基準話速倍率Ｍを下回る面積Ｓ₁と、話速
倍率Ｙが基準話速倍率Ｍを上回る面積Ｓ₂とが等しくな
るように目標話速倍率Ｎを算出する。なお、有音期間５
の時間長Ｔ₁（ｔ_S＝０〜ｔ_S＝ｔ_E）は単語や文節によっ
てまちまちであるので、平均的な値を用いて目標話速倍
率Ｎを算出する。First, the reference speech speed M, the specified time T _B , and the speech speed B at the start of speech are taken in (A1), and the target speech speed N is obtained.
Is calculated (A2). Specifically, as shown in FIG.
The target speech speed magnification N is calculated so that an area S ₁ where the speech speed magnification Y is lower than the reference speech speed magnification M is equal to an area S ₂ where the speech speed magnification Y is higher than the reference speech speed magnification M. Note that the sound period 5
Since the time length T ₁ (t _S = 0 to t _S = t _E ) varies depending on the word or phrase, the target speech speed magnification N is calculated using an average value.

【００７４】そして、録音バッファ１１から出力された
デジタルの音声信号ａ₂が入力開始されると、例えば、
０．０１秒等の微小時間Δｔの経過を待って（Ａ３）、
A４にてデジタルの音声信号ａ₂のその時点での解析結果
を取込む（Ａ４）。そして、デジタルの音声信号ａ₂が
有音期間５であれば（Ａ５）、有音期間フラグの状態を
調べる（Ａ６）。０に解除されたままであると、今回初
めて有音期間５に入ったと判断して、Ａ７にて、有音期
間フラグを１に設定するとともに、有音経過時間ｔ_Sを
初期化する（ｔ_S＝０）。さらに、無音股間フラグを０
に解除する（Ａ８）。When the input of the digital audio signal a ₂ output from the recording buffer 11 is started, for example,
After a lapse of a minute time Δt such as 0.01 second (A3),
Taking the analysis results at the time of the digital audio signal a ₂ by A4 (A4). Then, if the digital audio signal a ₂ has the sound period 5 (A5), the state of the sound period flag is checked (A6). If it remains released to 0, it is determined that the sound period 5 has entered for the first time this time, and the sound period flag is set to 1 at A7, and the sound elapsed time t _S is initialized (t _S = 0). Further, the silent crotch flag is set to 0.
(A8).

【００７５】なお、Ａ６にて、既に有音期間フラグが１
に設定されたままであると、前回以前に有音期間５に入
ったと判断して、Ａ９にて、有音経過時間ｔ_Sを更新す
る（ｔ_S＝ｔ_S＋Δｔ）。At A6, the sound period flag is already 1
If it is still set to, it is determined that the sound period 5 has entered before the previous time, and the sound elapsed time t _S is updated at A9 (t _S = t _S + Δt).

【００７６】そして、下式で示す話速倍率Ｙを算出する
（Ａ１０）。Then, the speech speed magnification Y represented by the following equation is calculated (A10).

【００７７】Ｙ＝Ｎ―（Ｎ―Ｂ）exp［―ｔ_S／（Ｔ_B／５）］この算出した話速倍率Ｙを信号合成部１２へ送出する
（Ａ１１）。そして、Ａ３へ戻り、次の微小時間Δｔが
経過するのを待つ。Y = N− (NB) exp [−t _S / (T _B / 5)] The calculated speech speed magnification Y is sent to the signal synthesis unit 12 (A 11). Then, the process returns to A3 and waits for the next minute time Δt to elapse.

【００７８】また、Ａ５にて、現在時点でデジタルの音
声信号ａ₂が有音期間５でなくて無音期間６の場合であ
れば、無音期間フラグの状態を調べる（Ａ１２）。０に
解除されたままであると、今回初めて無音期間６に入っ
たと判断して、Ａ１３にて、無音期間フラグを１に設定
するとともに、無音経過時間ｔ_Qを初期化する（ｔ_Q＝
０）。そして、Ａ９へ進み、有音経過時間ｔ_Sを更新す
る（ｔ_S＝ｔ_S＋Δｔ）。さらに、前述した話速倍率Ｙを
算出して（Ａ１０）、信号合成部１２へ送出する（Ａ１
１）。そして、Ａ３へ戻り、次の微小時間Δｔが経過す
るのを待つ。[0078] Also, at A5, in the case the digital audio signal a ₂ at the current point of silent period 6 not voiced period 5, determine the status of the silent period flag (A12). If it is kept at 0, it is determined that the silence period 6 has entered for the first time this time. At A13, the silence period flag is set to 1 and the silence elapsed time t _Q is initialized (t _Q =
0). Then, the process proceeds to A9, and the sound elapsed time t _S is updated (t _S = t _S + Δt). Further, the above-mentioned speech speed magnification Y is calculated (A10) and transmitted to the signal synthesizing unit 12 (A1).
1). Then, the process returns to A3 and waits for the next minute time Δt to elapse.

【００７９】Ａ１２にて、既に無音期間フラグが１に設
定されたままであると、前回以前に無音期間６に入った
と判断して、Ａ１４にて、無音経過時間ｔ_Qを更新する
（ｔ_Q＝ｔ_Q＋Δｔ）。そして、更新後の無音経過時間ｔ
_Qが予め設定されているしきい値Ｔ_SHを超えたか否かを
調べる（Ａ１５）。At A12, if the silent period flag is already set to 1, it is determined that the silent period 6 has been entered before the previous time, and the silent elapsed time t _Q is updated at A14 (t _Q = t _Q + Δt). Then, the silence elapsed time t after the update
It is checked whether or not _Q has exceeded a preset threshold value T _SH (A15).

【００８０】無音経過時間ｔ_Qがしきい値時間Ｔ_SHを超
えていなければ、Ａ９へ進み、有音経過時間ｔ_Sを更新
する（ｔ_S＝ｔ_S＋Δｔ）。さらに、話速倍率Ｙを算出し
て（Ａ１０）、信号合成部１２へ送出する（Ａ１１）。
そして、Ａ３へ戻り、次の微小時間Δｔが経過するのを
待つ。If the silence elapsed time t _Q does not exceed the threshold time T _SH , the process proceeds to A 9, where the speech elapsed time t _S is updated (t _S = t _S + Δt). Further, the speech speed magnification Y is calculated (A10) and transmitted to the signal synthesizing unit 12 (A11).
Then, the process returns to A3 and waits for the next minute time Δt to elapse.

【００８１】Ａ１５にて、無音経過時間ｔ_Qがしきい値
時間Ｔ_SHを超えると、初めて有音期間フラグを０に解除
する（Ａ１６）。そして、Ａ３へ戻り、次の微小時間Δ
ｔが経過するのを待つ。At A15, when the silence elapsed time t _Q exceeds the threshold time T _SH , the sound period flag is reset to 0 for the first time (A16). Then, returning to A3, the next minute time Δ
Wait for t to elapse.

【００８２】このように構成された第５実施形態の話速
変換装置においては、図９に示すように、録音バッファ
１１から解析済みのデジタルの音声信号ａ₂が信号合成
部１２及び話速算出部１９ａへ入力開始されると、出力
端子１５から出力される音声信号ａ₄の話速倍率Ｙは、
有音期間５（Ｔ₁）が開始される毎に、通常話速に近い
話し開始話速倍率Ｂから、規定時間Ｔ_B内に、基準話速
倍率Ｍ近傍の目標話速倍率Ｎまで増加し、該当有音期間
５（Ｔ₁）が継続する限りは、目標話速倍率Ｎを維持す
る。In the speech speed converter according to the fifth embodiment thus configured, as shown in FIG. 9, the analyzed digital audio signal a ₂ is recorded from the recording buffer 11 by the signal synthesizer 12 and the speech speed calculator. If the input starts to section 19a, speech speed ratio Y of the audio signal a ₄ output from the output terminal 15,
Each time the sound period 5 (T ₁₎ is started, an increase from the start speech speed ratio B talk close to normal speech speed, a specified time T in _B, until the target speech speed magnification N of the reference speech rate ratio M vicinity As long as the corresponding sound period 5 (T ₁ ) continues, the target speech speed magnification N is maintained.

【００８３】したがって、この第５実施形態の話速変換
装置を採用することによって、先に説明した第１実施形
態の話速変換装置と同様に、希望再生時間Ｒ₁を指定す
ると、この希望再生時間Ｒ₁を有する音声信号ａ₄を再生
するこことができる。Therefore, when the desired reproduction time R ₁ is designated by adopting the speech speed conversion device of the fifth embodiment, similarly to the speech speed conversion device of the first embodiment described above, the desired reproduction time is designated. can child playing audio signal a ₄ having a time R _1.

【００８４】さらに、演説やナレーションにおける音声
が一定のしきい値時間Ｔ_SHを超えると、話速倍率Ｙがほ
ぼ通常の値Ｂにも戻るので、次の話し始めは通常の話速
となるので、非常に聞き易くなる。Further, when the speech in the speech or narration exceeds a certain threshold time T _SH , the speech speed magnification Y returns to almost the normal value B, so that the next speech starts at the normal speech speed. , Very easy to hear.

【００８５】[0085]

【発明の効果】以上説明したように、本発明の話速変換
装置においては、希望再生時間を与えるのみで任意時間
長を有した音声信号を、違和感なく与えられた希望再生
時間で正確に再生できる。したがって、使い勝手を大幅
に向上できる。As described above, in the speech speed converter according to the present invention, an audio signal having an arbitrary time length can be accurately reproduced at a given desired reproduction time without giving a sense of incongruity only by giving a desired reproduction time. it can. Therefore, usability can be greatly improved.

[Brief description of the drawings]

【図１】本発明の第１実施形態に係わる話速変換装置の
概略構成を示すブロック図FIG. 1 is a block diagram showing a schematic configuration of a speech speed conversion device according to a first embodiment of the present invention;

【図２】録音及び再生する音声信号の信号構成を示す模
式図FIG. 2 is a schematic diagram showing a signal configuration of an audio signal to be recorded and reproduced;

【図３】本発明の第２実施形態に係わる話速変換装置の
概略構成を示すブロック図FIG. 3 is a block diagram showing a schematic configuration of a speech speed conversion device according to a second embodiment of the present invention.

【図４】同第２実施形態に係わる話速変換装置に組込ま
れた話速算出部の話速倍率の算出処理を示す流れ図FIG. 4 is a flowchart showing a speech speed multiplication calculation process of a speech speed calculation unit incorporated in the speech speed conversion device according to the second embodiment;

【図５】本発明の第３実施形態に係わる話速変換装置に
組込まれた話速算出部の話速倍率の算出処理を示す流れ
図FIG. 5 is a flowchart showing a process of calculating a speech speed magnification of a speech speed calculation unit incorporated in a speech speed conversion device according to a third embodiment of the present invention.

【図６】本発明の第４実施形態に係わる話速変換装置に
組込まれた話速算出部から出力された話速倍率の変化を
示す図FIG. 6 is a diagram showing a change in a speech speed magnification output from a speech speed calculation unit incorporated in a speech speed conversion device according to a fourth embodiment of the present invention.

【図７】同第４実施形態に係わる話速変換装置に組込ま
れた話速算出部の話速倍率の算出処理を示す流れ図FIG. 7 is a flowchart showing a speech speed multiplication calculation process of a speech speed calculation unit incorporated in the speech speed conversion device according to the fourth embodiment;

【図８】本発明の第５実施形態に係わる話速変換装置に
組込まれた話速算出部から出力された話速倍率の変化を
示す図FIG. 8 is a diagram showing a change in a speech speed magnification output from a speech speed calculation unit incorporated in a speech speed conversion device according to a fifth embodiment of the present invention.

【図９】同じく第５実施形態に係わる話速変換装置に組
込まれた話速算出部から出力された話速倍率の変化を示
す図FIG. 9 is a diagram showing a change in a speech speed magnification output from a speech speed calculator incorporated in the speech speed converter according to the fifth embodiment.

【図１０】同第５実施形態に係わる話速変換装置に組込
まれた話速算出部の話速倍率の算出処理を示す流れ図FIG. 10 is a flowchart showing a speech speed magnification calculation process of a speech speed calculation unit incorporated in the speech speed conversion device according to the fifth embodiment.

【図１１】一般的な音声信号波形を示す図FIG. 11 is a diagram showing a general audio signal waveform.

【図１２】一般的な音声信号の詳細を示す図FIG. 12 is a diagram showing details of a general audio signal.

[Explanation of symbols]

２…無声音３…有声音４…無音５…有音期間５…無音期間８…Ａ／Ｄ変換器９…音声信号メモリ１０…音声解析部１１…録音バッファ１２…信号合成部１３…出力バッファ１４…Ｄ／Ａ変換器１６…録音時間検出部１７…希望再生時間入力部１８…基準話速倍率算出部１９，１８ａ…話速算出部２０…累積有音期間算出部２１…累積無音期間算出部 2 ... unvoiced sound 3 ... voiced sound 4 ... silence 5 ... voiced period 5 ... silence period 8 ... A / D converter 9 ... audio signal memory 10 ... audio analysis unit 11 ... recording buffer 12 ... signal synthesis unit 13 ... output buffer 14 ... D / A converter 16 ... Recording time detection unit 17 ... Requested reproduction time input unit 18 ... Reference speech speed magnification calculation unit 19,18a ... Speech speed calculation unit 20 ... Cumulative sound period calculation unit 21 ... Cumulative silence period calculation unit

【手続補正書】[Procedure amendment]

【提出日】平成１２年１月７日（２０００．１．７）[Submission date] January 7, 2000 (2000.1.7)

【手続補正１】[Procedure amendment 1]

【補正対象書類名】図面[Document name to be amended] Drawing

【補正対象項目名】全図[Correction target item name] All figures

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【図１】 FIG.

【図２】 FIG. 2

【図４】 FIG. 4

【図６】 FIG. 6

【図９】 FIG. 9

【図３】 FIG. 3

【図５】 FIG. 5

【図７】 FIG. 7

【図１１】 FIG. 11

【図８】 FIG. 8

【図１０】 FIG. 10

【図１２】 FIG.

───────────────────────────────────────────────────── フロントページの続き (72)発明者勝木陽一東京都港区南麻布五丁目10番27号アンリツ株式会社内Ｆターム(参考） 5D045 AA07 BA02 5D080 AA05 BA01 DA02 FA31 FA39 GA02 GA16 ────────────────────────────────────────────────── ─── Continuing on the front page (72) Inventor Yoichi Katsuki 5-10-27 Minamiazabu, Minato-ku, Tokyo Anritsu Corporation F-term (reference) 5D045 AA07 BA02 5D080 AA05 BA01 DA02 FA31 FA39 GA02 GA16

Claims

[Claims]

An audio analysis unit for digitally analyzing a continuous audio signal; a recording buffer for storing a digital signal analyzed by the audio analysis unit; an original sound of the continuous audio signal; the original sound recording time detecting means for detecting the recording time (16), and calculates the detected original sound recording time (R ₀₎ and the desired playback time reference speech speed ratio represented by the ratio of (R ₁₎ (M) A reference voice speed magnification calculating means (18); and a voice speed ratio (Y) corresponding to the calculated reference voice speed ratio (M) by synthesizing a digital signal output from the recording buffer in response to a reproduction instruction. And a signal synthesizing section (12) for reproducing a new digital signal.

2. An audio analysis unit (10) for digitally analyzing a continuous audio signal, a recording buffer (11) for storing a digital signal analyzed by the audio analysis unit, and an original sound of the continuous audio signal. the original sound recording time detecting means for detecting the recording time (16), and calculates the detected original sound recording time (R ₀₎ and the desired playback time reference speech speed ratio represented by the ratio of (R ₁₎ (M) Reference speech speed magnification calculating means (18), and signal synthesis for reproducing a new digital signal at a specified speech speed magnification (Y) by synthesizing digital signals output from the recording buffer in response to a reproduction instruction. Unit (12); and a speech speed calculation unit (19) that calculates a speech speed ratio (Y) corresponding to the calculated reference speech speed ratio (M) and sends the result to the signal synthesis unit. Characteristic speech speed converter.

3. A cumulative sound period calculating means (20) for calculating a cumulative sound period of the continuous audio signal, and a cumulative silent period calculating means (21) for calculating a cumulative silent period of the continuous audio signal. The speech speed calculation unit (19a) is configured to change the speech speed ratio of the speech period from the calculated accumulated speech period and the accumulated silence period so that the reference speech speed ratio (M) is Calculate the obtained sound target speech speed magnification (N ₁ ) and send the speech speed magnification of the sound target speech speed magnification to the signal synthesizing section during the sound period of the digital signal. 3. The speech speed conversion device according to claim 2, wherein no change in period is sent to the signal synthesis unit during a period.

4. A cumulative sound period calculating means (20) for calculating a cumulative sound period of the continuous audio signal, and a cumulative silent period calculating means (21) for calculating a cumulative silent period of the continuous audio signal. The speech speed calculation unit (19a) includes: a silence target period in which the reference speech speed magnification (M) is obtained by changing the silence period from the calculated accumulated speech period and accumulated silence period. (M ₀ ) is calculated, and the speech rate magnification of the target silence period (N ₀ ) is sent to the signal synthesizing section during the sound period of the digital signal, and the speech speed of 1 is output during the sound period of the digital signal. 3. The speech speed conversion device according to claim 2, wherein a magnification is transmitted to the signal synthesis unit.

5. The speech speed calculation unit (19a), based on an initial speech speed magnification set in advance as time elapses, within a period from the start time of the continuous audio signal until a specified time elapses, sets the reference value. The method according to claim 1, further comprising: changing a speech speed ratio to a target speech speed ratio in the vicinity of the speech speed ratio, and after the lapse of the prescribed time, sequentially calculating a speech speed ratio for maintaining the target speech speed ratio, and transmitting the calculated speech speed ratio to the signal synthesis unit. 2. The speech speed converter according to 2.