JPH07129190A

JPH07129190A - Talk speed change method and device and electronic device

Info

Publication number: JPH07129190A
Application number: JP16723294A
Authority: JP
Inventors: Yoshito Nene; 義人禰寝; Yukio Kumagai; 幸夫熊谷; Masashi Takamiya; 正志高宮; Yasunori Kawauchi; 保憲川内; Nobuo Hataoka; 信夫畑岡; Juichi Morikawa; 寿一森川
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1993-09-10
Filing date: 1994-07-19
Publication date: 1995-05-19
Also published as: EP0643380A2; DE69421774T2; EP0643380B1; EP0643380A3; DE69421774D1; CA2131730A1

Abstract

PURPOSE:To provide possibility of hearing a talk with a retarded talk speed with no change in the feature of the hearer's voice by changing the talk speed without changing the pitch of the input voice only during the specified time when the hearer requires it. CONSTITUTION:A voice is fed to a microphone 321, emitted as a voice signal, digital converted at certain time intervals by an A/D converter 5 upon passing through an amplifier 10 and low-pass filter 7, and subjected to a talk speed converting process to be made by a software 11 on a digital signal processor (DSP) 1. A PTL, switch 4 is connected with a terminal 13 for external interrupt flag contained in the DSP 1, and the software 11 on the DSP 1 judges whether talk speed conversion should take place depending upon the numerical value of a flag register 14. When the hearer needs talk speed conversion, it is done without changing the pitch of the input voice only during the specified time. Accordingly this talk speed converting device can be used even in the situation such as in talking, and the hearer can select the voice to which a talk speed conversion should be applied.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声の速度を変調する
話速変換方法及び話速変換装置並びに電子装置に関し、
特に対話等にその装置を用いるための制御技術に適用し
て有効な技術に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech speed conversion method, a speech speed conversion device and an electronic device for modulating the speed of voice,
In particular, the present invention relates to a technique effectively applied to a control technique for using the device for dialogues and the like.

【０００２】[0002]

【従来の技術】難聴者の聴覚を補助する手段として、従
来アナログ回路を用いて音声の振幅及びその周波数特性
を加工するアナログ型補聴器が主に使用されてきた。こ
れに対し、近年、ディジタル信号処理を聴覚障害の補償
に応用するための研究開発が盛んに行われている。この
研究開発動向については、例えば、日本音響学会誌（１
９９１年４７巻１０号、Ｐ７６０〜Ｐ７６５）「聴覚障
害補償へのディジタル技術の応用」やＪ．Ａcoust．Ｓo
c．Ａm．（90（2），Ｐt．1，Ａug．1991）”Ｓpeech-p
erception aids for hearing-impaied people ：Ｃurre
nt status and needed research ”等に詳述されてい
る。2. Description of the Related Art As a means for assisting the hearing of a hearing-impaired person, an analog type hearing aid which processes an amplitude and a frequency characteristic of a voice by using an analog circuit has been mainly used. On the other hand, in recent years, research and development for applying digital signal processing to compensation for hearing impairment have been actively conducted. For this research and development trend, see, for example, the Acoustical Society of Japan (1
991, Vol. 47, No. 10, P760-P765) "Application of Digital Technology to Deafness Compensation" and J. Acoust. So
c. Am. (90 (2), Pt.1, Aug. 1991) "Speech-p
erception aids for hearing-impaied people: Curre
nt status and needed research ”etc.

【０００３】一般に、聴力の損失を補うためには、音圧
レベルの増幅とダイナミックレンジの圧縮を、使用者の
聴力特性に合わせて周波数毎に行う。従来のアナログ補
聴器では、このような処理をアナログ回路で実現してい
る。また、近年、開発されているディジタル補聴器で
は、この処理をディジタルフィルタ等のソフトウエアで
実現することで、使用者の聴力特性への適合がより詳細
に行えるようにしている。Generally, in order to compensate for the loss of hearing, the sound pressure level is amplified and the dynamic range is compressed for each frequency according to the hearing characteristics of the user. In a conventional analog hearing aid, such processing is realized by an analog circuit. Further, in the recently developed digital hearing aid, this processing is realized by software such as a digital filter, so that the user's hearing characteristics can be more precisely adapted.

【０００４】このような動向の中、近年、ディジタル信
号処理により音声のピッチを変えずに早さだけを変え
て、より高次な言語処理速度の衰えも含めた聴覚系全般
に渡った補聴を行おうとする試みがなされている。この
ような話速変換技術については、例えば、電子情報通信
学会技術研究報告（Ｖol．92 Ｎo．207 ＳＰ92-54「高
齢者向け音声加工を行うポータブルＤＳＰシステムの開
発」や同（ＳＰ92-55)「高品質リアルタイム話速変換シ
ステム」などに詳述されている。Under these circumstances, in recent years, hearing aids for the entire auditory system including a decline in higher-order language processing speed are obtained by changing only the speed without changing the pitch of voice by digital signal processing. An attempt is made to go. For such a speech rate conversion technology, for example, Technical Report of IEICE (Vol.92 No.207 SP92-54 "Development of portable DSP system for voice processing for the elderly") and the same (SP92-55). It is described in detail in "High-quality real-time speech speed conversion system".

【０００５】[0005]

【発明が解決しようとする課題】前記従来の技術におい
ては、話速変換を行う音声として、テレビ／ラジオ等の
放送音声やテープレコーダ等に録音された音声を対象と
していた。すなわち、聞き手に対して一方的に与えられ
る音声だけが話速変換の対象であった。In the above-mentioned conventional technique, the speech for converting the speech speed is intended for broadcasting speech of television / radio or speech recorded on a tape recorder or the like. That is, only the voice unilaterally given to the listener was the target of speech speed conversion.

【０００６】しかし、従来の補聴器が入力音声の種類に
よらず使用できることを考えると、話速変換装置も前記
以外の音声を入力として扱えることが望ましい。特に、
対話における相手の音声を、ゆっくり聞くことができれ
ば、高齢者や難聴者の聴覚を補助するばかりでなく、健
聴者が慣れていない外国語会話での聞き取りを補助する
場合などにも使用することが可能となる。However, considering that the conventional hearing aid can be used regardless of the type of input voice, it is desirable that the speech speed conversion apparatus can also handle voices other than the above. In particular,
If you can listen slowly to the voice of the other party in a dialogue, it can be used not only to assist the hearing of the elderly and the hearing impaired, but also to assist in hearing in foreign language conversations that normal hearing people are not familiar with. It will be possible.

【０００７】本発明の目的は、話の速度を必要に応じて
変換して再生することが可能な技術を提供することにあ
る。An object of the present invention is to provide a technique capable of converting the speech speed as needed and reproducing it.

【０００８】本発明の他の目的は、音声の原データを記
憶しておき、常時、音声の原データに基づいた話の速度
の変換が可能な技術を提供することにある。Another object of the present invention is to provide a technique capable of storing the original voice data and constantly converting the speech speed based on the original voice data.

【０００９】本発明の他の目的は、話速変換装置を対話
等に利用するための装置の制御手段を提供することにあ
る。Another object of the present invention is to provide a control means of the device for utilizing the speech speed conversion device for dialogue and the like.

【００１０】本発明の他の目的は、話速変換装置の応用
範囲の拡大が可能な技術を提供することにある。Another object of the present invention is to provide a technique capable of expanding the application range of the speech speed conversion device.

【００１１】本発明の他の目的は、話速変換装置の記憶
装置（メモリ）を有効に利用することができる技術を提
供することにある。Another object of the present invention is to provide a technique capable of effectively utilizing the storage device (memory) of the speech speed conversion device.

【００１２】本発明の他の目的は、話速変換装置の記憶
装置（メモリ）の読み出しポインタ戻しが可能な技術を
提供することにある。Another object of the present invention is to provide a technique capable of returning a read pointer to a storage device (memory) of a speech speed conversion device.

【００１３】本発明の他の目的は、話速変換装置におい
て、接続するＡＶ機器のコントロールが可能な技術を提
供することにある。Another object of the present invention is to provide a technology capable of controlling connected AV equipment in a speech speed conversion apparatus.

【００１４】本発明の他の目的は、話速変換装置におい
て、連続的話速変換を行うことが可能な技術を提供する
ことにある。Another object of the present invention is to provide a technique capable of continuously converting a voice speed in a voice speed conversion device.

【００１５】本発明の他の目的は、話速変換装置の操作
性の向上をはかることが可能な技術を提供することにあ
る。Another object of the present invention is to provide a technique capable of improving the operability of the speech speed conversion device.

【００１６】本発明の他の目的は、話速変換装置の低消
費電力化が可能な技術を提供することにある。Another object of the present invention is to provide a technique capable of reducing the power consumption of the speech speed conversion device.

【００１７】本発明の前記ならびにその他の目的及び新
規な特徴は、本明細書の記述及び添付図面によって明ら
かにする。The above and other objects and novel features of the present invention will be apparent from the description of this specification and the accompanying drawings.

【００１８】[0018]

【課題を解決するための手段】本願によって開示される
発明のうち代表的なものの概要を簡単に説明すれば、以
下のとおりである。The outline of the representative ones of the inventions disclosed by the present application will be briefly described as follows.

【００１９】（１）音声を入力し、該入力音声のピッチ
を変化させずに、音声の速度のみを変更する話速変換方
法であって、話速変換を聞き手が必要とする時に指定し
た時間の間のみ入力音声の話速変換処理が行われ、それ
以外の時間の間には話速変換が行われないものである。(1) A voice speed conversion method in which a voice is input and only the speed of the voice is changed without changing the pitch of the input voice, which is a time specified when the listener needs the voice speed conversion. During this period, the speech speed conversion processing of the input voice is performed, and during the other time, the speech speed conversion is not performed.

【００２０】（２）音声を入力する手段と、該入力音声
の速度を変更する話速変換処理手段と、該話速変換処理
手段の出力を聞き手の耳に音声出力する手段とを有する
話速変換装置であって、前記話速変換装置に話速変換処
理用スイッチを設け、該話速変換処理用スイッチがオン
（ＯＮ）している間だけ前記入力音声の話速を変更して
出力し、当該話速変換処理用スイッチがオフ（ＯＦＦ）
している間には入力音声の話速を変更せずに出力する手
段を設けたものである。(2) A voice speed having means for inputting voice, voice speed conversion processing means for changing the speed of the input voice, and means for outputting the output of the voice speed conversion processing means to the listener's ear as voice. A conversion device, wherein the speech speed conversion device is provided with a speech speed conversion processing switch, and the speech speed of the input voice is changed and output only while the speech speed conversion processing switch is ON. , The speech speed conversion processing switch is off (OFF)
A means for outputting the input voice without changing the speech speed is provided during the operation.

【００２１】（３）原音声を符号化して蓄積し、該蓄積
された符号化音声を読み出し、前記原音声のピッチを変
化させずに、音声の速度のみを変更する話速変換方法で
あって、話速変換を必要とする時に指定した時間の間の
み入力音声の話速変換処理が行われ、それ以外の時間の
間には話速変換が行われないものである。(3) A speech speed conversion method in which original speech is encoded and accumulated, the accumulated encoded speech is read out, and only the speed of the speech is changed without changing the pitch of the original speech. The speech speed conversion processing of the input voice is performed only during the designated time when the speech speed conversion is required, and the speech speed conversion is not performed during the other time.

【００２２】（４）原音声を入力する手段と、該入力音
声を符号化して蓄積する記憶手段と、該蓄積された符号
化音声を読出して前記入力音声の速度を変更する話速変
換処理手段と、該話速変換処理手段の出力を聞き手の耳
に音声出力する手段とを有する話速変換装置であって、
前記話速変換装置に話速変換処理用スイッチを設け、該
話速変換処理用スイッチがオン（ＯＮ）している間だけ
前記入力音声の話速を変更して出力し、当該話速変換処
理用スイッチがオフ（ＯＦＦ）している間には入力音声
の話速を変更せずに出力する手段を設けたものである。(4) Means for inputting the original voice, storage means for encoding and accumulating the input voice, and voice speed conversion processing means for reading the accumulated encoded voice and changing the speed of the input voice. And a speech speed conversion device having means for outputting the output of the speech speed conversion processing means to the listener's ear as voice,
The speech speed conversion device is provided with a speech speed conversion processing switch, and the speech speed of the input voice is changed and output only while the speech speed conversion processing switch is on (ON). A means for outputting the input voice without changing the speech speed is provided while the power switch is turned off.

【００２３】（５）前記記憶手段は、フレーム単位で記
憶する手段を有するものである。(5) The storage means has means for storing in frame units.

【００２４】（６）前記話速変換処理における波形伸長
／短縮処理の決定は、フレームのパワーとしきい値との
比較して行う手段を有し、前記しきい値を可変にしたも
のである。(6) The determination of the waveform expansion / contraction processing in the speech speed conversion processing is made by comparing the power of the frame with a threshold value, and the threshold value is made variable.

【００２５】（７）前記話速変換装置に話速を選択する
話速選択用スイッチを設け、該話速選択用スイッチで選
択された話速に変更する手段を設けたものである。(7) The speech speed conversion device is provided with a speech speed selection switch for selecting a speech speed, and means for changing the speech speed selected by the speech speed selection switch.

【００２６】（８）前記話速変換装置にオーディオ・ビ
デオ機器を制御する手段（ＡＶコントロール）を設けた
ものである。(8) The speech speed converter is provided with means (AV control) for controlling audio / video equipment.

【００２７】（９）前記話速変換装置にリピート用スイ
ッチを設け、該リピート用スイッチがオン（ＯＮ）して
いる間は再生音声をリピートする手段を設けたものであ
る。(9) The speech speed conversion apparatus is provided with a repeat switch, and means for repeating reproduced voice while the repeat switch is on.

【００２８】（１０）前記リピート手段は、１回押す毎
に数秒ずつバックさせる手段、戻っている間は時々間欠
音を発生させる手段、リングバッファの端まで行ったら
それ以上戻れなくする手段、リピート時の話速を選択す
る手段のうち少なくとも１つの手段を有するものであ
る。(10) The repeat means is a means for backing up for several seconds each time it is pressed, a means for occasionally generating an intermittent sound while returning, a means for preventing further returning after reaching the end of the ring buffer, a repeat At least one of the means for selecting the speech speed at time is included.

【００２９】（１１）前記リピート時の話速を選択する
手段は、デフォルト値でリピート、ゆっくりリピート、
早聴きでリピート、リピートが徐々に早くなるのうち少
なくとも２つ以上を有するものである。(11) The means for selecting the speech speed at the time of repeat is a default value of repeat, slow repeat,
It has at least two or more of the repeat and the repeat being gradually quickened by fast listening.

【００３０】（１２）前記話速変換装置において、話速
変換またはリピート動作によって実時間からの遅れが生
じた場合に、前記記憶されている情報を再生している間
に、前記遅れ量の調整を行う追いかけ手段を設けたもの
である。(12) In the speech speed conversion device, when a delay from real time occurs due to the speech speed conversion or the repeat operation, the delay amount is adjusted while the stored information is reproduced. It is provided with a chase means for carrying out.

【００３１】（１３）前記追いかけ手段は、ゆっくりと
再生するモードが終ると追いかけがスタートする手段、
リピート後にリピート開始時点まで再生すると追いかけ
がスタートする手段、追いかけ時の話速を選択する手
段、追いついたら入力音声をそのまま出力するスルーモ
ードに自動的に移る手段、及び追いついたら報知信号音
（メッセージ）を発生する手段のうち少なくとも１つを
有するものである。(13) The chasing means is means for starting the chasing when the mode for slowly reproducing is finished,
Means to start chasing when playing to the point where the repeat starts after repeat, means to select the speech speed at the time of chasing, means to automatically shift to the through mode to output the input sound as it is when catching up, and notification signal sound (message) when catching up At least one of the means for generating

【００３２】（１４）前記追いかけ時の話速を選択する
手段は、現実まで一気にスキップする手段、早聞きで現
実を追いかける手段、及び遅れたまま平行移動する手段
のうち少なくとも１つを有するものである。(14) The means for selecting the speech speed at the time of chasing has at least one of a means for skipping to reality at once, a means for chasing reality by listening fast, and a means for moving in parallel with delay. is there.

【００３３】（１５）前記話速変換装置の一側面の操作
し易い一周辺部に前記話速変換処理用スイッチ、話速選
択用スイッチ、リピート用スイッチ、及びリセットスイ
ッチのうち少なくとも１つを設けたものである。(15) At least one of the voice speed conversion processing switch, the voice speed selection switch, the repeat switch, and the reset switch is provided on one easily accessible peripheral portion on one side surface of the voice speed conversion device. It is a thing.

【００３４】（１６）前記リセットスイッチは、リピー
ト動作中もしくは追いかけ動作中に該スイッチをオンす
るとその動作を中止し現実にスキップし、その後はスル
ーモードに移る手段を有するものである。(16) The reset switch has means for stopping the operation when the switch is turned on during the repeat operation or the chase operation, skipping the operation, and then shifting to the through mode.

【００３５】（１７）前記話速変換処理手段は、外部か
らの割り込み要求信号を入力するための端子を有するデ
ィジタルシグナルプロセッサで実行されるソフトウエア
として提供され、前記話速変換処理用スイッチによる話
速変換処理の制御または話速変換速度の切り替えは、該
割り込み要求信号を入力する端子を通じて、ディジタル
シグナルプロセッサに与えられるものである。(17) The speech speed conversion processing means is provided as software executed by a digital signal processor having a terminal for inputting an interrupt request signal from the outside, and the speech speed conversion processing switch talks. The control of the speed conversion processing or the switching of the speech speed conversion speed is given to the digital signal processor through a terminal for inputting the interrupt request signal.

【００３６】（１８）前記話速変換装置の出力音声を両
耳用ヘッドホンを通して聞く手段を有するものである。(18) Means for listening to the output voice of the speech speed conversion device through the binaural headphones.

【００３７】（１９）音響信号を電気信号に変換するマ
イクロホンと、該マイクロホン出力を増幅するアナログ
アンプと、該アナログアンプの出力の高周波成分を取り
除くローパスフィルタと、該ローパスフィルタ出力のア
ナログ信号をディジタル信号に変換するＡ／Ｄ変換器
と、音声の速度を変更する処理をディジタル信号処理に
より実行するディジタルシグナルプロセッサと、入力音
声データや信号処理結果のデータを保存する記憶手段
と、該ディジタルシグナルプロセッサの行う音声の速度
を変更する処理を制御する手段と、処理のパラメータを
変更する手段と、ディジタル音声データをアナログ値に
変換するＤ／Ａ変換器と、該Ｄ／Ａ変換器の出力の高周
波成分を取り除く第２のローパスフィルタと、該第２の
ローパスフィルタの出力を増幅する第２のアナログアン
プと、該第２のアナログアンプの出力を音響信号に変換
し両耳に与えるヘッドホンとを有する話速変換装置であ
る。(19) A microphone that converts an acoustic signal into an electric signal, an analog amplifier that amplifies the microphone output, a low-pass filter that removes high-frequency components of the output of the analog amplifier, and an analog signal that is output from the low-pass filter is digitalized. A / D converter for converting into a signal, a digital signal processor for executing processing for changing the speed of voice by digital signal processing, storage means for storing input voice data and data of signal processing result, and the digital signal processor Means for controlling the processing for changing the speed of the sound, the means for changing the processing parameters, the D / A converter for converting the digital sound data into an analog value, and the high frequency of the output of the D / A converter. A second low-pass filter for removing the component and an output of the second low-pass filter. A second analog amplifier for amplifying an a speech speed converting device and a headphone for providing an output of the analog amplifier in the second both ears is converted to an acoustic signal.

【００３８】（２０）音響信号を電気信号に変換するマ
イクロホンと、該マイクロホン出力を増幅するアナログ
アンプと、該アナログアンプの出力の高周波成分を取り
除くローパスフィルタと、該ローパスフィルタ出力のア
ナログ信号をディジタル信号に変換するＡ／Ｄ変換器
と、入力音声データや信号処理結果のデータを保存する
記憶手段と、該蓄積された情報を読み出して音声の速度
を変更する処理をディジタル信号処理により実行するデ
ィジタルシグナルプロセッサと、該ディジタルシグナル
プロセッサの行う音声の速度を変調する処理を制御する
手段と、処理のパラメータを変更する手段と、ディジタ
ル音声データをアナログ値に変換するＤ／Ａ変換器と、
該Ｄ／Ａ変換器の出力の高周波成分を取り除く第２のロ
ーパスフィルタと、該第２のローパスフィルタの出力を
増幅する第２のアナログアンプと、該第２のアナログア
ンプの出力を音響信号に変換し両耳に与えるヘッドホン
とを有する話速変換装置である。(20) A microphone that converts an acoustic signal into an electric signal, an analog amplifier that amplifies the microphone output, a low-pass filter that removes high-frequency components of the output of the analog amplifier, and an analog signal that is output from the low-pass filter is digitalized. A / D converter for converting into a signal, storage means for storing input voice data and data of signal processing result, and digital for executing processing for reading the stored information and changing the speed of voice by digital signal processing A signal processor, a means for controlling the processing for modulating the speed of the sound performed by the digital signal processor, a means for changing the processing parameters, a D / A converter for converting the digital sound data into an analog value,
A second low-pass filter for removing high-frequency components of the output of the D / A converter, a second analog amplifier for amplifying the output of the second low-pass filter, and an output of the second analog amplifier as an acoustic signal. It is a speech speed conversion device having headphones that are converted and given to both ears.

【００３９】（２１）前記話速変換処理手段は、複数の
入力フレームバッファを用いたフレーム単位のパイプラ
イン処理で行われ、各フレームのデータに対して、まず
フレームの先頭部分に対しピッチ抽出処理を施してその
部分のピッチを検出し、その検出された１ピッチ長分の
データを出力バッファに転送し、２ピッチ長分のデータ
に対し、０から１に変化する窓関数と１から０に変化す
る窓関数をかけ、それぞれの窓関数をかけた結果のデー
タを加算して２ピッチ分の時間長を持つ合成波形を作り
出し、先に転送した１ピッチ分のデータの後に挿入し、
先にピッチ抽出処理を施したデータ上の位置から２ピッ
チ分離れた位置を先頭に、再びピッチ検出処理を行い、
その位置でのピッチ検出を行い、最後のピッチ検出で得
られたピッチ長を単位にｎ（ｎは整数）ピッチ分のデー
タを出力バッファに転送する一連の手順をフレーム全体
に渡って繰り返して行う話速変換装置である。(21) The speech speed conversion processing means is carried out by frame-by-frame pipeline processing using a plurality of input frame buffers, and for each frame of data, pitch extraction processing is first performed for the beginning of the frame. Then, the pitch of that portion is detected, the detected data of one pitch length is transferred to the output buffer, and the window function that changes from 0 to 1 and the data of 1 pitch to 0 for the data of two pitch lengths. Multiply the changing window function, add the data of the results of applying each window function to create a composite waveform with a time length of 2 pitches, insert it after the 1 pitch of data that was transferred earlier,
The pitch detection process is performed again with the position separated by two pitches from the position on the data that has been subjected to the pitch extraction process first, as the beginning.
Pitch detection is performed at that position, and a series of procedures for transferring data for n (n is an integer) pitch to the output buffer in units of the pitch length obtained by the last pitch detection is repeatedly performed over the entire frame. It is a speech speed converter.

【００４０】（２２）前記話速変換処理手段は、入力フ
レーム中のデータの平均パワーを計算し、該平均パワー
が予め設定したしきい値より大きかった場合にのみ実行
され、小さかった場合には、該フレームに含まれるデー
タがそのまま出力バッファに転送される話速変換装置で
ある。(22) The speech speed conversion processing means calculates the average power of the data in the input frame, and is executed only when the average power is larger than a preset threshold value. , The data included in the frame is directly transferred to the output buffer.

【００４１】（２３）前記入力フレーム中のデータの平
均パワーに対するしきい値処理において、第２のしきい
値を設け、該第２のしきい値より小さい平均パワーを持
つフレームが、予め設定した時間しきい値より長い時間
連続した場合には、該時間しきい値を越えて連続した前
記第２のしきい値よりも平均パワーが小さいフレームの
データが、出力バッファに転送されることを禁止したも
のである。(23) In the threshold processing for the average power of the data in the input frame, a second threshold is provided, and a frame having an average power smaller than the second threshold is preset. When the data continues for a time longer than the time threshold, it is prohibited to transfer the data of the frames, which are continuous beyond the time threshold and have a smaller average power than the second threshold, to the output buffer. It was done.

【００４２】（２４）前記各スイッチは、マイクロホン
がスイッチのクリック音を拾わないような柔らかい接触
感のスイッチからなるものである。(24) Each of the switches is a switch having a soft contact feeling so that the microphone does not pick up the click sound of the switch.

【００４３】（２５）前記各スイッチは、見なくてもど
のスイッチかわかるような触感の異なる表面形態であ
る。(25) Each of the switches has a surface form having a different tactile sensation so that which switch can be seen without looking.

【００４４】（２６）前記マイクロホンと装置本体との
距離を変えて、装置を胸ポケットに入れて使用する場合
にマイクロホンと衣服とが直接接触しないようにするた
めの布擦れ音防止手段が設けられている。(26) A cloth rubbing noise preventing means is provided for changing the distance between the microphone and the main body of the apparatus to prevent the microphone and clothing from directly contacting each other when the apparatus is put in a chest pocket and used. ing.

【００４５】（２７）前記話速変換装置の所定の位置
に、現在からの時間遅れ量が目視可能な表示手段を設け
たものである。(27) A display means for visually observing a time delay amount from the present is provided at a predetermined position of the speech speed conversion device.

【００４６】（２８）前記記憶手段としてリングバッフ
ァを用い、該リングバッファ上での時間遅れを表わすカ
ウンタで遅れ時間の管理する手段を設けたものである。(28) A ring buffer is used as the storage means, and means for managing the delay time is provided by a counter indicating the time delay on the ring buffer.

【００４７】（２９）前記スルーモードの他に、プロセ
ッサのクロックサイクルを下げ、スローモードと同様の
処理を行うスタンバイモードを設けたものである。(29) In addition to the through mode, a standby mode for lowering the clock cycle of the processor and performing the same processing as the slow mode is provided.

【００４８】（３０）電源スイッチをオン（ＯＮ）、オ
フ（ＯＦＦ）、オンとオフ中間の３段階とし、前記中間
位置にスイッチを合わせた時、音声信号処理系中のアナ
ログ入出力系を互いに短絡し、アナログ入出力系間にあ
るディジタル処理系への電源供給を中止するアナログス
ルーモードで動作するものである。(30) The power switch is set to three stages of ON (ON), OFF (OFF), and ON and OFF intermediate, and when the switch is adjusted to the intermediate position, the analog input / output systems in the audio signal processing system are mutually connected. It operates in the analog through mode in which a short circuit occurs and power supply to the digital processing system between the analog input / output systems is stopped.

【００４９】（３１）電話器のハンドセットと電話器本
体との間に前記（２），（４）乃至（３０）のうちいず
れか１つの話速変換手段を設けた電話器である。(31) A telephone set in which any one of the voice speed conversion means (2), (4) to (30) is provided between the handset of the telephone set and the main body of the telephone set.

【００５０】（３２）前記請求項２，４乃至３０のうち
いずれか１つの話速変換手段を電話交換器の中に設けた
電話交換器である。(32) A telephone exchange in which the speech speed converting means according to any one of claims 2, 4 to 30 is provided in the telephone exchange.

【００５１】[0051]

【作用】前述の手段の項の（１）及び（２）によれば、
音声を入力し、該入力音声のピッチを変化させずに、音
声の速度のみを変更する際に、話速変換を必要とする時
に指定した時間の間のみ入力音声の話速変換処理が行わ
れ、それ以外の時間の間には話速変換が行われないの
で、ラジオ音声のように一方的に聞き手に与えられる音
声だけでなく、対話のような状況でも話速変換装置を利
用できるようになり、聞き手自身の発話を妨害すること
なく、話速変換を施す音声を聞き手が選択できる。According to (1) and (2) of the above-mentioned means,
When inputting a voice and changing only the voice speed without changing the pitch of the input voice, the voice speed conversion process of the input voice is performed only during the time specified when the voice speed conversion is required. , Since the speech speed conversion is not performed during the other time, it is possible to use the speech speed conversion device not only in the one-sided voice given to the listener like radio voice but also in the situation such as dialogue. Therefore, the listener can select the voice to be subjected to the speech speed conversion without disturbing the listener's own speech.

【００５２】また、補聴器、外国語学習器、電話器等に
おいて、話し手の音声の特徴を変えることなく、ゆっく
りとした話速で聞くことができる。Further, in a hearing aid, a foreign language learning device, a telephone, etc., it is possible to listen at a slow speech speed without changing the characteristics of the speaker's voice.

【００５３】前述の手段の項の（３）及び（４）によれ
ば、原音声を符号化して蓄積し、該蓄積された符号化音
声を読み出し、前記原音声のピッチを変化させずに、音
声の速度のみを変更する場合等において、話速変換を必
要とする時に指定した時間の間のみ入力音声の話速変換
処理が行われ、それ以外の時間の間には話速変換が行わ
れないので、前記（１）及び（２）の効果の上に、メモ
リの有効利用、原音声のリピート機能、ボイスメモリ機
能、リピート音声の話速変換機能、早聞き再生機能等を
もたせることができる。According to (3) and (4) of the above-mentioned means, the original voice is encoded and accumulated, the accumulated encoded voice is read out, and the pitch of the original voice is not changed, When changing only the speed of voice, etc., the voice speed conversion process of the input voice is performed only during the time specified when the voice speed conversion is required, and the voice speed conversion is performed during other times. In addition to the effects of (1) and (2) above, effective use of memory, original voice repeat function, voice memory function, repeat voice speech speed conversion function, and fast playback function can be added. .

【００５４】前述の手段の項の（５）によれば、フレー
ム単位で記憶するので、書き込及び読み出しの効率を向
上することができる。According to item (5) of the above-mentioned means, since data is stored in frame units, the efficiency of writing and reading can be improved.

【００５５】前述の手段の項の（６）によれば、話速変
換処理における波形伸長処理、短縮処理、無音区間削除
処理の決定は、フレームのパワーとしきい値とを比較し
て行い、かつ、前記しきい値を入力された音声の大きさ
に応じて変更するので、使用環境条件に応じた話速変換
処理ができる。According to item (6) of the above-mentioned means, the waveform expansion processing, the shortening processing, and the silent section deletion processing in the speech speed conversion processing are determined by comparing the power of the frame with the threshold value, and Since the threshold value is changed according to the volume of the input voice, the speech speed conversion process can be performed according to the usage environment condition.

【００５６】前述の手段の項の（７）によれば、話速変
換装置に話速を選択する話速選択用スイッチ及びを設
け、該話速選択用スイッチで選択された話速に変更する
手段を設けたので、聞き手自身が聞く音声の話速を選択
することができる。According to (7) of the above-mentioned means, the speech speed conversion device is provided with a speech speed selection switch for selecting the speech speed, and the speech speed selected by the speech speed selection switch is changed. Since the means is provided, the listener can select the speech speed of the voice heard.

【００５７】前述の手段の項の（８）によれば、話速変
換装置にオーディオ・ビデオ機器を制御する手段（ＡＶ
コントロール）を設けたことにより、話速変換伸長／短
縮率とは全く無関係に、メモリ容量が不足する場合に、
外部機器の再生動作をポーズする信号を出して、一時的
に話速変換装置への音声入力を中止し、メモリの空き領
域ができた際には、該ポーズ信号の出力を中止して再び
外部機器からの音声入力を始める動作を繰り返すので、
長時間に渡り、話速変換を連続して使用することが可能
となる。According to (8) of the above-mentioned means, means for controlling the audio / video equipment by the voice speed conversion device (AV
By providing the control), when the memory capacity is insufficient, regardless of the speech speed conversion expansion / contraction rate,
When a signal to pause the playback operation of an external device is issued, voice input to the speech speed converter is temporarily stopped, and when there is free space in the memory, the pause signal output is stopped and the external device is restarted. Since the operation to start voice input from the device is repeated,
It is possible to continuously use the speech speed conversion for a long time.

【００５８】前述の手段の項の（９）乃至（１１）によ
れば、話速変換装置に話速をリピート用スイッチを設
け、該リピート用スイッチがオン（ＯＮ）している間は
再生音声をリピートする手段を設けたので、リピート音
声の話速変換を行うことができる。According to (9) to (11) of the above-mentioned means, the speech speed conversion device is provided with a switch for repeating the speech speed, and the reproduced voice is reproduced while the repeat switch is on (ON). Is provided, it is possible to convert the speech speed of the repeat voice.

【００５９】前述の手段の項の（１２）乃至（１４）に
よれば、話速変換装置に記憶されている情報の聞きたい
ところまで追いかける追いかけ手段を設けたので、話速
変換装置の応用範囲の拡大、操作時間の短縮、使い勝手
の向上等をはかることができる。According to (12) to (14) of the above-mentioned means, since the chase means for chasing the information stored in the speech speed conversion device to the desired point is provided, the application range of the speech speed conversion device. Can be expanded, the operation time can be shortened, and the usability can be improved.

【００６０】前述の手段の項の（１５）及び（１６）に
よれば、話速変換装置の一側面の操作し易い一周辺部に
前記話速変換処理用スイッチ、話速選択用スイッチ、リ
ピート用スイッチ、及びリセットスイッチのうち少なく
とも１つを設けたので、話速変換装置の応用範囲の拡
大、操作時間の短縮、使い勝手の向上等をはかることが
できる。According to (15) and (16) of the above-mentioned means, the voice speed conversion processing switch, the voice speed selection switch, and the repeat switch are provided in one easily accessible peripheral portion on one side of the voice speed conversion device. Since at least one of the operation switch and the reset switch is provided, the application range of the speech speed conversion device can be expanded, the operation time can be shortened, and the usability can be improved.

【００６１】前述の手段の項の（１７）乃至（２３）に
よれば、話速変換処理手段は、外部からの割り込み要求
信号を入力するための端子を有するディジタルシグナル
プロセッサで実行されるソフトウエアとして提供され、
前記話速変換処理用スイッチによる話速変換処理の制御
または話速変換速度の切り替えは、該割り込み要求信号
を入力する端子を通じて、ディジタルシグナルプロセッ
サに与えられる。According to (17) to (23) of the above-mentioned means, the speech speed conversion processing means is software executed by a digital signal processor having a terminal for inputting an interrupt request signal from the outside. Provided as
The control of the voice speed conversion processing by the voice speed conversion processing switch or the switching of the voice speed conversion speed is given to the digital signal processor through a terminal for inputting the interrupt request signal.

【００６２】また、前記話速変換処理手段は、複数の入
力フレームバッファを用いたフレーム単位のパイプライ
ン処理で行われ、各フレームのデータに対して、まずフ
レームの先頭部分に対しピッチ抽出処理を施してその部
分のピッチを検出し、その検出された１ピッチ長分のデ
ータを出力バッファに転送し、２ピッチ長分のデータに
対し、０から１に変化する窓関数と１から０に変化する
窓関数をかけ、それぞれの窓関数をかけた結果のデータ
を加算して２ピッチ分の時間長を持つ合成波形を作り出
し、先に転送した１ピッチ分のデータの後に挿入し、先
にピッチ抽出処理を施したデータ上の位置から２ピッチ
分離れた位置を先頭に、再びピッチ検出処理を行い、そ
の位置でのピッチ検出を行い、最後のピッチ検出で得ら
れたピッチ長を単位にｎ（ｎは整数）ピッチ分のデータ
を出力バッファに転送する一連の手順をフレーム全体に
渡って繰り返して行う。Further, the speech speed conversion processing means is carried out by frame-by-frame pipeline processing using a plurality of input frame buffers. For each frame of data, first, pitch extraction processing is performed on the beginning of the frame. Then, the pitch of that portion is detected, the detected data for one pitch length is transferred to the output buffer, and the window function that changes from 0 to 1 and the change from 1 to 0 for the data for two pitch lengths. Multiply the window function, and add the resulting data of each window function to create a composite waveform with a time length of 2 pitches, insert it after the 1 pitch of data that was previously transferred, and The pitch detection process is performed again starting from the position separated by 2 pitches from the position on the data subjected to the extraction process, the pitch is detected at that position, and the pitch length obtained by the final pitch detection is calculated. n (n is an integer) to perform repeated over a series of procedures for transferring to the output buffer data of pitch in the entire frame.

【００６３】また、前記話速変換処理手段は、入力フレ
ーム中のデータの平均パワーを計算し、該平均パワーが
予め設定したしきい値より大きかった場合にのみ実行さ
れ、小さかった場合には、該フレームに含まれるデータ
がそのまま出力バッファに転送される。Further, the speech speed conversion processing means calculates the average power of the data in the input frame, is executed only when the average power is larger than a preset threshold value, and when it is smaller, The data contained in the frame is transferred to the output buffer as it is.

【００６４】また、前記入力フレーム中のデータの平均
パワーに対するしきい値処理において、第２のしきい値
を設け、該第２のしきい値より小さい平均パワーを持つ
フレームが、予め設定した時間しきい値より長い時間連
続した場合には、該時間しきい値を越えて連続した前記
第２のしきい値よりも平均パワーが小さいフレームのデ
ータが、出力バッファに転送される。In the threshold processing for the average power of the data in the input frame, a second threshold is provided, and a frame having an average power smaller than the second threshold has a preset time. When the data continues for a time longer than the threshold value, the data of the frames having the average power smaller than the second threshold value and continuously exceeding the time threshold value are transferred to the output buffer.

【００６５】このように、話速変換処理手段を構成する
ことにより、話速変換処理の効率を向上と再生音声の品
質劣下防止をはかることができる。By configuring the speech speed conversion processing means in this way, it is possible to improve the efficiency of the speech speed conversion processing and prevent the quality deterioration of the reproduced voice.

【００６６】前述の手段の項の（２４）によれば、マイ
クロホンがスイッチのクリック音を拾わないので、スイ
ッチ操作時の大きな雑音を防止することができる。According to item (24) of the above-mentioned means, since the microphone does not pick up the click sound of the switch, it is possible to prevent a large noise when the switch is operated.

【００６７】前述の手段の項の（２５）によれば、見な
くてもどのスイッチかわかるような触感の異なる表面形
態となっているので、操作性を向上することができる。According to item (25) of the above-mentioned means, since the surface has a different tactile sensation so that the user can recognize which switch without looking at it, the operability can be improved.

【００６８】前述の手段の項の（２６）によれば、マイ
クロホンの布擦れ音防止手段を設けたので、雑音の侵入
を低減することができる。According to (26) in the above-mentioned means, since the cloth rubbing noise preventing means for the microphone is provided, it is possible to reduce the intrusion of noise.

【００６９】前述の手段の項の（２７）によれば、話速
変換装置の所定の位置に、現在からの時間遅れ量が目視
可能な表示手段を設けたので、操作時間の短縮、使い勝
手の向上等をはかることができる。According to (27) of the above-mentioned means, since the display means for visually observing the time delay amount from the present is provided at the predetermined position of the speech speed conversion device, the operation time is shortened and the usability is improved. It can be improved.

【００７０】前述の手段の項の（２８）によれば、記憶
手段としてリングバッファを用い、該リングバッファ上
での時間遅れを表わすカウンタで遅れ時間を管理する手
段を設けたので、リピード処理、追いかけ処理等を容易
に行うことができる。According to (28) of the above-mentioned means, since the ring buffer is used as the storage means and the means for managing the delay time by the counter showing the time delay on the ring buffer is provided, the repeat processing, A chase process or the like can be easily performed.

【００７１】前述の手段の項の（２９）によれば、スル
ーモードの他にスタンバイモードを設けたので、低消費
電力化をはかることができる。According to (29) of the above-mentioned means, since the standby mode is provided in addition to the through mode, the power consumption can be reduced.

【００７２】前述の手段の項の（３０）によれば、電源
スイッチをオン（ＯＮ）、オフ（ＯＦＦ）、オンとオフ
中間の３段階とし、アナログスローモードを設けたの
で、低電力化をはかることができる。According to (30) of the above-mentioned means, the power switch is turned on (ON), turned off (OFF), and has three stages of on and off, and the analog slow mode is provided. You can measure.

【００７３】前述の手段の項の（３１）によれば、電話
器のハンドセットと装置本体との間に前記話速変換手段
を設けたので、聞き手自身の発話を妨害することなく、
話速変換を施す音声を聞き手が選択できる。According to (31) of the above-mentioned means, since the speech speed converting means is provided between the handset of the telephone and the main body of the device, the listener's own utterance is not disturbed,
The listener can select the voice for which the speed conversion is applied.

【００７４】また、電話器において、話し手の音声の特
徴を変えることなく、ゆっくりとした話速で聞くことが
できる。Further, it is possible to listen at a slow speech speed without changing the characteristics of the speaker's voice on the telephone.

【００７５】前述の手段の項の（３２）によれば、話速
変換手段を電話交換器の中に設けたので、聞き手自身の
発話を妨害することなく、話速変換を施す音声を聞き手
が選択できる。According to (32) of the above-mentioned means, since the speech speed converting means is provided in the telephone exchange, the listener can hear the voice to be subjected to the speech speed conversion without disturbing the listener's own speech. You can choose.

【００７６】[0076]

【実施例】以下、本発明を実施例を図面を用いて詳細に
説明する。なお、実施例を説明するための全図におい
て、同一機能を有するものは同一符号を付け、その繰り
返しの説明は省略する。Embodiments of the present invention will now be described in detail with reference to the drawings. In all the drawings for explaining the embodiments, parts having the same function are designated by the same reference numerals, and repeated description thereof will be omitted.

【００７７】（実施例１）図１は、本発明による実施例
１の内部回路の概略構成を示すブロック図であり、１は
ＤＳＰ（ディジタルシグナルプロセッサ）、１１は話速
変換処理を行うソフトウエア、１２はシリアルポート、
１３は外部割り込みフラグ用端子、１４はフラグレジス
タ、２は音声メモリ（出力バッファ）、３はセレクタス
イッチ、４はＰＴＬ（Ｐush-Ｔo-Ｌisten）スイッチ、
５はＡ／Ｄ変換器、６はＤ／Ａ変換器、７はローパスフ
ィルタ、８はローパスフィルタ、９はアナログアンプ、
１０はアナログアンプ、３２１はマイクロホン、３２５
は両耳用ヘッドホン（イヤホン）である。(Embodiment 1) FIG. 1 is a block diagram showing a schematic configuration of an internal circuit of Embodiment 1 according to the present invention. Reference numeral 1 is a DSP (digital signal processor), and 11 is software for performing speech speed conversion processing. , 12 is a serial port,
Reference numeral 13 is an external interrupt flag terminal, 14 is a flag register, 2 is a voice memory (output buffer), 3 is a selector switch, 4 is a PTL (Push-To-Listen) switch,
5 is an A / D converter, 6 is a D / A converter, 7 is a low-pass filter, 8 is a low-pass filter, 9 is an analog amplifier,
10 is an analog amplifier, 321 is a microphone, 325
Are headphones for both ears (earphones).

【００７８】本実施例１の話速変換装置において、図１
に示すように、音声はマイクロホン３２１に入力され音
声信号（電気信号）として出力される。この音声信号は
アンプ１０及びローパスフィルタ７を通してＡ／Ｄ変換
器５に入力され、Ａ／Ｄ変換器５において、予め設定し
た時間間隔でアナログ値からディジタル値に変換され
る。In the speech speed converting apparatus of the first embodiment, as shown in FIG.
As shown in, the voice is input to the microphone 321 and output as a voice signal (electrical signal). The audio signal is input to the A / D converter 5 through the amplifier 10 and the low-pass filter 7, and is converted from an analog value to a digital value at a preset time interval in the A / D converter 5.

【００７９】前記ディジタル値に変換された音声信号
は、ＤＳＰ１に入力される。そして、音声信号の話速変
換処理は、ＤＳＰ１上のソフトウエア１１で実現され
る。ＰＴＬスイッチ４は、ＤＳＰ１の持つ外部割り込み
フラグ用端子１３に接続されており、ＰＴＬスイッチ４
の状態は、この端子１３に対応するＤＳＰ１の内部のフ
ラグレジスタ１４の数値として表現される。ＤＳＰ１上
のソフトウエア１１では、このフラグレジスタ１４の数
値に応じて、話速変換処理を行うか行わないかの判定を
する。The audio signal converted into the digital value is input to the DSP 1. Then, the voice speed conversion processing of the audio signal is realized by the software 11 on the DSP 1. The PTL switch 4 is connected to the external interrupt flag terminal 13 of the DSP 1,
The state of is expressed as a numerical value of the flag register 14 inside the DSP 1 corresponding to this terminal 13. The software 11 on the DSP 1 determines whether or not to perform the voice speed conversion process according to the numerical value of the flag register 14.

【００８０】話速変換処理が施されたディジタル音声デ
ータは、出力バッファメモリ２に保持される。Ｄ／Ａ変
換器６は出力バッファメモリ２のデータを、予め設定し
た時間間隔でディジタル値からアナログ値に変換する。
この変換により得られたアナログ信号はローパスフィル
タ８を通してアナログアンプ９に入力され、聞き手の好
み音圧レベルで両耳用ヘッドホン３２５により音として
出力される。The digital voice data which has been subjected to the speech speed conversion processing is held in the output buffer memory 2. The D / A converter 6 converts the data in the output buffer memory 2 from a digital value to an analog value at preset time intervals.
The analog signal obtained by this conversion is input to the analog amplifier 9 through the low-pass filter 8 and is output as sound by the binaural headphones 325 at the listener's favorite sound pressure level.

【００８１】本実施例１では、ＰＴＬスイッチ４に２種
類のスイッチが用意されている。その１つは押ボタンを
押している間だけ導通するスイッチである。もう１つは
手を押ボタンから離しても、導通した状態が維持される
スイッチである。前者は対話の場合に利用し、後者は、
従来の利用方法である、ラジオ音声などの一方的に与え
られる音声を連続的に話速変換する場合などに用いる。
また、本実施例１では、ＰＴＬスイッチ４の他にセレク
タスイッチ３がＤＳＰ１の持つ外部割り込みフラグ用端
子１３に接続されている。セレクタスイッチ３を切り替
えることにより、フラグレジスタ１４の数値を変え、ソ
フトウエア１１はこの数値に応じて話速変換処理の伸長
率を変更する。In the first embodiment, two types of switches are prepared for the PTL switch 4. One is a switch that conducts only while the push button is pressed. The other is a switch that maintains a conductive state even when the hand is released from the push button. The former is used for dialogue, the latter is
It is used when continuously converting the speech rate of a unilaterally given voice such as a radio voice, which is a conventional use method.
In addition, in the first embodiment, in addition to the PTL switch 4, the selector switch 3 is connected to the external interrupt flag terminal 13 of the DSP 1. By switching the selector switch 3, the numerical value of the flag register 14 is changed, and the software 11 changes the expansion rate of the speech speed conversion processing according to this numerical value.

【００８２】図２は本実施例１のＤＳＰ１内で実行され
る話速変換処理を説明するための図である。本実施例１
の話速変換処理は、音声信号のピッチ（基本周期）を検
出し、検出されたピッチ単位で波形の長さを延ばす方式
で、数１０ミリ秒分の音声データ集合（以下これをフレ
ームと呼ぶ）を１回の処理の単位としている。したがっ
て、ＤＳＰ１内部には少なくとも２つのフレーム長の入
力バッファを用意し、一方のバッファにＡ／Ｄ変換器か
らデータを入力している間に、他方のバッファに蓄積さ
れていたデータを処理する（パイプライン処理）。処理
後のデータは十分に大きい容量を持つ出力バッファ２に
蓄積される。各フレームのデータに対する処理の手順は
以下の通りである。FIG. 2 is a diagram for explaining the speech speed conversion processing executed in the DSP 1 of the first embodiment. Example 1
The speech speed conversion processing is a method of detecting the pitch (basic period) of a voice signal and extending the length of the waveform in units of the detected pitch. This is a set of voice data for several tens of milliseconds (hereinafter referred to as a frame). ) Is a unit of one processing. Therefore, an input buffer having at least two frame lengths is prepared inside the DSP 1, and while the data is input from the A / D converter to one buffer, the data accumulated in the other buffer is processed ( Pipeline processing). The processed data is stored in the output buffer 2 having a sufficiently large capacity. The processing procedure for the data of each frame is as follows.

【００８３】まず、（ａ）フレームの先頭部分に対しピ
ッチ抽出処理（図示せず）を施しその部分のピッチを検
出する。First, (a) pitch extraction processing (not shown) is applied to the head portion of the frame to detect the pitch of that portion.

【００８４】（ｂ）次にその検出された１ピッチ長分の
データを出力バッファ２に転送する。(B) Next, the detected data for one pitch length is transferred to the output buffer 2.

【００８５】（ｃ）次に２ピッチ長分のデータに対し、
０から１に変化する窓関数と１から０に変化する窓関数
をかける。(C) Next, for the data for two pitch lengths,
A window function that changes from 0 to 1 and a window function that changes from 1 to 0 are applied.

【００８６】ただし、窓関数をかけ始めるデータ上の位
置は１ピッチ分ずらす。そして、それぞれの窓関数をか
けた結果のデータを加算して２ピッチ分の時間長を持つ
合成波形を作り出し、先に転送した１ピッチ分のデータ
の後に挿入する。However, the position on the data where the window function is started is shifted by one pitch. Then, the data obtained by applying the respective window functions are added to create a composite waveform having a time length of 2 pitches, and the synthesized waveform is inserted after the previously transferred data for 1 pitch.

【００８７】（ｄ）次に、先にピッチ抽出処理を施した
データ上の位置から２ピッチ分離れた位置を先頭に、再
びピッチ検出処理（図示せず）を行い、その位置でのピ
ッチ検出を行う。一般に音声のピッチは常に変動してお
り、先に検出したピッチとは異なるピッチが２回目の検
出では得られる。(D) Next, a pitch detection process (not shown) is performed again with the position separated by two pitches from the position on the data which has been subjected to the pitch extraction process first, and the pitch detection at that position. I do. Generally, the pitch of the voice is constantly changing, and a pitch different from the pitch detected previously is obtained by the second detection.

【００８８】（ｅ）この最後のピッチ検出で得られたピ
ッチ長を単位にｎピッチ分のデータを出力バッファに転
送する。(E) Data for n pitches is transferred to the output buffer in units of the pitch length obtained by the last pitch detection.

【００８９】以上の（ａ）〜（ｅ）の手順をフレーム全
体に渡って繰り返して行う。The above steps (a) to (e) are repeated over the entire frame.

【００９０】ピッチ長は入力音声に依存するので、１フ
レームでの繰り返し回数は一定ではない。また、前記
（ｅ）の処理のｎの値を変えることで、異なる伸長率が
実現される。例えば、ｎ＝１とする入力バッファ中の３
ピッチ分のデータから４ピッチ分のデータが生成される
ので、伸長率は４／３＝１.３３倍となる。同様にｎ＝
０では１.５０倍、ｎ＝２では１.２５倍となる。Since the pitch length depends on the input voice, the number of repetitions in one frame is not constant. In addition, different expansion rates are realized by changing the value of n in the process (e). For example, 3 in the input buffer with n = 1
Since data for 4 pitches is generated from data for pitches, the expansion rate is 4/3 = 1.33 times. Similarly, n =
When it is 0, it is 1.50 times, and when n = 2, it is 1.25 times.

【００９１】また、本実施例１では、前記図２の話速変
換処理は、全てのフレームに対して施すのではなく、各
フレーム毎の平均パワーを計算し、パワーが予め設定し
ておいたしきい値Ｔhを越えた場合にのみ、前記説明し
た図２の処理を施すようにしている。そして，パワーが
しきい値Ｔhを越えなかったフレームのデータはそのま
ま出力バッファに転送される。図３にこのしきい値処理
の概念を示す。In the first embodiment, the speech speed conversion processing of FIG. 2 is not performed for all frames, but the average power of each frame is calculated and the power is set in advance. Only when the threshold value Th is exceeded, the above-described processing of FIG. 2 is performed. Then, the data of the frame whose power does not exceed the threshold value Th is transferred to the output buffer as it is. FIG. 3 shows the concept of this threshold processing.

【００９２】図３では、フレーム毎のパワーがしきい値
Ｔhを越えた部分を伸長区間として表わした。このしき
い値処理により、音声信号の立ち上がり及び立ち下がり
の部分は、処理が施されず原音のまま出力されるので、
音声の立ち上がり及び立ち下がりに含まれる音声の特
徴、例えば、子音の特徴が崩されないという利点があ
る。In FIG. 3, the portion where the power of each frame exceeds the threshold value Th is shown as an extension section. By this threshold processing, the rising and falling portions of the audio signal are not processed and are output as the original sound.
There is an advantage that the characteristics of the voice included in the rise and fall of the voice, for example, the features of consonants are not destroyed.

【００９３】さらに、本実施例１では、図３に示したフ
レーム毎の平均パワーに対するしきい値処理において、
第２のしきい値Ｔoを設けている。そして、この第２の
しきい値Ｔoよりもパワーが低いフレームが１秒以上続
いた場合には、１秒以上続くＴoよりパワーの低いフレ
ームを出力しないようにしている。これにより、出力バ
ッファに貯められるデータ量の削減が図られる。Furthermore, in the first embodiment, in the threshold processing for the average power for each frame shown in FIG.
A second threshold value To is provided. Then, when a frame whose power is lower than the second threshold value To continues for 1 second or more, the frame whose power is lower than To lasting 1 second or more is not output. As a result, the amount of data stored in the output buffer can be reduced.

【００９４】図３ではこの出力されない部分を削除区間
として表わした。出力バッファ２では、話速変換後処理
後のデータがフレーム単位で１度に書き込まれるのと平
行して、一定の時間間隔でＤ/Ａ変換器６に１つずつデ
ータが出力される。出力バッファ２のアドレスはリング
状に設定されており、最後尾のアドレスと先頭のアドレ
スが連続している。In FIG. 3, this non-output portion is represented as a deletion section. In the output buffer 2, the data after the post-speech speed conversion processing is written once per frame, and at the same time, the data is output to the D / A converter 6 one by one at regular time intervals. The addresses of the output buffer 2 are set in a ring shape, and the last address and the first address are continuous.

【００９５】したがって、このリング状のアドレス空間
において、話速変換処理後のデータの書き込み先を指す
アドレスポインタＰiを、Ｄ/Ａ変換器に送るデータを指
すアドレスポインタＰoが追いかけるように動作する。
本実施例１では、Ｐoの進む速度に比べＰiの進む速度の
方が速いので、いづれＰiがＰoを追い越してしまう。こ
の時点で、それまで出力バッファ２に蓄えられていた情
報は、出力されることなく書き換えられてしまう。Therefore, in this ring-shaped address space, the address pointer Pi pointing to the data write destination after the voice speed conversion processing is operated by the address pointer Po pointing to the data to be sent to the D / A converter.
In the first embodiment, the traveling speed of Pi is faster than the traveling speed of Po, so that Pi will overtake Po anyway. At this point, the information that has been stored in the output buffer 2 until then is rewritten without being output.

【００９６】したがって、話速変換動作を開始してから
このような状態になるまでの時間が、本実施例１の話速
変換処理の対応できる入力音声の時間長となる。前記し
きい値Ｔoによるデータ量の削減は、この対応可能な時
間長を長くする効果を持つ。Therefore, the time from the start of the voice speed conversion operation until such a state becomes the time length of the input voice that can be handled by the voice speed conversion processing of the first embodiment. The reduction of the data amount by the threshold value To has the effect of increasing the time length that can be handled.

【００９７】なお、前記図２及び図３を用いて説明した
話速変換処理の信号処理方式に関しては、電子情報通信
学会技術研究報告ＳＰ９２-１５０（１９９３-０３）
「難聴者による話速変換方式の評価」、あるいは日本音
響学会講演論文集（平成５年３月）１-７-６「ポータブ
ルＤＳＰシステムを用いた話速変換方式の検討」に報告
されている。Regarding the signal processing method of the speech speed conversion processing described with reference to FIGS. 2 and 3, the Institute of Electronics, Information and Communication Engineers Technical Report SP92-150 (1993-03)
Reported in "Evaluation of Speech Rate Conversion Method by Hearing-Impaired People" or Proceedings of Acoustical Society of Japan (March 1993) 1-7-6 "Study of Speech Rate Conversion Method Using Portable DSP System" .

【００９８】図４は本実施例１の話速変換装置の利用形
態を示す図である。図４ではＰＴＬスイッチ４は、装置
の上面に配置されているが、この配置位置は別の部位で
あっても構わない。一方、ＰＴＬスイッチ４の横には話
速変換の伸長率を変更するためのセレクタスイッチ３が
用意されている。セレクタスイッチ３の状態はＰＴＬス
イッチ４と同様、ＤＳＰ１の外部割り込みフラグの端子
を通じて、ＤＳＰ１上のソフトウエアから観測できるよ
うにしてあり、ＰＴＬスイッチ４を押した時のセレクタ
スイッチ３の状態で、前記話速変換処理中のｎ値を変更
する。ＰＴＬスイッチ４とセレクタスイッチ３を交互に
操作することで、発話単位で伸長率を変更することも可
能である。FIG. 4 is a diagram showing a form of use of the speech speed conversion apparatus according to the first embodiment. Although the PTL switch 4 is arranged on the upper surface of the device in FIG. 4, the arrangement position may be another part. On the other hand, next to the PTL switch 4, there is provided a selector switch 3 for changing the expansion rate of the voice speed conversion. Similar to the PTL switch 4, the state of the selector switch 3 can be observed from the software on the DSP 1 through the external interrupt flag terminal of the DSP 1, and the state of the selector switch 3 when the PTL switch 4 is pressed Change the n value during the speech speed conversion process. By operating the PTL switch 4 and the selector switch 3 alternately, it is possible to change the expansion rate for each utterance.

【００９９】図５は前記制御手順をフローチャートで表
わしたものである。話速変換処理は、例えば、数１０ミ
リ秒の時間長のフレームを処理の単位としているが、Ａ
／Ｄ変換及びＤ／Ａ変換はそれよりも短い一定の時間間
隔、例えば、数１０マイクロ秒で行われる処理である。
このためＡ／Ｄ変換とＤ／Ａ変換及びそれに伴なう処理
は、図５に示すように、割り込み処理として実現され
る。話速変換処理及び割り込み待ちの処理を行っている
間に、Ａ／Ｄ変換器及びＤ／Ａ変換器の接続されている
シリアルポートからの割り込み信号で、割り込み処理が
行われる。FIG. 5 is a flow chart showing the control procedure. In the speech speed conversion processing, for example, a processing unit is a frame having a time length of several tens of milliseconds.
The / D conversion and the D / A conversion are processings performed at fixed time intervals shorter than that, for example, several tens of microseconds.
Therefore, the A / D conversion, the D / A conversion, and the processing associated therewith are realized as interrupt processing, as shown in FIG. While the speech speed conversion process and the interrupt waiting process are being performed, the interrupt process is performed by the interrupt signal from the serial port to which the A / D converter and the D / A converter are connected.

【０１００】以上の説明からわかるように、本実施例１
によれば、ラジオ音声のように一方的に聞き手に与えら
れる音声だけでなく、対話のような状況でも話速変換装
置を利用できるようになり、聞き手自身の発話を妨害す
ることなく、話速変換を施す音声を聞き手が選択でき
る。As can be seen from the above description, this embodiment 1
According to the authors, it becomes possible to use a speech speed conversion device not only for voice that is given to the listener unilaterally such as radio speech, but also for situations such as dialogue, so that the listener can speak without changing the speech speed. The listener can select the voice to be converted.

【０１０１】また、本実施例１の話速変換装置は、高齢
者等に見られる音声の聞き取り能力の衰えを補助するこ
とに利用することができる。そして、健聴者が聞き慣れ
ていない外国語を聞くような状況でも使用できることは
言うまでもない。Further, the speech speed converting apparatus of the first embodiment can be used for assisting the deterioration of the ability of the elderly to hear the voice. It goes without saying that it can also be used in situations where a hearing person hears a foreign language that they are not familiar with.

【０１０２】（実施例２）図６乃至図１０は、本発明に
よる話速変換装置の実施例２の外観構成を示す図であ
り、図６は正面から見た正面平面図、図７は背面から見
た背面平面図、図８は上から見た上平面図、図９は左側
から見た左側平面図、図１０は右側から見た右側平面図
である。(Embodiment 2) FIGS. 6 to 10 are views showing the external structure of Embodiment 2 of the speech speed conversion apparatus according to the present invention. FIG. 6 is a front plan view seen from the front, and FIG. 7 is a rear view. FIG. 8 is a top plan view seen from above, FIG. 9 is a left side plan view seen from the left side, and FIG. 10 is a right side plan view seen from the right side.

【０１０３】図６乃至図１０において、１０１は話速変
換装置の本体、１０２は裏蓋、１０３は指かけ用へこ
み、１０４はスロースイッチ（スロー押ボタン）、１０
５はリピートスイッチ（リピート押ボタン）、１０６は
リセットスイッチ（リセット押ボタン）、３２１はマイ
クロホン、１０８は音量ボリューム、１０９は電源スイ
ッチ、１１０はイヤホン端子、１１１は外部入力端子、
１１２はＡＶコントロール端子、１１３は話速切換スイ
ッチ（話速設定スイッチ）である。6 to 10, 101 is the main body of the speech speed conversion device, 102 is a back cover, 103 is a dent for finger rest, 104 is a slow switch (slow push button), 10
5 is a repeat switch (repeat push button), 106 is a reset switch (reset push button), 321 is a microphone, 108 is a volume control, 109 is a power switch, 110 is an earphone terminal, 111 is an external input terminal,
Reference numeral 112 is an AV control terminal, and 113 is a speech speed changeover switch (speech speed setting switch).

【０１０４】本実施例２の話速変換装置は、図６乃至図
１０に示すように、話速変換装置の本体１０１の片手で
操作しやすい位置、例えば、正面上辺部にスロースイッ
チ１０４、リピートスイッチ１０５、リセットスイッチ
１０６が設けられ、右側平面図に話速切換スイッチ１１
３が設けられている。As shown in FIGS. 6 to 10, the speech speed converting apparatus according to the second embodiment has a slow switch 104 and a repeat switch at a position where it is easy to operate with one hand of the main body 101 of the speech speed converting apparatus, for example, on the front upper side. A switch 105 and a reset switch 106 are provided, and the speech speed changeover switch 11 is shown on the right side plan view.
3 is provided.

【０１０５】前記スロースイッチ１０４の押ボタンは、
押される頻度が高いので、他の押ボタンよりも大きめに
してある。そして、ゆっくり押ボタンを押し続けるた
め、疲れるので固定することもできるようにしてある。
例えば、押して横にスライドするとロックするスラ
イドロック方式、２度クリックするとロックするダ
ブルクリック方式、リセット押ボタンが押された場
合には解除する方式等のものを使用する。The push button of the slow switch 104 is
Since it is pushed frequently, it is made larger than other push buttons. And since I keep pushing the push button slowly, I am tired and can fix it.
For example, a slide lock method that locks by pushing and sliding to the side, a double-click method that locks by clicking twice, and a method that releases when the reset push button is pressed are used.

【０１０６】前記話速切換スイッチ（話速設定スイッ
チ）１１３は、スロースイッチ１０４と交互に操作がで
きるように、同じ指で操作できる範囲に近づけて配置し
てある。The speech speed changeover switch (speech speed setting switch) 113 is arranged close to the range where it can be operated with the same finger so that it can be operated alternately with the slow switch 104.

【０１０７】前記実施例の位置以外にもリングスイッ
チ、スライドスイッチ等を用いてさらに操作しやすくす
ることもできる。In addition to the position of the above-mentioned embodiment, a ring switch, a slide switch or the like may be used to make the operation easier.

【０１０８】前記音量ボリューム１０８も、常に適切な
音量で聴取することを可能にするために、調整しやすい
同じ指で届く範囲に配置する。The volume control 108 is also arranged in a range that can be easily adjusted by the same finger so that the user can always listen at an appropriate volume.

【０１０９】また、前記スロースイッチ１０４、リピー
トスイッチ１０５、リセットスイッチ１０６、話速切換
スイッチ１１３等の使用頻度の高いスイッチは、マイク
ロホン３２１がスイッチのクリック音を拾わないように
柔らかい接触感のスイッチを使用することが好ましい。
例えば、導電性ラバ等を使用したスイッチを用いる。Further, the frequently used switches such as the slow switch 104, the repeat switch 105, the reset switch 106, the speech speed changeover switch 113, etc. are switches having a soft touch feeling so that the microphone 321 does not pick up the click sound of the switch. Preference is given to using.
For example, a switch using a conductive rubber or the like is used.

【０１１０】また、前記各スイッチの外観は、見なくて
もどの種のスイッチかわかるように触感の異なる表面状
態に構成することが好ましい。Further, it is preferable that the appearance of each of the switches is formed in a surface state having a different tactile sensation so that the kind of switch can be recognized without looking.

【０１１１】前記指かけ用へこみ１０３を開けると、リ
ピート時の話速選択用スイッチなどのいくつかのスイッ
チが見えるようになっている。When the finger-holding dent 103 is opened, some switches such as a speech speed selection switch at the time of repeat can be seen.

【０１１２】本実施例２の話速変換装置の内部回路構成
は、前記図１に示す実施例１の回路構成と同様になって
いる。The internal circuit configuration of the speech speed converter of the second embodiment is similar to that of the first embodiment shown in FIG.

【０１１３】前記実施例１のＰＴＬスイッチ４として、
前記スロースイッチ１０４、リピートスイッチ１０５、
リセットスイッチ１０６等が用いられている。また、前
記実施例１のセレクタスイッチ３としては、話速切換ス
イッチ（話速設定スイッチ）１１３が用いられている。
そして、話速切換スイッチ（話速設定スイッチ）１１３
がＤＳＰ１の持つ外部割り込みフラグ用端子１３に接続
されている。話速切換スイッチ１１３を切り替えること
により、フラグレジスタ１４の数値を変え、ソフトウエ
ア１１はこの数値に応じて話速変換処理の伸長率を変更
する。As the PTL switch 4 of the first embodiment,
The slow switch 104, the repeat switch 105,
The reset switch 106 and the like are used. A speech speed changeover switch (speech speed setting switch) 113 is used as the selector switch 3 of the first embodiment.
Then, the speech speed changeover switch (speech speed setting switch) 113
Are connected to the external interrupt flag terminal 13 of the DSP 1. By switching the voice speed changeover switch 113, the numerical value of the flag register 14 is changed, and the software 11 changes the expansion rate of the voice speed conversion processing according to this numerical value.

【０１１４】図１１は、本実施例２の話速変換装置の機
能構成を示すブロック図であり、２１は音声入力部、２
２は入力バッファ、２３は中央処理部（ＣＰＵ）、２４
はリングバッファメモリ（図１の音声メモリ２に対
応）、２５は機能選択部、２６は出力バッファ、２７は
音声出力部である。FIG. 11 is a block diagram showing the functional arrangement of the speech speed conversion apparatus according to the second embodiment, in which 21 is a voice input unit and 2 is a voice input unit.
2 is an input buffer, 23 is a central processing unit (CPU), 24
Is a ring buffer memory (corresponding to the audio memory 2 in FIG. 1), 25 is a function selection unit, 26 is an output buffer, and 27 is an audio output unit.

【０１１５】前記本実施例２の構成各部を、前記実施例
１の構成とを図１を参照して対応させると、前記音声入
力部２１は、図１に示すマイクロホン３２１、アナログ
アンプ（増幅器）１０、ローパスフィルタ７、Ａ／Ｄ変
換器５からなっている。When the components of the second embodiment are made to correspond to those of the first embodiment with reference to FIG. 1, the voice input unit 21 includes the microphone 321 and the analog amplifier (amplifier) shown in FIG. 10, a low pass filter 7 and an A / D converter 5.

【０１１６】前記入力バッファ２２は、前記音声入力部
２１によってディジタル信号変換された音声を保持する
ためのものであり、その後の信号処理を施す単位である
１フレーム分のデータを保持できるだけの大きさを持
つ。この入力バッファ２２は、リングバッファメモリ
（図１の音声メモリ２に対応）２４の一部のアドレスを
割り当てることにより実現できる。The input buffer 22 is for holding the voice converted into a digital signal by the voice input unit 21, and is large enough to hold one frame of data which is a unit for performing subsequent signal processing. have. The input buffer 22 can be realized by allocating a part of the address of the ring buffer memory (corresponding to the voice memory 2 of FIG. 1) 24.

【０１１７】前記中央処理部（ＣＰＵ）２３は、図１に
示すＤＳＰ１上で実行させるソフトウエアの部分に対応
するものであり、音声圧縮部２３Ａ、無音圧縮部２３
Ｂ、解凍部２３Ｃ、波形処理部（話速変換処理部）２３
Ｄ、制御部２３Ｅを有している。The central processing unit (CPU) 23 corresponds to the software portion to be executed on the DSP 1 shown in FIG. 1, and includes a voice compression unit 23A and a silence compression unit 23.
B, decompression unit 23C, waveform processing unit (speech rate conversion processing unit) 23
D, and a control unit 23E.

【０１１８】前記機能選択部２５は、図１に示すスイッ
チ３，４及び外部割り込みフラグ用端子１３の部分が対
応し、前記スロースイッチ１０４、リピートスイッチ１
０５、リセットスイッチ１０６、話速切換スイッチ１１
３等で構成される。The function selector 25 corresponds to the switches 3, 4 and the external interrupt flag terminal 13 shown in FIG. 1, and corresponds to the slow switch 104 and the repeat switch 1.
05, reset switch 106, speech speed switch 11
It is composed of 3 etc.

【０１１９】前記出力バッファ２６は、前記波形処理部
２３Ｄで処理された結果のデータを保持するためのもの
であり、実際には、２つ存在しており、各々波形処理に
よって伸長された１フレーム分のデータが、十分入るだ
けの大きさを有する。前記実施例１では、入力バッファ
が２つあり、それらを交互に使用して、パイプライン処
理を実現したものと同様に、本実施例２では、２つの出
力バッファを交互に用いて、パイプライン処理を実現す
る。The output buffer 26 is for holding the data of the result processed by the waveform processing section 23D. Actually, there are two output buffers, and one frame is expanded by the waveform processing. Minute data is large enough to fit. In the first embodiment, there are two input buffers, and the pipeline processing is realized by alternately using them. In the second embodiment, two output buffers are alternately used and the pipeline processing is performed. Realize processing.

【０１２０】すなわち、１フレーム分の波形加工処理を
実行し、一方の出力バッファにその結果を保存している
間、もう一方の出力バッファから前回のサイクルで処理
を終えた波形処理結果を音声出力部２７を通じて出力す
る。この出力バッファ２６は、リングバッファメモリ
（図１の音声メモリ２に対応）２４の一部のアドレスを
割り当てることにより実現できる。That is, while the waveform processing for one frame is executed and the result is stored in one output buffer, the waveform processing result which has been processed in the previous cycle is output as voice from the other output buffer. Output through the unit 27. The output buffer 26 can be realized by allocating a part of the address of the ring buffer memory (corresponding to the audio memory 2 in FIG. 1) 24.

【０１２１】前記入力バッファ２２へのデータ入力及び
出力バッファ２６からの出力は、前記実施例１と同様
に、Ａ／Ｄ変換器５及びＤ／Ａ変換器６のサンプリング
レート間隔で行われる。このため、ＤＳＰ１で実行され
る処理は、フレーム単位の波形処理とサンプリング間隔
で実行される割り込み処理とからなる。The data input to the input buffer 22 and the output from the output buffer 26 are performed at the sampling rate intervals of the A / D converter 5 and the D / A converter 6, as in the first embodiment. Therefore, the processing executed by the DSP 1 includes waveform processing in frame units and interrupt processing executed at sampling intervals.

【０１２２】すなわち、１フレーム分のデータを波形処
理している間に、何度も割り込み処理が実行され、見か
け上２つの処理が、同時に実行される。That is, while waveform processing is being performed on one frame of data, interrupt processing is executed many times, and apparently two processings are simultaneously executed.

【０１２３】前記リングバッファメモリ２４としては、
公知のものを用い、その書き込み・読み出しは、フレー
ム単位で行われる。以下にその詳細につて説明する。As the ring buffer memory 24,
A known one is used, and the writing / reading is performed in frame units. The details will be described below.

【０１２４】（書き込み動作）図１１において、音声入
力部２１により入力された音声データは、入力バッファ
２２に保持される。入力バッファ２２は、１データあた
り１６ビットの符号長が割り当てられ、１フレーム分の
数のデータを保持できるだけの容量があり、図１に示す
音声メモリ２上の一部のアドレスを割り当てることによ
って実現されている。(Write Operation) In FIG. 11, the voice data input by the voice input unit 21 is held in the input buffer 22. The input buffer 22 is assigned a code length of 16 bits per data, has a capacity to hold the number of data for one frame, and is realized by allocating a part of the address on the audio memory 2 shown in FIG. Has been done.

【０１２５】図１１に示す制御部２３Ｅは、この入力バ
ッファ２２の状態を監視しており、音声データが１フレ
ーム分溜るごとに、この１フレーム分のデータを、音声
圧縮部２３Ａに転送する。The control unit 23E shown in FIG. 11 monitors the state of the input buffer 22 and transfers the data for one frame to the audio compression unit 23A every time one frame of audio data is accumulated.

【０１２６】音声圧縮部２３Ａでは、入力された１フレ
ーム分の音声データの情報圧縮処理を行い、圧縮結果の
データをリングバッファメモリ２４へ保存する。この圧
縮処理には、いくつかの方法が考えられる。その一例
は、図１２に示された差分データの保存方法である。図
１２は、本実施例２の音声圧縮部２３Ａの圧縮処理を説
明するための模式図である。The audio compression unit 23A performs information compression processing on the input audio data for one frame, and stores the compression result data in the ring buffer memory 24. Several methods can be considered for this compression processing. One example is the method of saving the difference data shown in FIG. FIG. 12 is a schematic diagram for explaining the compression process of the audio compression unit 23A of the second embodiment.

【０１２７】この圧縮処理では、各フレームの先頭デー
タから順に、「１つ前のデータとの差」が計算される。
図１２（ａ）においては、これらの差分を△１，△２，
…と表現してある。圧縮処理の出力データは、フレーム
先頭のデータを、上位８ビットと下位８ビットに分割し
た後に、前記差分データ△１，△２，…を１データ８ビ
ット符号長として並べたものである。入力データの１つ
のデータは１６ビットのディジタル符号長を有するが、
音声信号のようなサンプリング間隔に比べて十分緩やか
に変化する入力信号の場合には、１つ前のサンプル値と
の差分はあまり大きくならないので、図１２（ｂ）に示
すように、半分の８ビットの符号長があれば、十分表現
できる。このため、圧縮処理の前と後とでは、データの
容量は約半分になるが、そこに含まれている内容は、処
理の途中で差分が８ビットの符号長では表現できないほ
ど大きくならない限り、欠落することはない。In this compression processing, the "difference from the immediately preceding data" is calculated in order from the head data of each frame.
In FIG. 12A, these differences are represented by Δ1, Δ2,
... is expressed. The output data of the compression processing is obtained by dividing the data at the head of the frame into upper 8 bits and lower 8 bits, and then arranging the difference data Δ1, Δ2, ... As one data 8-bit code length. One piece of input data has a digital code length of 16 bits,
In the case of an input signal, which changes sufficiently gently compared to the sampling interval such as an audio signal, the difference from the previous sample value does not become so large, and as shown in FIG. If there is a bit code length, it can be sufficiently expressed. Therefore, the data capacity before and after the compression processing is about half, but the content contained therein is not so large that the difference cannot be expressed by the code length of 8 bits during the processing. There is nothing missing.

【０１２８】リングバッファメモリ２４への保存は、こ
のようにフレーム単位で半分の容量に圧縮されたデータ
を、リングバッファメモリ２４上に、時間的な順序が保
たれるように並べられる。The data stored in the ring buffer memory 24 is arranged in the ring buffer memory 24 in such a manner that the data thus compressed to a half capacity on a frame-by-frame basis is maintained in the ring buffer memory 24.

【０１２９】なお、フレームの切れ目がわかるように、
各フレームの圧縮データの先頭には、フレームヘッダが
付加される。圧縮処理部では、前記図１２の圧縮処理と
共に、そのフレームの全データの絶対値の総和を計算
し、この結果をこのフレームのパワー値として、前記フ
レームヘッダ部に記録する作業も、同じに行う。[0129] In addition, so that the break of the frame can be seen,
A frame header is added to the beginning of the compressed data of each frame. The compression processing unit calculates the sum of the absolute values of all data of the frame and records the result as the power value of the frame in the frame header unit in the same manner as the compression processing of FIG. .

【０１３０】波形伸長／短縮処理を施すフレームの決定
は、フレームのパワーとしきい値Ｔｈとの比較により行
っている。また、無音区間削除処理は、フレームのパワ
ーとしきい値Ｔｏとの比較により行っている。The frame to be subjected to the waveform expansion / contraction processing is determined by comparing the power of the frame with the threshold Th. Further, the silent section deletion process is performed by comparing the power of the frame with the threshold value To.

【０１３１】これらのしいき値は、固定の値を使用する
のではなく、入力される音声の大きさに応じて、変更す
ることが望ましい。例えば、静かな部屋の中で使ってい
る場合と、背景雑音が大きい状況とでは、当然これらの
しきい値を上手に調整しないと、うまく話速変換できな
い。It is desirable to change these threshold values according to the volume of the input voice, rather than using fixed values. For example, when using in a quiet room and when the background noise is large, naturally speaking rate conversion cannot be performed successfully unless these thresholds are adjusted properly.

【０１３２】具体的な実現方法は、過去数秒間のフレー
ムパワーの最大／最小値を記憶しておき、これらの値を
元に、前記しきい値を決定するようにしている。例え
ば、１フレームの時間長を５０ミリ秒（ｍsec）とし、
５秒毎にこれらのしきい値を変更しようとする場合に
は、１００フレーム処理を行う毎に１度しきい値Ｔｈの
変更処理を行えばよい。As a concrete realization method, the maximum / minimum values of the frame power for the past several seconds are stored, and the threshold value is determined based on these values. For example, the time length of one frame is 50 milliseconds (msec),
When it is desired to change these thresholds every 5 seconds, the threshold Th may be changed once every 100 frame processing.

【０１３３】前述したように、フレーム毎のパワーは、
図１１の音声圧縮部でフレーム単位に情報圧縮を行う毎
に全入力について、必ず計算されており、その情報はフ
レームヘッダに記録されて、リングバッファ２４に保存
される。As described above, the power for each frame is
Every time the information is compressed in frame units in the audio compression unit in FIG. 11, all inputs are calculated, and the information is recorded in the frame header and stored in the ring buffer 24.

【０１３４】このフレームパワーの計算の際に、最大フ
レームパワーＰmax及び最小フレームパワーＰminとの比
較を行い、必要ならば更新するようにする。この最大フ
レームパワーＰmax及びＰminは５秒毎（１００フレーム
毎）にリセットされるようにしておけば、過去５秒間の
最大最小フレームパワーがいつも残るようになる。When the frame power is calculated, the maximum frame power Pmax and the minimum frame power Pmin are compared with each other and updated if necessary. If the maximum frame powers Pmax and Pmin are reset every 5 seconds (every 100 frames), the maximum and minimum frame powers for the past 5 seconds will always remain.

【０１３５】しきい値の計算は、例えば、Ｔｈを最大フ
レームパワーＰmaxと最小フレームパワーＰminの間の１
０％に、Ｔｏを５％に設定する。式で表現すれば、次式
（数１）及び（数２）となる。The threshold value is calculated by, for example, Th being 1 between the maximum frame power Pmax and the minimum frame power Pmin.
Set 0% and To to 5%. If expressed by equations, the following equations (Equation 1) and (Equation 2) are obtained.

【０１３６】[0136]

【数１】Ｔｈ＝｜Ｐmax−Ｐmin｜＊０.１０＋Ｐmin## EQU1 ## Th = | Pmax-Pmin | * 0.10 + Pmin

【０１３７】[0137]

【数２】Ｔｏ＝｜Ｐmax−Ｐmin｜＊０.０５＋Ｐmin 次に、本実施例２のリングバッファメモリ２４への原
（生）データ保存方法を説明したので、無音区間圧縮の
詳細についても説明する。## EQU00002 ## To = | Pmax-Pmin | * 0.05 + Pmin Next, since the method of storing the original (raw) data in the ring buffer memory 24 according to the second embodiment has been described, the details of the silent interval compression will be described. .

【０１３８】無音区間圧縮の機能は、前記実施例１の図
３で説明したように、１秒以上連続した無音区間（有音
／無音しきい値Ｔｏよりパワーが小さい期間）を削除す
るものである。As described with reference to FIG. 3 of the first embodiment, the function of the silent interval compression is to delete a silent interval (a period in which the power is smaller than the voice / silent threshold To) for 1 second or longer. is there.

【０１３９】無音圧縮処理は、図１１に示す無音圧縮部
２３Ｂが行う。この無音圧縮処理は、後述する１フレー
ムを単位として実行される処理（以下、メイン処理と称
する）とは独立した処理であり、１フレーム分のメイン
処理終了後行われる。（図１４では、「遅れ量＝０」判
定（Ｓ１４３）と「ＰｏｗｅｒＯＮ？」判定（Ｓ１４
４）の間で行われる（図示はしていない）。The silence compression processing is performed by the silence compression unit 23B shown in FIG. This silent compression process is a process independent of a process (hereinafter referred to as a main process) executed in units of one frame, which will be described later, and is performed after the main process for one frame is completed. (In FIG. 14, “delay amount = 0” determination (S143) and “PowerON?” Determination (S14
4) (not shown).

【０１４０】無音圧縮部２３Ｂでは、入力バッファ２２
に溜まったデータを一定の単位（例えば１／４フレー
ム）毎に加算してパワーを算出し、そのパワーが「有音
／無音しきい値を下から上に横切った」時に無音圧縮動
作を開始する。なぜならば、無音区間が終了する時は、
パワーが小から大に変化する時であり、このときでない
と、それまで続いていた無音区間が、１秒以上あったか
否かの判断ができないからである。In the silence compressor 23B, the input buffer 22
The power is calculated by adding the data accumulated in 1) every fixed unit (for example, 1/4 frame), and the silent compression operation is started when the power "crosses the voice / silence threshold from the bottom to the top". To do. Because when the silent section ends,
This is because it is the time when the power changes from small to large, and unless it is this time, it is not possible to determine whether or not the silent section that had continued until then was for one second or longer.

【０１４１】無音圧縮処理が開始されると、まず、リン
グバッファメモリ２４のフレームヘッダを、過去の方向
に遡りながら検索する。リングバッファメモリ２４上の
圧縮データは、フレーム単位に圧縮されており、前述の
とおりフレームヘッダにそのフレームのパワー値が記録
されている。もし、１秒以上続いてパワーがＴｏより低
いフレームが続いていたら、無音削除可能となり、リン
グバッファメモリ２４への入力ポインタを、無音区間が
１秒続いた時点まで戻す。次の圧縮データの入力は、こ
の戻された時点から上書きするように記録する。これに
より、現時点の直前の１秒以上続く無音区間が、常に削
除される。When the silence compression process is started, first, the frame header of the ring buffer memory 24 is searched backward in the past. The compressed data on the ring buffer memory 24 is compressed in frame units, and the power value of the frame is recorded in the frame header as described above. If a frame with a power lower than To continues for 1 second or more, the silence can be deleted, and the input pointer to the ring buffer memory 24 is returned to the point where the silent section continues for 1 second. The input of the next compressed data is recorded so as to overwrite from the time when it is returned. As a result, the silent section lasting 1 second or more immediately before the present time is always deleted.

【０１４２】（読み出し動作）後述する本実施例２の装
置のメイン処理は、フレーム単位で行われる。そこで、
図１１に示す波形処理部２３Ｄが現在処理中のフレーム
データを保持し、リングバッファメモリ２４からの読み
だしは、フレーム単位でまとめて行う。すなわち、まと
めて取り出す場合には、リングバッファメモリ２４への
アドレッシングが単純にアドレスを１つずつ増やす処理
で済むため、１つずつデータを取り出すよりも効率が良
い。(Reading Operation) Main processing of the apparatus of the second embodiment, which will be described later, is performed in frame units. Therefore,
The waveform processing unit 23D shown in FIG. 11 holds the frame data currently being processed, and the reading from the ring buffer memory 24 is performed collectively for each frame. That is, when collectively fetching, the addressing to the ring buffer memory 24 can be performed simply by increasing the address one by one, which is more efficient than fetching data one by one.

【０１４３】リングバッファメモリ２４に保存されてい
るデータは、前述のとおり、圧縮されたデータであるた
め、波形処理を行う前に、この圧縮を解凍して元のデー
タに戻す必要がある。図１１に示す解凍部２３Ｃは、こ
のためのものである。まず、１フレーム分の圧縮データ
を入力とし、先頭の２つの８ビットデータを１６ビット
の上位／下位に配置して先頭データを作る。次に、圧縮
データの３つ目の値を先の先頭データに加え、２番目の
データを復元する。次に、その次の圧縮データ値をこの
２番目のデータに加えて３番目のデータを復元する。以
下次々に圧縮データを１つ前の復元データに加算する作
業を繰り返すことにより、全てのフレームデータを復元
させる。Since the data stored in the ring buffer memory 24 is compressed data as described above, it is necessary to decompress this data and restore the original data before performing the waveform processing. The decompression unit 23C shown in FIG. 11 is provided for this purpose. First, the compressed data for one frame is input, and the two leading 8-bit data are arranged in the upper / lower 16 bits to create the leading data. Next, the third value of the compressed data is added to the preceding head data to restore the second data. The next compressed data value is then added to this second data to restore the third data. After that, all the frame data is restored by repeating the operation of adding the compressed data to the previous restored data one after another.

【０１４４】次に、本実施例２の話速変換装置の基本動
作を簡単に説明する。Next, the basic operation of the speech speed conversion apparatus according to the second embodiment will be briefly described.

【０１４５】図１１に示すように、音声入力部２１によ
ってディジタル信号変換された音声は、まず、入力バッ
ファ２２に入力される。入力バッファ２２から読み出さ
れた音声信号は、ＤＳＰ１（図１）のＣＰＵ２３内の音
声圧縮部２３Ａに送られ、音声信号にデータ圧縮処理が
施され、リングバッファメモリ２４に記憶される。ま
た、前記音声信号は無音圧縮部２３Ｂに送られ、必要な
場合には無音圧縮処理をリングバッファメモリ２４に記
憶されているデータに施す。As shown in FIG. 11, the voice converted into a digital signal by the voice input unit 21 is first input to the input buffer 22. The audio signal read from the input buffer 22 is sent to the audio compression unit 23A in the CPU 23 of the DSP 1 (FIG. 1), the audio signal is subjected to data compression processing, and stored in the ring buffer memory 24. Further, the voice signal is sent to the silence compression unit 23B, and if necessary, the silence compression processing is performed on the data stored in the ring buffer memory 24.

【０１４６】リングバッファメモリ２４に記憶されてい
る音声信号のデータは、フレーム単位で解凍部２３Ｃに
送られ、解凍部２３Ｃで圧縮処理された音声データを解
凍し、波形処理部（話速変換処理部）２３Ｄに入力す
る。波形処理部（話速変換処理部）２３Ｄでは、機能選
択部２５により設定された条件に基づいて話速変換処理
等を行う。この話速変換処理等が施されたディジタル音
声データは、出力バッファ２６に保持される。出力バッ
ファ２６のデータを読み出して、音声出力部２７から話
速変換処理等が施された音声を出力する。The audio signal data stored in the ring buffer memory 24 is sent to the decompression unit 23C in frame units, and the audio data compressed by the decompression unit 23C is decompressed, and the waveform processing unit (speech speed conversion processing) is performed. Part) 23D. The waveform processing unit (speech speed conversion processing unit) 23D performs a speech speed conversion process and the like based on the conditions set by the function selection unit 25. The digital voice data that has been subjected to the speech speed conversion processing and the like is held in the output buffer 26. The data in the output buffer 26 is read out, and the voice output unit 27 outputs the voice subjected to the voice speed conversion processing and the like.

【０１４７】すなわち、出力バッファ２６のデータを読
み出して、図１に示すように、Ｄ／Ａ変換器６により設
定した時間間隔でディジタル値からアナログ値に変換す
る。この変換により得られたアナログ信号はローパスフ
ィルタ８を通してアナログアンプ９に入力され、聞き手
の好み音圧レベルで両耳用ヘッドホン３２５により音と
して出力される。That is, the data in the output buffer 26 is read out and converted from a digital value to an analog value at time intervals set by the D / A converter 6, as shown in FIG. The analog signal obtained by this conversion is input to the analog amplifier 9 through the low-pass filter 8 and is output as sound by the binaural headphones 325 at the listener's favorite sound pressure level.

【０１４８】次に、本実施例２における１フレームを単
位として実行される処理（以下、メイン処理と称する）
について図１１，図１３，図１４を用いて説明する。Next, processing executed in units of one frame in the second embodiment (hereinafter referred to as main processing)
This will be described with reference to FIGS. 11, 13 and 14.

【０１４９】図１３及び図１４は本実施例２におけるメ
イン処理の手順を示すフローチャートである。13 and 14 are flowcharts showing the procedure of the main processing in the second embodiment.

【０１５０】本実施例２におけるメイン処理は、図１３
に示すように、Ｐower ＯＮして、「フェードイン」処
理を行う(Ｓ１３１）。すなわち、電源投入直後では、
出力バッファ２６に入っているデータは、不定である。
このため、電源投入直後には、音声とは無関係なデータ
出力される可能性があり、これを音声出力部２７からそ
のまま出力した場合、非常に大きなレベルの雑音となる
可能性がある。このことを防止するため、本実施例２で
は、フェードイン処理が実行され、電源投入後の一定時
間、音声出力部の出力が、出力バッファ内のデータに関
係なく、徐々に大きな音になるように出力バッファ内の
データの値が調整される。具体的には、出力バッファか
らＤ／Ａ変換器に１データ転送する毎にこのデータ値に
係数を掛け算し、この係数値を時間的に変化させること
で、この機能を実現する。この動作は図１１に示す制御
部２３Ｅが実行する。The main processing in the second embodiment is shown in FIG.
As shown in, the power is turned on and the "fade in" process is performed (S131). That is, immediately after the power is turned on,
The data stored in the output buffer 26 is indefinite.
For this reason, there is a possibility that data unrelated to voice may be output immediately after the power is turned on, and if this data is output from the voice output unit 27 as it is, a very large level of noise may occur. In order to prevent this, in the second embodiment, the fade-in process is executed so that the output of the audio output unit gradually becomes loud regardless of the data in the output buffer for a certain period after the power is turned on. The value of the data in the output buffer is adjusted. Specifically, this function is realized by multiplying this data value by a coefficient each time one data is transferred from the output buffer to the D / A converter and changing the coefficient value with time. This operation is executed by the control unit 23E shown in FIG.

【０１５１】その後、「スルーモード」処理に入る。ス
ルーモード処理では、まず、「読書ポインタ一致」処理
が行われる（Ｓ１３２）。この読書ポインタ一致処理
は、音声入力部２１からデータを入力する際に、入力バ
ッファ２２へ入力した直後に、出力バッファ２６にも同
じデータを入力する処理である。この動作は、メモリ上
の入力番地を指す入力ポインタの値を、入力バッファ２
２へデータを入力した直後に、出力データのメモリ上の
番地を指す出力ポインタの値に、一致させることによっ
て実現する。この動作は、図１１では制御部２３Ｅが行
う。Then, the "through mode" process is started. In the through mode process, first, a "reading pointer match" process is performed (S132). The reading pointer matching process is a process of inputting the same data to the output buffer 26 immediately after inputting the data to the input buffer 22 when the data is input from the voice input unit 21. This operation changes the value of the input pointer pointing to the input address on the memory to the input buffer 2
Immediately after inputting the data to 2, it is realized by matching the value of the output pointer indicating the address of the output data on the memory. This operation is performed by the control unit 23E in FIG.

【０１５２】読書ポインタ一致後、スルーモードでは、
スロースイッチ１０４及びリピートスイッチ１０５が押
された状態（ＯＮ状態）をチェックし（Ｓ１３３，Ｓ１
３４）、両方とも押されていない状態（ＯＦＦ状態）の
場合には、前の読書ポインタ一致処理（Ｓ１３２）に戻
り、スルーモードを続ける。したがって、スルーモード
を続けている間に生じる割り込み処理では、必ず入力さ
れたデータがそのまま出力されるため、音声出力部２７
は、入力音声と同じになる。After the reading pointer matches, in the through mode,
The state where the slow switch 104 and the repeat switch 105 are pressed (ON state) is checked (S133, S1).
34) If both are not pressed (OFF state), the process returns to the previous reading pointer matching process (S132), and the through mode is continued. Therefore, in the interrupt processing that occurs while the through mode is continued, the input data is always output as it is, so that the audio output unit 27
Is the same as the input voice.

【０１５３】前記スロースイッチ１０４、リピートスイ
ッチ１０５、リセットスイッチ１０６の各スイッチは、
図１１では機能選択部２５の中に含まれており、その状
態のチェックは制御部２３Ｅが行う。Each of the slow switch 104, the repeat switch 105, and the reset switch 106 is
In FIG. 11, it is included in the function selection unit 25, and the control unit 23E checks the state.

【０１５４】前記スルーモード中にリピートスイッチ１
０５が押される（ＯＮする）と、別に用意したリピート
フラグ（図示せず）を０から１にして、「読みだしポイ
ンタ戻し」ルーチンを行う（Ｓ１３５）。この読みだし
ポインタ戻しルーチンの内部の処理手順のフローチャー
トを図１６に示す。この図１６の説明は後述する。Repeat switch 1 during the through mode
When 05 is pressed (turned on), a separately prepared repeat flag (not shown) is changed from 0 to 1, and a "read pointer return" routine is performed (S135). FIG. 16 shows a flowchart of the internal processing procedure of this read pointer returning routine. The description of FIG. 16 will be given later.

【０１５５】前記スルーモード中にスロースイッチ１０
４が押されると、図１３に示すように、「伸長処理にパ
ラメータ設定」を行うルーチン（Ｓ１３６）に飛ぶ。こ
のルーチンの内部の処理手順のフローチャートを図１７
に示す。この図１７の説明は後述する。During the through mode, the slow switch 10
When 4 is pressed, as shown in FIG. 13, the routine jumps to a routine (S136) for performing "parameter setting for expansion processing". FIG. 17 is a flowchart of the processing procedure inside this routine.
Shown in. The description of FIG. 17 will be given later.

【０１５６】伸長処理にパラメータ設定が行われた後
は、１フレーム分の波形伸長または短縮処理が行われる
（Ｓ１３７）。この１フレーム分の波形伸長処理の内部
の処理手段のフローチャートを図１８及び図１９に示
す。この図１８及び図１９の説明は後述する。After the parameters are set for the expansion process, the waveform expansion or contraction process for one frame is executed (S137). 18 and 19 are flowcharts of the processing means inside the waveform expansion processing for one frame. The description of FIGS. 18 and 19 will be given later.

【０１５７】前記１フレームの分の処理が終了した後
は、各スイッチ類の押されているか否かの状態のチェッ
クに入る。ところで、１フレームの処理は、１フレーム
の時間長以内で終了するため、数１０ミリ秒（ｍsec）
のオーダーで完了する。一方、ユーザーが各スイッチ
（押ボタン）を押した場合、どんなに短い期間押し下げ
たとしても、これ以上の時間押し下げ状態が持続するよ
うなスイッチデバイスが本装置には使用される。このた
め、１フレームの処理を行う毎にスイッチの押し下げ状
態をチェックすれば、ユーザーに反応遅れを感じさせな
い程度の時間遅れで、希望する動作に移行できる。After the processing for one frame is completed, it is checked whether or not each switch is pressed. By the way, since processing of one frame is completed within the time length of one frame, several tens of milliseconds (msec)
Complete with the order. On the other hand, when the user presses each switch (push button), the switch device is used in the present apparatus so that the pressed state continues for a longer time no matter how short the switch is pressed. Therefore, if the pressed state of the switch is checked every time one frame is processed, the desired operation can be performed with a time delay that does not cause the user to feel a reaction delay.

【０１５８】まず、リセットスイッチ１０６が押し下げ
られているか否かをチェックする（Ｓ１３８）。もし、
リセットスイッチ１０６が押し下げられている（Ｓ１３
８のＹesの場合）ならば、この時点で強制的にスルーモ
ードに移行する。First, it is checked whether the reset switch 106 is pushed down (S138). if,
The reset switch 106 is pushed down (S13
8)), the mode is forced to shift to the through mode at this point.

【０１５９】リセットスイッチ１０６が押し下げられて
いない（Ｓ１３８のＮoの場合）ならば、図１４に示す
ように、スロースイッチ１０４が押し下げられているか
否かをチェックする（Ｓ１３９）。もし、スロースイッ
チ１０４が押し下げられている（ＯＮの場合：Ｓ１３９
のＹesの場合）ならば、波形伸長処理を続けて次のフレ
ームでも行うよう伸長処理にパラメータをセットするル
ーチンに戻る。スロースイッチ１０４が押し下げ続けて
いる場合には、このループを回り続けることになる。ま
た、リピート再生及び追いかけ再生中にスロースイッチ
１０４を押し下げ続けた場合にも、このループを回り続
けることなる。If the reset switch 106 is not pushed down (No in S138), as shown in FIG. 14, it is checked whether or not the slow switch 104 is pushed down (S139). If the slow switch 104 is pushed down (if ON: S139
If Yes), the waveform decompression process is continued, and the process returns to the routine for setting parameters in the decompression process so that it is performed in the next frame. If the slow switch 104 continues to be depressed, it will continue to rotate this loop. Further, even when the slow switch 104 is continuously pressed down during the repeat reproduction and the chase reproduction, this loop is continued.

【０１６０】スロースイッチ１０４が開放されてた場合
（ＯＦＦの場合：Ｓ１３９のＮoの場合）には、次のリ
ピートの押し下げ状態の判定に移る（Ｓ１４０）。この
時点で、リピートスイッチ１０５の押し下げが検出され
るケースは、「リピート再生中にリピートスイッチを押
した」か「追いかけ再生中にリピートスイッチを押し
た」かのいずれかである。どちらの場合にしても、その
時のリングバッファメモリ２４の出力ポインタの位置か
ら約５秒戻った付近の無音部分よりリピート再生を開始
するように、読み出しポインタ戻しルーチンへ分岐す
る。When the slow switch 104 is opened (when OFF: when No in S139), the process goes to the next repeat depressing state determination (S140). At this point, the case where the pressing down of the repeat switch 105 is detected is either “the repeat switch was pressed during the repeat reproduction” or “the repeat switch was pressed during the chasing reproduction”. In either case, the process branches to the read pointer returning routine so that the repeat reproduction is started from the silent portion near the position of the output pointer of the ring buffer memory 24 about 5 seconds after that.

【０１６１】スロースイッチ１０４が開放されており、
リピートスイッチ１０５も押し下げられていない場合に
は、次のリピート終了判定に移る（Ｓ１４１）。リピー
ト動作は、出力ポインタが、スルーモードからリピート
スイッチ１０５の押し下げによりリピート動作に移った
時の出力ポインタの位置に戻るまで、連続して続けられ
る。すなわち、ここでの判定により、現在リピートモー
ドにあり、かつ、出力ポインタの位置が、リピートを開
始した時点での出力ポインタ位置まで戻っていない場合
には、前記の１フレーム分の波形伸長・短縮処理に戻る
よう処理ループが形成される。これ以降の処理は、追い
かけ再生のための処理である。The slow switch 104 is opened,
If the repeat switch 105 has not been pushed down either, the process proceeds to the next repeat end determination (S141). The repeat operation is continuously continued until the output pointer returns from the through mode to the position of the output pointer when the repeat operation is started by pressing down the repeat switch 105. That is, if it is determined in this step that the repeat pointer is currently set and the position of the output pointer does not return to the position of the output pointer at the time when the repeat is started, the waveform expansion / contraction for one frame is performed. A processing loop is formed to return to the processing. The subsequent processing is processing for chasing playback.

【０１６２】リピート再生が終了した後及びスロー（ゆ
っくり）再生が終了した後は、追いかけ再生に移動す
る。追いかけ再生とは、１フレーム毎の波形短縮処理を
繰り返すことによって実現される早聞き再生によって、
リピートあるいはスロー（ゆっくり）再生により生じた
実時間からの遅れを取り戻す動作である。ここの部分の
処理では、追いかけ再生のための波形短縮処理用にパラ
メータ設定を行う（Ｓ１４２）。After the repeat reproduction and the slow (slow) reproduction are completed, the chase reproduction is started. Chasing playback is a fast-listening playback realized by repeating the waveform shortening process for each frame.
This is an operation to recover the delay from the real time caused by repeat or slow (slow) playback. In the processing of this portion, parameters are set for the waveform shortening processing for the chase reproduction (S142).

【０１６３】実時間からの遅れ量は、リピートボタンを
押し下げた時及び波形伸長処理を実行した時に増加し、
逆に波形短縮処理を実行した時には減少する。The delay amount from the real time increases when the repeat button is pressed and when the waveform expansion processing is executed,
On the contrary, it decreases when the waveform shortening process is executed.

【０１６４】なお、後述する１フレーム分の波形伸長・
短縮処理の手順（フローチャート）を示す図１８及び図
１９には、この遅れ量の増加／減少を行う処理は図示さ
れていない。Note that the waveform expansion / compression for one frame, which will be described later,
18 and 19 showing the procedure (flowchart) of the shortening process, the process of increasing / decreasing the delay amount is not shown.

【０１６５】この実時間からの遅れ量があるかないかを
判定し（Ｓ１４３）、まだ、遅れ量がある場合には、追
いかけ再生を連続して行うため、処理ループを形成して
いる。すなわち、追いかけ再生は、遅れ量が０になるま
で続けて行われるという動作が、この判定により実現さ
れる。It is determined whether or not there is a delay amount from this real time (S143), and if there is still a delay amount, chasing reproduction is continuously performed, so a processing loop is formed. That is, the operation of continuously performing the chasing reproduction until the delay amount becomes 0 is realized by this determination.

【０１６６】ところで、以上説明したメイン処理では、
話速変換あるいはリピート動作によって生じた実時間か
らの時間遅れは、カウンタを用いて「遅れ量」として管
理されている。By the way, in the main processing described above,
The time delay from the real time caused by the speech speed conversion or the repeat operation is managed as a "delay amount" using a counter.

【０１６７】実時間からの時間遅れは、現在のサンプル
データが入力されるリングバッファ２４上の位置と、出
力を行っているデータの位置入力されているリングバッ
ファ２４上の位置の差、すなわち、２つのポインタが示
すアドレスの差でも管理することが可能であるが、本発
明では前述のように遅れ量カウンタで管理する方式を用
いる。これは、リングバッファ２４上の入出力ポインタ
間のアドレスの差では、遅れ量を正しく表現できない場
合があるからである。The time delay from the real time is the difference between the position on the ring buffer 24 to which the current sample data is input and the position on the ring buffer 24 to which the position of the data being output is input, that is, Although it is possible to manage the difference between the addresses indicated by the two pointers, the present invention uses the method of managing with the delay amount counter as described above. This is because the delay amount may not be correctly expressed by the difference in address between the input / output pointers on the ring buffer 24.

【０１６８】例えば、リングバッファ２４に割り当てる
メモリアドレス空間を、０番地から１０００番地とする
と、プログラム中では「１０００番地の次は０番地に飛
ぶ」として扱うことにより、リングバッファ２４を実現
する。このため入出力ポインタがこのアドレスの切れ目
をまたいでいる場合には、単純にアドレス値の差を取っ
ただけでは、この間のデータ量を表現できない。アドレ
ス計算により、このポインタ間のデータ量を知るには、
２つのポインタがその位置に至るまでの経過を踏まえた
複雑な場合分けを含むアドレス値計算が必要になる。For example, assuming that the memory address space assigned to the ring buffer 24 is from address 0 to address 1000, the ring buffer 24 is realized by treating it as "jump to address 0 after address 1000" in the program. Therefore, when the input / output pointer crosses this address break, it is not possible to express the amount of data in this interval by simply taking the difference between the address values. To know the amount of data between these pointers by calculating the address,
It is necessary to perform address value calculation including complicated case classification based on the process of the two pointers reaching their positions.

【０１６９】本発明の話速変換装置では、リングバッフ
ァ２４へのデータの読み書きを行う毎に、遅れ量カウン
タの値を増減させることで時間遅れ量を管理し、複雑な
アドレス計算による処理量の増大を防いでいる。In the speech speed conversion apparatus of the present invention, the time delay amount is managed by increasing or decreasing the value of the delay amount counter each time data is read from or written in the ring buffer 24, and the processing amount by the complicated address calculation is controlled. It is preventing the increase.

【０１７０】前記メイン処理は、電源スイッチがＯＦＦ
にされるまで、前記の処理を繰り返して行う無限ループ
になっている（Ｓ１４４）。In the main processing, the power switch is turned off.
Until it is turned on, an infinite loop is performed in which the above processing is repeated (S144).

【０１７１】電源スイッチがＯＦＦになった場合には、
いきなり処理を停止するのではなく、一定時間処理を続
けてから停止する（ミュート）（Ｓ１４５）。この間、
出力される音声の大きさが徐々に小さくなるような処理
をここで行う。When the power switch is turned off,
Instead of stopping the processing suddenly, the processing is continued for a certain time and then stopped (mute) (S145). During this time,
Here, processing is performed so that the volume of the output voice gradually decreases.

【０１７２】具体的には、始めのフェードイン動作と同
様、割り込処理において、出力バッファ２６から図１に
示すＤ／Ａ変換器６に１データ転送する毎にこのデータ
値に係数を掛け算し、この係数値を時間的に変化させる
ことで、この機能を実現する。この動作は図１１に示す
制御部２３Ｅが実行する。Specifically, similar to the first fade-in operation, in the interrupt processing, this data value is multiplied by a coefficient every time one data is transferred from the output buffer 26 to the D / A converter 6 shown in FIG. , This function is realized by changing the coefficient value with time. This operation is executed by the control unit 23E shown in FIG.

【０１７３】図１５は、以上説明した本実施例２におけ
る各モード間の遷移を模式的に示す状態遷移図であり、
この図１５によれば、スイッチ操作に伴うモードの切り
替わり方が良くわかるであろう。なお、図１５中のスタ
ンバイモードについては後に詳しく説明する。FIG. 15 is a state transition diagram schematically showing the transition between the modes in the second embodiment described above.
According to this FIG. 15, it will be understood well how the modes are switched in accordance with the switch operation. The standby mode in FIG. 15 will be described in detail later.

【０１７４】以下、前述した本実施例２の各ルーチンの
処理動作の詳細について説明する。The details of the processing operation of each routine of the second embodiment described above will be described below.

【０１７５】図１６は読書ポインタ戻しルーチンの処理
手順を示すフローチャートである。FIG. 16 is a flow chart showing the processing procedure of the reading pointer returning routine.

【０１７６】本実施例２の読書ポインタ戻しルーチン
は、リピート機能を実現するために必要なリングバァフ
ァ２４からデータを読みだす位置を指す出力ポインタの
値を変えるための具体的な方法である。The reading pointer return routine of the second embodiment is a concrete method for changing the value of the output pointer which indicates the position where the data is read from the ring buffer 24, which is necessary to realize the repeat function.

【０１７７】図１６に示すように、まず、現時点での出
力ポインタ位置をＰoutにセットする（Ｓ１６１）。次
に、現時点での実時間から遅れ量をＤに設定（セット）
する（Ｓ１６２）。As shown in FIG. 16, first, the current output pointer position is set to Pout (S161). Next, set the delay amount to D from the current real time (set)
Yes (S162).

【０１７８】もし、現時点ですでに遅れ量が大きく、さ
らに、５秒分遅れ量（Ｂ５）を増やすと、リングバッフ
ァメモリ２４のサイズを越えてしまうかを判定し（Ｓ１
６３）、越えてしまうと判定される場合（Ｓ１６３のＹ
esの場合）には、Ｐout及びＤはそのままにして（Ｓ１
６９，Ｓ１７０）、このルーチンを終了する。If the delay amount is already large at this point and the delay amount (B5) is increased by 5 seconds, it is determined whether the size of the ring buffer memory 24 is exceeded (S1).
63), if it is determined that it will be exceeded (Y in S163)
es), leave Pout and D unchanged (S1
69, S170), and this routine ends.

【０１７９】５秒分遅れ量（Ｂ５）を増加しても大丈夫
な場合（Ｓ１６３のＮoの場合）には、ポインタを５秒
分戻し（−Ｂ５）、遅れ量を５秒分増やす（＋Ｂ５）
（Ｓ１６４）。If it is okay to increase the delay amount (B5) by 5 seconds (No in S163), the pointer is returned by 5 seconds (-B5) and the delay amount is increased by 5 seconds (+ B5).
(S164).

【０１８０】次に、リピートの開始が、言葉の切れ目に
なるように、逆方向に無音区間をサーチする処理を開始
する。まず、リングバッファメモリ２４上でＰoutが指
す位置から、逆方向にデータをアクセスし、１フレーム
分のパワーを計算する（Ｓ１６５）。Next, a process for searching a silent section in the reverse direction is started so that the start of repeat is a break between words. First, data is accessed in the reverse direction from the position indicated by Pout on the ring buffer memory 24, and the power for one frame is calculated (S165).

【０１８１】この時、１フレーム分（Ｆ）出力ポインタ
を戻す（−Ｆ）と、遅れ量もさらに１フレーム分増える
ことになる（＋Ｆ）が、遅れ量が１フレーム分増える
と、遅れ量のトータル量がリングバッファメモリサイズ
を越えてしまうかを判断し（Ｓ１６６）、越えてしまう
と判断される場合（Ｓ１６６のＹesの場合）には、この
無音部分サーチを中止し、この時のＰout及びＤを出力
ポインタ値及び遅れ量としてセットし（Ｓ１６９，Ｓ１
７０）、このルーチンを終了する。At this time, if the output pointer is returned by one frame (F) (-F), the delay amount is further increased by one frame (+ F). However, if the delay amount is increased by one frame, the delay amount is increased. It is determined whether the total amount exceeds the ring buffer memory size (S166). If it is determined that the total amount exceeds the ring buffer memory size (Yes in S166), this silent portion search is stopped, and Pout and Pout D is set as the output pointer value and the delay amount (S169, S1
70) and this routine is completed.

【０１８２】１フレーム分出力ポインタを戻しても、ト
ータルの遅れ量がリングバッファメモリサイズを越えな
い場合（Ｓ１６６のＮoの場合）には、Ｐoutを１フレー
ム分の長さ戻し、遅れ量Ｄを１フレーム分増やして（Ｓ
１６７）から、計算された１フレーム分のパワーＷと有
音／無音のしきい値との比較を行う（Ｓ１６８）。１フ
レーム分のパワーＷがこのしきい値より小さい場合に
は、このフレーム付近が言葉の切れ目であると判断して
（Ｓ１６８のＮoの場合）には、この時のＰout及びＤを
出力ポインタ値及び遅れ量としてセットし（Ｓ１６９，
Ｓ１７０）、このルーチンを終了する。If the total delay amount does not exceed the ring buffer memory size even if the output pointer for one frame is returned (No in S166), Pout is returned for one frame length and the delay amount D is set. Increase by one frame (S
167), the calculated power W for one frame is compared with the sound / silence threshold value (S168). When the power W for one frame is smaller than this threshold value, it is determined that the vicinity of this frame is a word break (in the case of No in S168), and Pout and D at this time are output pointer values. And the delay amount (S169,
(S170), this routine ends.

【０１８３】もし、１フレーム分のパワーＷがこのしき
い値より大きい場合（Ｓ１６８のＹesの場合）には、さ
らに、１フレーム戻り、同じように無音部分サーチを続
け、無音部分が検出されるが、遅れ量がリングバッファ
メモリサイズを越えるまで続けられる。このようにし
て、リピートスイッチ１０５が押された際の出力ポイン
タ戻しの処理を完了する。If the power W for one frame is larger than this threshold value (in the case of Yes in S168), one frame is returned and the silent part search is continued in the same manner to detect a silent part. However, it continues until the delay amount exceeds the ring buffer memory size. In this way, the process of returning the output pointer when the repeat switch 105 is pressed is completed.

【０１８４】図１７及び図１８は、本実施例２における
１フレーム分の波形伸長・短縮処理手順を示すフローチ
ャートである。FIG. 17 and FIG. 18 are flowcharts showing the waveform expansion / contraction processing procedure for one frame in the second embodiment.

【０１８５】本実施例２における１フレーム分の波形伸
長・短縮処理は、図１７及び図１８に示すように、ま
ず、始めに、今回の１フレーム分のデータのパワーを計
算する（Ｓ１７１）。次に、このパワー値Ｐをしきい値
Ｔｈと比較し（Ｓ１７２）、しきい値Ｔｈより大きいパ
ワーを有するフレームについては、以下に述べる処理を
施し、しきい値Ｔｈより小さいパワーを有するフレーム
のデータは、何もせずにそのまま出力してリングバッフ
ァ２４へ転送するか（Ｓ１７３）、子音強調処理を施し
てから出力バッファ２６へ転送する。子音強調を行うか
否かは、隠しスイッチの１つであるモードスイッチの状
態によって決定する。In the waveform expansion / contraction processing for one frame in the second embodiment, as shown in FIGS. 17 and 18, first, the power of the data for one frame this time is calculated (S171). Next, this power value P is compared with a threshold value Th (S172), and for the frame having the power larger than the threshold value Th, the processing described below is performed, and the frame having the power smaller than the threshold value Th is processed. The data is output as it is without any processing and transferred to the ring buffer 24 (S173), or is subjected to the consonant emphasis processing and then transferred to the output buffer 26. Whether to enhance the consonant is determined by the state of the mode switch, which is one of the hidden switches.

【０１８６】子音強調処理の具体的な実現方法として
は、例えば、しきい値Ｔｈ以上のパワーを持つフレーム
の直前のしきい値Ｔｈ以下のパワーを持つフレームを子
音とみなして、そのフレーム中のデータの値を大きくす
る方法が考えられる。As a concrete method of realizing the consonant emphasizing process, for example, a frame having a power equal to or lower than the threshold value Th immediately before a frame having a power equal to or higher than the threshold value Th is regarded as a consonant, and A method of increasing the data value can be considered.

【０１８７】前記パワー判定でしきい値Ｔｈより大きい
パワーを有するフレーム（Ｓ１７２のＹesの場合）で
は、まず、１フレームのデータ数を未処理データ量を表
わす変数Ｚに記憶してから（Ｓ１７４）、フレームの先
頭からピッチ抽出処理を行う（Ｓ１７５）。ピッチ抽出
処理には、いくつかの方法が考えられるが、例えば、良
く知られた自己相関を用いたアルゴリズムによりフレー
ム先頭でのピッチ長を抽出する。In the frame having the power larger than the threshold value Th in the power judgment (Yes in S172), first, the number of data of one frame is stored in the variable Z representing the unprocessed data amount (S174). , Pitch extraction processing is performed from the beginning of the frame (S175). Several methods are conceivable for the pitch extraction processing. For example, the pitch length at the beginning of the frame is extracted by a well-known algorithm using autocorrelation.

【０１８８】次に、この抽出されたピッチ長２個分のデ
ータ量と、未処理データの量とを比較し（Ｓ１７６）、
未処理データ量Ｚが、この抽出されたピッチの２個分の
データ量より少ない場合には、この処理を中止する。Next, the data amount of the two extracted pitch lengths is compared with the amount of unprocessed data (S176),
If the unprocessed data amount Z is smaller than the data amount of the two extracted pitches, this process is stopped.

【０１８９】未処理データ量Ｚが２ピッチ分以上ある場
合（Ｓ１７６のＹesの場合）には、まず、前転送処理を
行う（Ｓ１７８）。前転送処理とは、以下に述べる合成
波形挿入処理の前に、入力データの一部を、そのまま出
力バッファ２６に転送する処理であり、前記実施例１の
図２の（ｂ）の部分に相当する。前転送処理で転送する
データ数は、ピッチ単位で設定するが、その数は波形伸
長率／短縮率により異なり、後述する図１９で説明する
パラメータ設定ルーチンにより、この数Ｎｐｆが設定さ
れる（Ｓ１７７）。前転送処理した（Ｓ１７８）後に
は、未処理データ量Ｚを、転送したデータ数の分だけ少
なくする（Ｓ１７９）。When the unprocessed data amount Z is equal to or more than 2 pitches (Yes in S176), first, the pre-transfer process is performed (S178). The pre-transfer process is a process of transferring a part of the input data as it is to the output buffer 26 before the synthetic waveform insertion process described below, and corresponds to the part (b) of FIG. 2 of the first embodiment. To do. The number of data to be transferred in the pre-transfer processing is set in pitch units, but the number differs depending on the waveform expansion rate / shortening rate, and this number Npf is set by the parameter setting routine described later with reference to FIG. 19 (S177). ). After the pre-transfer processing (S178), the unprocessed data amount Z is reduced by the number of transferred data (S179).

【０１９０】次に、図１９に示したパラメータ設定ルー
チン中の設定される別のパラメータＰｔｒｉに応じて、
合成波形生成のための△窓関数を適用する位置を決める
（Ｓ１８０）。伸長と短縮で異なるのは、△窓関数を用
いて合成波形を生成するときの、窓関数をかける現波形
上の位置だけである。Next, according to another parameter Ptri set in the parameter setting routine shown in FIG.
The position to which the Δ window function for generating the composite waveform is applied is determined (S180). The only difference between expansion and contraction is the position on the current waveform on which the window function is applied when the composite waveform is generated using the Δ window function.

【０１９１】すなわち、波形伸長の場合には、図２で示
したように、１ピッチ分の波形から２ピッチ分の波形が
生成されるように、△窓関数を適用する（Ｓ１８１）。
一方、波形短縮の場合には、図２０から図２２で示した
ように、３ピッチあるいは４ピッチ分の波形から２ピッ
チ分の波形が生成されるように、△窓関数を適用する。
この合成波形の挿入により、実時間からの遅れ量が変化
する（図示せず）。That is, in the case of waveform expansion, as shown in FIG. 2, the Δ window function is applied so that a waveform for one pitch is generated from a waveform for two pitches (S181).
On the other hand, in the case of waveform shortening, the Δ window function is applied so that a waveform for 2 pitches is generated from a waveform for 3 pitches or 4 pitches, as shown in FIGS.
By inserting this composite waveform, the delay amount from the real time changes (not shown).

【０１９２】合成波形挿入処理の後には、未処理データ
量Ｚを、処理データ数の分だけ少なくする（Ｓ１８
２）。After the synthetic waveform insertion processing, the unprocessed data amount Z is reduced by the number of processed data (S18).
2).

【０１９３】次に、もう１度、ピッチ抽出処理を行う
（Ｓ１８３）。これは人間の音声のピッチが常に変動し
ていることに対応する処理で、ピッチの抽出をやり直す
ことによって、実際のピッチ長と、処理を行うピッチ長
との誤差を減少させ、その結果として、伸長／短縮後の
波形における歪みの増加を防いでいる。Next, pitch extraction processing is performed again (S183). This is a process corresponding to the fact that the pitch of human voice is constantly changing, and by re-extracting the pitch, the error between the actual pitch length and the pitch length to be processed is reduced, and as a result, This prevents an increase in distortion in the waveform after stretching / shortening.

【０１９４】次に、図１８に示すように、未処理データ
数と、新たに抽出したピッチ長の２個分のデータ量との
比較を行う（Ｓ１８４）。もし、２ピッチ分のデータ量
が残っていなければ（Ｓ１８４のＮoの場合）、ただち
に、この処理を中止する。Next, as shown in FIG. 18, the number of unprocessed data is compared with the data amount of two newly extracted pitch lengths (S184). If the data amount for two pitches does not remain (in the case of No in S184), this process is immediately stopped.

【０１９５】２ピッチ分以上のデータ量が残っていれば
（Ｓ１８４のＹesの場合）、後転送処理を行う。後転送
処理とは、前転送処理と同様の処理で前記実施例１にお
ける図２の（ｅ）の部分に相当する。後転送処理で転送
するデータ数は、ピッチ単位で設定するが、その数は波
形伸長率／短縮率により異なり、図１９で説明するパラ
メータ設定ルーチンによりこの数Ｎｐｒが設定される
（Ｓ１８５）。後転送処理（Ｓ１８６）の後には、未処
理データ量Ｚを、転送したデータ数の分だけ少なくする
（Ｓ１８７）。If the data amount of 2 pitches or more remains (Yes in S184), post-transfer processing is performed. The post-transfer process is the same process as the pre-transfer process and corresponds to the part (e) of FIG. 2 in the first embodiment. The number of data to be transferred in the post-transfer process is set in pitch units, but the number differs depending on the waveform expansion rate / shortening rate, and this number Npr is set by the parameter setting routine described in FIG. 19 (S185). After the post-transfer processing (S186), the unprocessed data amount Z is reduced by the number of transferred data (S187).

【０１９６】以上の処理が、途中２回行われる２ピッチ
分のデータ量と未処理量との比較で、この処理が中断さ
れるまで、連続して繰り返し行われる。The above processing is continuously repeated until the processing is interrupted by comparing the data amount for two pitches and the unprocessed amount, which is performed twice during the process.

【０１９７】図１９は本実施例２における伸長処理にパ
ラメータを設定するパラメータ設定処理ルーチンの処理
手順を示すフローチャートである。FIG. 19 is a flow chart showing the processing procedure of a parameter setting processing routine for setting parameters for the decompression processing in the second embodiment.

【０１９８】図１９に示すパラメータ設定ルーチンは、
実は、図１３及び図１４に示すメイン処理の中で２回使
われている。そのうちの１回は前記１フレーム分の波形
伸長・短縮処理ルーチンの直前で行われ、もう１回は、
リピート終了判定後の「短縮処理にパラメータを設定す
る処理」で使われる。The parameter setting routine shown in FIG.
Actually, it is used twice in the main processing shown in FIGS. 13 and 14. One of them is performed immediately before the waveform expansion / contraction processing routine for one frame, and the other is
It is used in the "processing for setting parameters for shortening processing" after the end of repeat determination.

【０１９９】ところで、波形短縮処理とは、ゆっくり聞
いた後、またはリピートを行った後に続けて行う「追い
かけ処理（早聞き処理）」を実現するためのものであ
る。波形伸長処理の中で行っていた△窓関数を用いた合
成波形の生成を、窓関数をかける位置を伸長する場合と
逆方向にずらして行うと、波形短縮になる。By the way, the waveform shortening processing is for realizing a "chasing processing (fast-listening processing)" which is continuously performed after listening slowly or after repeating. If the composite waveform generation using the Δ window function, which was performed during the waveform expansion processing, is performed by shifting the position to which the window function is applied in the opposite direction to the expansion, the waveform is shortened.

【０２００】図１９において、始めに伸長か短縮かの判
定を行う（Ｓ１９１）。これは前述の２回のうちのどち
らかを判定する。In FIG. 19, first, it is determined whether to extend or shorten (S191). This determines either of the above two times.

【０２０１】伸長処理にパラメータ設定を行う場合に
は、この判定の後、話速選択スイッチの位置をチェック
し（Ｓ１９２）、スイッチ位置に応じて伸長率ｅを設定
し（Ｓ１９３）、伸長率ｅに応じて波形伸長処理の中で
使われるパラメータＮｐｆとＮｐｒの位置を設定し（Ｓ
１９４，Ｓ１９５）、さらに、波形伸長処理の中で行わ
れる△窓との積和演算を始める位置を示すパラメータＰ
ｔｒｉを設定して、このルーチンを終わる。When setting parameters for the expansion processing, after this determination, the position of the speech speed selection switch is checked (S192), the expansion rate e is set according to the switch position (S193), and the expansion rate e is set. The positions of the parameters Npf and Npr used in the waveform expansion processing are set according to
194, S195), and a parameter P indicating the position where the product-sum calculation with the Δ window performed in the waveform expansion processing is started.
Set tri and end this routine.

【０２０２】一方、短縮処理にパラメータ設定を行う場
合には、図１９の右側のフローを実行する。まず、追い
かけモードスイッチ（隠しスイッチの一つ）の位置をチ
ェックし（Ｓ１９６）、追いかけモード（Ｍcat）が
「飛び」「早聞き」「１倍」のどれに設定されているか
をチェックする（Ｓ１９７，Ｓ１９８）。On the other hand, when setting parameters for the shortening process, the flow on the right side of FIG. 19 is executed. First, the position of the chase mode switch (one of the hidden switches) is checked (S196), and it is checked whether the chase mode (Mcat) is set to "fly", "quick listening" or "1x" (S197). , S198).

【０２０３】「飛び」に設定されている場合、追いかけ
モード（Ｍcat）は、実際には「追いかけ」ではなく、
スロースイッチ（ゆっくり押ボタン）を離した途端に現
実に飛ぶように機能する（Ｓ１９９）。具体的には、こ
の部分で強制的にスルーモードに戻す分岐処理を行う。When set to "fly", the chase mode (Mcat) is not actually "chase",
As soon as the slow switch (slow push button) is released, it functions so as to actually fly (S199). Specifically, a branch process forcibly returning to the through mode is performed in this portion.

【０２０４】追いかけモード（Ｍcat）スイッチが「１
倍」にセットされている場合には、短縮率ｓを１倍に設
定し（Ｓ２００）、ステップＳ２０２に移る。The chasing mode (Mcat) switch is set to "1".
If it is set to "double", the shortening rate s is set to 1 (S200), and the process proceeds to step S202.

【０２０５】追いかけモード（Ｍcat）スイッチが「１
倍」にセットされていない場合には、追いかけモード時
の図１９の真中のフローを通り短縮率ｓがセットされ
（Ｓ２０１）、短縮率ｓに応じて波形短縮処理の中で使
われるパラメータＮｐｆとＮｐｒの値を設定し（Ｓ２０
２）、さらに、波形短縮処理の中で行われる△窓との積
和演算を始める位置を示すパラメータＰｔｒｉを設定し
て（Ｓ２０３）、このルーチンを終わる。The chasing mode (Mcat) switch is set to "1".
If it is not set to “double”, the shortening rate s is set through the flow in the middle of FIG. 19 in the chase mode (S201), and the parameter Npf used in the waveform shortening process is set according to the shortening rate s. Set the value of Npr (S20
2) Furthermore, the parameter Ptri indicating the position where the product-sum calculation with the Δ window, which is performed during the waveform shortening process, is set (S203), and this routine is ended.

【０２０６】（実施例３）図２３及び図２４は、本発明
による本実施例３の連続話速変換手段を付加した話速変
換装置の全体動作の処理手順を示すフローチャートであ
る。(Embodiment 3) FIGS. 23 and 24 are flowcharts showing the processing procedure of the entire operation of the speech speed conversion apparatus to which the continuous speech speed conversion means of the third embodiment of the present invention is added.

【０２０７】本実施例３の連続話速変換手段を付加した
話速変換装置における連続話速変換基本的には、スロー
スイッチ（ゆっくり押ボタン）１０４を押し続けて、ゆ
っくり再生を続けて行う動作のことである。ただし、一
定の波形伸長率で波形伸長を続けると、時間遅れがどん
どん蓄積してしまい、最後には、実時間からの遅れ量が
リングバッファ２４の容量を越えてしまい、それ以上ゆ
っくり聞き続けることが不可能になる。Continuous voice speed conversion in the voice speed control apparatus to which the continuous voice speed conversion means of the third embodiment is added. Basically, the slow switch (slow push button) 104 is continuously pressed and the slow reproduction is continuously performed. That is. However, if waveform expansion is continued at a constant waveform expansion rate, time delays accumulate, and finally, the amount of delay from the actual time exceeds the capacity of the ring buffer 24, and you should continue listening more slowly. Becomes impossible.

【０２０８】そこで、ゆっくり再生を行う際に、波形を
伸長する期間と、逆に波形を短縮する期間を、取り混ぜ
て、実時間から遅れが、どんどん増えないようにするの
が、連続話速変換手段である。Therefore, when performing slow reproduction, it is necessary to mix the period for extending the waveform and the period for shortening the waveform, on the contrary, so that the delay from the real time does not increase steadily. It is a means.

【０２０９】連続話速変換モードへの切換えは、いくつ
かの方法が考えられるが、単に、スロースイッチを長め
に押しつづけている場合と、連続話速変換モードに入る
場合とで明確な区別をした方がわかりやすいので、例え
ば、スロースイッチをダブルクリック（短い時間間隔で
の２度押し）したり、スロースイッチを押しながら横に
ずらすと、ロックするスイッチ部品を用いたりしてこの
切換えを実現する。There are several possible methods for switching to the continuous voice speed conversion mode. However, a clear distinction should be made between simply holding the slow switch for a long time and entering the continuous voice speed conversion mode. It is easier to understand, for example, by double-clicking the slow switch (pressing twice at short time intervals) or by pushing the slow switch and sliding it sideways, this can be achieved by using a switch component that locks. .

【０２１０】本実施例３の図２３及び図２４に示すフロ
ーチャート中の各処理は、前記の図１３及び図１４で説
明したメイン処理での処理手順と全く同じである。Each processing in the flowcharts shown in FIGS. 23 and 24 of the third embodiment is exactly the same as the processing procedure in the main processing described in FIGS. 13 and 14 above.

【０２１１】本実施例３の連続話速変換手段は、図２３
及び図２４におけるステップＳ２３１において、連続話
速変換処理か否かをチェックする（Ｓ２３１）。連続話
速変換処理であれば（Ｓ２３１のＹesの場合)、１フレ
ーム分の波形伸長・短縮処理する（Ｓ２３２)。次に、
リセットスイッチ１０６が押されている（ＯＮしてい
る）か否かを判定し（Ｓ２３３）、リセットスイッチ１
０６が押されていない（ＯＦＦしている）場合、１フレ
ーム数カウントＵＰし（Ｓ２３４）、伸長期間か否かを
判定し（Ｓ２３５）、伸長期間であれば（Ｓ２３５のＹ
esの場合）、ステップＳ２３２に戻る。伸長期間でなれ
ば（Ｓ２３５のＮoの場合）、短縮処理にパラメータを
設定する（Ｓ２３６）。次に、遅れ量が０であるをチェ
ックし（Ｓ２３７）、遅れ量が０である場合（Ｓ２３７
のＹesの場合）、ステップＳ２３２に戻る。遅れ量が０
でない場合（Ｓ２３７のＮoの場合）、伸長処理にパラ
メータを設定し（Ｓ２３８）、スレーム数カウンタをリ
セットし（Ｓ２３９）、ステップＳ２３２に戻り、連続
話速変換処理動作が繰り返される。前記ステップＳ２３
１において、連続話速変換処理でない場合（Ｓ２３１の
Ｎoの場合)、前述したメイン処理ルーチン（スルーモー
ド）に移る。The continuous speech speed converting means of the third embodiment is shown in FIG.
Then, in step S231 in FIG. 24, it is checked whether or not it is the continuous voice speed conversion processing (S231). If it is the continuous speech speed conversion process (Yes in S231), the waveform expansion / contraction process for one frame is performed (S232). next,
It is determined whether the reset switch 106 is pressed (ON) (S233), and the reset switch 1
If 06 is not pressed (OFF), one frame number is counted up (S234), it is determined whether it is the extension period (S235), and if it is the extension period (Y of S235).
es), the process returns to step S232. If it is the extension period (No in S235), parameters are set for the shortening process (S236). Next, it is checked whether the delay amount is 0 (S237), and if the delay amount is 0 (S237).
If Yes), the process returns to step S232. 0 delay
If not (No in S237), parameters are set for the decompression process (S238), the slam counter is reset (S239), the process returns to step S232, and the continuous speech speed conversion process operation is repeated. Step S23
If the continuous voice speed conversion processing is not performed in No. 1 (No in S231), the above-described main processing routine (through mode) is performed.

【０２１２】すなわち、本実施例３の連続話速変換手段
は、あらかじめ決められた時間間隔で、ゆっくり再生と
追い付き再生を交互に繰り返す方法である。この方法に
よれば、必ず一定時間毎に実時間に追い付くことが可能
となる。そして、波形伸長と波形短縮を切り替えの管理
は、フレーム数のカウントで行う。例えば、約５秒分の
フレーム数を伸長処理したら、その後短縮処理を繰り返
し行い、遅れ量が０になったら、フレームカウントを０
に戻して、再び伸長処理を繰り返す。That is, the continuous speech speed converting means of the third embodiment is a method in which slow reproduction and catch-up reproduction are alternately repeated at a predetermined time interval. According to this method, it is always possible to catch up with the real time at regular time intervals. The management of switching between waveform expansion and waveform shortening is performed by counting the number of frames. For example, after extending the number of frames for about 5 seconds, the shortening process is repeated, and when the delay amount becomes 0, the frame count is set to 0.
Then, the expansion process is repeated again.

【０２１３】また、連続話速変換モードから抜け出すに
は、リセットスイッチ１０６を押し下げ、スルーモード
に戻すことで達成する。To get out of the continuous speech speed conversion mode, the reset switch 106 is pushed down to return to the through mode.

【０２１４】（実施例４）図２５及び図２６は、本発明
による前記実施例３と異なる本実施例４の連続話速変換
手段を付加した話速変換装置の全体動作の処理手順を示
すフローチャートである。(Embodiment 4) FIGS. 25 and 26 are flow charts showing the processing procedure of the entire operation of the speech speed conversion apparatus to which the continuous speech speed conversion means of the present embodiment 4 different from the embodiment 3 according to the present invention is added. Is.

【０２１５】本実施例４の連続話速変換手段を付加した
話速変換装置における連続話速変換は、パワーの大きい
フレームが波形伸長を行い、パワーの小さいフレームは
波形短縮が行う動作である。The continuous speech speed conversion in the speech speed conversion apparatus to which the continuous speech speed conversion means of the fourth embodiment is added is an operation in which a waveform having a high power is subjected to waveform expansion and a frame having a low power is subjected to waveform shortening.

【０２１６】本実施例４の連続話速変換手段は、図２５
及び図２６における、ステップＳ２５１において、連続
話速変換処理か否かをチェックする。連続話速変換処理
であれば（Ｓ２５１のＹesの場合)、リセットスイッチ
１０６が押されている（ＯＮしている）か否かを判定し
（Ｓ２５２）、リセットスイッチ１０６が押されていな
い（ＯＦＦしている）場合、１フレーム分のパワーを算
出する（Ｓ２５３）。次に、算出された１フレーム分の
パワーがしきい値Ｔｈよりも大きいか否かをチェックし
（Ｓ２５４）、算出された１フレーム分のパワーがしき
い値Ｔｈよりも小さい場合（Ｓ２５１のＮoの場合）に
は、短縮処理にパラメータを設定し（Ｓ２５６）、ステ
ップＳ２５７に移る。算出された１フレーム分のパワー
がしきい値Ｔｈよりも大さい場合（Ｓ２５４のＹesの場
合)には、伸長処理にパラメータを設定し（Ｓ２５
５）、１フレーム分の波形伸長・短縮処理を行い（Ｓ２
５７）、ステップＳ２５２に戻り、連続話速変換処理動
作が繰り返される。前記ステップＳ２５１において、連
続話速変換処理でない場合（Ｓ２５１のＮoの場合)、前
述したメイン処理ルーチン（スルーモード）に移る。The continuous speech speed converting means of the fourth embodiment is shown in FIG.
Also, in step S251 in FIG. 26, it is checked whether or not it is the continuous voice speed conversion process. If it is the continuous speech speed conversion process (Yes in S251), it is determined whether or not the reset switch 106 is pressed (ON) (S252), and the reset switch 106 is not pressed (OFF). If so, the power for one frame is calculated (S253). Next, it is checked whether the calculated power for one frame is larger than the threshold Th (S254), and if the calculated power for one frame is smaller than the threshold Th (No in S251). In the case of), parameters are set for the shortening process (S256), and the process proceeds to step S257. If the calculated power for one frame is larger than the threshold value Th (Yes in S254), a parameter is set for the decompression process (S25).
5) Perform waveform expansion / contraction processing for one frame (S2
57), the process returns to step S252, and the continuous speech speed conversion processing operation is repeated. In step S251, if it is not the continuous speech speed conversion process (No in S251), the process proceeds to the main process routine (through mode) described above.

【０２１７】すなわち、連続話速変換モードに入ると、
１フレーム分のパワーが算出され、しきい値Ｔｈとの比
較により、伸長あるいは短縮がフレーム毎に施される。
連続話速変換モードから抜けだすには、リセットスイッ
チ１０６を押し下げる。That is, when the continuous speech speed conversion mode is entered,
The power for one frame is calculated, and the expansion or contraction is performed for each frame by comparison with the threshold value Th.
To get out of the continuous speech speed conversion mode, the reset switch 106 is pushed down.

【０２１８】本実施例４によれば、音声のパワーに応じ
てゆっくりになったり早口になったりする。According to the fourth embodiment, the voice becomes slow or fast depending on the power of the voice.

【０２１９】一般に、通常の会話では、相手に聞いても
らいたい重要な部分は、声が大きくなり、あまり重要で
ない部分は声が小さくなる傾向があることから、本実施
例４による話速制御の方が、より自然に近い出力音声が
得られるという特徴がある。Generally, in a normal conversation, the voice tends to be louder in the important portions that the other party wants to hear, and the voices tend to be smaller in the less important portions. The characteristic is that a more natural output voice can be obtained.

【０２２０】ただし、パワーの大きい部分と小さい部分
の出現確率は必ずしも等しくないので、実施例３の場合
のように、必ず一定時間間隔で実時間に追い付く保証は
ない。However, since the appearance probabilities of the high power portion and the low power portion are not necessarily equal, there is no guarantee that they will always catch up with the real time at a constant time interval as in the case of the third embodiment.

【０２２１】また、連続話速変換モードに入るための、
ユーザーからの指示方法としては、スロースイッチ（ゆ
っくり押ボタン）１０４を押した後、横にスライドして
ロックする方法や、スロースイッチ（ゆっくり押ボタ
ン）１０４をダブルクリック（短い時間間隔で２度続け
て押し下げる）などが、考えられる。これらの方法を用
いれば、スロースイッチ（ゆっくり押ボタン）１０４を
押して、「ゆっくり再生をさせよう」とする意図と、そ
の動作を「連続させよう」という意図と、同じ押ボタン
の押し方の違いで表現できるため、別に連続話速変換用
の押ボタンを設けるのに比較して、より直感的でわかり
やすい操作方式が提供できるようになる。In order to enter the continuous speech speed conversion mode,
As an instruction method from the user, after pushing the slow switch (slow push button) 104, slide it sideways to lock it, or double click the slow switch (slow push button) 104 (continue twice at short time intervals). And push down) is possible. If these methods are used, the difference between the intention of pushing the slow switch (slow push button) 104 to "play slowly" and the intention of "making the operation continuous" and the push method of the same push button are different. Since it can be expressed by, it becomes possible to provide a more intuitive and easy-to-understand operation method as compared with providing a push button for continuous speech speed conversion separately.

【０２２２】これまでの実施例２，３，４では、波形伸
長処理により「ゆっくり」再生する場合の波形の「伸長
率」は、装置上に設けた「話速設定スイッチ」の設定に
よって決められ、波形短縮処理により「早聞き」再生す
る際の波形の「短縮率」は、あらかじめ（プログラム中
で）決められた「デフィルト値」を用いるものとして、
説明してきた。In the second, third, and fourth embodiments so far, the "expansion rate" of the waveform when "slowly" is reproduced by the waveform expansion process is determined by the setting of the "speech speed setting switch" provided on the apparatus. As for the "shortening rate" of the waveform when "fast listening" is reproduced by the waveform shortening process, it is assumed that the "deflate value" determined in advance (in the program) is used.
I've explained.

【０２２３】しかし、図２７に示した「アクセル型スイ
ッチ」による波形伸長／短縮率の変更が行えるようにす
ると、本装置が提供する「音声の時間軸上を自由に行き
来できる」という機能を、使用者がより直感的に使うこ
とが可能となる。However, if it is possible to change the waveform expansion / contraction ratio by the "accelerator type switch" shown in FIG. 27, the function "to freely switch the voice on the time axis" provided by this device is provided. The user can use it more intuitively.

【０２２４】アクセル型スイッチを中心に設定している
時は、前述の実施例２，３，４のスルーモードが実行さ
れる。スライドスイッチを手前側に引くと、波形伸長処
理となり実時間から遅れながら「ゆっくり再生」が実行
される。そして、スライドスイッチを向う側に押すと、
逆に波形短縮処理となり（実時間からの遅れが０になる
まで）早聞き再生が実行される。When the accelerator type switch is set at the center, the through mode of the above-mentioned Embodiments 2, 3 and 4 is executed. When the slide switch is pulled to the near side, waveform expansion processing starts and "slow playback" is executed with a delay from the actual time. Then, press the slide switch to the opposite side,
Conversely, the waveform shortening process is performed (until the delay from the actual time becomes 0), and the fast-listening reproduction is executed.

【０２２５】この際、制御部は、波形伸長及び短縮率
を、スライドスイッチの中心からの距離に応じて変化さ
せる。しかし、前述の実施例の図２０から図２２の説明
でもわかるように、伸長率／短縮率は、整数の比で表す
ことのできる、いくつかの値しか設定できないので、実
際には、スライドスイッチの中心からの距離に応じて、
数段階の値が選択できるように設定すればよい。At this time, the control unit changes the waveform expansion and contraction rate according to the distance from the center of the slide switch. However, as can be seen from the description of FIGS. 20 to 22 of the above-described embodiment, the expansion rate / reduction rate can be set only to some values that can be represented by the ratio of integers. Depending on the distance from the center of
It may be set so that values of several levels can be selected.

【０２２６】また、このアクセル型スイッチから使用者
が指を離すと、自動的に中心にレバーが戻るように力が
かかるようにすると、使用者はスライドスイッチを中心
以外の途中の位置に維持することが、しやすくなるの
で、より使い勝手のよい操作方法を実現できる。なお、
このレバーが中心に戻るように力を生じさせるために
は、スイッチデバイスの内部に、２つのスプリングを設
け、両側から均等の力でレバーを引っ張る等の機械的な
手段を持たせることにより、実現できる。When the user releases his / her finger from the accelerator type switch, the lever is automatically returned to the center so that the user keeps the slide switch at a position other than the center. This makes it easier to implement, and thus a more convenient operating method can be realized. In addition,
In order to generate force so that the lever returns to the center, it is possible to provide two springs inside the switch device and to provide mechanical means such as pulling the lever with equal force from both sides. it can.

【０２２７】（実施例５）図２８は、本発明による実施
例５のＡＶコントロール手段を付加した話速変換装置の
機能構成を示すブロック図、図２９は、本実施例５のＡ
Ｖコントロール手段の動作を説明するための図、図３０
及び図３１は、本実施例３におけるＡＶコントロール手
段を付加した話速変換装置のメイン処理の動作手順を示
すフローチャートである。(Embodiment 5) FIG. 28 is a block diagram showing the functional structure of a speech speed conversion apparatus to which the AV control means of embodiment 5 according to the present invention is added, and FIG.
FIG. 30 is a diagram for explaining the operation of the V control means.
31 and FIG. 31 are flowcharts showing the operation procedure of the main processing of the speech speed conversion apparatus to which the AV control means according to the third embodiment is added.

【０２２８】本実施例５のＡＶコントロール手段を付加
した話速変換装置は、図２８に示すように、前記図１１
に示す実施例１の話速変換装置の機能構成に、ＡＶコン
トロール部２８を追加し、制御部２３Ｅに接続した機能
構成にしたものである。As shown in FIG. 28, the speech speed conversion apparatus to which the AV control means of the fifth embodiment is added is the same as that shown in FIG.
The AV speed control unit 28 is added to the function structure of the speech speed conversion apparatus of the first embodiment shown in FIG.

【０２２９】前記制御部２８は、ＡＶコントロール信号
出力を行う条件になったか否かを判定し、ＡＶコントロ
ール部２８を動作させ、ＡＶコントロール部２８はＡＶ
コントロール信号の出力開始／停止を行う。The control section 28 judges whether or not the condition for outputting the AV control signal is satisfied, and the AV control section 28 is operated.
Starts / stops the output of control signals.

【０２３０】ＡＶコントロール手段とは、図２９に示す
ように、スロー（ゆっくり）再生またはリピート再生に
より、実時間からの遅れ量が、一定の量（図２９では３
０秒となっている）を越えた場合に、ＡＶコントロール
信号を出力し、さらに、追いかけ再生を経て遅れ量が０
になったときに、同信号の出力を中止するソフトウエア
である。As shown in FIG. 29, the AV control means is a slow (slow) reproduction or a repeat reproduction so that the delay amount from the real time is a constant amount (3 in FIG. 29).
If it exceeds 0 seconds), the AV control signal is output and the delay amount becomes 0 after chasing playback.
It is software that stops the output of the same signal when it becomes.

【０２３１】ＡＶコントロール信号は、本装置の外に取
り出され、テープレコーダやビデオ等の録音再生装置の
再生動作を一時停止させるために用いられる。この手段
により、本装置のリングバッファ２４の容量を越えるよ
うな長い時間の連続する入力声を、連続してゆっくり聞
き続けることが可能となる。The AV control signal is taken out of the apparatus and used for temporarily stopping the reproducing operation of the recording / reproducing apparatus such as a tape recorder or a video recorder. By this means, it becomes possible to continuously and slowly listen to a continuous input voice for a long time that exceeds the capacity of the ring buffer 24 of the present apparatus.

【０２３２】図３０及び図３１において、点線で囲んだ
部分が、前記図１２及び図１３のフローチャートに付加
したＡＶコントロール手段の動作手順を示すステップで
あり、ＡＶコントロール信号出力を行う条件になったか
否かの判定する（Ｓ３０１）。ＡＶコントロール信号出
力の判定は、スロー（ゆっくり）再生またはリピート再
生１フレーム分の波長伸長／短縮処理を繰り返し行うル
ープの中で、実時間からの遅れ量が３０秒以上あるか否
かを判定し（Ｓ３０１）、３０以上実時間から遅れがあ
る場合には、ＡＶコントロール信号出力を開始する（Ｓ
３０２）ことで実現される。In FIGS. 30 and 31, the portion surrounded by the dotted line is the step showing the operation procedure of the AV control means added to the flow charts of FIGS. 12 and 13, and whether the conditions for outputting the AV control signal are satisfied or not. It is determined whether or not (S301). The AV control signal output is determined by determining whether or not the delay amount from the real time is 30 seconds or more in a loop in which the wavelength expansion / reduction processing for one frame of slow reproduction or repeat reproduction is repeated. (S301), if there is a delay of 30 or more from the real time, the AV control signal output is started (S301).
302).

【０２３３】一方、ＡＶコントロール信号を停止させる
処理は、追いかけ再生処理のループを抜け出す判定であ
る。「遅れ量＝０」判定の直後に行われる（Ｓ３０
３）。On the other hand, the process of stopping the AV control signal is a decision to exit the loop of the chase reproduction process. Immediately after the "delay amount = 0" determination (S30
3).

【０２３４】（実施例６）図３２は本発明による実施例
６の話速変換装置のマイクロホンの配置を説明するため
の図であり、１０１は話速変換装置本体、３２１はマイ
クロホン、３２２はマイクロホン３２１を支持するため
の伸縮自在な支柱、３２３はマイクロホン３２１を支持
するためのフレキシブルな支柱、３２４はマイクロホン
３２１と話速変換装置本体１０１とを有線で電気的に接
続するための電気コードである。(Sixth Embodiment) FIG. 32 is a view for explaining the arrangement of microphones in a speech speed conversion apparatus according to a sixth embodiment of the present invention, in which 101 is a speech speed conversion apparatus main body, 321 is a microphone, and 322 is a microphone. Stretchable columns for supporting 321 are flexible columns for supporting the microphone 321, flexible columns for supporting the microphone 321, and 324 are electrical cords for electrically connecting the microphone 321 and the speech speed conversion device main body 101 by wire. .

【０２３５】図３３は本実施例６の変形例を示す図であ
り、１０１は話速変換装置本体、１０４はスロースイ
チ、１０５はリピートスイッチ、１０６はリセットスイ
ッチ、３２１はマイクロホン、３２４はマイクロホン３
２１と話速変換装置本体１０１とを電気的に接続するた
めの電気コード、３２５はイヤホン、３００は接続部材
である。FIG. 33 is a diagram showing a modification of the sixth embodiment, in which 101 is a speech speed converting apparatus main body, 104 is a slow switch, 105 is a repeat switch, 106 is a reset switch, 321 is a microphone, and 324 is a microphone 3.
21 is an electric cord for electrically connecting 21 to the speech speed conversion device main body 101, 325 is an earphone, and 300 is a connecting member.

【０２３６】本実施例６の話速変換装置のマイクロホン
の配置は、図３２（ａ）に示すように、伸縮自在な支柱
３２２によってマイクロホン３２１を支持している。こ
のようにマイクロホン３２１支持することにより、話速
変換装置本体１０１からマイクロホン３２１が離される
ので、本体を胸ポケットに入れて使用する際の布擦れ音
を防止することができる。As for the arrangement of the microphones of the speech speed converting apparatus of the sixth embodiment, as shown in FIG. 32 (a), the microphones 321 are supported by the stretchable columns 322. By supporting the microphone 321 in this manner, the microphone 321 is separated from the speech speed conversion device main body 101, so that it is possible to prevent a cloth rubbing sound when the main body is put in a chest pocket and used.

【０２３７】また、図３２（ｂ）に示すように、フレキ
シブルな支柱３２３によってマイクロホン３２１を支持
している。このようにマイクロホン３２１支持すること
により、話速変換装置本体１０１からマイクロホン３２
１が離され、かつ、すきな方向にねじ曲げられるので、
本体を胸ポケットに入れて使用する際の布擦れ音を防止
することができる。Further, as shown in FIG. 32 (b), a flexible support column 323 supports the microphone 321. By supporting the microphone 321 in this way, the microphone 32 can be moved from the speech speed conversion device main body 101.
Since 1 is separated and twisted in a free direction,
It is possible to prevent the cloth rubbing noise when the main body is put in the chest pocket and used.

【０２３８】また、図３２（ｂ）に示すように、マイク
ロホン３２１と話速変換装置本体１０１とを有線（もし
くは無線）で電気的に接続している。このようにマイク
ロホン３２１と話速変換装置本体１０１とを有線（もし
くは無線）で電気的に接続し、マイクロホン３２１を話
速変換装置本体１０１から独立させ、話し手の近くにマ
イクロホン３２１を配置するので、Ｓ／Ｎ比を向上させ
ることができる。Further, as shown in FIG. 32 (b), the microphone 321 and the speech speed conversion device main body 101 are electrically connected by wire (or wirelessly). In this way, the microphone 321 and the speech speed conversion device main body 101 are electrically connected by wire (or wirelessly), the microphone 321 is separated from the speech speed conversion device main body 101, and the microphone 321 is arranged near the speaker. The S / N ratio can be improved.

【０２３９】また、図３３に示すように、話速変換装置
本体１０１とマイクロホン３２１，イヤホン３２５とを
接続部材３００を介在させて電気コードで電気的に接続
する。そして、前記接続部材３００の上にスロースイッ
チ１０４、リピートスイッチ１０５、リセットスイッチ
１０６等の操作スイッチを設ける。このようにすること
により、本体を胸ポケットに入れて使用する際の布擦れ
音を防止することができ、かつ、Ｓ／Ｎ比を向上させる
ことができ、さらに、使い勝手をよくすることができ
る。Further, as shown in FIG. 33, the speech speed conversion device main body 101, the microphone 321, and the earphone 325 are electrically connected by an electric cord with the connecting member 300 interposed therebetween. Then, operation switches such as the slow switch 104, the repeat switch 105, and the reset switch 106 are provided on the connection member 300. By doing so, it is possible to prevent the cloth rubbing noise when the main body is put in the chest pocket and used, and it is possible to improve the S / N ratio and further improve the usability. .

【０２４０】（実施例７）図３４は、本発明による実施
例７の話速変換装置の遅れ時間表示手段を説明するため
の図であり、３４１は表示部、３４２は表示画面であ
る。(Embodiment 7) FIG. 34 is a diagram for explaining a delay time display means of a speech speed conversion apparatus according to Embodiment 7 of the present invention, in which 341 is a display unit and 342 is a display screen.

【０２４１】本実施例７における遅れ時間表示手段は、
図３４に示すように、前記スロー再生及びリピート再生
時に話し手の音声が実際の話速よりどの位遅れているか
を表示するものである。例えば、図３４において、人の
画像１個で１０秒遅れていることにして、現在から遅れ
ている時間を人の表示画像の数で表示する。このように
することにより、現在からの時間遅れ量を目で見てわか
るので、話し手、聞く手ともに話速変換の調節を容易に
することができるので、使い勝手良く使用することがで
きる。The delay time display means in the seventh embodiment is
As shown in FIG. 34, it is displayed how much the voice of the speaker lags behind the actual speech speed during the slow reproduction and the repeat reproduction. For example, in FIG. 34, it is assumed that one person image is delayed by 10 seconds, and the time delayed from the present is displayed by the number of person display images. By doing so, the amount of time delay from the present can be visually recognized, and it is possible for both the speaker and the listener to easily adjust the speech speed conversion, so that the speaker can be used conveniently.

【０２４２】時間遅れの視覚表示は、例えば、図６に示
した話速変換装置本体正面の中央に液晶ディスプレイを
設置し、その液晶ディスプレイ上に、図３４に示すよう
な表示画面を表示することで実現する。そして、この表
示部は、図１１の制御部２３Ｅに接続される「液晶ディ
スプレイドライバ」によって制御される（図示はしてい
ない）。For visual display of time delay, for example, a liquid crystal display is installed in the center of the front of the speech speed conversion apparatus main body shown in FIG. 6, and a display screen as shown in FIG. 34 is displayed on the liquid crystal display. Will be realized in. Then, this display unit is controlled by a “liquid crystal display driver” connected to the control unit 23E of FIG. 11 (not shown).

【０２４３】表示される時間遅れ量は、図１３及び１４
で示されたメイン処理の中では、遅れ量カウンタで常に
管理されているので、この遅れ量カウンタの持つ数値を
１０秒単位に換算し、対応する個数の人画像を前記ディ
スプレイに表示すればよい。この表示動作は、図１１の
制御部２３Ｅが前記ディスプレイドライバを通じて行う
が、表示を書き換えるタイミングは、１フレームの処理
が終了毎に行えれば十分である。例えば、図１４のＳ１
３７とＳ１３８の間で、この表示処理を行う。The displayed time delay amount is as shown in FIGS.
In the main process shown by, the delay amount counter is always managed, so the numerical value of the delay amount counter may be converted into units of 10 seconds and the corresponding number of human images may be displayed on the display. . This display operation is performed by the control unit 23E of FIG. 11 through the display driver, but the timing for rewriting the display is sufficient if the processing of one frame is completed. For example, S1 in FIG.
This display process is performed between 37 and S138.

【０２４４】（実施例８）図３５は、本発明による話速
変換装置の実施例８の話速変換装置の電源装置を説明す
るための図であり、１０００は話速変換装置に係わる装
置の部分、１はＤＳＰ、５はＡ／Ｄ変換器、６はＤ／Ａ
変換器、９はアナログアンプ、１０はアナログアンプ、
１００１は電源、１００２は電力供給線、１００３は切
換スイッチである。(Embodiment 8) FIG. 35 is a diagram for explaining a power supply device of a speech speed converting apparatus according to an eighth embodiment of the speech speed converting apparatus of the present invention, in which 1000 is a device relating to the speech speed converting apparatus. Part 1, 1 DSP, 5 A / D converter, 6 D / A
Converter, 9 is an analog amplifier, 10 is an analog amplifier,
Reference numeral 1001 is a power source, 1002 is a power supply line, and 1003 is a changeover switch.

【０２４５】本実施例８の話速変換装置は、図１５の状
態遷移図で示したようにスルーモードの他にスタンバイ
モードを設け、一定時間スルーモードが続くと、自動的
にスタンバイモードに入るようにしている。すなわち、
スロー（ゆっくり）スイッチとリピートスイッチのいず
れかが押される（ＯＮされる）と、クロック周波数が高
くなり、各処理を行う。The speech speed conversion apparatus according to the eighth embodiment is provided with a standby mode in addition to the through mode as shown in the state transition diagram of FIG. 15, and when the through mode continues for a certain period of time, the standby mode is automatically entered. I am trying. That is,
When either the slow (slow) switch or the repeat switch is pressed (turned on), the clock frequency becomes high and each process is performed.

【０２４６】また、スルーモードでは、ＤＳＰ１は早い
クロックで動作しているが、話速変換等の処理はしてい
ないので、パワーが無駄になっている。そこで、スタン
バイモードでは、ＤＳＰ１の動作クロックを落として、
データの入出力だけを行うことで、消費電力を下げる。
そして、メモリへの保存だけは行っておく。これによ
り、ボイスメモリ機能が実現される。In the through mode, the DSP 1 operates at a fast clock, but the power is wasted because no processing such as voice speed conversion is performed. Therefore, in the standby mode, the operating clock of DSP1 is dropped,
Power consumption is reduced by only inputting / outputting data.
Then, only save to memory. As a result, the voice memory function is realized.

【０２４７】さらに、図３５に示すように、アナログス
ルーモードの時は、切換スイッチ１００３を電力供給線
１００２を切断する接点側に接続するともに、アナログ
アンプ１０とアナログアンプ９とが直接接続する接点側
に接続してＤＳＰ１、Ａ／Ｄ変換器５、Ｄ／Ａ変換器６
及び周辺ディジタル回路への電力供給を行わない。この
時、メモリへの保存も行わない。すなわち、入出力のア
ナログ系を直接接続して、単なるアナログ増幅器として
動作させる。前記切換スイッチとしては、図３５に示す
ようなオン（ＯＮ）、オフ（ＯＦＦ）、オン（ＯＮ）と
オフ（ＯＦＦ）との中間の３段階スイッチとし、アナロ
グスルーモードを設ける。Further, as shown in FIG. 35, in the analog through mode, the changeover switch 1003 is connected to the contact side for disconnecting the power supply line 1002, and the contact point between the analog amplifier 10 and the analog amplifier 9 is directly connected. Connected to the side, DSP1, A / D converter 5, D / A converter 6
Also, power is not supplied to peripheral digital circuits. At this time, it is not stored in the memory either. That is, the input / output analog system is directly connected to operate as a simple analog amplifier. The changeover switch is a three-stage switch which is on (ON), off (OFF), or intermediate between on (ON) and off (OFF) as shown in FIG. 35, and an analog through mode is provided.

【０２４８】前記説明からわかるように、本実施例８に
よれば、オン（ＯＮ）、オフ（ＯＦＦ）、オン（ＯＮ）
とオフ（ＯＦＦ）との中間の３段階スイッチとし、アナ
ログスルーモードを設けたので、低電力化をはかること
ができ、かつ、電源の使用範囲を拡大することができ
る。As can be seen from the above description, according to the eighth embodiment, on (ON), off (OFF), on (ON).
Since the switch has a three-stage switch between the OFF state and the OFF state and the analog through mode is provided, it is possible to reduce the power consumption and to expand the range of use of the power source.

【０２４９】（実施例９）図３６は、本発明による話速
変換手段を電話器に適用した実施例９を説明するための
図であり、２０００は本発明による話速変換手段、３０
００は電話器本体、３００１は送受話器、３００２は電
話線である。(Embodiment 9) FIG. 36 is a diagram for explaining an embodiment 9 in which the voice speed converting means according to the present invention is applied to a telephone, and 2000 is the voice speed converting means according to the present invention.
00 is a telephone main body, 3001 is a handset, and 3002 is a telephone line.

【０２５０】本実施例９の電話器は、図３６に示すよう
に、送受話器３００１と電話器本体３０００との間に本
発明による話速変換手段２０００を挿入したものであ
る。話速変換手段２０００は、例えば、電話器本体３０
００を載置する台のような形状に構成する。As shown in FIG. 36, the telephone according to the ninth embodiment has a voice speed converting means 2000 according to the present invention inserted between a handset 3001 and a telephone main body 3000. The speech speed conversion means 2000 is, for example, the telephone body 30.
It is configured in a shape like a table on which 00 is placed.

【０２５１】また、コードレスハンドセットもしくはコ
ードレス子器の送受話器３００１の場合には、送受話器
３００１と電話器本体３０００との間に無線方式で話速
変換手段２０００を挿入したものである。Further, in the case of the handset 3001 of a cordless handset or cordless handset, the voice speed converting means 2000 is inserted between the handset 3001 and the telephone body 3000 by a wireless system.

【０２５２】なお、本発明による話速変換手段は、交換
器の中に話速変換手段として用い、ユーザからのリクエ
ストによって動作させることもできる。The speech speed converting means according to the present invention can be used as a speech speed converting means in the exchange and operated by a request from the user.

【０２５３】このように構成することにより、電話の話
声をゆっくり聞くことができる。また、話し手側にはス
ルーの音声をフィードバックし、聞き手には話声をゆっ
くり聞こえるようにして老人等に電話をする際に、話し
手側は普通に話すことができるので、話しづらいという
ことがない。With this structure, the voice of the telephone can be heard slowly. Also, when calling the old man etc. by feeding back the through voice to the speaker side so that the listener can hear the voice slowly, the speaker side can speak normally, so it is not difficult to talk .

【０２５４】また、ディジタル回路ならば、話速変換手
段内部にＡ／Ｄ手段はいらない。If it is a digital circuit, the A / D means is not required inside the speech speed converting means.

【０２５５】（実施例１０）図３７は、本発明による話
速変換手段を構内放送に適用した実施例１０を説明する
ための図であり、２０００は話速変換手段、３２１はマ
イクロホン、３２５はイヤホン、４００３は増幅器、４
００４スピーカである。(Embodiment 10) FIG. 37 is a diagram for explaining the embodiment 10 in which the speech speed conversion means according to the present invention is applied to a local broadcast. 2000 is a speech speed conversion means, 321 is a microphone, and 325 is Earphones, 4003 is an amplifier, 4
It is a 004 speaker.

【０２５６】本実施例１０の電話器は、図３７に示すよ
うに、マイクロホン３２１、イヤホン３２５とスピーカ
４００４の増幅器４００３との間に本発明による話速変
換手段２０００を挿入したものである。As shown in FIG. 37, the telephone according to the tenth embodiment has a speech speed converting means 2000 according to the present invention inserted between a microphone 321, an earphone 325 and an amplifier 4003 of a speaker 4004.

【０２５７】このように構成することにより、話し手が
話速変換動作を制御しなくても聞き手は適性な話速で聞
くことができる。例えば、話し手が勝手にべらべら高話
速（もうスピード）で話しても、聞き手は適性な話速で
聞くことができる。With this configuration, the listener can listen at an appropriate speech speed even if the speaker does not control the speech speed conversion operation. For example, even if the speaker speaks freely at a high speech speed (already speed), the listener can listen at an appropriate speech speed.

【０２５８】また、ゆっくり話した場合にも、スピーカ
に聞き手は適性な話速で聞くことができるようにするこ
とも可能である。It is also possible to allow the listener to listen to the speaker at an appropriate speaking speed even when speaking slowly.

【０２５９】以上の説明からわかるように、本発明は、
電話器、電話交換器、構内放送以外の話速変換の必要な
技術分野、例えば、補聴器、語学学習、海外旅行、音楽
等に適用できる。As can be seen from the above description, the present invention is
The present invention can be applied to telephone fields, telephone exchanges, technical fields requiring speech rate conversion other than in-house broadcasting, such as hearing aids, language learning, overseas travel, and music.

【０２６０】例えば、語学学習、海外旅行グッズにおい
て、以下のような場合に応用できる。For example, language learning and overseas travel goods can be applied in the following cases.

【０２６１】（１）録音された音声を続けてゆっくり聞
く。(1) Listen slowly to the recorded voice.

【０２６２】（２）レベルの向上に応じて、伸長率を変
化させる。(2) The expansion rate is changed according to the level improvement.

【０２６３】（３）普通の速度で聞いてみて、わからな
い部分を繰り返してゆっくり聞く。(3) Listen at a normal speed, and listen slowly by repeating the part you do not understand.

【０２６４】（４）ゆっくり聞いた後、もう一度元の速
度で聞く。(4) After listening slowly, listen again at the original speed.

【０２６５】（５）ゆっくりリピートの後に真似して発
音する。(5) Slowly repeat and then imitate.

【０２６６】（６）真似して発音と元の音声とを聞き比
べる。(6) Imitate and compare the pronunciation with the original voice.

【０２６７】（７）１つのソースを複数の人が同時に、
自分の好みの話速で聞く。(7) Multiple people can use one source at the same time.
Listen at your favorite speed.

【０２６８】また、テープレコーダ、ＣＤ，ＭＤ等ディ
ジタルオーディオ機器との組み合せにおいては、ディジ
タル出力を持つ機器なら、話速変換装置にＡ／Ｄ変換器
が不要となる。In combination with a digital audio device such as a tape recorder, a CD, an MD, a device having a digital output does not require an A / D converter in the speech speed converter.

【０２６９】また、音楽用において、以下の点につい
て、変更を施せば応用できる。・伸長フレームのパワーによる判定をやらない（テンポ
がくるうからである）。・ピッチ抽出範囲を音声よりも広くする。・波形伸長処理を固定長のピッチで行う。これの音声の
場合はピッチを検出してその検出されたピッチで処理す
る。・変換動作をフットスイッチでできるようにする。これ
によれば、楽器を弾きながらコントロールできる。以上、本発明を実施例に基づき具体的に説明したが、本
発明は、前記実施例に限定されるものではなく、その要
旨を逸脱しない範囲において、種々変更し得ることは勿
論である。Also, for music, the following points can be applied by making changes.・ Do not judge by the power of the decompressed frame (because the tempo comes).・ Make the pitch extraction range wider than that of voice. -Waveform expansion processing is performed at a fixed pitch. In the case of this voice, the pitch is detected and processed at the detected pitch. -Allows conversion operation with a foot switch. This allows you to control while playing the instrument. Although the present invention has been specifically described above based on the embodiments, the present invention is not limited to the embodiments and various modifications can be made without departing from the scope of the invention.

【０２７０】[0270]

【発明の効果】本願において開示される発明のうち代表
的なものによって得られる効果を簡単に説明すれば、以
下のとおりである。The effects obtained by the typical ones of the inventions disclosed in the present application will be briefly described as follows.

【０２７１】（１）ラジオ音声のように一方的に聞き手
に与えられる音声だけでなく、対話のような状況でも話
速変換装置を利用できるようになるので、聞き手自身の
発話を妨害することなく、話速変換を施す音声を聞き手
が選択できる。(1) Since not only the voice given to the listener unilaterally such as radio voice but also the speech speed conversion device can be used in a situation such as dialogue, the listener's own utterance can be prevented. , The listener can select the voice for which the speech speed conversion is performed.

【０２７２】また、補聴器、外国語学習器、電話器等に
おいて、話し手の音声の特徴を変えることなく、ゆっく
りとした話速で聞くことができる。Further, in a hearing aid, a foreign language learning device, a telephone, etc., it is possible to listen at a slow speech speed without changing the characteristics of the speaker's voice.

【０２７３】（２）記憶装置（メモリ）の有効利用、原
音声のリピート機能、ボイスメモリ機能、リピート音声
の話速変換機能、早聞き再生機能等をもたせることがで
きる。(2) The storage device (memory) can be effectively used, the original voice repeat function, the voice memory function, the repeat voice talk speed conversion function, and the fast-listening playback function can be provided.

【０２７４】（３）話速選択用スイッチで選択された話
速に変更する手段を設けたので、聞き手自身が聞く音声
の話速を選択することができる。(3) Since the means for changing the voice speed selected by the voice speed selection switch is provided, the voice speed of the voice heard by the listener can be selected.

【０２７５】（４）リピート用スイッチがオン（ＯＮ）
している間は再生音声をリピートする手段を設けたの
で、リピート音声の話速変換を行うことができる。(4) Repeat switch is on (ON)
Since the means for repeating the reproduced voice is provided during the operation, the voice speed conversion of the repeat voice can be performed.

【０２７６】（５）話速変換装置に記憶されている情報
の聞きたいところまで追いかける追いかけ手段を設けた
ので、話速変換装置の応用範囲の拡大、操作時間の短
縮、使い勝手の向上等をはかることができる。(5) Since the chase means for chasing the information stored in the speech speed conversion device to a desired point is provided, the application range of the speech speed conversion device is expanded, the operation time is shortened, and the usability is improved. be able to.

【０２７７】（６）話速変換装置の一側面の操作し易い
一周辺部に上記話速変換処理用スイッチ、話速選択用ス
イッチ、リピート用スイッチ、及びリセットスイッチの
うち少なくとも１つを設けたので、話速変換装置の応用
範囲の拡大、操作時間の短縮、使い勝手の向上等をはか
ることができる。(6) At least one of the voice speed conversion processing switch, the voice speed selection switch, the repeat switch, and the reset switch is provided on one easily-operable peripheral portion on one side of the voice speed conversion device. Therefore, the application range of the speech speed conversion device can be expanded, the operation time can be shortened, and the usability can be improved.

【０２７８】（７）話速変換処理の効率を向上すること
ができる。(7) The efficiency of speech speed conversion processing can be improved.

【０２７９】（８）話速変換処理における波形伸長処
理、短縮処理、無音区間削除処理の決定は、フレームの
パワーとしきい値とを比較して行い、かつ、前記しきい
値を入力された音声の大きさに応じて変更するので、使
用環境条件に応じた話速変換処理ができる。(8) The waveform expansion processing, the shortening processing, and the silent section deletion processing in the speech speed conversion processing are determined by comparing the power of the frame with the threshold value, and the threshold value is input to the speech. Since it is changed according to the size of, the voice speed conversion processing can be performed according to the usage environment condition.

【０２８０】（９）マイクロホンがスイッチのクリック
音を拾わないので、再生音声を正確に聞くことができ
る。(9) Since the microphone does not pick up the click sound of the switch, the reproduced sound can be heard accurately.

【０２８１】（１０）見なくてもどのスイッチかわかる
ような触感の異なる表面形態となっているので、操作性
を向上することができる。(10) Since the surfaces have different tactile sensations so that it is possible to recognize which switch without looking, the operability can be improved.

【０２８２】（１１）マイクロホンの布擦れ音防止手段
を設けたので、雑音の侵入を低減することができる。(11) Since the means for preventing the rubbing noise of the microphone is provided, the intrusion of noise can be reduced.

【０２８３】（１２）話速変換装置の所定の位置に、現
在からの時間遅れ量が目視可能な表示手段を設けたの
で、操作時間の短縮、使い勝手の向上等をはかることが
できる。(12) Since the display means for visually observing the time delay amount from the present is provided at a predetermined position of the speech speed conversion device, the operation time can be shortened and the usability can be improved.

【０２８４】（１３）記憶手段としてリングバッファを
用い、該リングバッファ上での時間遅れを表わすカウン
タで遅れ時間を管理する手段を設けたので、リピート処
理、追いかけ処理等を複雑なポインタアドレスの計算を
容易に行うことができる。(13) Since the ring buffer is used as the storage means and the means for managing the delay time is provided by the counter indicating the time delay on the ring buffer, the repeat processing, the chasing processing, etc. are calculated by complicated pointer address calculation. Can be done easily.

【０２８５】（１４）スルーモードの他にスタンバイモ
ード及びアナログスルーモードを設けたので、低消費電
力化をはかることができる。(14) Since the standby mode and the analog through mode are provided in addition to the through mode, low power consumption can be achieved.

【０２８６】（１５）電源スイッチをオン（ＯＮ）、オ
フ（ＯＦＦ）、オンとオフ中間の３段階とし、アナログ
スルーモードを設けたので、低電力化をはかることがで
き、かつ、電源の使用範囲を拡大することができる。(15) Since the power supply switch has three stages of ON (ON), OFF (OFF), and ON and OFF intermediate and the analog through mode is provided, it is possible to reduce the power consumption and use the power supply. The range can be expanded.

【０２８７】（１６）電話器のハンドセットと装置本体
との間に前記話速変換手段を設けたので、聞き手自身の
発話を妨害することなく、話速変換を施す音声を聞き手
が選択できる。(16) Since the speech speed converting means is provided between the handset of the telephone and the main body of the apparatus, the listener can select the voice to be subjected to the speech speed conversion without disturbing the listener's own speech.

【０２８８】（１７）電話器において、話し手の音声の
特徴を変えることなく、ゆっくりとした話速で聞くこと
ができる。(17) With the telephone, it is possible to listen at a slow speaking speed without changing the characteristics of the speaker's voice.

【０２８９】（１８）話速変換手段を電話交換器の中に
設けたので、聞き手自身の発話を妨害することなく、話
速変換を施す音声を聞き手が選択できる。(18) Since the speech speed converting means is provided in the telephone exchange, the listener can select the voice to be subjected to the speech speed conversion without disturbing the listener's own speech.

[Brief description of drawings]

【図１】本発明による実施例１の内部回路の概略構成を
示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of an internal circuit of a first embodiment according to the present invention.

【図２】本実施例１のＤＳＰ内で実行される話速変換処
理を説明するための図である。FIG. 2 is a diagram for explaining a voice speed conversion process executed in the DSP according to the first embodiment.

【図３】本実施例１のしきい値処理の概念を説明するた
めの図である。FIG. 3 is a diagram for explaining the concept of threshold value processing according to the first embodiment.

【図４】本実施例１の話速変換装置の利用形態を示す図
である。FIG. 4 is a diagram showing a usage pattern of the speech speed conversion apparatus according to the first embodiment.

【図５】本実施例１の話速変換装置の制御手順を示すフ
ローチャートである。FIG. 5 is a flowchart showing a control procedure of the speech speed conversion apparatus according to the first embodiment.

【図６】本発明による実施例２の話速変換装置の正面か
ら見た正面平面図である。FIG. 6 is a front plan view seen from the front of a speech speed conversion device according to a second embodiment of the present invention.

【図７】本実施例２の話速変換装置の背面から見た背面
平面図である。FIG. 7 is a rear plan view of the speech speed converting apparatus according to the second embodiment as viewed from the rear side.

【図８】本実施例２の話速変換装置の上から見た上平面
図である。FIG. 8 is an upper plan view seen from above the speech speed conversion apparatus according to the second embodiment.

【図９】本実施例２の話速変換装置の左側から見た左側
平面図である。FIG. 9 is a left side plan view of the speech speed conversion apparatus according to the second embodiment as viewed from the left side.

【図１０】本実施例２の話速変換装置の右側から見た右
側平面図である。FIG. 10 is a right side plan view of the speech speed conversion apparatus according to the second embodiment as viewed from the right side.

【図１１】本実施例２の話速変換装置の機能構成を示す
ブロック図である。FIG. 11 is a block diagram showing a functional configuration of a speech speed conversion apparatus according to the second embodiment.

【図１２】本実施例２の音声圧縮処理部の圧縮処理を説
明するための模式図である。FIG. 12 is a schematic diagram for explaining compression processing of an audio compression processing unit according to the second embodiment.

【図１３】本実施例２におけるメイン処理の手順を示す
フローチャートである。FIG. 13 is a flowchart illustrating a procedure of main processing according to the second embodiment.

【図１４】図１３のフローチャートの続きである。FIG. 14 is a continuation of the flowchart of FIG.

【図１５】本実施例２における各モード間の遷移を模式
的に示す状態遷移図である。FIG. 15 is a state transition diagram schematically showing a transition between each mode in the second embodiment.

【図１６】本実施例２における読書ポインタ戻しルーチ
ンの処理手順を示すフローチャートである。FIG. 16 is a flowchart illustrating a processing procedure of a reading pointer returning routine according to the second embodiment.

【図１７】本実施例２における１フレーム分の波形長・
短縮処理手順を示すフローチャートである。FIG. 17 is a waveform length for one frame in the second embodiment.
It is a flowchart which shows the shortening process procedure.

【図１８】図１７のフローチャートの続きである。FIG. 18 is a continuation of the flowchart of FIG.

【図１９】本実施例２におけるパラメータ設定処理手順
を示すフローチャートである。FIG. 19 is a flowchart showing a parameter setting processing procedure in the second embodiment.

【図２０】本実施例２におけるデータの圧縮処理を説明
するための図である。FIG. 20 is a diagram for explaining a data compression process according to the second embodiment.

【図２１】本実施例２におけるデータの圧縮処理を説明
するための図である。FIG. 21 is a diagram for explaining a data compression process according to the second embodiment.

【図２２】本実施例２におけるデータの圧縮処理を説明
するための図である。FIG. 22 is a diagram for explaining a data compression process according to the second embodiment.

【図２３】本発明による実施例３の連続話速変換手段を
付加した話速変換装置の全体動作の処理手順を示すフロ
ーチャートである。FIG. 23 is a flowchart showing the processing procedure of the overall operation of the speech speed conversion device to which the continuous speech speed conversion means according to the third embodiment of the present invention is added.

【図２４】図２３のフローチャートの続きである。FIG. 24 is a continuation of the flowchart of FIG. 23.

【図２５】本発明による前記実施例３と異なる実施例４
の連続話速変換手段を付加した話速変換装置の全体動作
の処理手順を示すフローチャートである。FIG. 25 is a fourth embodiment different from the third embodiment according to the present invention.
6 is a flowchart showing a processing procedure of the overall operation of the voice speed conversion device to which the continuous voice speed conversion means of FIG.

【図２６】図２５のフローチャートの続きである。FIG. 26 is a continuation of the flowchart of FIG.

【図２７】本実施例４における連続話速変換手段に用い
るアクセル型スイッチを説明するための模式図である。FIG. 27 is a schematic diagram for explaining an accelerator type switch used in the continuous voice speed converting means in the fourth embodiment.

【図２８】本発明による実施例５のＡＶコントロール手
段を付加した話速変換装置の機能構成を示すブロック図
である。FIG. 28 is a block diagram showing a functional configuration of a speech speed conversion device to which an AV control means according to a fifth embodiment of the present invention is added.

【図２９】本実施例５のＡＶコントロール手段の動作を
説明するための図である。FIG. 29 is a diagram for explaining the operation of the AV control means of the fifth embodiment.

【図３０】本実施例５のＡＶコントロール手段を付加し
た話速変換装置のメイン処理手順を示すフローチャート
である。FIG. 30 is a flow chart showing a main processing procedure of the speech speed conversion apparatus to which the AV control means of the fifth embodiment is added.

【図３１】図３０のフローチャートの続きである。FIG. 31 is a continuation of the flowchart of FIG. 30.

【図３２】本発明による実施例６の話速変換装置のマイ
クロホンの配置を説明するための図である。FIG. 32 is a diagram for explaining the arrangement of microphones in the speech speed conversion device according to the sixth embodiment of the present invention.

【図３３】本実施例６の変形例の構成を示す図である。FIG. 33 is a diagram showing a configuration of a modified example of the sixth embodiment.

【図３４】本発明による実施例７の話速変換装置の遅れ
時間表示手段を説明するための図である。FIG. 34 is a diagram for explaining a delay time display means of the speech speed conversion apparatus according to the seventh embodiment of the present invention.

【図３５】本発明による実施例８の話速変換装置の電源
装置を説明するための図である。FIG. 35 is a diagram for explaining a power supply device of a speech speed conversion device according to Example 8 of the present invention.

【図３６】本発明による話速変換手段を電話器に適用し
た実施例９を説明するための図である。FIG. 36 is a diagram for explaining the ninth embodiment in which the speech speed converting means according to the present invention is applied to a telephone.

【図３７】本発明による話速変換手段を構内放送に適用
した実施例１０を説明するための図である。[Fig. 37] Fig. 37 is a diagram for explaining the tenth embodiment in which the speech speed conversion means according to the present invention is applied to a local broadcast.

[Explanation of symbols]

１…ＤＳＰ（ディジタルシグナルプロセッサ）、１１…
話速変換処理を行うソフトウエア、１２…シリアルポー
ト、１３…外部割り込みフラグ用端子、１４…フラグレ
ジスタ、２…音声メモリ、３…セレクタスイッチ、４…
ＰＴＬスイッチ、５…Ａ／Ｄ変換器、６…Ｄ／Ａ変換
器、７…ローパスフィルタ、８…ローパスフィルタ、９
…アナログアンプ、１０…アナログアンプ、３２１…マ
イクロホン、３２５…両耳用ヘッドホン（イヤホン）、
１０１は話速変換装置の本体、１０２は裏蓋、１０３…
指かけ用へこみ、１０４…スロースイッチ（スロー押ボ
タン）、１０５…リピートスイッチ（リピート押ボタ
ン）、１０６…リセットスイッチ（リセット押ボタ
ン）、１０８…音量ボリューム、１０９…電源スイッ
チ、１１０…イヤホン端子、１１１…外部入力端子、１
１２…ＡＶコントロール端子、１１３…話速切換スイッ
チ（話速設定スイッチ）、２１…音声入力部、２２…入
力バッファ、２３…中央処理部（ＣＰＵ）、２４…リン
グバッファメモリ、２５…機能選択部、２６…出力バッ
ファ、２７…音声出力部、２８…ＡＶコントローラ。1 ... DSP (Digital Signal Processor), 11 ...
Software for performing speech rate conversion processing, 12 ... Serial port, 13 ... External interrupt flag terminal, 14 ... Flag register, 2 ... Voice memory, 3 ... Selector switch, 4 ...
PTL switch, 5 ... A / D converter, 6 ... D / A converter, 7 ... Low-pass filter, 8 ... Low-pass filter, 9
... Analog amplifier, 10 ... Analog amplifier, 321, ... Microphone, 325 ... Binaural headphones (earphones),
101 is the main body of the speech speed conversion device, 102 is a back cover, 103 ...
Depression for finger rest, 104 ... Slow switch (slow push button), 105 ... Repeat switch (repeat push button), 106 ... Reset switch (reset push button), 108 ... Volume control volume, 109 ... Power switch, 110 ... Earphone terminal, 111 ... External input terminal, 1
12 ... AV control terminal, 113 ... Speech speed changeover switch (speech speed setting switch), 21 ... Voice input section, 22 ... Input buffer, 23 ... Central processing section (CPU), 24 ... Ring buffer memory, 25 ... Function selection section , 26 ... Output buffer, 27 ... Audio output section, 28 ... AV controller.

───────────────────────────────────────────────────── フロントページの続き (72)発明者川内保憲茨城県勝田市稲田1410番地株式会社日立製作所ＡＶ機器事業部内 (72)発明者畑岡信夫東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者森川寿一神奈川県横浜市戸塚区吉田町292番地株式会社日立製作所試作開発センタ内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Yasunori Kawauchi 1410 Inada, Katsuta-shi, Ibaraki Hitachi Ltd. AV Equipment Division (72) Inventor Nobuo Hataoka 1-280, Higashi Koikeku, Kokubunji, Tokyo Hitachi Central In the laboratory (72) Inventor, Juichi Morikawa, 292, Yoshida-cho, Totsuka-ku, Yokohama, Kanagawa

Claims

[Claims]

1. A voice speed conversion method for inputting voice and changing only the speed of voice without changing the pitch of the input voice, wherein a voice speed conversion is performed at a time specified when a listener needs the voice speed conversion. The speech speed conversion method is characterized in that the speech speed conversion processing of the input voice is performed only during the period, and the speech speed conversion is not performed during the other time.

2. A voice speed conversion having a means for inputting a voice, a voice speed conversion processing means for changing the speed of the input voice, and a means for outputting the output of the voice speed conversion processing means to a listener's ear. In the apparatus, the speech speed conversion device is provided with a speech speed conversion processing switch, and the speech speed conversion processing switch is turned on (O
N) The voice speed of the input voice is changed and output only during the period of time N), and is output without changing the voice speed of the input voice while the switch for the voice velocity conversion processing is off (OFF). A speech speed conversion device comprising means.

3. A speech speed conversion method for encoding and accumulating original speech, reading out the accumulated encoded speech, and changing only the speed of the speech without changing the pitch of the original speech, A speech speed conversion method characterized in that the speech speed conversion processing of the input voice is performed only during a designated time when the speech speed conversion is required, and the speech speed conversion is not performed during the other time.

4. A means for inputting an original voice, a storage means for encoding and accumulating the input voice, and a speech speed conversion processing means for reading out the accumulated encoded voice and changing the speed of the input voice. A voice speed conversion device having a means for outputting the output of the voice speed conversion processing means to a listener's ear as a voice, wherein the voice speed conversion device is provided with a voice speed conversion processing switch. The voice speed of the input voice is changed and output only while the switch is on (ON), and the voice speed of the input voice is changed while the voice speed conversion processing switch is off (OFF). A speech speed conversion device characterized in that a means for outputting without being provided is provided.

5. The voice speed conversion apparatus according to claim 4, wherein the storage unit has a unit for storing in frame units.

6. A method for determining waveform expansion / contraction processing in the speech speed conversion processing by comparing frame power with a threshold value, wherein the threshold value is variable. The speech speed conversion device according to claim 5.

7. The voice speed conversion device is provided with a voice speed selection switch for selecting a voice speed, and means for changing the voice speed selected by the voice speed selection switch is provided. 7. The speech speed conversion device according to any one of 2, 4 to 6.

8. The voice speed control apparatus according to claim 7, wherein said voice speed control apparatus is provided with means (AV control) for controlling an audio / video device.

9. The speech speed conversion device is provided with a repeat switch, and means for repeating the reproduced voice while the repeat switch is on (ON) is provided. 9. The speech speed conversion device according to any one of items 8 to 8.

10. The repeat means is means for backing up for several seconds each time it is pressed, means for occasionally generating an intermittent sound while returning, means for preventing further returning after reaching the end of the ring buffer, and repeat time. 10. The speech speed conversion apparatus according to claim 9, further comprising at least one means for selecting the speech speed of the above.

11. The means for selecting the speech speed at the time of repeat includes at least two or more of a default value of repeat, a slow repeat, a fast-listening repeat, and a repeat gradually increasing. Item 10. The speech speed conversion device according to item 10.

12. In the speech speed conversion device, when a delay from real time occurs due to a speech speed conversion or a repeat operation, while reproducing the stored information,
12. The speech speed conversion apparatus according to claim 2, further comprising a chasing unit that adjusts the delay amount.

13. The chasing means is means for starting chasing when the slow reproduction mode ends, a means for starting chasing when reproducing to a repeat start time after a repeat, a means for selecting a speech speed at the time of chasing, and a case of catching up. 13. The speech speed conversion apparatus according to claim 12, further comprising at least one of a unit that automatically shifts to a through mode that outputs the input voice as it is and a unit that generates a notification signal sound (message) when catching up. .

14. The means for selecting the speech speed at the time of chasing includes at least one of a means for skipping to the reality at once, a means for chasing the reality by listening fast, and a means for moving in parallel with a delay. Claim 13
The speech speed conversion device described in.

15. At least one of the voice speed conversion processing switch, the voice speed selection switch, the repeat switch, and the reset switch is provided on one easily accessible peripheral portion on one side surface of the voice speed conversion device. The speech speed conversion device according to any one of claims 2, 4 to 14, characterized in that.

16. The reset switch has means for stopping the operation when the switch is turned on during the repeat operation or the chase operation, skipping the operation, and then shifting to the through mode. The speech speed conversion device described in.

17. The speech rate conversion processing means is provided as software executed by a digital signal processor having a terminal for inputting an interrupt request signal from the outside, and the speech rate conversion processing switch switches the speech rate. 17. Control of conversion processing or switching of speech speed conversion speed is given to a digital signal processor through a terminal for inputting the interrupt request signal, according to any one of claims 2, 4 to 16. Speech speed converter.

18. The voice speed conversion apparatus according to claim 2, further comprising means for listening to an output voice of the voice speed conversion apparatus through binaural headphones.

19. A microphone for converting an acoustic signal into an electric signal, an analog amplifier for amplifying the microphone output, a low-pass filter for removing high frequency components of the output of the analog amplifier, and an analog signal for the output of the low-pass filter as a digital signal. To an A / D converter, a digital signal processor that executes a process of changing a voice speed by digital signal processing, a storage unit that stores input voice data and data of a signal processing result, and a digital signal processor of the digital signal processor. A means for controlling the processing for changing the speed of the sound to be performed, a means for changing the processing parameters, a D / A converter for converting the digital sound data into an analog value, and a high frequency component of the output of the D / A converter. The second low-pass filter that removes the A speech speed conversion device comprising: a wide second analog amplifier; and a headphone which converts an output of the second analog amplifier into an acoustic signal and applies it to both ears.

20. A microphone for converting an acoustic signal into an electric signal, an analog amplifier for amplifying the microphone output, a low-pass filter for removing high frequency components of the output of the analog amplifier, and an analog signal for the low-pass filter output as a digital signal. To A / D converter, storage means for storing input voice data and data of signal processing result, and digital signal for executing processing for reading the stored information and changing the speed of voice by digital signal processing. A processor, a means for controlling the processing for changing the speed of sound performed by the digital signal processor, a means for changing processing parameters, a D / A converter for converting digital sound data into an analog value, and the D
A second low-pass filter that removes high-frequency components of the output of the A / A converter, a second analog amplifier that amplifies the output of the second low-pass filter, and an output of the second analog amplifier that is converted into an acoustic signal. A speech speed conversion device having headphones for both ears.

21. The speech speed conversion processing means is performed by pipeline processing in frame units using a plurality of input frame buffers, and for each frame of data, first, pitch extraction processing is performed on the beginning of the frame. Then, the pitch of that portion is detected, the detected data of one pitch length is transferred to the output buffer, and the window function that changes from 0 to 1 and the data of 1 pitch to 0 for the data of two pitch lengths. Multiply the changing window function, add the data of the results of applying each window function to create a composite waveform with a time length of 2 pitches, insert it after the 1 pitch of data that was transferred earlier, and The pitch detection process is performed again starting from the position separated by 2 pitches from the position on the data subjected to the pitch extraction process, the pitch is detected at that position, and the pitch length obtained by the last pitch detection is taken as a unit. 21. The speech speed conversion apparatus according to claim 19, wherein a series of procedures for transferring data for n (n is an integer) pitch to the output buffer is repeatedly performed over the entire frame.

22. The speech speed conversion processing means calculates an average power of data in an input frame, is executed only when the average power is larger than a preset threshold value, and when it is smaller, 22. The speech speed conversion apparatus according to claim 21, wherein the data included in the frame is transferred to the output buffer as it is.

23. A threshold value is provided in the threshold processing for the average power of data in the input frame, and a frame having an average power smaller than the second threshold value has a preset time. When the data continues for a time longer than the threshold value, the transfer of data of a frame having a smaller average power than the second threshold value that continues beyond the time threshold value to the output buffer is prohibited. 23. The speech speed conversion device according to claim 22, wherein:

24. The switches according to claim 2, 4 or 1, wherein each of the switches is a switch having a soft touch so that a microphone does not pick up a click sound of the switch.
8. The speech speed conversion device according to any one of 8.

25. The speech speed conversion device according to claim 24, wherein each of the switches has a surface form having a different tactile sensation so that the switch can be seen without looking.

26. A cloth rubbing noise preventing means is provided for changing the distance between the microphone and the apparatus main body so that the microphone and the clothes do not come into direct contact when the apparatus main body is put in a chest pocket for use. The speech speed conversion device according to any one of claims 2, 4 to 25.

27. The display means for visually observing a time delay amount from the present is provided at a predetermined position of the speech speed conversion device, according to any one of claims 2, 4 to 26. The described speech speed conversion device.

28. A ring buffer is used as the storage means, and means for managing the delay time is provided by a counter indicating a time delay on the ring buffer. The speech speed conversion device described in the item.

29. In addition to the through mode, a standby mode for lowering the clock cycle of the processor and performing the same processing as in the through mode is provided, according to any one of claims 19 to 28. Speed converter.

30. The power switch has three stages of ON, OFF, ON and OFF intermediate, and when the switch is adjusted to the intermediate position, the analog input / output systems in the audio signal processing system are short-circuited to each other. 30. A power supply means for operating in an analog through mode for stopping power supply to a digital processing system between analog input / output systems is provided. Speed converter.

31. A telephone set, comprising the speech speed converting means according to claim 2, provided between the handset of the telephone set and the main body of the telephone set.

32. A telephone exchange, wherein the speech speed converting means according to any one of claims 2, 4 to 30 is provided in the telephone exchange.