JPH08292796A

JPH08292796A - Reproducing device

Info

Publication number: JPH08292796A
Application number: JP7095490A
Authority: JP
Inventors: Koji Tanaka; 浩司田中; Masayuki Iida; 正幸飯田; Masanori Miyatake; 正典宮武
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 1995-04-20
Filing date: 1995-04-20
Publication date: 1996-11-05

Abstract

PURPOSE: To shorten the reproducing time of a video and a voice by making the reproducing speed of the video and the voice with respect to the voice of sections which are not important so much to understanding automatically faster. CONSTITUTION: At a normal time, a control signal generating part 26, in the time of a time shortening reproducing mode, transmits the speed control signal corresponding to the reproducing speed magnification set at the time of starting a reproducing to a motor control part. Consequently, a capstan motor is driven to be rotated at a speed corresponding to the reproducing speed magnification set at the time of starting the reproducing at the normal time. Then, in the time of a time shortening reproducing mode, when the continuation length of the silent section calculated by a silence continuation length calculating part 25 becomes equal to or longer than a prescribed value, the control signal generating part 26 transmits the speed control signal corresponding to a prescribed reproducing speed magnification larger than the reproducing speed magnification set at the time of starting the reproducing to the motor control part. Consequently, in this case, the capstan motor is driven to be rotated at the speed corresponding to the reproducing speed magnification larger than the reproducing speed magnification set at the time of starting the reproducing. That is, the speed of the capstan motor is made faster.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、映像および音声を再
生する再生装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a reproducing device for reproducing video and audio.

【０００２】[0002]

【従来の技術】ビデオテープレコーダ（ＶＴＲ）におい
ては、再生開始時に設定された再生速度で、映像および
音声が再生されている。短い時間で、再生を行いたい場
合には、たとえば、２倍速の再生速度で映像および音声
を再生することができる。2. Description of the Related Art In a video tape recorder (VTR), video and audio are reproduced at a reproduction speed set at the start of reproduction. When it is desired to reproduce in a short time, video and audio can be reproduced at a reproduction speed of double speed, for example.

【０００３】しかしながら、２倍速のような高速の再生
速度で再生を行った場合には、再生音声が聴き取りにく
くなる。そこで、ＶＴＲの２倍速再生時に再生された音
声を時間軸伸長することにより、音声の出力速度を標準
速度にする技術がすでに開発されている。しかしなが
ら、この方法では、半分の音声が削除されてしまう。However, when the reproduction is performed at a high reproduction speed such as double speed, the reproduced voice becomes difficult to hear. Therefore, a technique has already been developed in which the output speed of the audio is standardized by expanding the audio reproduced during the double speed reproduction of the VTR on the time axis. However, with this method, half of the voice is deleted.

【０００４】[0004]

【発明が解決しようとする課題】この発明は、理解にあ
まり重要でない区間の音声に対しては映像および音声の
再生速度を自動的に速くでき、映像および音声の再生時
間を短縮化できる再生装置を提供することを目的とす
る。DISCLOSURE OF THE INVENTION The present invention is a reproducing apparatus capable of automatically increasing the reproduction speed of video and audio for the audio in a section which is not so important for understanding, and shortening the reproduction time of the video and audio. The purpose is to provide.

【０００５】この発明は、再生された音声の発声速度に
応じて、映像および音声の再生速度を自動的に調節で
き、聴き取りやすい音声出力を得ることができる再生装
置を提供することを目的とする。It is an object of the present invention to provide a reproducing apparatus capable of automatically adjusting the reproduction speeds of video and audio according to the utterance speed of reproduced audio and providing an audio output which is easy to hear. To do.

【０００６】[0006]

【課題を解決するための手段】この発明による第１の再
生装置は、映像源および音源から映像および音声をそれ
ぞれ再生する手段、再生された音声が、音声区間の音声
または継続長が所定値未満である無音区間の音声である
ときには、映像および音声を再生開始時の設定再生速度
で再生して出力する手段、ならびに再生された音声が、
継続長が所定値以上の無音区間の音声である場合に、映
像および音声を再生開始時の設定再生速度より速い再生
速度で再生して出力する手段を備えていることを特徴と
する。A first reproducing apparatus according to the present invention is a means for reproducing video and audio from a video source and a sound source, respectively, and the reproduced audio has a voice of a voice section or a duration of less than a predetermined value. When the sound is in the silent section, the means for reproducing and outputting the image and sound at the set reproduction speed at the start of reproduction, and the reproduced sound are
When the continuation length is a voice in a silent section of a predetermined value or more, a feature is provided that reproduces and outputs video and voice at a reproduction speed faster than a set reproduction speed at the start of reproduction.

【０００７】この発明による第２の再生装置は、映像源
および音源から映像および音声をそれぞれ再生する手
段、再生された音声が、音声区間の音声か無音区間の音
声かを判別する判別手段、再生された音声が、音声区間
の音声または継続長が所定値未満である無音区間の音声
であるときには、映像および音声を再生開始時の設定再
生速度で再生して出力する手段、ならびに再生された音
声が、継続長が所定値以上の無音区間の音声である場合
に、映像および音声を再生開始時の設定再生速度より速
い再生速度で再生して出力する手段を備えていることを
特徴とする。According to the second reproducing apparatus of the present invention, means for reproducing the image and the sound from the image source and the sound source respectively, a judging means for judging whether the reproduced sound is the sound in the voice section or the sound in the silent section, the reproduction When the reproduced voice is a voice in a voice section or a voice in a silent section whose duration is less than a predetermined value, a means for playing back and outputting the video and voice at the set playback speed at the start of playback, and the played voice. However, when the duration is a voice in a silent section having a predetermined value or more, a means for playing back and outputting the video and the voice at a playback speed faster than the set playback speed at the start of playback is provided.

【０００８】上記第１の再生装置または第２の再生装置
に、再生された音声が、継続長が所定値以上の無音区間
の音声である場合に、再生された音声を削除する手段を
設けてもよい。また、再生された音声が、継続長が所定
値以上の無音区間の音声である場合に、再生された音声
の音程を元に戻して出力する手段を設けてもよい。The first reproducing device or the second reproducing device is provided with means for deleting the reproduced sound when the reproduced sound is a sound in a silent section having a duration of a predetermined value or more. Good. Further, when the reproduced voice is a voice in a silent section whose duration is equal to or greater than a predetermined value, a unit may be provided that restores the pitch of the reproduced voice and outputs it.

【０００９】この発明による第３の再生装置は、映像源
および音源から映像および音声をそれぞれ再生する手
段、再生された音声の発声速度を検出する手段、ならび
に、検出された発声速度が遅くなるほど、映像および音
声の再生速度を速くさせる手段を備えていることを特徴
とする。In the third reproducing apparatus according to the present invention, the means for reproducing the image and the sound from the image source and the sound source, the means for detecting the utterance speed of the reproduced sound, and the slower the detected utterance speed, It is characterized in that it is provided with means for increasing the reproduction speed of video and audio.

【００１０】再生された音声が、音声区間の音声か無音
区間の音声かを判別する判別手段、再生された音声が、
音声区間の音声または継続長が所定値未満である無音区
間の音声であるときには、再生された音声に対して、現
在の再生速度に応じて時間軸圧縮伸長処理を施す手段、
ならびに再生された音声が、継続長が所定値以上の無音
区間の音声である場合に、再生された音声を削除する手
段を、上記第３の再生装置に設けてもよい。Discriminating means for discriminating whether the reproduced voice is the voice in the voice section or the voice in the silent section, and the reproduced voice is
When the voice of the voice section or the voice of the silent section whose duration is less than a predetermined value, means for performing time axis compression / expansion processing on the reproduced voice according to the current reproduction speed,
In addition, when the reproduced sound is a sound in a silent section whose duration is equal to or more than a predetermined value, the third reproducing device may be provided with means for deleting the reproduced sound.

【００１１】[0011]

【作用】この発明による第１の再生装置では、再生され
た音声が、音声区間の音声または継続長が所定値未満で
ある無音区間の音声であるときには、再生開始時の設定
再生速度で映像および音声が再生されて出力される。再
生された音声が、継続長が所定値以上の無音区間の音声
である場合には、再生開始時の設定再生速度より速い再
生速度で映像および音声が再生されて出力される。In the first reproducing apparatus according to the present invention, when the reproduced sound is the sound of the sound section or the sound of the silent section whose duration is less than the predetermined value, the video and the video are reproduced at the set reproduction speed at the start of reproduction. The sound is played and output. When the reproduced sound is a sound in a silent section whose duration is equal to or more than a predetermined value, video and sound are reproduced and output at a reproduction speed faster than the set reproduction speed at the start of reproduction.

【００１２】この発明による第２の再生装置では、再生
された音声が、音声区間の音声か無音区間の音声かが判
別される。再生された音声が、音声区間の音声または継
続長が所定値未満である無音区間の音声であるときに
は、再生開始時の設定再生速度で映像および音声が再生
されて出力される。再生された音声が、継続長が所定値
以上の無音区間の音声である場合に、再生開始時の設定
再生速度より速い再生速度で映像および音声が再生され
て出力される。In the second reproducing apparatus according to the present invention, it is determined whether the reproduced voice is the voice in the voice section or the voice in the silent section. When the reproduced sound is the sound in the sound section or the sound in the silent section in which the duration is less than the predetermined value, the video and sound are reproduced and output at the set reproduction speed at the start of reproduction. When the reproduced sound is a sound in a silent section whose duration is equal to or greater than a predetermined value, video and sound are reproduced and output at a reproduction speed faster than the set reproduction speed at the start of reproduction.

【００１３】この発明による第３の再生装置では、再生
された音声の発声速度が検出される。そして、検出され
た発声速度が遅くなるほど、映像および音声の再生速度
が速くせしめられる。In the third reproducing device according to the present invention, the utterance speed of the reproduced voice is detected. Then, the slower the detected utterance speed is, the faster the reproduction speed of the video and audio is.

【００１４】[0014]

【実施例】以下、図面を参照して、この発明をビデオテ
ープレコーダに適用した場合の実施例について説明す
る。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT An embodiment in which the present invention is applied to a video tape recorder will be described below with reference to the drawings.

【００１５】（１）第１実施例の説明(1) Description of the first embodiment

【００１６】図１は、ビデオテープレコーダ（ＶＴＲ）
の概略構成を示している。このＶＴＲの動作モードに
は、従来のＶＴＲが備えている一般的な動作モードの他
に、再生時間を短縮化できる時間短縮再生モードがあ
る。FIG. 1 shows a video tape recorder (VTR).
Shows a schematic configuration of. The operation mode of the VTR includes a time-reduced reproduction mode capable of shortening the reproduction time, in addition to the general operation mode provided in the conventional VTR.

【００１７】２つの回転ヘッド２、３は、ビデオテープ
１の映像トラックを再生する。回転ヘッド２、３によっ
て交互に再生された映像は、ヘッド切り替え回路４を介
して映像再生回路５に供給され、映像再生回路５で映像
信号に変換される。The two rotary heads 2 and 3 reproduce the video track of the video tape 1. The image reproduced alternately by the rotary heads 2 and 3 is supplied to the image reproducing circuit 5 via the head switching circuit 4 and converted into a video signal by the image reproducing circuit 5.

【００１８】オーディオヘッド６は、ビデオテープ１の
オーディオトラックを再生する。オーディオヘッド６に
よって読み出された音声信号は、セレクタ７に送られ
る。セレクタ７は、オーディオヘッド６によって読み出
された音声信号を、常時は、音声再生回路８に供給す
る。音声再生回路８は、入力された音声信号を再生して
出力する。The audio head 6 reproduces the audio track of the video tape 1. The audio signal read by the audio head 6 is sent to the selector 7. The selector 7 constantly supplies the audio signal read by the audio head 6 to the audio reproduction circuit 8. The audio reproduction circuit 8 reproduces and outputs the input audio signal.

【００１９】時間短縮再生モードが設定されているとき
には、セレクタ７は、オーディオヘッド６によって読み
出された音声信号を音声分析・加工部９に送る。音声分
析・加工部９は、入力された音声信号を分析するととも
に、分析結果に応じて入力された音声信号を加工する。
また、音声分析・加工部９は、音声分析結果に基づい
て、速度指令信号を出力する。When the time shortened reproduction mode is set, the selector 7 sends the audio signal read by the audio head 6 to the audio analysis / processing section 9. The voice analysis / processing unit 9 analyzes the input voice signal and processes the input voice signal according to the analysis result.
In addition, the voice analysis / processing unit 9 outputs a speed command signal based on the voice analysis result.

【００２０】モータ制御部１０は、キャプスタンモータ
１１を制御する。モータ制御部１０は、常時は、図示し
ない操作部によって設定された再生速度等に基づいて、
キャプスタンモータ１１を制御する。そして、時間短縮
再生モード時には、モータ制御部１０は、音声分析・加
工部９からの速度指令信号に基づいて、キャプスタンモ
ータ１１を制御する。The motor control unit 10 controls the capstan motor 11. The motor control unit 10 always operates on the basis of the reproduction speed set by an operation unit (not shown) and the like.
The capstan motor 11 is controlled. Then, in the time shortened reproduction mode, the motor control unit 10 controls the capstan motor 11 based on the speed command signal from the voice analysis / processing unit 9.

【００２１】以下、時間短縮再生モードが設定されてい
る場合の動作について説明する。The operation when the time shortened reproduction mode is set will be described below.

【００２２】図２は、音声分析・加工部９の構成を示し
ている。FIG. 2 shows the configuration of the voice analysis / processing section 9.

【００２３】音声分析・加工部９は、音声信号入力部２
１、音声処理部２２、音声信号出力部２３、区間判別部
２４、無音継続長算出部２５および制御信号発生部２６
を備えている。The voice analysis / processing unit 9 includes a voice signal input unit 2
1, audio processing unit 22, audio signal output unit 23, section determination unit 24, silence duration calculation unit 25, and control signal generation unit 26.
It has.

【００２４】音声信号入力部２１は、たとえば、増幅
部、Ａ／Ｄ変換部、フレームメモリ等を備えている。音
声信号入力部２１に入力された信号は、増幅された後、
ディジタル信号に変換されて、フレームメモリに格納さ
れる。音声信号入力部２１の出力は、区間判別部２４と
音声処理部２２とに送られる。The audio signal input section 21 is provided with, for example, an amplification section, an A / D conversion section, a frame memory and the like. The signal input to the audio signal input unit 21 is amplified,
It is converted into a digital signal and stored in the frame memory. The output of the voice signal input unit 21 is sent to the section discrimination unit 24 and the voice processing unit 22.

【００２５】この実施例では、音声分析・加工部９にア
ナログ信号が入力される場合を示したが、ＩＣメモリ等
から読み出されたディジタル信号を音声分析・加工部９
に入力するようにしてもよい。この場合には、音声信号
入力部２１にＡ／Ｄ変換部を設ける必要はない。In this embodiment, the case where an analog signal is input to the voice analysis / processing unit 9 has been described, but the digital signal read from the IC memory or the like is used as the voice analysis / processing unit 9.
May be input to. In this case, it is not necessary to provide the audio signal input unit 21 with an A / D conversion unit.

【００２６】区間判別部２４では、入力信号が音声区間
であるか無音区間であるかが判別される。区間判別部２
４においては、たとえば、音声信号入力部２１のフレー
ムメモリに格納された１フレーム分の音声データが無音
区間であるか音声区間であるかが判定される。The section discriminating section 24 discriminates whether the input signal is a voice section or a silent section. Section discriminator 2
In 4, for example, it is determined whether the audio data for one frame stored in the frame memory of the audio signal input unit 21 is in the silent section or the audio section.

【００２７】無音区間であるか音声区間であるかの判定
は、たとえば、音声信号入力部２１のフレームメモリに
格納された１フレーム分の音声データのパワー平均が所
与のしきい値以上か否かによって行われる。つまり、パ
ワー平均が所与のしきい値以上であれば、音声区間と判
別され、パワー平均が所与のしきい値より小さければ、
無音区間と判定される。The determination as to whether it is a silent section or a voice section is made, for example, by determining whether the power average of the voice data for one frame stored in the frame memory of the voice signal input unit 21 is equal to or more than a given threshold value. Is done by or. That is, if the power average is greater than or equal to the given threshold, it is determined to be a voice section, and if the power average is less than the given threshold,
It is determined to be a silent section.

【００２８】より具体的に説明すると、音声信号入力部
２１のフレームメモリから読み出された１フレーム分の
音声データの平均パワー値Ｐが計算される。この平均パ
ワー値Ｐは、サンプリングされた１フレーム内の各音声
データの振幅をｉ₀，ｉ₁，…ｉ_{N -1}（Ｎは１フレーム
分の音声データ数）とすると、次の数式１によって算出
される。More specifically, the average power value P of the audio data for one frame read from the frame memory of the audio signal input section 21 is calculated. This average power value P is given by the following equation 1 when the amplitude of each sampled voice data in one frame is i ₀ , i ₁ , ... i _{N -1} (N is the number of voice data for one frame). It is calculated.

【００２９】[0029]

【数１】 [Equation 1]

【００３０】算出された平均パワー値Ｐは、しきい値Ｔ
ｈと比較される。平均パワー値Ｐがしきい値Ｔｈ以上
（Ｐ≧Ｔｈ）のときには、現フレームが音声区間である
ことを示す信号が、平均パワー値Ｐがしきい値Ｔｈより
小さい（Ｐ＜Ｔｈ）ときには、現フレームが無音区間で
あることを示す信号が、区間判別部２４から出力され
る。区間判別部２４による判別結果は、無音継続長算出
部２５および制御信号発生部２６に送られる。The calculated average power value P is the threshold value T
Compared with h. When the average power value P is greater than or equal to the threshold value Th (P ≧ Th), a signal indicating that the current frame is in the voice section is output when the average power value P is less than the threshold value Th (P <Th). The section discriminating unit 24 outputs a signal indicating that the frame is in the silent section. The determination result by the section determination unit 24 is sent to the silence duration calculation unit 25 and the control signal generation unit 26.

【００３１】無音継続長算出部２５では、区間判別部２
４によって判別された無音区間の継続長（継続フレーム
数）を算出する。無音継続長算出部２５によって算出さ
れた無音区間の継続長は、制御信号発生部２６に送られ
る。In the silent duration calculating unit 25, the section discriminating unit 2
The continuous length (the number of continuous frames) of the silent section determined by 4 is calculated. The duration of the silent section calculated by the silence duration calculation unit 25 is sent to the control signal generation unit 26.

【００３２】制御信号発生部２６は、時間短縮再生モー
ド時において、通常時は、再生開始時に設定された再生
速度倍率に応じた速度制御信号をモータ制御部１０に送
る。したがって、時間短縮再生モード時において、通常
時は、キャプスタンモータ１１は、再生開始時に設定さ
れた再生速度倍率に応じた速度で回転駆動される。In the time shortened reproduction mode, the control signal generator 26 normally sends a speed control signal to the motor controller 10 according to the reproduction speed magnification set at the start of reproduction. Therefore, in the time shortened reproduction mode, normally, the capstan motor 11 is rotationally driven at a speed according to the reproduction speed magnification set at the start of reproduction.

【００３３】時間短縮再生モード時において、無音継続
長算出部２５によって算出された無音区間の継続長が所
定値以上になったときには、制御信号発生部２６は、再
生開始時に設定された再生速度倍率より大きい所定の再
生速度倍率に応じた速度制御信号をモータ制御部１０に
送る。したがって、無音区間の継続長が所定値以上にな
ったときには、キャプスタンモータ１１は、再生開始時
に設定された再生速度倍率より大きな再生速度倍率に応
じた速度で回転駆動される。つまり、キャプスタンモー
タ１１の速度が速くされる。In the time shortened reproduction mode, when the duration of the silent section calculated by the silence duration calculation unit 25 becomes equal to or greater than a predetermined value, the control signal generation unit 26 causes the reproduction speed multiplication factor set at the start of reproduction. A speed control signal corresponding to a larger predetermined reproduction speed magnification is sent to the motor control unit 10. Therefore, when the duration of the silent section becomes equal to or greater than the predetermined value, the capstan motor 11 is rotationally driven at a speed according to the reproduction speed multiplication ratio that is larger than the reproduction speed multiplication ratio set at the start of reproduction. That is, the speed of the capstan motor 11 is increased.

【００３４】無音区間の継続長が所定値以上になった後
に、区間判別部２４において音声区間が検出されたとき
には、制御信号発生部２６は再生開始時に設定された再
生速度倍率に応じた速度制御信号をモータ制御部１０に
送る。したがって、キャプスタンモータ１１の回転速度
は、再生開始時に設定された再生速度倍率に応じた速度
に戻る。When the section discriminating section 24 detects a voice section after the duration of the silent section exceeds a predetermined value, the control signal generating section 26 controls the speed according to the reproduction speed magnification set at the start of reproduction. A signal is sent to the motor controller 10. Therefore, the rotation speed of the capstan motor 11 returns to the speed corresponding to the reproduction speed magnification set at the start of reproduction.

【００３５】制御信号発生部２６からは、無音区間の継
続長が所定値以上である場合には、音声処理用制御信号
が音声処理部２２に送られる。音声処理部２２は、図３
に示すように、セレクタ３１および音声削除部３２を備
えている。セレクタ３１は、制御信号発生部２６からの
音声処理用制御信号が入力されているとき、すなわち、
無音区間の継続長が所定値以上である場合には、入力音
声信号を音声削除部３２に送る。したがって、無音区間
の継続長が所定値以上である入力音声信号は、音声削除
部３２によって削除される。When the duration of the silent section is equal to or greater than a predetermined value, the control signal generation unit 26 sends a voice processing control signal to the voice processing unit 22. The voice processing unit 22 is shown in FIG.
As shown in, the selector 31 and the voice deletion unit 32 are provided. The selector 31 receives the audio processing control signal from the control signal generator 26, that is,
When the duration of the silent section is equal to or greater than the predetermined value, the input voice signal is sent to the voice deletion unit 32. Therefore, the voice deletion unit 32 deletes the input voice signal in which the duration of the silent section is equal to or greater than the predetermined value.

【００３６】音声区間の音声信号および継続長が所定値
未満の無音区間の音声信号は、セレクタ３１を介して音
声信号出力部２３に送られる。つまり、この実施例で
は、時間短縮再生モード時においては、音声区間の音声
信号および継続長が所定値未満の無音区間の音声信号の
みが、音声信号出力部２３を介して出力される。The voice signal in the voice section and the voice signal in the silent section whose duration is less than a predetermined value are sent to the voice signal output unit 23 via the selector 31. That is, in this embodiment, in the time shortened reproduction mode, only the audio signal in the audio section and the audio signal in the silent section whose duration is less than the predetermined value are output via the audio signal output unit 23.

【００３７】音声信号出力部２３は、Ｄ／Ａ変換部を備
えている。音声処理部２２から音声信号出力部２３に送
られてきたディジタル信号は、アナログ信号に変換され
て音声信号出力部２３から出力される。音声区間の音声
信号および継続長が所定値未満の無音区間の音声信号が
入力されているときには、再生速度は再生開始時に設定
された再生速度となっているため、音声区間の音声信号
および継続長が所定値未満の無音区間の音声信号は、再
生開始時に設定された再生速度に応じた速度で出力され
る。The audio signal output section 23 includes a D / A conversion section. The digital signal sent from the voice processing unit 22 to the voice signal output unit 23 is converted into an analog signal and output from the voice signal output unit 23. When a voice signal in the voice section and a voice signal in the silent section whose duration is less than a predetermined value are input, the playback speed is the playback speed set at the start of playback, so the voice signal and duration of the voice section are set. An audio signal in a silent section whose is less than a predetermined value is output at a speed according to the reproduction speed set at the start of reproduction.

【００３８】この実施例では、音声分析・加工部９から
音声信号をアナログ信号として出力する場合を示した
が、音声分析・加工部９から音声信号をデイジタル信号
として出力するようにしてもよい。この場合には、音声
信号出力部２３にＤ／Ａ変換部を設ける必要はない。In this embodiment, the voice analysis / processing section 9 outputs the voice signal as an analog signal. However, the voice analysis / processing section 9 may output the voice signal as a digital signal. In this case, it is not necessary to provide the audio signal output unit 23 with the D / A conversion unit.

【００３９】上記第１実施例では、時間短縮再生モード
が設定されている場合には、継続長が所定値以上の無音
区間の入力音声は、削除される。そして、継続長が所定
値以上の無音区間の音声が入力されている間は、再生速
度が速くされる。このため、再生時間を短縮化できる。In the first embodiment, when the time-reduced reproduction mode is set, the input voice in the silent section whose duration is equal to or longer than the predetermined value is deleted. Then, the reproduction speed is increased while the voice in the silent section having the duration of the predetermined value or more is input. Therefore, the reproduction time can be shortened.

【００４０】図４は、図２の音声処理部の変形例を示し
ている。FIG. 4 shows a modification of the voice processing section of FIG.

【００４１】この音声処理部１２２は、セレクタ３３お
よび間引き処理部３４を備えている。セレクタ３３は、
制御信号発生部２６からの音声処理用制御信号が入力さ
れているとき、すなわち、無音区間の継続長が所定値以
上である場合には、入力音声信号を間引き処理部３４に
送る。間引き処理部３４は、現在設定されている再生速
度倍率をｎとすると、１／ｎの圧縮率で入力音声信号を
間引く。間引き処理部３４の出力音声信号は、音声信号
出力部２３に送られる。The voice processing unit 122 includes a selector 33 and a thinning processing unit 34. The selector 33 is
When the voice processing control signal from the control signal generation unit 26 is input, that is, when the duration of the silent section is equal to or more than a predetermined value, the input voice signal is sent to the thinning processing unit 34. The thinning-out processing unit 34 thins out the input audio signal at a compression ratio of 1 / n, where n is the currently set reproduction speed multiplication factor. The output audio signal of the thinning processing unit 34 is sent to the audio signal output unit 23.

【００４２】なお、音声分析・加工部９の入出力信号が
共にアナログ信号である場合には、音声信号出力部２３
内のＤ／Ａ変換部のサンプリング周波数は、標準サンプ
リング周波数ｆ_SOに設定され、音声信号入力部２１内の
Ａ／Ｄ変換部のサンプリング周波数は、現在設定されて
いる再生速度倍率をｎとすると、ｎ・ｆ_SOに設定され
る。If both the input and output signals of the voice analysis / processing unit 9 are analog signals, the voice signal output unit 23
The sampling frequency of the D / A conversion unit in the above is set to the standard sampling frequency f _SO, and the sampling frequency of the A / D conversion unit in the audio signal input unit 21 is assumed to be the currently set reproduction speed multiplication factor n. , N · f _SO .

【００４３】継続長が所定値以上の無音区間の音声が入
力されている場合に、再生速度倍率ｎが、たとえば、２
にされたときには、音声信号入力部２１内のＡ／Ｄ変換
部のサンプリング周波数は、２ｆ_SOとなる。また、間引
き処理部３４によって入力音声信号の２ピッチが１ピッ
チに間引かれる。したがって、音声信号出力部２３から
出力される音声の速度は標準音声速度の２倍となるが、
その音程は標準速度再生時の音程となる。When a voice in a silent section having a duration longer than a predetermined value is input, the reproduction speed multiplication factor n is, for example, 2
When set to, the sampling frequency of the A / D converter in the audio signal input unit 21 becomes 2f _SO . Further, the thinning-out processing unit 34 thins out two pitches of the input audio signal into one pitch. Therefore, the speed of the voice output from the voice signal output unit 23 is twice the standard voice speed,
The pitch is the pitch at the standard speed reproduction.

【００４４】なお、音声分析・加工部９の入出力信号が
共にディジタル信号である場合には、現在設定されてい
る再生速度倍率をｎとすると、音声信号出力部２３から
出力されるデータの出力速度に対して、音声信号入力部
２１に入力されるデータの入力速度は、ｎ倍となるよう
に設定される。When the input / output signals of the voice analysis / processing unit 9 are both digital signals, the output of the data output from the voice signal output unit 23 is assumed, where n is the currently set reproduction speed multiplication factor. The input speed of the data input to the audio signal input unit 21 is set to be n times the speed.

【００４５】音声区間の入力音声信号および継続長が所
定値未満の無音区間の入力音声信号は、セレクタ３３を
介して音声信号出力部２３に送られる。音声区間の音声
信号および継続長が所定値未満の無音区間の音声信号が
入力されているときには、再生速度は再生開始時に設定
された再生速度となっているため、音声区間の音声信号
および継続長が所定値未満の無音区間の音声信号は、再
生開始時に設定された再生速度に応じた速度で出力され
る。The input voice signal in the voice section and the input voice signal in the silent section whose duration is less than a predetermined value are sent to the voice signal output unit 23 via the selector 33. When a voice signal in the voice section and a voice signal in the silent section whose duration is less than a predetermined value are input, the playback speed is the playback speed set at the start of playback, so the voice signal and duration of the voice section are set. An audio signal in a silent section whose is less than a predetermined value is output at a speed according to the reproduction speed set at the start of reproduction.

【００４６】つまり、時間短縮再生モード時において
は、継続長が所定値以上の無音区間の音声が入力されて
いる間は、再生速度が速くされる。このため、再生時間
を短縮することができる。そして、継続長が所定値以上
の無音区間の音声信号は、その時の再生速度倍率をｎと
すると、圧縮率１／ｎで間引かれた後に出力される。こ
のため、継続長が所定値以上の無音区間の音声信号は、
その音程が標準速度再生時の音程に戻されて出力され
る。That is, in the time-reduced reproduction mode, the reproduction speed is increased while the voice of the silent section having the duration of the predetermined value or more is input. Therefore, the reproduction time can be shortened. Then, the audio signal in the silent section whose duration is equal to or greater than a predetermined value is output after being thinned out at a compression rate 1 / n, where n is the reproduction speed magnification at that time. Therefore, the voice signal in the silent section whose duration is equal to or greater than the predetermined value is
The pitch is returned to the pitch at the standard speed reproduction and output.

【００４７】（２）第２実施例の説明(2) Description of the second embodiment

【００４８】図５は、ビデオテープレコーダ（ＶＴＲ）
の概略構成を示している。このＶＴＲの動作モードに
は、従来のＶＴＲが備えている一般的な動作モードの他
に、発声速度に応じて再生速度を制御して、聴き取りや
すい出力音声を得る再生速度可変再生モードがある。FIG. 5 shows a video tape recorder (VTR).
Shows a schematic configuration of. The operation mode of the VTR includes, in addition to the general operation mode provided in the conventional VTR, a reproduction speed variable reproduction mode in which the reproduction speed is controlled according to the utterance speed to obtain an output sound that is easy to hear. .

【００４９】２つの回転ヘッド２、３は、ビデオテープ
１の映像トラックを再生する。回転ヘッド２、３によっ
て交互に再生された映像は、ヘッド切り替え回路４を介
して映像再生回路５に供給され、映像再生回路５で映像
信号に変換される。The two rotary heads 2 and 3 reproduce the video track of the video tape 1. The image reproduced alternately by the rotary heads 2 and 3 is supplied to the image reproducing circuit 5 via the head switching circuit 4 and converted into a video signal by the image reproducing circuit 5.

【００５０】オーディオヘッド６は、ビデオテープ１の
オーディオトラックを再生する。オーディオヘッド６に
よって読み出された音声信号は、セレクタ７に送られ
る。セレクタ７は、オーディオヘッド６によって読み出
された音声信号を、常時は、音声再生回路８に供給す
る。音声再生回路８は、入力された音声信号を再生して
出力する。The audio head 6 reproduces the audio track of the video tape 1. The audio signal read by the audio head 6 is sent to the selector 7. The selector 7 constantly supplies the audio signal read by the audio head 6 to the audio reproduction circuit 8. The audio reproduction circuit 8 reproduces and outputs the input audio signal.

【００５１】再生速度可変再生モードが設定されている
ときには、セレクタ７は、オーディオヘッド６によって
読み出された音声信号を音声分析・加工部１２に送る。
音声分析・加工部１２は、入力された音声信号を分析す
るとともに、分析結果に応じて入力された音声信号を加
工する。When the reproduction speed variable reproduction mode is set, the selector 7 sends the audio signal read by the audio head 6 to the audio analysis / processing section 12.
The voice analysis / processing unit 12 analyzes the input voice signal and processes the input voice signal according to the analysis result.

【００５２】発声速度検出部１３は、たとえば、音声分
析・加工部１２の出力に基づいて発生速度を検出し、検
出した発声速度に基づいて速度指令信号を出力する。The utterance speed detection unit 13 detects the utterance speed based on the output of the voice analysis / processing unit 12, and outputs a speed command signal based on the detected utterance speed.

【００５３】モータ制御部１０は、キャプスタンモータ
１１を制御する。モータ制御部１０は、常時は、図示し
ない操作部によって設定された再生速度等に基づいて、
キャプスタンモータ１１を制御する。そして、再生速度
可変再生モード時には、モータ制御部１０は、発声速度
検出部１３から速度指令信号に基づいて、キャプスタン
モータ１１を制御する。The motor control unit 10 controls the capstan motor 11. The motor control unit 10 always operates on the basis of the reproduction speed set by an operation unit (not shown) and the like.
The capstan motor 11 is controlled. Then, in the reproduction speed variable reproduction mode, the motor control unit 10 controls the capstan motor 11 based on the speed command signal from the vocalization speed detection unit 13.

【００５４】以下、再生速度可変再生モードが設定され
ているときの動作について説明する。The operation when the reproduction speed variable reproduction mode is set will be described below.

【００５５】図６は、音声分析・加工部１２の構成を示
している。FIG. 6 shows the configuration of the voice analysis / processing section 12.

【００５６】音声分析・加工部１２は、音声信号入力部
４１、区間判別部４２、信号処理部４３、音声メモリ４
４、未読出しデータ蓄積量算出部４５および音声信号出
力部４６を備えている。信号処理部４３は、時間軸圧縮
伸長部５１、削除部５２等を備えている。The voice analysis / processing unit 12 includes a voice signal input unit 41, a section discrimination unit 42, a signal processing unit 43, and a voice memory 4.
4, an unread data storage amount calculation unit 45 and an audio signal output unit 46. The signal processing unit 43 includes a time axis compression / expansion unit 51, a deletion unit 52, and the like.

【００５７】音声信号入力部４１は、たとえば、増幅
部、Ａ／Ｄ変換部、フレームメモリ等を備えている。音
声信号入力部４１に入力された信号は、増幅された後、
ディジタル信号に変換されて、フレームメモリに格納さ
れる。音声信号入力部４１の出力は、区間判別部４２と
信号処理部４３とに送られる。この実施例では、音声分
析・加工部１２にアナログ信号が入力される場合を示し
たが、ＩＣメモリ等から読み出されたディジタル信号を
音声分析・加工部１２に入力するようにしてもよい。こ
の場合には、音声信号入力部４１にＡ／Ｄ変換部を設け
る必要はない。The audio signal input section 41 is provided with, for example, an amplification section, an A / D conversion section, a frame memory and the like. The signal input to the audio signal input unit 41 is amplified,
It is converted into a digital signal and stored in the frame memory. The output of the audio signal input unit 41 is sent to the section discrimination unit 42 and the signal processing unit 43. In this embodiment, the case where an analog signal is input to the voice analysis / processing unit 12 is shown, but a digital signal read from an IC memory or the like may be input to the voice analysis / processing unit 12. In this case, it is not necessary to provide the audio signal input unit 41 with an A / D conversion unit.

【００５８】区間判別部４２では、図２の区間判別部２
４と同様に、入力信号が音声区間であるか無音区間であ
るかが判別される。区間判別部４２の判別結果は、信号
処理部４３に送られる。In the section discriminating section 42, the section discriminating section 2 shown in FIG.
Similar to 4, it is determined whether the input signal is a voice section or a silent section. The determination result of the section determination unit 42 is sent to the signal processing unit 43.

【００５９】信号処理部４３では、音声入力部４１から
送られてくる入力信号に対して、区間判別部４２の判別
結果に応じた処理が行なわれる。つまり、継続長が所定
値以上の無音区間の入力信号は、削除部５２によって削
除される。また、音声区間の入力信号および継続長が所
定値未満の無音区間の入力信号に対しては、時間軸圧縮
伸長部５１によって、現在の再生速度倍率をｎとして１
／ｎ以上の圧縮率で時間軸圧縮伸長処理が施される。In the signal processing section 43, the input signal sent from the voice input section 41 is processed according to the discrimination result of the section discrimination section 42. That is, the deletion unit 52 deletes the input signal in the silent section whose duration is equal to or greater than the predetermined value. For the input signal of the voice section and the input signal of the silent section whose duration is less than the predetermined value, the time axis compression / expansion unit 51 sets the current reproduction speed multiplication factor to n.
The time axis compression / decompression process is performed at a compression rate of / n or more.

【００６０】時間軸圧縮伸長部５１で用いられる時間軸
圧縮伸長法としては、たとえば、ポインタ移動制御によ
る重複加算法（Pointer Interval Control Overlap and
Add: PICOLA)、TDHS(Time Domain Harmonic Scaling)
法等がある。As the time axis compression / expansion method used in the time axis compression / expansion unit 51, for example, an overlap addition method by pointer movement control (Pointer Interval Control Overlap and
Add: PICOLA), TDHS (Time Domain Harmonic Scaling)
There are laws etc.

【００６１】ＰＩＣＯＬＡを用いて、入力信号（時間軸
圧縮伸長部５１への入力音声データ）を圧縮率２／３で
圧縮する方法について、図７を用いて簡単に説明する。
まず、入力信号からピッチ周期が抽出される。抽出され
たピッチ周期をＴｐとする。波形Ａに対しては、１から
０へ直線的に向かう重み（重み関数Ｋ１）がつけられ
て、波形Ａ’が作成される。波形Ｂに対しては０から１
に向かう重み（重み関数Ｋ２）がつけられて、波形Ｂ’
が作成される。A method of compressing an input signal (input audio data to the time axis compression / expansion unit 51) at a compression rate of 2/3 using PICOLA will be briefly described with reference to FIG.
First, the pitch period is extracted from the input signal. The extracted pitch period is Tp. A weight (weighting function K1) that linearly goes from 1 to 0 is added to the waveform A to create the waveform A ′. 0 to 1 for waveform B
A weight (weighting function K2) toward
Is created.

【００６２】そして、これらの波形Ａ’およびＢ’が加
え合わされ、長さＴｐの波形Ａ’＊Ｂ’が作成される。
これらの重みは、波形Ａ’＊Ｂ’の前後の接続点での連
続性を保つためにつけられている。次に、ポインタが、
圧縮率に基づいて決定される長さである３Ｔｐ分だけ移
動され、同様な操作が行われる。これにより、３つの波
形Ａ、Ｂ、Ｃから２つの波形Ａ’＊Ｂ’とＣとが得られ
る。このようにして、３ピッチ周期分の信号が、２ピッ
チ周期分の信号に圧縮される。Then, these waveforms A'and B'are added together to form a waveform A '* B' of length Tp.
These weights are added to maintain continuity at the connection points before and after the waveform A ′ * B ′. Then the pointer becomes
The same operation is performed after moving by 3 Tp, which is the length determined based on the compression rate. As a result, two waveforms A ′ * B ′ and C are obtained from the three waveforms A, B, and C. In this way, the signal for 3 pitch periods is compressed into the signal for 2 pitch periods.

【００６３】信号処理部４３の出力は、音声メモリ４４
に一旦蓄積された後、音声信号出力部４６に送られて出
力される。音声信号出力部４６は、Ｄ／Ａ変換部を備え
ている。音声メモリ４４から音声信号出力部４６に送ら
れてきたディジタル信号は、アナログ信号に変換されて
音声信号出力部４６から出力される。この実施例では、
音声分析・加工部１２から音声信号をアナログ信号とし
て出力する場合を示したが、音声分析・加工部１２から
音声信号をディジタル信号として出力するようにしても
よい。この場合には、音声信号出力部４６にＤ／Ａ変換
部を設ける必要はない。The output of the signal processing unit 43 is the audio memory 44.
After being temporarily stored in, the audio signal is output to the audio signal output unit 46. The audio signal output unit 46 includes a D / A conversion unit. The digital signal sent from the audio memory 44 to the audio signal output unit 46 is converted into an analog signal and output from the audio signal output unit 46. In this example,
Although the case where the voice signal is output as an analog signal from the voice analysis / processing unit 12 is shown, the voice signal may be output as a digital signal from the voice analysis / processing unit 12. In this case, it is not necessary to provide the audio signal output unit 46 with the D / A conversion unit.

【００６４】なお、音声分析・加工部１２の入出力信号
が共にアナログ信号である場合には、音声信号出力部４
６内のＤ／Ａ変換部のサンプリング周波数は、標準サン
プリング周波数ｆ_SOに設定され、音声信号入力部４１内
のＡ／Ｄ変換部のサンプリング周波数は、現在の再生速
度倍率をｎとすると、ｎ・ｆ_SOに設定される。したがっ
て、高速再生時においても、出力音声の音程は元の音程
となる。If both the input and output signals of the voice analysis / processing unit 12 are analog signals, the voice signal output unit 4
The sampling frequency of the D / A conversion unit in 6 is set to the standard sampling frequency f _SO, and the sampling frequency of the A / D conversion unit in the audio signal input unit 41 is n when the current reproduction speed multiplication factor is n. _-Set to f _SO . Therefore, even during high-speed reproduction, the pitch of the output voice becomes the original pitch.

【００６５】また、音声分析・加工部１２の入出力信号
が共にディジタル信号である場合には、現在の再生速度
倍率をｎとすると、音声信号出力部４６から出力される
データの出力速度に対して、音声信号入力部４１に入力
されるデータの入力速度は、ｎ倍となるように設定され
る。したがって、高速再生時においても、出力音声の音
程は元の音程となる。When the input / output signals of the voice analysis / processing unit 12 are both digital signals, assuming that the current reproduction speed multiplication factor is n, the output speed of the data output from the voice signal output unit 46 is Thus, the input speed of the data input to the audio signal input unit 41 is set to be n times. Therefore, even during high-speed reproduction, the pitch of the output voice becomes the original pitch.

【００６６】未読出しデータ蓄積量算出部４５は、音声
メモリ４４内に書き込まれているが読み出されていない
音声データ量（未読出しデータ蓄積量）を算出する。The unread data storage amount calculation unit 45 calculates the amount of audio data written in the audio memory 44 but not read (unread data storage amount).

【００６７】音声分析・加工部１２の出力は、発声速度
検出部１３にも送られる。発声速度検出部１３は、音声
分析・加工部１２の出力信号に基づいて、再生されてい
る音声の発声速度を検出する。The output of the voice analysis / processing section 12 is also sent to the speaking rate detection section 13. The utterance speed detection unit 13 detects the utterance speed of the voice being reproduced, based on the output signal of the voice analysis / processing unit 12.

【００６８】再生されている音声の発声速度が遅い場合
には、映像および音声の再生速度が速くされる。つま
り、再生されている音声の発声速度が遅い場合には、キ
ャプスタンモータ１１の回転速度が速くなるように、発
声速度検出部１３からモータ制御部１０に速度指令信号
が供給される。When the utterance speed of the audio being reproduced is slow, the reproduction speed of the video and audio is increased. That is, when the utterance speed of the voice being reproduced is slow, the utterance speed detection unit 13 supplies the speed command signal to the motor control unit 10 so that the rotation speed of the capstan motor 11 is increased.

【００６９】逆に、再生されている音声の発声速度が速
い場合には、映像および音声の再生速度が遅くされる。
つまり、再生されている音声の発声速度が速い場合に
は、キャプスタンモータ１１の回転速度が遅くなるよう
に、発声速度検出部１３からモータ制御部１０に速度指
令信号が供給される。On the other hand, when the utterance speed of the sound being reproduced is high, the reproduction speed of the image and sound is decreased.
That is, when the utterance speed of the sound being reproduced is high, the utterance speed detection unit 13 supplies the speed command signal to the motor control unit 10 so that the rotation speed of the capstan motor 11 becomes slow.

【００７０】たとえば、再生速度可変再生モードによる
再生開始時に再生速度が２倍速に設定されている場合に
は、常時は、再生速度倍率は２倍であり、音声区間の入
力音声および継続長が所定値未満の無音区間の入力音声
は、時間軸圧縮伸長部５１によってたとえば、圧縮率２
／３で圧縮伸長処理されて出力される。また、継続長が
所定値以上の無音区間の入力音声は、削除部５２によっ
て削除される。For example, when the reproduction speed is set to double speed at the start of reproduction in the variable reproduction speed reproduction mode, the reproduction speed multiplication factor is always double, and the input voice and duration of the voice section are predetermined. The input voice in the silent section having a value less than the value is, for example, compressed by the time axis compression / expansion unit 51 at a compression rate of 2
It is compressed and expanded at / 3 and output. In addition, the deletion unit 52 deletes the input voice in the silent section whose duration is equal to or more than a predetermined value.

【００７１】そして、発声速度検出部１３によって検出
された発声速度に応じて、再生速度倍率１〜２の間で、
再生速度が制御される。発声速度検出部１３によって検
出された発声速度が速くなるほど、再生速度倍率が１に
近くなるように再生速度が制御され、発声速度検出部１
３によって検出された発声速度が遅くなるほど、再生速
度倍率が２に近くなるように再生速度が制御される。Then, according to the utterance speed detected by the utterance speed detection unit 13, between the reproduction speed magnifications 1 and 2,
The playback speed is controlled. The reproduction speed is controlled so that the reproduction speed multiplication rate becomes closer to 1 as the utterance speed detected by the utterance speed detection unit 13 becomes faster.
The reproduction speed is controlled so that the reproduction speed magnification becomes closer to 2 as the utterance speed detected by 3 becomes slower.

【００７２】この場合の、再生速度と時間軸圧縮伸長部
５１で用いられる圧縮率との関係の一例を次表に示す。An example of the relationship between the reproduction speed and the compression rate used in the time axis compression / expansion unit 51 in this case is shown in the following table.

【００７３】[0073]

【表１】 [Table 1]

【００７４】発声速度の検出方法としては、次のような
方法が用いられる。The following method is used as a method for detecting the vocalization rate.

【００７５】（ａ）未読出しデータ蓄積量算出部４５に
よって算出された、音声メモリ４４内の未読出しデータ
蓄積量に基づいて、発声速度を検出する。未読出しデー
タ蓄積量が多いほど発声速度が速いと判定される。(A) The utterance speed is detected based on the unread data storage amount in the voice memory 44 calculated by the unread data storage amount calculation unit 45. It is determined that the utterance speed is higher as the unread data storage amount is larger.

【００７６】（ｂ）音声メモリ４４の未読出しデータ蓄
積量の時間的変化量に基づいて、発声速度を検出する。
音声メモリ４４の未読出しデータ蓄積量の時間的変化量
が大きいほど発声速度が速いと判定される。(B) The utterance speed is detected based on the temporal change amount of the unread data storage amount of the voice memory 44.
It is determined that the utterance speed is higher as the temporal change amount of the unread data storage amount of the voice memory 44 is larger.

【００７７】（ｃ）単位時間当たりの音声区間と無音区
間との割合に基づいて、発声速度を検出する。単位時間
当たりの音声区間の総和（時間）が長いほど発声速度が
速いと判定される。(C) The utterance speed is detected based on the ratio of the voice section and the silent section per unit time. It is determined that the utterance speed is higher as the total sum (time) of the voice sections per unit time is longer.

【００７８】（ｄ）単位時間当たりの母音の個数に基づ
いて、発声速度を検出する。単位時間当たりの母音の個
数が多いほど発声速度が速いと判定される。(D) The utterance speed is detected based on the number of vowels per unit time. It is determined that the higher the number of vowels per unit time, the faster the utterance speed.

【００７９】（ｅ）単位時間当たりの周波数成分の変化
量に基づいて、発声速度を検出する。単位時間当たりの
周波数成分の変化量が大きいほど発声速度が速いと判定
される。(E) The utterance speed is detected based on the amount of change in the frequency component per unit time. It is determined that the speaking rate is faster as the amount of change in the frequency component per unit time is larger.

【００８０】上記実施例では、再生速度可変再生モード
時においては、音声分析・加工部１２の出力に基づい
て、再生されている音声メッセージの発声速度を検出し
ている。しかしながら、上記（ｃ）、（ｄ）、（ｅ）の
発声速度検出方法を用いる場合には、音声分析・加工部
１２の入力信号に基づいて、音声メッセージの発声速度
を検出することもできる。つまり、上記（ｃ）、
（ｄ）、（ｅ）の発声速度検出方法を用いる場合には、
図８に示すように、発声速度検出部１３を音声分析・加
工部１２の前段に設けるようにしてもよい。In the above embodiment, in the reproduction speed variable reproduction mode, the utterance speed of the reproduced voice message is detected based on the output of the voice analysis / processing section 12. However, in the case of using the utterance speed detection methods (c), (d), and (e), the utterance speed of the voice message can be detected based on the input signal of the voice analysis / processing unit 12. That is, the above (c),
When using the speech rate detection methods of (d) and (e),
As shown in FIG. 8, the utterance speed detection unit 13 may be provided in front of the voice analysis / processing unit 12.

【００８１】上記第２実施例では、再生速度可変再生モ
ードが設定されているときには、音声分析・加工部１２
から出力される音声の発声速度またはオーディオヘッド
６から出力される音声の発声速度が速いほど、映像およ
び音声の再生速度が遅くされているので、聞き取りやす
い出力音声を得ることができる。In the second embodiment, when the reproduction speed variable reproduction mode is set, the voice analysis / processing unit 12
The higher the utterance speed of the sound output from the audio head or the higher the utterance speed of the sound output from the audio head 6, the slower the reproduction speed of the image and the sound, so that the output sound that is easy to hear can be obtained.

【００８２】[0082]

【発明の効果】この発明によれば、理解にあまり重要で
ない区間の音声に対しては映像および音声の再生速度を
自動的に速くできる。このため、映像および音声の再生
時間を短縮化することができる。According to the present invention, the reproduction speed of video and audio can be automatically increased with respect to the audio in a section that is not so important for understanding. Therefore, the reproduction time of video and audio can be shortened.

【００８３】この発明によれば、再生された音声の発声
速度に応じて、映像および音声の再生速度を自動的に調
節でき、聴き取りやすい音声出力を得ることができる。According to the present invention, the reproduction speed of video and audio can be automatically adjusted according to the utterance speed of the reproduced sound, and a audible audio output can be obtained.

[Brief description of drawings]

【図１】この発明の第１実施例であるビデオテープレコ
ーダの概略構成を示す構成図である。FIG. 1 is a configuration diagram showing a schematic configuration of a video tape recorder that is a first embodiment of the present invention.

【図２】図１の音声分析・加工部の構成を示すブロック
図である。FIG. 2 is a block diagram showing a configuration of a voice analysis / processing unit in FIG.

【図３】図２の音声処理部の構成を示すブロック図であ
る。FIG. 3 is a block diagram showing a configuration of a voice processing unit in FIG.

【図４】音声処理部の変形例を示すブロック図である。FIG. 4 is a block diagram showing a modified example of a voice processing unit.

【図５】この発明の第２実施例であるビデオテープレコ
ーダの概略構成を示す構成図である。FIG. 5 is a configuration diagram showing a schematic configuration of a video tape recorder which is a second embodiment of the present invention.

【図６】図５の音声分析・加工部の構成を示すブロック
図である。6 is a block diagram showing a configuration of a voice analysis / processing unit in FIG.

【図７】ＰＩＣＯＬＡを用いた時間軸圧縮伸長法を説明
するための模式図である。FIG. 7 is a schematic diagram for explaining a time axis compression / expansion method using PICOLA.

【図８】この発明の第２実施例の変形例であるビデオテ
ープレコーダの概略構成を示す構成図である。FIG. 8 is a configuration diagram showing a schematic configuration of a video tape recorder which is a modified example of the second embodiment of the present invention.

[Explanation of symbols]

２、３回転ヘッド５映像再生回路６オーディオヘッド９、１２音声分析・加工部１０モータ制御部１１キャプスタンモータ１３発声速度検出部２１、４１音声信号入力部２３、４６音声信号出力部２４、４２区間判別部２２、１２２音声処理部３２削除部３４間引き処理部４３信号処理部４４音声メモリ５１時間軸圧縮伸長部５２削除部 2 and 3 rotary head 5 video reproduction circuit 6 audio head 9 and 12 voice analysis and processing unit 10 motor control unit 11 capstan motor 13 vocalization speed detection unit 21 and 41 voice signal input unit 23 and 46 voice signal output unit 24 and 42 Section discriminating unit 22, 122 Audio processing unit 32 Deletion unit 34 Decimation processing unit 43 Signal processing unit 44 Audio memory 51 Time base compression / expansion unit 52 Deletion unit

Claims

[Claims]

1. Means for reproducing video and audio respectively from a video source and a sound source, and when the reproduced audio is audio in an audio section or audio in a silent section whose duration is less than a predetermined value, the image and audio are reproduced. A means for playing back and outputting at the set playback speed at the start of playback, and when the played sound is sound in a silent section with a duration of a specified value or more A playback device having means for playing back and outputting at a fast playback speed.

2. A means for reproducing a video and a sound from a video source and a sound source, a judgment means for judging whether the reproduced sound is a sound in a voice section or a sound in a silent section, and the reproduced voice is a sound section. When the voice or the voice in the silent section whose duration is less than a predetermined value, the means for playing back and outputting the video and voice at the playback speed set at the start of playback, and the played voice, the duration of which is more than the predetermined value. In the case of the sound in the silent section, the reproducing device is provided with means for reproducing and outputting the image and sound at a reproduction speed faster than the set reproduction speed at the start of reproduction.

3. The reproduced voice according to claim 1, further comprising means for deleting the reproduced voice when the reproduced voice is a voice in a silent section having a duration of a predetermined value or more. Playback device.

4. The apparatus according to claim 1, further comprising means for restoring the pitch of the reproduced voice and outputting the reproduced voice when the reproduced voice is a voice in a silent section having a duration of a predetermined value or more. The playback device according to any one of 1.

5. A means for reproducing video and audio respectively from a video source and a sound source, a means for detecting the vocalization speed of the reproduced audio, and a faster reproduction speed of the video and audio as the detected vocalization speed becomes slower. A reproducing device having means for causing the reproducing device to operate.

6. A discriminating means for discriminating whether a reproduced voice is a voice in a voice section or a voice in a silent section, and the reproduced voice is a voice in a voice section or a voice in a silent section whose duration is less than a predetermined value. When the replayed sound is a means for performing time-axis compression / expansion processing on the replayed sound according to the current replay speed, and when the replayed sound is a sound in a silent section whose duration is equal to or longer than a predetermined value. The reproducing apparatus according to claim 5, further comprising: means for deleting the reproduced sound.