JPH07129198A

JPH07129198A - Sound recording device and reproducing device

Info

Publication number: JPH07129198A
Application number: JP5274599A
Authority: JP
Inventors: Hideo Okano; 秀生岡野; Hideyuki Takahashi; 秀享高橋
Original assignee: Olympus Optical Co Ltd
Current assignee: Olympus Corp
Priority date: 1993-11-02
Filing date: 1993-11-02
Publication date: 1995-05-19

Abstract

PURPOSE:To prevent the voice from being hard to hear even in reproducing at a high speed as over twice. CONSTITUTION:A main control circuit 8 controls an address control circuit 9 at the time of reproducing, and the data of one block is read from a memory part 10, and further the data of another block is read when a quick listening process is conducted. If these two pieces of block data contain a voiceless sound flag, data transfer process to a DSP part 5 is conducted, and if no voiceless sound flag, a direction to perform a temporal axis compression in TDHS system is given to the DSP part 5, and data transfer to the DSP part 5 is conducted. Thereby the temporal axis compression is made by TDHS system or through partial thinning in the case of voice sound or not of sound, while such as operation is performed in the case of voiceless sound division that the temporal axis compression is not made or that the ratio of temporal axis compression is suppressed.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、マイクロホンより入力
したアナログ信号をディジタル信号に変換してメモリ等
の記録媒体に記録し、その記録した信号をアナログ信号
に変換してスピーカで再生できるような音声記録装置及
び音声再生装置に係り、特に、高速早聞きができる高速
再生機能を持った音声記録再生装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention can convert an analog signal input from a microphone into a digital signal, record it on a recording medium such as a memory, convert the recorded signal into an analog signal and reproduce it by a speaker. The present invention relates to an audio recording device and an audio reproducing device, and more particularly, to an audio recording and reproducing device having a high-speed reproduction function that enables high-speed and fast listening.

【０００２】[0002]

【従来の技術】一般に、テープレコーダの用途の一つと
して、口述記録がある。例えば、予め送るべき手紙の内
容をテープレコーダにてテープに音声で記録しておき、
秘書などにこの録音したテープを渡し、この秘書など
は、そのテープを再生して音声を聞きながら手紙をタイ
プ作成する、あるいは、会議の内容を録音しておき、後
でこの録音を聞きながら議事録を作成する、等、様々な
利用法がある。2. Description of the Related Art Generally, dictation recording is one of the applications of tape recorders. For example, record the contents of the letter to be sent in advance on the tape with a tape recorder,
Give this recorded tape to a secretary, etc., and play this tape and type a letter while listening to the voice, or record the contents of the meeting and listen to this recording later and discuss There are various uses such as making a record.

【０００３】ところが、タイプ速度は、タイピストの技
量により大きな個人差があり、従って、タイプを行うた
めテープに録音された音の再生を行う時はタイプ速度に
合わせてテープの回転速度を変化させる。このとき、再
生速度をゆっくりしたものでは音声の高さ（トーン）が
下がり、音声の明瞭度が低下する。逆に、再生速度を早
くすると、音声の高さが上がりこれも明瞭度が低下す
る。However, the type speed has a great individual difference depending on the skill of the typist. Therefore, when the sound recorded on the tape is reproduced to perform the type, the rotation speed of the tape is changed according to the type speed. At this time, if the reproduction speed is slowed down, the pitch (tone) of the voice is lowered and the clarity of the voice is lowered. On the contrary, if the reproduction speed is increased, the pitch of the voice is increased and the clarity is also lowered.

【０００４】従来、この問題を解決するために、音声信
号を間引くVariable Speech Control と呼ばれる方式
や、例えば特開平１−２３２４００号公報や特開平１−
２３３８３５号公報に開示されているように、音声の入
力信号のピッチ周期を抽出し、そのピッチ周期に応じて
ピッチ２周期分の音声データに重み窓関数をかけて時間
軸圧縮を行うTime Domain Harmonic Scale (TDHS) 方式
などが利用されてきた。Conventionally, in order to solve this problem, a method called Variable Speech Control for thinning out an audio signal, for example, JP-A-1-232400 or JP-A-1-232400.
As disclosed in Japanese Patent No. 233835, Time Domain Harmonic for extracting a pitch period of a voice input signal and applying a weighting window function to voice data of two pitch periods according to the pitch period to perform time axis compression. Scale (TDHS) method has been used.

【０００５】また、近年、装置の小型化の要求から、例
えば、特開昭６３−２５９７００号公報に開示されてい
るように、記録媒体として、磁気テープの代わりに半導
体メモリを使用する装置が開発されてきている。Further, in recent years, in response to the demand for miniaturization of the apparatus, for example, as disclosed in Japanese Patent Laid-Open No. 63-259700, an apparatus using a semiconductor memory instead of a magnetic tape as a recording medium has been developed. Has been done.

【０００６】前述したようなＴＤＨＳ方式の時間軸圧縮
処理を図８に示す。この例は、周期性のある２ピッチ分
の音声信号を１ピッチに圧縮した場合を示している。ま
ず、２ピッチ分の音声信号Ｓinを取り込み、前部の周期
Ｐ１の音声信号には重み窓関数Ｗ(m) をかけ、後部の周
期Ｐ２には、前部の周期Ｐ１とは反対の重み窓関数１−
Ｗ(m) をかけ、それぞれ加算して１つのＳout として時
間軸を圧縮している。FIG. 8 shows a time axis compression process of the TDHS system as described above. This example shows a case where a sound signal for two pitches having periodicity is compressed to one pitch. First, the audio signal Sin for two pitches is taken in, the weighted window function W (m) is applied to the audio signal of the front period P1, and the weight window opposite to the front period P1 is applied to the rear period P2. Function 1-
The time axis is compressed by multiplying W (m) and adding each to make one Sout.

【０００７】[0007]

【発明が解決しようとする課題】しかし、このようなＴ
ＤＨＳ方式に於いては、有声音部、無音部、無声音部を
一様に時間軸圧縮すると、各部の音声信号が平均化さ
れ、２倍速程度以上になると、聞き取りにくくなる。特
に、無声音部を含む区間は、摩擦部や爆発部などについ
て時間軸圧縮を行うと聞き取りにくくなるという問題点
があった。However, such T
In the DHS system, if the voiced sound portion, the unvoiced sound portion, and the unvoiced sound portion are uniformly time-axis-compressed, the sound signals of the respective portions are averaged, and it becomes difficult to hear when the speed is about double speed or more. In particular, in the section including the unvoiced sound portion, there is a problem that it becomes difficult to hear when the time axis compression is performed on the frictional portion or the explosive portion.

【０００８】本発明は、上記の点に鑑みてなされたもの
で、例えば２倍速程度以上の高速再生時にも音声が聞き
取りにくくなることのない音声記録装置及び音声再生装
置を提供することを目的とする。The present invention has been made in view of the above points, and an object of the present invention is to provide an audio recording apparatus and an audio reproducing apparatus in which the audio is not difficult to hear even at the time of high speed reproduction of about double speed or more. To do.

【０００９】[0009]

【課題を解決するための手段】上記の目的を達成するた
めに、本発明による音声再生装置は、ディジタル音声信
号を再生する音声再生手段と、上記音声再生手段により
逐次再生される音声信号が、有声音に係るものか、無声
音に係るものか、無音に係るものかを判別する判別手段
と、音声信号を通常よりも高速で再生するとき、上記判
別手段により有声音及び無音に係るものであると判別さ
れた音声信号については時間軸を圧縮し、無声音に係る
ものであると判別された音声信号については時間軸を圧
縮しないか上記有声音及び無音に係る音声信号の時間軸
の圧縮率に比べて低い圧縮率で時間軸圧縮を行う時間軸
圧縮手段とを備えている。In order to achieve the above object, an audio reproducing apparatus according to the present invention comprises an audio reproducing means for reproducing a digital audio signal, and an audio signal successively reproduced by the audio reproducing means. Discrimination means for discriminating between voiced sound, unvoiced sound, and silence, and when the voice signal is reproduced at a higher speed than usual, the discrimination means relates to voiced sound and silence. The time axis is compressed for the voice signal determined to be, and the time axis is not compressed for the voice signal determined to be related to unvoiced sound. And a time axis compression means for performing time axis compression at a lower compression rate.

【００１０】また、本発明による音声記録装置は、所定
の記録媒体に音声信号をディジタル化して記録する音声
記録手段と、上記音声記録手段によって逐次記録される
音声信号が、有声音に係るものか、無声音に係るもの
か、無音に係るものかを検出する検出手段と、上記検出
手段により検出された情報を、その音声信号と関連付け
て、上記記録媒体に記録する関連情報記録手段とを備え
ている。Further, in the voice recording apparatus according to the present invention, whether the voice recording means for digitizing and recording the voice signal on a predetermined recording medium and the voice signal successively recorded by the voice recording means relate to voiced sound. A detection means for detecting whether it is unvoiced or unvoiced, and related information recording means for recording the information detected by the detection means in the recording medium in association with the audio signal. There is.

【００１１】さらに、本発明による別の音声再生装置
は、ディジタル音声信号を再生する音声再生手段と、上
記音声再生手段により逐次再生される音声信号が、有声
音に係るものか、無声音に係るものか、無音に係るもの
かを判別する判別手段と、音声信号を通常よりも高速で
再生するとき、上記判別手段により有声音に係るもので
あると判別された音声信号については時間軸を圧縮し、
無声音に係るものであると判別された音声信号について
は所定の条件に従って時間軸の圧縮を行う条件付時間軸
圧縮手段とを備えている。Further, in another audio reproducing apparatus according to the present invention, the audio reproducing means for reproducing a digital audio signal and the audio signals successively reproduced by the audio reproducing means are related to voiced sound or unvoiced sound. Whether or not the voice signal is reproduced at a speed higher than normal when the voice signal is reproduced at a higher speed than usual, the time axis of the voice signal determined to be related to the voiced sound is compressed by the above-mentioned discrimination means. ,
The audio signal determined to be unvoiced is provided with conditional time axis compression means for performing time axis compression according to a predetermined condition.

【００１２】そして、本発明による更に別の音声再生装
置は、ディジタル音声信号を再生する音声再生手段と、
上記音声再生手段により逐次再生される音声信号が、有
声音に係るものか、無声音に係るものか、無音に係るも
のかを判別する判別手段と、音声信号を通常よりも高速
で再生するとき、上記判別手段により有声音に係るもの
であると判別された音声信号については時間軸を圧縮
し、無音に係るものであると判別された音声信号につい
てはデータの間引き処理を行うデータ処理手段とを備え
ている。Further, another audio reproducing apparatus according to the present invention comprises an audio reproducing means for reproducing a digital audio signal,
When the audio signal sequentially reproduced by the audio reproducing means is related to voiced sound, unvoiced sound, or unvoiced sound, and a sound signal is reproduced at a higher speed than usual, A data processing unit that compresses the time axis for a voice signal that is determined to be related to voiced sound by the determination unit and performs a data thinning process for a voice signal that is determined to be related to silence. I have it.

【００１３】[0013]

【作用】本発明による音声再生装置では、判別手段が、
音声再生手段により逐次再生されるディジタル音声信号
が、有声音に係るものか、無声音に係るものか、無音に
係るものかを判別する。そして、音声信号を通常よりも
高速で再生するときには、時間軸圧縮手段が、この判別
手段により有声音及び無音に係るものであると判別され
た音声信号については時間軸を圧縮し、無声音に係るも
のであると判別された音声信号については時間軸を圧縮
しないか、上記有声音及び無音に係る音声信号の時間軸
の圧縮率に比べて低い圧縮率で時間軸圧縮を行う。In the voice reproducing apparatus according to the present invention, the discriminating means is
It is determined whether the digital audio signal sequentially reproduced by the audio reproducing means is related to voiced sound, unvoiced sound, or unvoiced sound. When the audio signal is reproduced at a higher speed than usual, the time axis compression unit compresses the time axis of the audio signal that is discriminated by the discriminating unit as to the voiced sound and the silent voice, and The time axis is not compressed for the audio signal that is determined to be the one, or the time axis compression is performed at a compression rate lower than the time axis compression rate of the voice signal related to voiced sound and silence.

【００１４】また、本発明による音声記録装置では、所
定の記録媒体に音声信号をディジタル化して記録する音
声記録手段が音声信号を逐次記録するとき、検出手段
は、その音声信号が、有声音に係るものか、無声音に係
るものか、無音に係るものかを検出する。そして、関連
情報記録手段は、この検出手段により検出された情報
を、その音声信号と関連付けて、上記記録媒体に記録す
る。Further, in the audio recording apparatus according to the present invention, when the audio recording means for digitizing and recording the audio signal on the predetermined recording medium successively records the audio signal, the detecting means makes the audio signal into a voiced sound. It is detected whether it is related, unvoiced sound, or silent sound. Then, the related information recording means records the information detected by the detecting means in the recording medium in association with the audio signal.

【００１５】さらに、本発明による別の音声再生装置で
は、判別手段が、音声再生手段により逐次再生されるデ
ィジタル音声信号が、有声音に係るものか、無声音に係
るものか、無音に係るものかを判別する。そして、音声
信号を通常よりも高速で再生するときには、条件付時間
軸圧縮手段が、この判別手段により有声音に係るもので
あると判別された音声信号については時間軸を圧縮し、
無声音に係るものであると判別された音声信号について
は所定の条件に従って時間軸の圧縮を行う。Further, in another audio reproducing apparatus according to the present invention, the discriminating means determines whether the digital audio signals successively reproduced by the audio reproducing means are voiced sound, unvoiced sound, or unvoiced sound. To determine. When the audio signal is reproduced at a higher speed than usual, the conditional time axis compression means compresses the time axis of the audio signal determined to be related to voiced sound by the determination means,
A voice signal determined to be unvoiced is compressed on the time axis according to a predetermined condition.

【００１６】そして、本発明による更に別の音声再生装
置では、判別手段が、音声再生手段により逐次再生され
るディジタル音声信号が、有声音に係るものか、無声音
に係るものか、無音に係るものかを判別する。そして、
音声信号を通常よりも高速で再生するときには、データ
処理手段が、この判別手段により有声音に係るものであ
ると判別された音声信号については時間軸を圧縮し、無
音に係るものであると判別された音声信号についてはデ
ータの間引き処理を行う。In still another audio reproducing apparatus according to the present invention, the discriminating means determines whether the digital audio signals sequentially reproduced by the audio reproducing means are voiced sound, unvoiced sound, or unvoiced sound. To determine if. And
When the audio signal is reproduced at a higher speed than usual, the data processing unit determines that the audio signal, which is determined to be related to voiced sound by the determining unit, is compressed in the time axis and is related to silence. Data thinning processing is performed on the generated audio signal.

【００１７】[0017]

【実施例】以下、図面を参照して、本発明の一実施例を
説明する。図１は、本発明の一実施例としての音声記録
再生装置のブロック構成図である。この音声録音再生装
置では、マイクロホン１より得られるアナログ信号を、
増幅器（ＡＭＰ）２により増幅し、低域通過フィルタ
（ＬＰＦ）３を通した後、アナログ／ディジタル（Ａ／
Ｄ）変換器４によってディジタル信号に変換して、判別
手段，時間軸圧縮手段，検出手段，条件付時間軸圧縮手
段，及びデータ処理手段の構成要素であるディジタル信
号処理（ＤＳＰ）部５に入力する。このＤＳＰ部５は、
録音動作時に音声を圧縮し、また再生動作時に音声を伸
長する。該ＤＳＰ部５の動作は制御回路６により制御さ
れ、圧縮した音声をデータ入出力（Ｉ／Ｏ）バッファ７
を介して主制御回路８に送る。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of an audio recording / reproducing apparatus as an embodiment of the present invention. In this voice recording / reproducing apparatus, the analog signal obtained from the microphone 1 is
The signal is amplified by an amplifier (AMP) 2 and passed through a low pass filter (LPF) 3 and then analog / digital (A /
D) Converted into a digital signal by the converter 4, and input to the digital signal processing (DSP) unit 5 which is a constituent element of the discriminating means, the time base compression means, the detection means, the conditional time base compression means, and the data processing means. To do. This DSP unit 5
Audio is compressed during recording operation and expanded during playback operation. The operation of the DSP unit 5 is controlled by the control circuit 6, and the compressed audio is input / output (I / O) buffer 7
To the main control circuit 8 via.

【００１８】音声再生手段，音声記録手段，及び関連情
報記録手段の構成要素である主制御回路８は、複数の操
作ボタン及びスイッチの操作に応じて、上記ＤＳＰ部５
と、アドレス制御回路９及び当該録音再生装置に着脱自
在な記録媒体としての半導体メモリ部１０の動作を制御
する。即ち、アドレス制御回路９に適当なアドレス信号
を与え、データＩ／Ｏバッファ７から供給された音声デ
ータをメモリ部１０に記録、あるいは、メモリ部１０に
記録されているデータを読出して上記データＩ／Ｏバッ
ファ７を介してＤＳＰ部５に供給する。The main control circuit 8, which is a constituent element of the audio reproducing means, the audio recording means, and the related information recording means, responds to the operation of a plurality of operation buttons and switches by the DSP section 5 described above.
Then, it controls the operation of the address control circuit 9 and the semiconductor memory unit 10 as a recording medium detachably attached to the recording / reproducing apparatus. That is, an appropriate address signal is given to the address control circuit 9, the voice data supplied from the data I / O buffer 7 is recorded in the memory section 10, or the data recorded in the memory section 10 is read to read the data I It is supplied to the DSP unit 5 via the / O buffer 7.

【００１９】なお、ここで音声情報の記録位置を示す情
報であるアドレスは、着脱自在な半導体メモリ部１０に
記憶させても良く、記録再生装置側に設けられているア
ドレス制御回路に付随する不図示半導体メモリ（内部記
憶部）に記憶させるようにしても良い。The address, which is the information indicating the recording position of the audio information, may be stored in the removable semiconductor memory unit 10, and is not associated with the address control circuit provided on the recording / reproducing apparatus side. It may be stored in the illustrated semiconductor memory (internal storage unit).

【００２０】上記ＤＳＰ部５で伸長された読み出しデー
タは、ディジタル／アナログ（Ｄ／Ａ）変換器１１によ
りアナログ信号に変換され、増幅器（ＡＭＰ）１２で増
幅された後、スピーカ１３に出力される。The read data expanded by the DSP unit 5 is converted into an analog signal by the digital / analog (D / A) converter 11, amplified by the amplifier (AMP) 12, and then output to the speaker 13. .

【００２１】また、上記主制御回路８は、駆動回路１４
を制御して表示器１５に、動作モードなどの各種情報を
表示させる。上記メモリ部１０は、本実施例では、図２
に示すような記録構成を有している。即ち、メモリ空間
は、インデックス部１０Ａと音声データ部１０Ｂとに大
きく二分されている。インデックス部１０Ａは、音声デ
ータ部１０Ｂに記録される複数の音声メッセージファイ
ル１０Ｂ１，１０Ｂ２，１０Ｂ３，…それぞれについ
て、操作開始位置情報１０Ａ１と操作終了位置情報１０
Ａ２、その他符号モードや操作条件等が記録される。ま
た、現在の音声データ部１０Ｂに対する動作位置を示す
動作位置情報１０Ａ３が記録される。The main control circuit 8 includes a drive circuit 14
Is controlled to display various information such as the operation mode on the display unit 15. The memory unit 10 in FIG.
It has a recording structure as shown in. That is, the memory space is roughly divided into the index section 10A and the audio data section 10B. The index unit 10A includes the operation start position information 10A1 and the operation end position information 10 for each of the plurality of voice message files 10B1, 10B2, 10B3, ... Recorded in the voice data unit 10B.
A2, other code modes, operating conditions, etc. are recorded. In addition, operation position information 10A3 indicating the current operation position with respect to the audio data portion 10B is recorded.

【００２２】なお、上記主制御回路８に接続されるボタ
ンとしては、録音ボタンＲＥＣ，再生ボタンＰＬ，停止
ボタンＳＴ，早送りボタンＦＦ，戻しボタンＲＥＷ，Ｉ
マークボタンＩ，ＥマークボタンＥ，音声起動（ボイス
アクティブディテクタ）ボタンＶＡＤがあり、スイッチ
としては電池ＢＡＴとの間の主電源スイッチ１６があ
る。ここで、ＩマークやＥマークとは、次のようなもの
である。即ち、記録媒体には複数の文章が記録されるこ
とから、この種の音声記録再生装置では、文章録音者に
より録音時に、ＩマークボタンＩを操作することによ
り、記録媒体に記録された複数文章間の優先関係を示す
インストラクション（Ｉ）マークというタイピストや秘
書向けの指示用インデックスマークを記録することがで
きるようになっており、文章録音者は、このＩマークを
使って、音声によって具体的に優先関係を指示するとい
うことが可能になっている。また、複数文書間の区切り
を示すために、ＥマークボタンＥの操作により、エンド
（Ｅ）マークというインデックスマークを記録すること
ができるようになっている。The buttons connected to the main control circuit 8 are a record button REC, a play button PL, a stop button ST, a fast forward button FF, and a return button REW, I.
There are mark buttons I and E, a mark button E, a voice activation (voice active detector) button VAD, and a main power switch 16 between the battery BAT and the switch. Here, the I mark and the E mark are as follows. That is, since a plurality of sentences are recorded on the recording medium, in this type of audio recording / reproducing apparatus, a plurality of sentences recorded on the recording medium can be operated by operating the I mark button I at the time of recording by the sentence recorder. It is possible to record an instruction (I) mark, which indicates the priority relationship between the typists and an index mark for instruction for a secretary, and the sentence recorder can use the I mark to concretely record by voice. It is possible to indicate a priority relationship. Further, an index mark called an end (E) mark can be recorded by operating the E mark button E in order to indicate a division between a plurality of documents.

【００２３】次に、このような構成の音声録音再生装置
の動作を詳細に説明する。電池ＢＡＴがセットされて電
源が主制御回路８に供給されると、主制御回路８はそれ
を電圧検出により検出して、図３のフローチャートに示
すような動作を開始する。Next, the operation of the voice recording / reproducing apparatus having such a configuration will be described in detail. When the battery BAT is set and power is supplied to the main control circuit 8, the main control circuit 8 detects it by voltage detection and starts the operation as shown in the flowchart of FIG.

【００２４】即ち、まず、主制御回路８の外部条件や内
部の記憶部の初期設定を行う（ステップＳ１）。ただし
この時点では、当該録音再生装置の全体への電力供給を
指示するための主電源スイッチ１６はＯＦＦ状態にあ
る。初期設定を完了した後、主制御回路８は、主電源ス
イッチ１６がＯＮされたかどうか検出をする（ステップ
Ｓ２）。That is, first, the external conditions of the main control circuit 8 and the internal storage unit are initialized (step S1). However, at this point, the main power switch 16 for instructing the power supply to the entire recording / playback apparatus is in the OFF state. After completing the initial setting, the main control circuit 8 detects whether or not the main power switch 16 is turned on (step S2).

【００２５】検出の結果、主電源スイッチ１６がＯＮ状
態にあることを検出したならば、当該記録再生装置全体
に電力を供給するための電池ＢＡＴと各回路との間に設
けられた不図示スイッチをＯＮにし、その後、記録媒体
（メモリ部）１０より、インデックス部１０Ａの情報を
読み込む（ステップ３）。即ち、操作開始位置情報１０
Ａ１、操作終了位置情報１０Ａ２、その他符号モードや
操作条件等を読み込む。When it is detected that the main power switch 16 is in the ON state as a result of the detection, a switch (not shown) provided between the battery BAT and each circuit for supplying electric power to the entire recording / reproducing apparatus. Is turned on, and then the information of the index section 10A is read from the recording medium (memory section) 10 (step 3). That is, the operation start position information 10
A1, operation end position information 10A2, other code modes, operation conditions, etc. are read.

【００２６】この時、メモリ部１０から読み込んだデー
タによって、メモリ部１０が既にインデックスを正常に
記録したものかどうか、即ちメモリ部１０のフォーマッ
トが正常かどうかを判断する（ステップＳ４）。At this time, it is determined from the data read from the memory unit 10 whether the memory unit 10 has already recorded the index normally, that is, whether the format of the memory unit 10 is normal (step S4).

【００２７】ここで、メモリ部１０としてフォーマット
されていないものを入れていた時には、正常ではないと
判断され、その場合には、メモリ部１０のインデックス
部１０Ａに利用条件を示す情報を入力し且つ音声データ
部１０Ｂに“０”を入力する処理であるメモリフォーマ
ット（初期化）を行うかどうかを確認する（ステップＳ
５）。即ち、駆動回路１４を制御して、メモリフォーマ
ットを行うか否かの確認表示を表示器１５に行わせる。Here, when an unformatted memory unit 10 is inserted, it is determined that the memory unit 10 is not normal, and in that case, information indicating the usage condition is input to the index unit 10A of the memory unit 10. It is confirmed whether or not the memory format (initialization), which is the process of inputting "0" to the voice data section 10B, is performed (step S).
5). That is, the drive circuit 14 is controlled to cause the display 15 to display a confirmation display as to whether or not to perform the memory format.

【００２８】ここで、メモリフォーマット処理を確認指
示するボタン（録音ボタンＲＥＣ兼用）が押されたなら
ば、メモリ部１０のフォーマット（初期化）を行い（ス
テップＳ６）、このフォーマット完了後、駆動回路１４
を制御して表示器１５にて初期設定完了表示を行う（ス
テップＳ７）。If the button for confirming the memory format process (also used as the record button REC) is pressed, the memory section 10 is formatted (initialized) (step S6), and after this format is completed, the drive circuit is completed. 14
Is controlled to display the completion of initial setting on the display 15 (step S7).

【００２９】また、メモリフォーマットをしないことを
確認指示するボタン（停止ボタンＳＴ兼用）が押された
ときには、駆動回路１４を制御して表示器１５にてメモ
リ部１０が正常でないことを表示すると共に、メモリ部
１０を取り替えるべきである旨の指示表示を行い、当該
録音再生装置全体に電力を供給するための電池ＢＡＴと
各回路との間に設けられ不図示スイッチをＯＦＦする
（ステップＳ８）。その後、メモリ部１０交換のため
に、主電源スイッチ１６がＯＦＦされるのを待ち（ステ
ップＳ９）、それがＯＦＦされたことを検出すると、上
記ステップＳ２に戻る。When a button (also used as the stop button ST) for confirming that the memory is not formatted is pressed, the drive circuit 14 is controlled to display on the display 15 that the memory section 10 is not normal. An instruction display indicating that the memory unit 10 should be replaced is displayed, and a switch (not shown) provided between the battery BAT for supplying power to the entire recording / reproducing apparatus and each circuit is turned off (step S8). Thereafter, in order to replace the memory unit 10, the main power switch 16 is waited for being turned off (step S9), and when it is detected that it is turned off, the process returns to step S2.

【００３０】一方、メモリ部１０が正常に初期設定が完
了されたものは、初期設定完了表示後、インデックス部
１０から読出した情報（動作位置情報１０Ａ３）より現
在の動作位置を検出し、駆動回路１４を制御して表示器
１５にてその検出した位置の表示を行う（ステップＳ１
０）。その後、当該装置の操作ボタンのどれが押された
かどうかを検出しながら各回路を待ち状態にさせる（ス
テップＳ１１）。On the other hand, in the case where the initial setting of the memory unit 10 is normally completed, the current operating position is detected from the information (operating position information 10A3) read from the index unit 10 after the initial setting completion display, and the drive circuit 14 is controlled to display the detected position on the display 15 (step S1).
0). After that, each circuit is placed in a waiting state while detecting which of the operation buttons of the device has been pressed (step S11).

【００３１】そして、いずれかの操作ボタンが押された
ことを検出すると、まず、操作されたのが録音ボタンＲ
ＥＣかどうか検出し（ステップＳ１２）、もし録音ボタ
ンＲＥＣが押されれば、ＤＳＰ部５を制御してＡ／Ｄ変
換器４から入力される音声情報を圧縮し、アドレス制御
回路９を制御してメモリ部１０の音声データ部１０Ｂに
記録を行う録音処理に入る（ステップＳ１３）。When it is detected that any one of the operation buttons has been pressed, first, the operated button is the record button R.
If it is EC (step S12), and if the record button REC is pressed, the DSP unit 5 is controlled to compress the voice information input from the A / D converter 4, and the address control circuit 9 is controlled. Then, a recording process for recording in the voice data section 10B of the memory section 10 is started (step S13).

【００３２】操作されたのが録音ボタンＲＥＣでない時
には、次に、再生ボタンＰＬの検出を行う（ステップＳ
１４）。ここでもし再生ボタンＰＬが押されていれば、
アドレス制御回路９を制御してメモリ部１０の音声デー
タ部１０Ｂから記録されているデータを読み出し、ＤＳ
Ｐ部５に送って伸長処理を行い、Ｄ／Ａ変換器１１に音
声情報を送る再生処理に入る（ステップＳ１５）。When the operated button is not the record button REC, the play button PL is detected (step S).
14). If the play button PL is pressed here,
The address control circuit 9 is controlled to read the recorded data from the voice data section 10B of the memory section 10,
The data is sent to the P section 5 for decompression processing, and the reproduction processing for sending audio information to the D / A converter 11 is started (step S15).

【００３３】また、再生ボタンＰＬが押されていない時
は、早送りボタンＦＦが押されているかどうか、ボタン
の状態を検出する（ステップＳ１６）。もし早送りボタ
ンＦＦが押されていれば、動作位置を順次適当な速度
（例えば、再生時の２０倍）で早送りを行う早送り処理
に入る（ステップＳ１７）。When the play button PL is not pressed, whether or not the fast-forward button FF is pressed is detected (step S16). If the fast-forward button FF has been pressed, the fast-forward process of sequentially fast-moving the operating position at an appropriate speed (for example, 20 times that at the time of reproduction) is started (step S17).

【００３４】早送りボタンＦＦが押されていなければ、
戻しボタンＲＥＷが押されているか釦の状態検出をする
（ステップＳ１８）。もし戻しボタンＲＥＷが押されて
いれば、上記早送りの場合とは逆の方向に同様の速度で
動作位置の移動を行う戻し処理に入る（ステップＳ１
９）。If the fast-forward button FF is not pressed,
Whether the return button REW is pressed or not is detected (step S18). If the return button REW is pressed, a return process for moving the operation position at the same speed in the opposite direction to the case of the fast-forwarding is started (step S1).
9).

【００３５】上記ステップＳ１３，Ｓ１５，Ｓ１７，Ｓ
１９の各処理は、停止ボタンＳＴが押されると、各処理
から抜けて上記ステップＳ１１に戻る。また、操作され
たのが録音，再生，早送り，戻し等のボタンでなけれ
ば、電源ＯＦＦ又は各種の設定ボタンの状態の検出を行
う（ステップＳ２０）。主電源スイッチ１６が電源ＯＦ
Ｆ操作された時には、アドレス制御回路９を制御して、
メモリ部１０のインデックス部１０Ａ内の情報を消去
し、主制御部８内部の不図示記憶部に記憶してあるイン
デックス情報を、メモリ部１０のインデックス部１０Ａ
に記憶する（ステップＳ２１）。このインデックス転送
処理が完了すると、当該装置全体、つまり各回路に給電
のための不図示電源スイッチをＯＦＦにする（ステップ
Ｓ２２）。そして、上記ステップＳ２に戻る。Steps S13, S15, S17, S
When the stop button ST is pressed, each process of 19 exits each process and returns to step S11. If the operated button is not a button for recording, reproducing, fast-forwarding, returning, etc., the power is turned off or the state of various setting buttons is detected (step S20). Main power switch 16 is power supply OF
When the F operation is performed, the address control circuit 9 is controlled to
The information in the index unit 10A of the memory unit 10 is erased, and the index information stored in the unillustrated storage unit inside the main control unit 8 is replaced with the index unit 10A of the memory unit 10.
(Step S21). When this index transfer process is completed, the power switch (not shown) for supplying power to the entire apparatus, that is, each circuit is turned off (step S22). Then, the process returns to step S2.

【００３６】また、上記ステップＳ２０において、主電
源スイッチ１６がＯＦＦでないと判断された時には、設
定ボタンの状態を検出し、その状態を内部の記憶部に記
憶した後、上記ステップＳ１１に戻る。なおここで、設
定ボタンは、実際に当該装置に設けられたボタンではな
く、録音ボタンＲＥＣ，再生ボタンＰＬ，停止ボタンＳ
Ｔ，早送りボタンＦＦ，戻しボタンＲＥＷ，Ｉマークボ
タンＩ，ＥマークボタンＥ，音声起動ボタンＶＡＤの内
の幾つかの同時押しにより代用されるボタンである。When it is determined in step S20 that the main power switch 16 is not OFF, the state of the setting button is detected, the state is stored in the internal storage section, and the process returns to step S11. Here, the setting button is not the button actually provided in the device, but the record button REC, the play button PL, and the stop button S.
It is a button that is substituted by simultaneous pressing of some of T, fast-forward button FF, return button REW, I-mark button I, E-mark button E, and voice activation button VAD.

【００３７】次に、上記ステップＳ１３での録音処理に
ついて、図４のフローチャートを参照して、さらに詳細
に説明する。録音ボタンＲＥＣが押されたことを検出す
るとこの録音処理に処理が移り、まず、その時の音声録
音条件（例えば、音声起動、又は無音圧縮やバリアブル
レートタイプ利用等）を検出する（ステップＳ３１）。
この検出された条件により、音声録音の条件モードをＤ
ＳＰ部５へ送る（ステップＳ３２）。そして、内部記憶
部に記憶しているインデックス情報（動作位置情報）よ
り、メモリ部１０の音声データ部１０Ｂにおける録音ス
タート位置を求め、そのスタート位置を示す情報をイン
デックス部１０Ａに操作開始位置情報１０Ａ１として書
き込む（ステップＳ３３）。ここで、ＤＳＰ部５より録
音データ転送を行い（ステップＳ３４）、ＤＳＰ部５
は、音声信号を符号化し、（有声音、無声音と無音の判
定符号を含む）符号化データを出力する。Next, the recording process in step S13 will be described in more detail with reference to the flowchart of FIG. When it is detected that the record button REC is pressed, the process shifts to this recording process, and first, a voice recording condition at that time (for example, voice activation, silence compression, use of variable rate type, etc.) is detected (step S31).
Depending on the detected condition, the voice recording condition mode is set to D.
Send to the SP unit 5 (step S32). Then, the recording start position in the voice data section 10B of the memory section 10 is obtained from the index information (motion position information) stored in the internal storage section, and information indicating the start position is stored in the index section 10A as the operation start position information 10A1. (Step S33). Here, the recording data is transferred from the DSP unit 5 (step S34), and the DSP unit 5
Encodes a voice signal and outputs encoded data (including voiced sound, unvoiced sound, and unvoiced determination code).

【００３８】主制御回路８は、この符号化データで無音
圧縮モードの処理を行うかを判定する（ステップＳ３
５）。例えば、無音圧縮モードが設定され、ＤＳＰ部５
から送られてきたデータの内に、無音を指すデータが含
まれていると、ステップＳ３９へジャンプする。また、
データの内に有声音又は無声音のデータが含まれてい
て、つまり無音でなければ、表示器１５に設けられた不
図示ＬＥＤを点灯し、音声が入力されたことを表示す
る。The main control circuit 8 determines whether or not the encoded data is to be processed in the silent compression mode (step S3).
5). For example, the silent compression mode is set, and the DSP unit 5
If the data sent from the device includes data indicating silence, the process jumps to step S39. Also,
If the data includes voiced or unvoiced sound data, that is, if there is no sound, an LED (not shown) provided in the display unit 15 is turned on to display that voice is input.

【００３９】そして、次に、データ転送された圧縮デー
タを書き込むべきアドレスを、内部記憶部に記憶してい
る動作位置情報より算出し、アドレス制御回路９へ出力
する（ステップＳ３６）。これと同時に、ＤＳＰ部５よ
りデータ転送された圧縮データがメモリ部１０に送られ
（ステップＳ３７）、上記アドレス制御回路９の制御に
より音声データ部１０Ｂに記録される。次に、内部記憶
部に記憶している動作位置情報を更新し、その更新した
値に、インデックス部１０Ａの操作終了位置情報１０Ａ
２及び動作位置情報１０Ａ３を更新する（ステップＳ３
８）。Then, the address at which the data-transferred compressed data should be written is calculated from the operating position information stored in the internal storage unit and output to the address control circuit 9 (step S36). At the same time, the compressed data transferred from the DSP section 5 is sent to the memory section 10 (step S37) and recorded in the audio data section 10B under the control of the address control circuit 9. Next, the operating position information stored in the internal storage unit is updated, and the updated value is set to the updated end position information 10A of the index unit 10A.
2 and the operating position information 10A3 are updated (step S3
8).

【００４０】そして、停止ボタンＳＴが押されているか
検出し（ステップＳ３９）、押されていなければ、上記
ステップＳ３４へジャンプする。また、停止ボタンＳＴ
が押されていれば、終了位置を確定して、この録音処理
から抜け出る。Then, it is detected whether the stop button ST is pressed (step S39), and if not pressed, the process jumps to step S34. Also, the stop button ST
If is pressed, the end position is determined and the recording process is exited.

【００４１】次に、このような録音処理時に於ける有声
音，無声音，無音の検出処理について説明する。録音処
理の時、ＤＳＰ部５の内部では、音声データを符号化す
るために、ＣＥＬＰ（Code Excited LPC）符号化（分析
合成形符号化）方式を利用する。このＣＥＬＰ方式は、
ＬＰＣ（Linear Prediction Coefficients）合成フィル
タの音源信号を、種々の波形パターンから成るコードブ
ックを用いて極めて高率的にベクトル量子化をする方式
である。この方式により抽出された予測された波形パタ
ーンと所定区間内の音声信号との差を残差信号として、
この残差信号と所定区間内音声信号の相互相関を取り、
これを音声信号の自己相関で割った場合に、０．８１以
下の時は有声音で、０．８１を越える時には無声音か無
音とする。無音と無声音は、音声信号の自己相関のレベ
ルによって判定を行う。つまり、残差信号は、本来、乱
数発生する信号（ホワイト雑音）に近くなり、この残差
信号と相関があるとすればホワイト雑音に近いことを意
味するため、これによって有声音か無音又は無声音かを
判定することができる。自己相関は、音声のエネルギー
波形を表すことができ、音声波形とエネルギー波形の関
係は図５の（Ａ）に示すようになる。この内、音声エネ
ルギーで上記のような方法によりノイズレベルと判定区
別を行った時、人間の言語パターン（音声パターン）に
は、ノイズに近い無声音が含まれる場合がある。従っ
て、この無声音部を除外しないように、有声音の前後を
無声音区間（ｔ1 ，ｔ2 ，ｔ3 ）とする。Next, a voiced sound, an unvoiced sound, and a silent sound detection process in such a recording process will be described. During the recording process, the DSP unit 5 uses a CELP (Code Excited LPC) coding (analysis-synthesis type coding) system to code audio data. This CELP method
This is a method for extremely highly efficiently vector quantizing a sound source signal of an LPC (Linear Prediction Coefficients) synthesis filter using a codebook composed of various waveform patterns. As the residual signal, the difference between the predicted waveform pattern extracted by this method and the audio signal in the predetermined section,
The cross-correlation between this residual signal and the voice signal within the predetermined section is taken,
When this is divided by the autocorrelation of the audio signal, it is voiced when it is 0.81 or less, and it is unvoiced when it exceeds 0.81. Silence and unvoiced sound are determined by the level of autocorrelation of the voice signal. In other words, the residual signal is originally close to a signal (white noise) generated by random numbers, and if it has a correlation with this residual signal, it means that it is close to white noise. Can be determined. The autocorrelation can represent the energy waveform of the voice, and the relationship between the voice waveform and the energy waveform is as shown in FIG. Among them, when the noise energy and the noise level are discriminated by the above-mentioned method by the voice energy, the human language pattern (voice pattern) may include unvoiced sound close to noise. Therefore, in order not to exclude this unvoiced sound portion, unvoiced sound sections (t1, t2, t3) are set before and after the voiced sound.

【００４２】ＤＳＰ部５は、図６に示すように、このよ
うな有声音，無声音，無音の判定を、符号化処理と同時
に行う。即ち、まず、音声データをＡ／Ｄ変換回路４よ
り入力し、フレーム（２０ｍｓ間にサンプルされたデー
タを１フレームとする）処理を行う（ステップＳ４
１）。次に、このサンプルされたデータにプリエンファ
シスや、ハミング窓掛け処理を行う（ステップＳ４
２）。そして、前述したような分析合成形符号化処理を
行う（ステップＳ４３）。この処理により、（現）フレ
ームの音声のエネルギー（自己相関）や、残差波形との
相互相関が求められる（ステップＳ４４）。この時、前
述の方法により、音声エネルギーより有声音が判定され
（ステップＳ４５）、もし有声音と判定されると、その
前フレーム又は所定の複数の前フレームまでさかのぼり
無音になっているかを判定する（ステップＳ４６）。も
し、その前フレーム又は所定複数前フレームが無音であ
るならば、前フレーム又は所定複数前フレームから現フ
レームまでを無声音フレームとする（ステップＳ４
７）。そして、現フレームに有声音であることを示す符
号（有声音フラグ）を付加した後（ステップＳ４８）、
この処理から抜け出る。As shown in FIG. 6, the DSP unit 5 determines such voiced sound, unvoiced sound, and silent sound simultaneously with the encoding process. That is, first, audio data is input from the A / D conversion circuit 4 and a frame (data sampled during 20 ms is defined as one frame) processing is performed (step S4).
1). Next, pre-emphasis and a Hamming windowing process are performed on the sampled data (step S4).
2). Then, the analysis-synthesis type encoding process as described above is performed (step S43). By this processing, the energy (autocorrelation) of the voice of the (current) frame and the cross-correlation with the residual waveform are obtained (step S44). At this time, the voiced sound is determined from the voice energy by the above-described method (step S45). If it is determined that the voiced sound is present, it is determined whether or not there is silence going back to the preceding frame or a plurality of predetermined preceding frames. (Step S46). If the previous frame or the predetermined plurality of previous frames is silent, the previous frame or the predetermined plurality of previous frames to the current frame are unvoiced frames (step S4).
7). After adding a code (voiced sound flag) indicating voiced sound to the current frame (step S48),
Get out of this process.

【００４３】また、上記ステップＳ４６に於いて、前フ
レーム又は所定複数前フレームが無音でない場合は、上
記ステップＳ４８へジャンプする。一方、上記ステップ
Ｓ４５に於いて、現フレームが有声音でないと判断され
た場合、前フレームが無音か否かを判断する（ステップ
Ｓ４９）。ここで、前フレームが無音でないと判断され
ると、前フレームが有声音かどうかを判断する（ステッ
プＳ５０）。前フレームが有声音であると判断された場
合には、フレーム数をカウントする内部カウンタｎに
“５”を設定する（ステップＳ５１）。そして、無声音
であることを示す符号（無声音フラグ）を音声符号化デ
ータに追加し（ステップＳ５２）、この処理から抜け出
る。If it is determined in step S46 that the previous frame or the predetermined plurality of previous frames are not silent, the process jumps to step S48. On the other hand, when it is determined in step S45 that the current frame is not voiced sound, it is determined whether or not the previous frame is silent (step S49). If it is determined that the previous frame is not silent, it is determined whether the previous frame is voiced sound (step S50). When it is determined that the previous frame is a voiced sound, "5" is set to the internal counter n that counts the number of frames (step S51). Then, a code indicating unvoiced sound (unvoiced sound flag) is added to the voice coded data (step S52), and this processing is exited.

【００４４】また、上記ステップＳ６０に於いて、前フ
レームが有声音でないと判定されると、上記フレーム数
カウンタｎの値から“１”マイナスして、カウントダウ
ンする（ステップＳ５３）。そして、このカウンタｎの
値が零よりも小さいかどうか判断して（ステップＳ５
４）、小さければ無音を示す符号（無音フラグ）を音声
符号化データに追加した後（ステップＳ５５）、この処
理から抜け出る。If it is determined in step S60 that the preceding frame is not a voiced sound, the value of the frame number counter n is decremented by "1" to count down (step S53). Then, it is judged whether or not the value of the counter n is smaller than zero (step S5).
4) If it is smaller, a code indicating silence (silence flag) is added to the voice coded data (step S55), and then the process is exited.

【００４５】一方、上記ステップＳ４９に於いて、前フ
レームが無音であると判定された場合には、現フレーム
は無音であることを示す符号を音声符号化データに追加
して（ステップＳ５６）、この処理から抜け出る。On the other hand, if it is determined in step S49 that the previous frame is silent, a code indicating that the current frame is silent is added to the voice coded data (step S56). Get out of this process.

【００４６】以上のようにして、有声音と無声音が区別
され、それを示す符号が音声符号化データに付加され
る。次に、図７のフローチャートを参照して、上記ステ
ップＳ１５に於ける再生処理を詳細に説明する。As described above, the voiced sound and the unvoiced sound are distinguished from each other, and the code indicating them is added to the voice coded data. Next, the reproduction process in step S15 will be described in detail with reference to the flowchart in FIG.

【００４７】再生ボタンＰＬが押されていることを検出
するとこの再生処理に処理が移り、主制御回路８は、ま
ず、その時の音声再生の条件（無音圧縮、スピード再
生、ノイズ除去等）を検出すると共に、読み出しブロッ
ク数を計数するための内部カウンタをリセットする（ス
テップＳ６１）。この検出された条件により、音声再生
の条件モードをＤＳＰ部５へ送る（ステップＳ６２）。
そして、メモリ部１０の音声データ部１０Ｂの読み出し
位置を、インデックス情報部１０Ａの動作位置情報より
得て、駆動回路１４を制御してその位置を表示部１５に
表示する（ステップＳ６３）。そして、メモリ部１０の
音声データ部１０Ｂから音声メッセージファイル読み込
みを行うため、内部記憶部に記憶している動作開始位置
情報より算出したアドレスをアドレス制御回路９に出力
する（ステップＳ６４）。これにより、メモリ部１０の
音声データ部１０Ｂより１ブロックのデータ（例えば、
音声を２０ｍｓのブロックに分けたデータ）が主制御回
路８に読み込まれる（ステップＳ６５）。When it is detected that the reproduction button PL has been pressed, the processing shifts to this reproduction processing, and the main control circuit 8 first detects the condition of voice reproduction at that time (silent compression, speed reproduction, noise removal, etc.). At the same time, the internal counter for counting the number of read blocks is reset (step S61). According to the detected condition, the condition mode of voice reproduction is sent to the DSP unit 5 (step S62).
Then, the read position of the audio data section 10B of the memory section 10 is obtained from the operation position information of the index information section 10A, and the drive circuit 14 is controlled to display the position on the display section 15 (step S63). Then, in order to read the voice message file from the voice data section 10B of the memory section 10, the address calculated from the operation start position information stored in the internal storage section is output to the address control circuit 9 (step S64). As a result, one block of data (for example, from the audio data unit 10B of the memory unit 10)
Data obtained by dividing the voice into blocks of 20 ms) is read by the main control circuit 8 (step S65).

【００４８】ここで、早聞き処理を行うかどうか、音声
起動ボタンＶＡＤの状態により設定されるモードを検出
して判断を行う（ステップＳ６６）。早聞きを行う場合
には、さらにもう１ブロック分のデータをメモリ部１０
から主制御回路８に読み込む（ステップＳ６７）。そし
て、この２つのブロックデータの中に無声音フラグが有
るか判断し（ステップＳ６８）、もし無声音フラグが有
ればＤＳＰ部５へデータ転送処理を行う（ステップＳ６
９）。また、無声音フラグが無ければ、時間軸圧縮を行
う命令をＤＳＰ部５へ出力して（ステップＳ７０）、Ｄ
ＳＰ部５へデータ転送を行う（ステップＳ６９）。この
時の時間軸圧縮は、前述したようなＴＤＨＳ方式を利用
する。Here, it is determined whether or not the fast-listening process is performed by detecting the mode set by the state of the voice activation button VAD (step S66). When performing fast listening, another block of data is added to the memory unit 10.
Is read from the main control circuit 8 (step S67). Then, it is determined whether or not there is an unvoiced sound flag in these two block data (step S68), and if there is an unvoiced sound flag, data transfer processing is performed to the DSP unit 5 (step S6).
9). If there is no unvoiced sound flag, an instruction to perform time axis compression is output to the DSP unit 5 (step S70), and D
Data is transferred to the SP unit 5 (step S69). The time base compression at this time uses the TDHS method as described above.

【００４９】そして、主制御回路８は、内部記憶部に記
憶している再生位置（動作位置）情報を更新し、またイ
ンデックス部１０Ａの動作位置情報１０Ａ３を更新する
（ステップＳ７１）。その後、停止ボタンＳＴが押され
ているか状態を検出する（ステップＳ７２）。もし押さ
れていればこの再生処理を抜け出すが、押されていなけ
れば上記ステップＳ６４へ戻って、再生処理を続ける。Then, the main control circuit 8 updates the reproduction position (operating position) information stored in the internal storage unit and also updates the operating position information 10A3 of the index unit 10A (step S71). Then, it is detected whether or not the stop button ST is pressed (step S72). If it is pressed, the reproduction processing is exited, but if it is not pressed, the processing returns to step S64 and the reproduction processing is continued.

【００５０】以上のような早聞き再生処理を行えば、無
声音の波形部つまり子音の部分は加算平均処理（ＴＤＨ
Ｓ処理）を行われず、早聞き処理しても聞き取り易くす
ることができる。When the above-described fast-listening reproduction processing is performed, the unvoiced waveform portion, that is, the consonant portion, is added and averaged (TDH).
Even if the fast listening process is performed without performing (S processing), it is possible to make it easier to hear.

【００５１】なお、上記ステップＳ６６に於いて、早聞
き処理を行うかどうかの判定は、図５の（Ｂ）に示す表
のように、幾つかのモードを再生起動する前に音声起動
ボタンＶＡＤにより選択することによって優先条件を付
けて判定される。In step S66, whether or not the fast-listening process is to be performed is determined by the voice activation button VAD before starting the reproduction of some modes as shown in the table of FIG. 5B. By making a selection, a priority condition is added to make a determination.

【００５２】以上のように、本実施例では、所定時間
（例えば、２０ｍｓ）内の音声信号とディジタル処理に
よって導かれる予測信号の残差との相互相関を算出し
て、その算出した値と音声信号の自己相関値との比を取
り、有声音，無声音，無音を判断する。これにより、有
声音又は無音であればＴＤＨＳ方式や一部間引きを行う
ことで時間軸圧縮を行い、無声音区間であれば、時間軸
圧縮を行わない又は時間軸圧縮の比率を下げる操作を行
う。一般的に、無声音部は有声音より聞き取りにくいた
め、上記の方法により聞き取り易くする。また、ある時
間内、例えば４秒程度内で、無音部，有声音部，無声音
部の順に優先順位を付けて時間軸圧縮を行う。As described above, in the present embodiment, the cross-correlation between the voice signal within a predetermined time (for example, 20 ms) and the residual of the prediction signal derived by digital processing is calculated, and the calculated value and the voice are calculated. Voiced sound, unvoiced sound, and silence are determined by taking the ratio with the autocorrelation value of the signal. As a result, the time axis compression is performed by performing the TDHS method or partial thinning in the case of voiced sound or silence, and in the unvoiced section, time axis compression is not performed or the time axis compression ratio is reduced. In general, unvoiced parts are harder to hear than voiced sounds, so the above method makes them easier to hear. Further, within a certain time, for example, within about 4 seconds, the time axis compression is performed by prioritizing the silent part, the voiced sound part, and the unvoiced sound part in this order.

【００５３】なお、上記実施例では、有声音，無音，無
声音の区別を音声情報の記録時にしておくものとした
が、再生時にメモリ部１０から音声情報を読み出す際に
演算をして検出することもできる。従って、記録時，再
生時のどちらで区別をするようにしても良い。In the above embodiment, the voiced sound, the unvoiced sound, and the unvoiced sound are discriminated from each other at the time of recording the voice information, but the voice information is read out from the memory unit 10 at the time of reproduction to be detected by calculation. You can also Therefore, it is possible to distinguish between recording and reproducing.

【００５４】[0054]

【発明の効果】以上詳述したように、本発明の音声再生
装置によれば、逐次再生されるディジタル音声信号が、
有声音に係るものか、無声音に係るものか、無音に係る
ものかを判別し、音声信号を通常よりも高速で再生する
ときには、有声音及び無音に係るものであると判別され
た音声信号については時間軸を圧縮し、一般に有声音よ
りは聞き取りにくいものである無声音に係るものである
と判別された音声信号については、時間軸を圧縮しない
か、上記有声音及び無音に係る音声信号の時間軸の圧縮
率に比べて低い圧縮率で時間軸圧縮を行うようにしてい
るので、２倍速程度以上の高速再生を行っても、無声音
部が聞き取りにくくなることはなく、結果として、再生
音声全体が聞き取りにくなるようなことを防止できる。As described in detail above, according to the audio reproducing apparatus of the present invention, the digital audio signals that are successively reproduced are
When determining whether a voiced sound, unvoiced sound, or unvoiced sound is played back, and when the audio signal is played back faster than normal Does not compress the time axis for voice signals that are determined to be related to unvoiced sound that is generally harder to hear than voiced sound, or does not compress the time axis. Since the time axis compression is performed at a compression rate lower than the axis compression rate, the unvoiced part does not become difficult to hear even when performing high-speed reproduction of about double speed or more, and as a result, the entire reproduced sound is reproduced. It is possible to prevent people from hearing.

【００５５】また、本発明による音声記録装置では、所
定の記録媒体に音声信号をディジタル化して逐次記録す
るとき、その音声信号が、有声音に係るものか、無声音
に係るものか、無音に係るものかを検出し、その検出さ
れた情報をその音声信号と関連付けて記録媒体に記録す
るようにしているので、上記音声再生装置等に於ける再
生時には、音声信号と関連付けて記録された情報より当
該音声信号が有声音に係るものか、無声音に係るもの
か、無音に係るものかを容易に判別することができるよ
うになる。Further, in the voice recording apparatus according to the present invention, when the voice signal is digitized and sequentially recorded on a predetermined recording medium, the voice signal is related to voiced sound, unvoiced sound, and silent sound. Since it is detected that the information is detected and the detected information is recorded in the recording medium in association with the audio signal, the information recorded in association with the audio signal is reproduced from the information recorded at the time of reproduction in the audio reproduction device or the like. It becomes possible to easily determine whether the voice signal is related to voiced sound, unvoiced sound, or unvoiced sound.

【００５６】さらに、本発明による別の音声再生装置で
は、逐次再生されるディジタル音声信号が、有声音に係
るものか、無声音に係るものか、無音に係るものかを判
別し、音声信号を通常よりも高速で再生するときには、
有声音に係るものであると判別された音声信号について
は時間軸を圧縮し、一般に有声音よりは聞き取りにくい
ものである無声音に係るものであると判別された音声信
号については、所定の条件に従って時間軸の圧縮を行う
ようにしていのるので、有声音、無声音、無音のつなぎ
の不自然さがなくなり、高速再生時の音声の聞き取りや
すさが向上する。Further, in another audio reproducing apparatus according to the present invention, it is determined whether the digital audio signals which are successively reproduced are related to voiced sound, unvoiced sound or unvoiced sound, and the audio signal is normally reproduced. When playing faster than
For voice signals that are determined to be related to voiced sound, the time axis is compressed.For voice signals that are determined to be related to unvoiced sound, which is generally harder to hear than voiced sound, Since the time axis is compressed, unnaturalness of voiced sound, unvoiced sound, and connection of silent sounds is eliminated, and the audibility of the sound during high-speed reproduction is improved.

【００５７】そして、本発明による更に別の音声再生装
置では、逐次再生されるディジタル音声信号が、有声音
に係るものか、無声音に係るものか、無音に係るものか
を判別し、音声信号を通常よりも高速で再生するときに
は、有声音に係るものであると判別された音声信号につ
いては時間軸を圧縮し、無音に係るものであると判別さ
れた音声信号についてはデータの間引き処理を行うよう
にしているので、音声の明瞭性を保った状態で、即ち、
聞き取りやすい状態を保ちつつ時間を短縮して情報を聞
き取ることができる。In still another audio reproducing apparatus according to the present invention, it is determined whether the sequentially reproduced digital audio signal relates to voiced sound, unvoiced sound, or unvoiced sound, and outputs the audio signal. When playing at a higher speed than normal, the time axis is compressed for voice signals that are determined to be related to voiced sound, and data thinning processing is performed for voice signals that are determined to be related to silence. Therefore, while maintaining the intelligibility of the voice, that is,
Information can be heard in a short time while maintaining an easy-to-understand condition.

[Brief description of drawings]

【図１】本発明の一実施例としての音声記録再生装置の
ブロック構成図である。FIG. 1 is a block configuration diagram of an audio recording / reproducing apparatus as an embodiment of the present invention.

【図２】メモリ部の記録構成を示す図である。FIG. 2 is a diagram showing a recording configuration of a memory unit.

【図３】主制御回路の動作フローチャートである。FIG. 3 is an operation flowchart of a main control circuit.

【図４】図３のフローチャート中の録音処理の詳細を説
明するための動作フローチャートである。4 is an operation flowchart for explaining details of a recording process in the flowchart of FIG.

【図５】（Ａ）は音声波形とエネルギー波形の関係を示
す図であり、（Ｂ）は各モードに於ける無音と無声音と
有声音の圧縮を行う優先条件を示す表である。5A is a diagram showing a relationship between a voice waveform and an energy waveform, and FIG. 5B is a table showing priority conditions for compressing silence, unvoiced sound, and voiced sound in each mode.

【図６】ＤＳＰ内の符号化と有声音，無声音の判定処理
のフローチャートである。FIG. 6 is a flowchart of encoding processing in a DSP and determination processing of voiced sound and unvoiced sound.

【図７】図３のフローチャート中の再生処理の詳細を説
明するための動作フローチャートである。7 is an operation flowchart for explaining details of a reproduction process in the flowchart of FIG.

【図８】ＴＤＨＳ方式の時間軸圧縮処理を説明するため
の図である。FIG. 8 is a diagram for explaining a time axis compression process of the TDHS method.

[Explanation of symbols]

１…マイクロホン、２，１２…増幅器（ＡＭＰ）、３…
低域通過フィルタ（ＬＰＦ）、４…アナログ／ディジタ
ル（Ａ／Ｄ）変換器、５…ディジタル信号処理（ＤＳ
Ｐ）部、６…制御回路、７…データ入出力（Ｉ／Ｏ）バ
ッファ、８…主制御回路、９…アドレス制御回路、１０
…記録媒体（半導体メモリ部）、１０Ａ…インデックス
部、１０Ａ１…操作開始位置情報、１０Ａ２…操作終了
位置情報、１０Ａ３…動作位置情報、１０Ｂ…音声デー
タ部、１０Ｂ１，１０Ｂ２，１０Ｂ３…音声メッセージ
ファイル、１１…ディジタル／アナログ（Ｄ／Ａ）変換
器、１３…スピーカ、１４…駆動回路、１５…表示器、
１６…主電源スイッチ、ＲＥＣ…録音ボタン、ＰＬ…再
生ボタン、ＳＴ…停止ボタン、ＦＦ…早送りボタン、Ｒ
ＥＷ…戻しボタン、Ｉ…Ｉマークボタン、Ｅ…Ｅマーク
ボタン、ＶＡＤ…音声起動（ボイスアクティブディテク
タ）ボタン。1 ... Microphone, 2, 12 ... Amplifier (AMP), 3 ...
Low-pass filter (LPF), 4 ... Analog / digital (A / D) converter, 5 ... Digital signal processing (DS)
P) section, 6 ... Control circuit, 7 ... Data input / output (I / O) buffer, 8 ... Main control circuit, 9 ... Address control circuit, 10
... recording medium (semiconductor memory section), 10A ... index section, 10A1 ... operation start position information, 10A2 ... operation end position information, 10A3 ... operation position information, 10B ... voice data section, 10B1, 10B2, 10B3 ... voice message file, 11 ... Digital / analog (D / A) converter, 13 ... Speaker, 14 ... Driving circuit, 15 ... Display,
16 ... Main power switch, REC ... Record button, PL ... Play button, ST ... Stop button, FF ... Fast forward button, R
EW ... Return button, I ... I mark button, E ... E mark button, VAD ... Voice activation (voice active detector) button.

【手続補正書】[Procedure amendment]

【提出日】平成６年５月１３日[Submission date] May 13, 1994

【手続補正１】[Procedure Amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】図５[Name of item to be corrected] Figure 5

【補正方法】変更[Correction method] Change

【補正内容】[Correction content]

【図５】（Ａ）は音声波形とエネルギー波形の関係を示
す図であり、（Ｂ）は各モードに於ける無音と無声音と
有声音の圧縮を行う優先条件を示す図表である。5A is a diagram showing a relationship between a voice waveform and an energy waveform, and FIG. 5B is a chart showing a priority condition for performing compression of silence, unvoiced sound, and voiced sound in each mode.

Claims

[Claims]

1. An audio reproduction unit for reproducing a digital audio signal, and determines whether the audio signals successively reproduced by the audio reproduction unit are related to voiced sound, unvoiced sound, or unvoiced sound. When the audio signal is reproduced at a higher speed than usual, the audio signal, which is determined to be related to voiced sound and unvoiced sound, is related to unvoiced sound by compressing the time axis. A time axis compression unit that does not compress the time axis of the determined audio signal or performs time axis compression at a compression rate lower than the time axis compression rate of the voiced and unvoiced audio signals. An audio reproducing device characterized by.

2. An audio recording means for digitizing and recording an audio signal on a predetermined recording medium, and an audio signal successively recorded by said audio recording means,
Detecting means for detecting whether it is related to voiced sound, unvoiced sound, or unvoiced sound, and the information detected by the detecting means, related information to be recorded in the recording medium in association with the audio signal. A voice recording device comprising: a recording unit.

3. A sound reproducing means for reproducing a digital sound signal, and determining whether the sound signals successively reproduced by the sound reproducing means are related to voiced sound, unvoiced sound or unvoiced sound. When the audio signal is reproduced at a higher speed than usual, the audio signal which is determined to be related to voiced sound by the determining means compresses the time axis and is determined to be related to unvoiced sound. And a conditional time axis compression means for performing time axis compression on the audio signal according to a predetermined condition.

4. A voice reproducing means for reproducing a digital voice signal, and determining whether the voice signals successively reproduced by the voice reproducing means relate to voiced sound, unvoiced sound or unvoiced sound. When the audio signal is reproduced at a higher speed than usual, the audio signal, which is determined to be related to voiced sound, is compressed by the determining means and is determined to be related to silence. And a data processing unit that performs data thinning processing on the audio signal.