JPH0816199A

JPH0816199A - Sound recording device

Info

Publication number: JPH0816199A
Application number: JP6144663A
Authority: JP
Inventors: Tadashi Asai; 忠浅井; Teruo Hoshi; 照雄法師
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 1994-06-27
Filing date: 1994-06-27
Publication date: 1996-01-19

Abstract

PURPOSE:To conduct a high quality sound recording of digital voice signals for a long time. CONSTITUTION:Input voice signals are converted into gidital data in an A/D converter 12, coded by a coder 14 and stored in a voice memory 30. Then, the inputted signals are fed to a characteristic discrimination section 24 and the distinction of the sex of the speaker is made by checking the frequency characteristics of the data, for example, by discriminating the level of the voice signal of lower frequency. The discrimination result is supplied to a control section 26, which sets the sampling frequency of the converter 12 lower than that of a female when the speaker is discriminated to be a male. Since a male voice contains much lower frequency components, a lower sampling frequency does not adversely affect the tone quality. By conducting the above control, an efficient data compression is accomplished and a long time recording is performed while keeping the tone quality high.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声信号をデジタルデ
ータとして録音する録音装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a recording device for recording a voice signal as digital data.

【０００２】[0002]

【従来の技術】従来より、デジタルデータの記録媒体と
して半導体メモリが利用されており、留守番電話機の録
音など音声信号の録音にも利用されている。このような
録音は、例えば音声信号をＡ／Ｄ変換器でデジタル化し
た後、ＤＳＰ（デジタル・シグナル・プロセッシング）
処理により高効率符号化して大容量ＬＳＩメモリに記録
することによって行う。また、再生は、メモリから読み
出した符号化データをＤＳＰ処理して復号した後、Ｄ／
Ａ変換器で音声信号に戻す。2. Description of the Related Art Conventionally, a semiconductor memory has been used as a recording medium for digital data, and it has also been used for recording voice signals such as recording on an answering machine. Such recording is performed by, for example, digitizing an audio signal by an A / D converter and then DSP (digital signal processing).
It is performed by high-efficiency encoding by processing and recording in a large-capacity LSI memory. For reproduction, the coded data read from the memory is DSP processed and decoded, and then D /
Return to audio signal with A converter.

【０００３】このようなシステムの場合、Ａ／Ｄ変換の
際のサンプリング周波数は、通常８ＫＨｚに固定されて
いるが、限られたメモリ容量でより長時間録音を可能に
するために、６ｋＨｚ、４ｋＨｚ、３ｋＨｚも選択可能
になっている。また、符号化データのビット数（量子化
ビット数）も１〜４ビットのいずれかに固定されている
ものが多いが、２ビット、３ビット、４ビットと量子化
ビットを選択可能なものも知られている。In such a system, the sampling frequency for A / D conversion is usually fixed at 8 KHz, but in order to enable recording for a longer time with a limited memory capacity, 6 kHz, 4 kHz. 3 kHz is also selectable. Also, the number of bits (quantization bit number) of encoded data is often fixed to any one of 1 to 4 bits, but some bits can select 2 bits, 3 bits, 4 bits and quantization bits. Are known.

【０００４】勿論、良好な音質を得るためには、サンプ
リング周波数Ｆｓは８ｋＨｚ、量子化ビット数は４ビッ
ト（このときのビットレートは３２ｋｂｐｓ）が必要で
ある。しかしながら、録音時間はビットレートに反比例
するため、音質とのかねあいから２４ｋｂｐｓ程度に設
定される場合が多い。Of course, in order to obtain good sound quality, the sampling frequency Fs needs to be 8 kHz and the number of quantization bits must be 4 bits (the bit rate at this time is 32 kbps). However, since the recording time is inversely proportional to the bit rate, it is often set to about 24 kbps in consideration of the sound quality.

【０００５】[0005]

【発明が解決しようとする課題】上述のように、従来の
録音装置においては、長時間の録音を行いたい場合に音
質を犠牲にしていた。しかし、音質を犠牲にすることが
好ましいわけではなく、十分な音質を維持しつつ、長時
間の録音を可能にすることが望まれている。As described above, in the conventional recording apparatus, the sound quality is sacrificed when it is desired to record for a long time. However, it is not preferable to sacrifice sound quality, and it is desired to enable long-term recording while maintaining sufficient sound quality.

【０００６】本発明は、上記課題に鑑みなされたもので
あり、音質を維持しつつ長時間録音を可能とする録音装
置を提供することを目的とする。The present invention has been made in view of the above problems, and an object of the present invention is to provide a recording apparatus capable of recording for a long time while maintaining sound quality.

【０００７】[0007]

【課題を解決するための手段】本発明は、音声信号をＡ
／Ｄ変換器によりデジタルデータに変換して記録する録
音装置において、入力されてくる音声信号の周波数特性
を判定する周波数特性判定手段と、判定された周波数特
性に応じて上記Ａ／Ｄ変換器におけるサンプリング周波
数を変更するサンプリング周波数変更手段と、を有する
ことを特徴とする。SUMMARY OF THE INVENTION The present invention provides an audio signal A
In a recording device for converting digital data by an A / D converter and recording the digital data, a frequency characteristic judging means for judging a frequency characteristic of an input audio signal, and an A / D converter according to the judged frequency characteristic. And a sampling frequency changing means for changing the sampling frequency.

【０００８】また、本発明は、音声信号をＡ／Ｄ変換器
によりデジタルデータに変換した後、符号化器で所定ビ
ットの符号データに変換して記録する録音装置におい
て、入力されてくる音声信号の周波数特性を判定する周
波数特性判定手段と、判定された周波数特性に応じて上
記Ａ／Ｄ変換器におけるサンプリング周波数を変更する
サンプリング周波数変更手段と、判定された周波数特性
に応じて上記符号化で得る符号データのビット数を変更
するビット数変更手段と、を有することを特徴とする。Further, according to the present invention, an audio signal input to a recording device which converts an audio signal into digital data by an A / D converter and then converts the audio data into coded data of a predetermined bit by an encoder for recording. The frequency characteristic determining means for determining the frequency characteristic of, the sampling frequency changing means for changing the sampling frequency in the A / D converter according to the determined frequency characteristic, and the encoding according to the determined frequency characteristic. And a bit number changing means for changing the bit number of the obtained code data.

【０００９】また、本発明は、上記周波数特性判定手段
は、音声の母音部分の基本周波数に当たる音声信号波形
の繰り返し周波数を検出し、低周波音声または高周波音
声のいずれであるかを判定し、上記サンプリング周波数
変更手段は、低周波音声の場合にサンプリング周波数を
低くし、高周波音声の場合にサンプリング周波数を高く
することを特徴とする。Further, according to the present invention, the frequency characteristic determining means detects the repetition frequency of the voice signal waveform corresponding to the fundamental frequency of the vowel part of the voice, determines whether it is a low frequency voice or a high frequency voice, and The sampling frequency changing means is characterized by lowering the sampling frequency in the case of low frequency sound and increasing the sampling frequency in the case of high frequency sound.

【００１０】また、本発明は、上記周波数特性判定手段
は、所定周波数以下の成分が多いか否かを検出し、低周
波音声または高周波音声のいずれであるかを判定し、上
記サンプリング周波数変更手段は、低周波音声の場合に
サンプリング周波数を低くし、高周波音声の場合にサン
プリング周波数を高くすることを特徴とする。Further, according to the present invention, the frequency characteristic judging means detects whether or not there are many components below a predetermined frequency, judges whether it is a low frequency sound or a high frequency sound, and the sampling frequency changing means. Is characterized by lowering the sampling frequency in the case of low frequency speech and increasing the sampling frequency in the case of high frequency speech.

【００１１】[0011]

【作用】本発明は、男性と女性とで、音声の特性が異な
ることに注目することによってなされたものである。音
声は、ホルトマントと呼ばれるスペクトルの極大値を形
成するピーク位置によって母音の識別が行われており、
周波数の低い方から第１ホルトマント、第２ホルトマン
トと呼ばれれている。この第１および第２ホルトマント
は、男性の場合２８０Ｈｚ〜２３１０Ｈｚ、女性の場合
３４０Ｈｚ〜２８３０Ｈｚ程度にある。ここで、男女各
２５名の発声測定の結果を平均したホルトマントの位置
を表１に示す。The present invention has been made by paying attention to the fact that male and female have different voice characteristics. In speech, vowels are identified by the peak position that forms the maximum value of the spectrum called Holtmant,
The ones with lower frequencies are called the first and second holts. The first and second holtmants are at 280 Hz to 2310 Hz for men and about 340 Hz to 2830 Hz for women. Here, Table 1 shows the positions of the Holtmants, which are the averages of the results of the vocalization measurement for each of 25 men and women.

【００１２】[0012]

【表１】また、第３ホルトマントは、話者の識別に重要なファク
ターになっており、これによって個人の声の特徴が形成
される。従って、音声を録音する場合には、この第３ホ
ルトマントをカバーする必要がある。そして、この第３
ホルトマントは、男性で３０００Ｈｚまで、女性で４０
００Ｈｚまでの帯域にある。音声信号の音質を維持して
Ａ／Ｄ変換する場合、その周波数の２倍の周波数でサン
プリングする必要がある。従って、Ａ／Ｄ変換のサンプ
リング周波数として、女性の場合は８ｋＨｚが必要であ
り、男性の場合６ｋＨｚでも十分である。[Table 1] Also, the third holtmant is an important factor for speaker identification, which forms the characteristics of the individual's voice. Therefore, when recording voice, it is necessary to cover this third Holtmant. And this third
Holtmant is up to 3000Hz for men and 40 for women.
It is in the band up to 00 Hz. When performing A / D conversion while maintaining the sound quality of the audio signal, it is necessary to sample at a frequency twice that frequency. Therefore, as a sampling frequency for A / D conversion, 8 kHz is necessary for a female and 6 kHz is sufficient for a male.

【００１３】このように、男性と女性とでは、音質を維
持するために必要なサンプリング周波数が異なる。そこ
で、男性と女性とで、サンプリング周波数を変更すれ
ば、音質を十分なものに維持しながらデータの圧縮の効
率を上昇できる。As described above, the sampling frequency required for maintaining the sound quality differs between men and women. Therefore, by changing the sampling frequency between male and female, the efficiency of data compression can be increased while maintaining sufficient sound quality.

【００１４】本発明によれば、入力されてくる音声信号
の周波数特性を判定する。これによって、男性と女性の
識別を行う。そして、この識別結果に応じて、Ａ／Ｄ変
換の際のサンプリング周波数を変更することで、音質を
損なうことなく、効率良いデータの圧縮が行える。例え
ば、男性の場合には、サンプリング周波数を６ｋＨｚ、
女性の場合には、８ｋＨｚに設定することによって、第
３ホルトマントを失うことなく、データの圧縮効率を高
めることができる。According to the present invention, the frequency characteristic of the input voice signal is determined. This distinguishes between male and female. Then, by changing the sampling frequency at the time of A / D conversion according to the identification result, efficient data compression can be performed without degrading the sound quality. For example, for men, the sampling frequency is 6 kHz,
In the case of a woman, by setting the frequency to 8 kHz, the data compression efficiency can be improved without losing the third Holtmant.

【００１５】また、Ａ／Ｄ変換によって得られたデジタ
ルデータをそのまま記憶すると、データ量が膨大にな
る。そこで、本発明においては、符号化して、データ量
を削減してからデータを記憶する。そして、この符号化
した際に得る符号化データのビット数を周波数特性に応
じて変更する。例えば、男性の音声であれば４ビット、
女性の音声であれば３ビットにする。特に、この場合の
Ａ／Ｄ変換の際のサンプリング周波数を男性の場合６ｋ
Ｈｚ、女性の場合８ｋＨｚにすることによって、男女と
も２４ｋｂｐｓとなる。このようにして、ビットレート
を同一として、効果的なデータの圧縮ができる。Further, if the digital data obtained by the A / D conversion is stored as it is, the data amount becomes enormous. Therefore, in the present invention, the data is stored after being encoded to reduce the data amount. Then, the number of bits of the encoded data obtained at the time of encoding is changed according to the frequency characteristic. For example, 4 bits for male voice,
If it is a female voice, set it to 3 bits. In particular, the sampling frequency for A / D conversion in this case is 6k for men.
Hz and 8 kHz for a female, 24 kbps for both male and female. In this way, it is possible to effectively compress data with the same bit rate.

【００１６】また、音声の母音部分の波形は、声帯の形
状等によって決定される基本周波数で同一波形を繰り返
すものになっている。そして、この基本周波数は、男性
と女性とで異なっている。例えば、男女の音声の基本周
波数は、図４に示すような分布になっており、男性の場
合１２５Ｈｚ程度が中心、女性の場合２４０Ｈｚ程度が
中心の分布になっている。そこで、音声信号における基
本周波数により男性か女性かを判定できる。Further, the waveform of the vowel portion of the voice is such that the same waveform is repeated at the fundamental frequency determined by the shape of the vocal cord and the like. And this fundamental frequency is different between men and women. For example, the fundamental frequencies of the voices of men and women have a distribution as shown in FIG. 4, with the distribution centering around 125 Hz for men and 240 Hz for women. Therefore, it is possible to determine male or female based on the fundamental frequency of the voice signal.

【００１７】本発明では、この基本周波数を検出し、男
性の音声に対応する低周波の音声と女性の音声の対応す
る高周波の音声かを判定する。そして、この判定結果に
従ってＡ／Ｄ変換の際のサンプリング周波数を変更する
ため、音質を損なうことなくデータの圧縮率を上昇する
ことができる。According to the present invention, the fundamental frequency is detected to determine whether the low frequency voice corresponding to the male voice and the high frequency voice corresponding to the female voice. Since the sampling frequency at the time of A / D conversion is changed according to this determination result, the data compression rate can be increased without impairing the sound quality.

【００１８】また、男女の音声を周波数分析した場合、
男性の声に比較して、女性の声は１５０Ｈｚ以下の成分
が極めて少ないという特徴がある。When frequency analysis is performed on the voices of men and women,
Compared to male voices, female voices are characterized by having very few components below 150 Hz.

【００１９】本発明では、この所定周波数（例えば、１
５０Ｈｚ）以下の成分が多く存在するか否かで、男性の
音声に対応する低周波の音声と女性の音声の対応する高
周波の音声かを判定する。そこで、上述の場合と同様に
して、音質を損なうことなくデータの圧縮率を上昇する
ことができる。In the present invention, this predetermined frequency (for example, 1
It is determined by whether or not there are many components below 50 Hz), that is, a low frequency voice corresponding to a male voice and a high frequency voice corresponding to a female voice. Therefore, similarly to the case described above, the data compression rate can be increased without deteriorating the sound quality.

【００２０】[0020]

【実施例】以下、本発明の実施例について、図面に基づ
いて説明する。図１は、システムの全体構成を示すブロ
ック図であり、固体録音再生ＬＳＩ１０、音声メモリ３
０およびマイコン４０からなっている。マイクロフォン
等によって、電気信号に変換されたアナログの入力音声
信号は固体録音再生ＬＳＩ１０に入力される。固体録音
再生ＬＳＩ１０は、その内部に、Ａ／Ｄ変換器１２、符
号化器１４、書込みドライバ１６、読出しドライバ１
８、復号化器２０、Ｄ／Ａ変換器２２、特性判定部２４
および制御部２６を有している。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the overall configuration of the system, including a solid-state recording / playback LSI 10 and an audio memory 3.
0 and the microcomputer 40. An analog input voice signal converted into an electric signal by a microphone or the like is input to the solid-state recording / reproducing LSI 10. The solid-state recording / reproducing LSI 10 includes therein an A / D converter 12, an encoder 14, a write driver 16, a read driver 1.
8, decoder 20, D / A converter 22, characteristic determination unit 24
And a control unit 26.

【００２１】入力音声信号は、Ａ／Ｄ変換器１２に入力
され、ここで所定ビット、例えば８ビットのデジタルデ
ータに変換される。なお、このＡ／Ｄ変換器１２はサン
プリング周波数が少なくとも６ｋＨｚ、８ｋＨｚの２段
階に切り替え可能になっている。Ａ／Ｄ変換器１２から
のデジタルデータは、符号化器１４で所定の手法で符号
化される。例えば、この符号化の手法として、ＭＳＢＣ
−ＡＢ等が好適である。このＭＳＢＣ−ＡＢは、適応的
なビット割り付けを行う変形サブバンド符号化を意味し
ており、音声データを２０ｍｓｅｃ程度のフレームに区
切り、このフレーム毎に４バンド位の帯域に分割し各々
の波形を符号化するするもので、この符号化の際の周波
数帯域毎の割り当てビット数が適応的に変更されるもの
である。The input audio signal is input to the A / D converter 12, where it is converted into digital data of a predetermined bit, for example, 8 bits. The A / D converter 12 can switch the sampling frequency between two levels of at least 6 kHz and 8 kHz. The digital data from the A / D converter 12 is encoded by the encoder 14 by a predetermined method. For example, as the encoding method, MSBC
-AB and the like are preferable. This MSBC-AB means a modified sub-band coding that adaptively allocates bits, divides audio data into frames of about 20 msec, divides each frame into bands of about 4 bands, and divides each waveform. Encoding is performed, and the number of allocated bits for each frequency band at the time of encoding is adaptively changed.

【００２２】そして、この符号化器１４における１つの
デジタル信号に対する符号化データのビット数（量子化
ビット数）が、切り替え可能になっている。量子化ビッ
ト数は、少なくとも、３ビットまたは４ビットに切り替
え可能になっている。The bit number (quantization bit number) of encoded data for one digital signal in the encoder 14 can be switched. The number of quantization bits can be switched to at least 3 bits or 4 bits.

【００２３】このようにして得られた符号化データは、
書込みドライバ１６を介し、音声メモリ３０に書き込ま
れる。これによって、入力音声信号が、デジタルデータ
として、半導体の音声メモリ３０に記録される。なお、
この音声メモリ３０は、例えば４ＭバイトのＤＲＡＭで
構成される。The coded data thus obtained is
It is written in the audio memory 30 via the write driver 16. As a result, the input voice signal is recorded as digital data in the semiconductor voice memory 30. In addition,
The voice memory 30 is composed of, for example, a 4-Mbyte DRAM.

【００２４】一方、音声メモリ３０から読出しドライバ
１８を介し読み出された符号化データは、復号化器２０
に供給され、ここで復号化され、デジタルの音声データ
に変換される。そして、Ｄ／Ａ変換器２２によりアナロ
グの出力音声信号に戻される。ここで、符号化器２０の
復号化は、符号化器１４の符号化に対応したものであ
り、またＤ／Ａ変換器２２はＡ／Ｄ変換器１２に対応し
たものであり、これらは記録系のサンプリング周波数や
量子化ビット数の変更に対応して処理内容を変更する。On the other hand, the encoded data read from the audio memory 30 via the read driver 18 is decoded by the decoder 20.
, Where it is decoded and converted into digital audio data. Then, the D / A converter 22 returns the analog output audio signal. Here, the decoding of the encoder 20 corresponds to the encoding of the encoder 14, the D / A converter 22 corresponds to the A / D converter 12, and these are recorded. The processing contents are changed in response to changes in the sampling frequency of the system and the number of quantization bits.

【００２５】そして、本実施例では、特性判定部２４に
おいて、入力音声信号の特性を判定し、男性の音声か女
性の音声かを判定する。男性の音声の場合、その音声の
特徴を示す第３ホルトマントまでをカバーするためのＡ
／Ｄ変換器１２のサンプリング周波数は６ｋＨｚでよ
い。そこで、特性判定部２４において、入力音声信号が
男性と判定した場合には、制御部２６がＡ／Ｄ変換器１
２におけるサンプリング周波数を６ｋＨｚ、符号化器１
４の量子化ビットを４ビットに設定すると共に、復号化
器２０、Ａ／Ｄ変換器２２をこれらに対応したものに設
定する。Then, in the present embodiment, the characteristic judging section 24 judges the characteristic of the input voice signal to judge whether it is a male voice or a female voice. In the case of a male voice, A for covering up to the third holtmant, which shows the features of the voice
The sampling frequency of the / D converter 12 may be 6 kHz. Therefore, when the characteristic determination unit 24 determines that the input audio signal is male, the control unit 26 causes the A / D converter 1 to operate.
Sampling frequency in 2 is 6 kHz, encoder 1
The quantization bits of 4 are set to 4 bits, and the decoder 20 and the A / D converter 22 are set to those corresponding to these.

【００２６】一方、特性判定部２４において、入力音声
信号が女性と判定した場合には、制御部２６がＡ／Ｄ変
換器１２におけるサンプリング周波数を８ｋＨｚ、符号
化器１４の量子化ビットを３ビットに設定すると共に、
復号化器２０、Ａ／Ｄ変換器２２をこれらに対応したも
のに設定する。On the other hand, when the characteristic judging section 24 judges that the input voice signal is female, the controlling section 26 sets the sampling frequency of the A / D converter 12 to 8 kHz and the quantization bit of the encoder 14 to 3 bits. Set to
The decoder 20 and the A / D converter 22 are set to correspond to these.

【００２７】このようにして、本実施例によれば、ビッ
トレートは２４ｋｂｐｓに固定したままで、サンプリン
グ周波数および量子化ビットを変更することによって、
音質を維持しつつ、音声メモリ３０に記憶するデータ量
を削減することができる。すなわち、入力音声信号の特
質に合わせてデータ圧縮の手法を変更し、効果的なデー
タ圧縮を行うことができる。In this way, according to this embodiment, by changing the sampling frequency and the quantized bit while keeping the bit rate fixed at 24 kbps,
It is possible to reduce the amount of data stored in the audio memory 30 while maintaining the sound quality. That is, the data compression method can be changed according to the characteristics of the input audio signal, and effective data compression can be performed.

【００２８】なお、マイコン４０は、外部から入力され
る操作信号などによって、固体録音再生ＬＳＩ１０の動
作を制御するものであり、録音の起動停止、再生の起動
停止、モードの設定などを制御する。また、外部からの
操作によって、サンプリング周波数を変更したり、量子
化ビットを変更するようにしてもよいし、また特性判定
による制御をオンオフできるようにしても良い。The microcomputer 40 controls the operation of the solid-state recording / reproducing LSI 10 in accordance with an operation signal input from the outside, and controls recording start / stop, reproduction start / stop, mode setting, and the like. The sampling frequency may be changed, the quantization bit may be changed, or the control based on the characteristic determination may be turned on / off by an external operation.

【００２９】次に、特性判定部２４の構成の一例につい
て、図２に基づいて説明する。この例では、入力音声信
号の基本周波数を測定し、男性女性の別を判定する。す
なわち、入力音声信号は、ローパスフィルタ５２に入力
され、ここで高周波成分がカットされた後、周波数解析
器５２に入力され周波数解析される。そして、判定器５
６が周波数解析の結果に応じて、入力音声信号が男性の
ものか女性のものかを判定する。Next, an example of the configuration of the characteristic determining section 24 will be described with reference to FIG. In this example, the fundamental frequency of the input voice signal is measured to determine whether it is male or female. That is, the input audio signal is input to the low pass filter 52, where high frequency components are cut off, and then input to the frequency analyzer 52 for frequency analysis. And the determiner 5
6 determines whether the input voice signal is male or female, according to the result of the frequency analysis.

【００３０】ここで、ローパスフィルタ５２では、例え
ば５００Ｈｚ以上の成分がカットされる。音声は、声帯
で決定される基本周波数の音に舌、顎等の形による高周
波成分が重畳されて、各種の音になる。しかし、通常の
会話の際の音声の基本周波数は一定である。このため、
音声信号から５００Ｈｚ以上の成分をカットすると、ほ
ぼ基本周波数の成分のみが残る。そこで、得られた信号
の周波数解析を行うことによって、基本周波数を検出す
ることができる。Here, in the low-pass filter 52, components of, for example, 500 Hz or higher are cut. A voice has various sounds by superposing a high-frequency component due to the shape of the tongue, jaw, etc. on the sound of the fundamental frequency determined by the vocal cords. However, the fundamental frequency of voice in a normal conversation is constant. For this reason,
If the component of 500 Hz or more is cut from the audio signal, only the component of the fundamental frequency remains. Therefore, the fundamental frequency can be detected by performing frequency analysis of the obtained signal.

【００３１】そして、判定器５６は、周波数解析器５４
の解析結果により周波数が１２５Ｈｚ近辺であった場合
には、男性と判定し、解析結果が２４５Ｈｚ近辺であっ
た場合には、女性と判定する。この判定は、カウンタの
カウント値を所定値と比較し、所定範囲に入っているか
を判定すればよい。The decision unit 56 is the frequency analyzer 54.
When the frequency is around 125 Hz according to the analysis result of 1., it is determined to be a male, and when the analysis result is around 245 Hz, it is determined to be a female. This determination may be made by comparing the count value of the counter with a predetermined value and determining whether it is within a predetermined range.

【００３２】このようにして、音声の基本周波数を検出
することによって、話者が男性であるか、女性であるか
を判定することができる。したがって、この情報を制御
部２６に供給することによって、制御部２６が音声信号
の特性に合わせたデータの圧縮を行うことができる。In this way, it is possible to determine whether the speaker is male or female by detecting the fundamental frequency of the voice. Therefore, by supplying this information to the control unit 26, the control unit 26 can compress the data in accordance with the characteristics of the audio signal.

【００３３】次に、図３に、特性判定部２４の他の構成
例を示す。この例では、音声信号の１５０Ｈｚ以上の成
分と、１５０Ｈｚ以下の成分の割合に応じて、話者が男
性であるか、女性であるかを判定する。すなわち、入力
音声信号は、ローパスフィルタ６２およびハイパスフィ
ルタ６４に入力される。ローパスフィルタ６２は、１５
０Ｈｚ以上の信号をカットするものであり、ハイパスフ
ィルタ６４は、１５０Ｈｚ以下の信号をカットするもの
である。Next, FIG. 3 shows another structural example of the characteristic judging section 24. In this example, it is determined whether the speaker is a male or a female according to the ratio of the component of 150 Hz or higher and the component of 150 Hz or lower of the audio signal. That is, the input audio signal is input to the low pass filter 62 and the high pass filter 64. The low pass filter 62 has 15
The signal of 0 Hz or higher is cut, and the high pass filter 64 cuts the signal of 150 Hz or lower.

【００３４】ローパスフィルタ６２およびハイパスフィ
ルタ６４の出力は、それぞれ別々のレベル積算器６６、
６８に入力される。これらレベル積算器６６、６８は、
入力されてくる信号の信号レベルを検波すると共に、こ
のレベル値を所定時間積算する。従って、レベル積算器
６６、６８には、１５０Ｈｚ以下の信号のレベルと、１
５０Ｈｚ以上の信号のレベルが得られる。そして、レベ
ル積算器６６、６８の積算結果の信号はコンパレータ７
０に入力され、ここで両者が比較される。The outputs of the low-pass filter 62 and the high-pass filter 64 are respectively level accumulators 66,
68 is input. These level accumulators 66 and 68 are
The signal level of the input signal is detected and this level value is integrated for a predetermined time. Therefore, the level accumulators 66 and 68 have a signal level of 150 Hz or less and
A signal level of 50 Hz or higher can be obtained. The signal of the integration result of the level integrators 66 and 68 is sent to the comparator 7
It is input to 0, and both are compared here.

【００３５】男性の場合１５０Ｈｚ以下の成分の信号レ
ベルが大きく、一方、女性の場合は１５０Ｈｚ以下の成
分の信号レベルは非常に小さい。そこで、コンパレータ
７０の比較結果において、レベル積算器６６の出力、す
なわち１５０Ｈｚ以下の信号のレベルの方が大きけれ
ば、入力音声信号は男性のものであると判断され、レベ
ル積算器６８の出力、すなわち１５０Ｈｚ以上の信号の
レベルの方が大きければ、入力音声信号は女性のもので
あると判断される。このようにして、この例の特性判定
部２４により、入力音声信号が男性のものであるか、女
性のものであるかを判定することができる。したがっ
て、この回路を利用して、音声信号の特性に合わせたデ
ータの圧縮を上述の場合と同様に行うことができる。In the case of a male, the signal level of the component below 150 Hz is large, while in the case of a female, the signal level of the component below 150 Hz is very small. Therefore, in the comparison result of the comparator 70, if the output of the level integrator 66, that is, the level of the signal of 150 Hz or less is larger, it is determined that the input voice signal is male, and the output of the level integrator 68, that is, If the level of the signal of 150 Hz or higher is higher, the input audio signal is judged to be female. In this way, the characteristic determining unit 24 of this example can determine whether the input audio signal is of a male type or a female type. Therefore, by using this circuit, data compression suitable for the characteristics of the audio signal can be performed in the same manner as in the above case.

【００３６】また、留守番電話機の用件録音のように、
通常の録音の場合、話者の性別は分からない。従って、
上述のような性別の判定は録音開始後の初期に行わなけ
ればならない。そこで、本実施例の装置では、図５に示
すように、録音開始当初の１０秒間位は、サンプリング
周波数８Ｈｚ、量子化ビット４ビットの、ビットレート
３２ｋｂｐｓで録音しながら性別の判定を行う。そし
て、性別の判定ができた場合に、男性ならサンプリング
周波数を６ｋＨｚに変更し、女性なら量子化ビットを３
ビットに変更し、ビットレート２４ｋｂｐｓでその後の
録音を最後まで行う。このようにすることによって、全
体の録音時間にもよるが、性別判定時間は録音時間に占
める割合が少ないので、１回の録音に要するメモリの容
量は、ほぼ２４ｋｂｐｓでの値に近くなり、十分な音質
を維持しつつ、限られたメモリ容量で、長時間の録音が
可能になる。Also, like the message recording of an answering machine,
In normal recording, the gender of the speaker is unknown. Therefore,
The determination of sex as described above must be performed early after the start of recording. Therefore, in the apparatus of the present embodiment, as shown in FIG. 5, for 10 seconds at the beginning of recording, the gender is determined while recording at a bit rate of 32 kbps with a sampling frequency of 8 Hz and 4 quantization bits. If the gender can be determined, the sampling frequency is changed to 6 kHz for men, and the quantization bit is set to 3 for women.
Change to bit and record at the bit rate of 24 kbps until the end. By doing this, although the sex determination time is a small percentage of the recording time, depending on the total recording time, the memory capacity required for one recording is close to the value at 24 kbps, which is sufficient. With a limited memory capacity, it is possible to record for a long time while maintaining excellent sound quality.

【００３７】なお、上述の実施例において、特性判定部
２４は、アナログの入力音声信号を受入れ、処理を行う
ように記載したが、Ａ／Ｄ変換器１２の出力であるデジ
タルデータを受入れ処理を行っても良い。この場合、回
路は、すべてデジタル回路で形成される。In the above-mentioned embodiment, the characteristic judging section 24 is described as receiving the analog input voice signal and performing the processing. However, the characteristic judging section 24 receives the digital data output from the A / D converter 12 and performs the processing. You can go. In this case, the circuit is formed entirely of digital circuits.

【００３８】[0038]

【発明の効果】以上説明したように、本発明のよれば、
入力されてくる音声信号の周波数特性を判定する。これ
によって、男性と女性の識別が行える。そこで、この識
別結果に応じて、Ａ／Ｄ変換の際のサンプリング周波数
を変更することで、音質を損なうことを抑制して、効率
良いデータの圧縮が行える。As described above, according to the present invention,
The frequency characteristic of the input audio signal is determined. This makes it possible to distinguish between men and women. Therefore, by changing the sampling frequency at the time of A / D conversion according to this identification result, it is possible to suppress the loss of sound quality and perform efficient data compression.

【００３９】また、本発明においては、符号化して、デ
ータ量を削減してからデータを記憶すると共に、この符
号化した際に得る符号化データのビット数を周波数特性
に応じて変更する。このように、量子化ビット数の制御
を合わせて行うことによりビットレートは常時同一とし
ながら、効果的なデータの圧縮ができる。Further, in the present invention, the data is stored after being encoded so as to reduce the amount of data, and the number of bits of the encoded data obtained by this encoding is changed according to the frequency characteristic. In this way, by controlling the number of quantization bits together, it is possible to effectively compress data while always keeping the same bit rate.

【００４０】また、本発明では、この基本周波数を検出
し、男性の音声に対応する低周波の音声と女性の音声の
対応する高周波の音声かを判定する。そして、この判定
結果に従ってＡ／Ｄ変換の際のサンプリング周波数を変
更するため、音質を損なうことなくデータの圧縮率を上
昇することができる。Further, in the present invention, this fundamental frequency is detected to determine whether the low frequency voice corresponding to the male voice and the high frequency voice corresponding to the female voice. Since the sampling frequency at the time of A / D conversion is changed according to this determination result, the data compression rate can be increased without impairing the sound quality.

【００４１】また、本発明では、この所定周波数以下の
成分が多く存在するか否かで、男性の音声に対応する低
周波の音声と女性の音声の対応する高周波の音声かを判
定する。そこで、上述の場合と同様にして、音質を損な
うことなくデータの圧縮率を上昇することができる。Further, in the present invention, it is determined whether there is a low frequency voice corresponding to a male voice and a high frequency voice corresponding to a female voice depending on whether or not there are many components below the predetermined frequency. Therefore, similarly to the case described above, the data compression rate can be increased without deteriorating the sound quality.

【００４２】そして、このような効率的なデータの圧縮
が行えるため、限られたメモリ容量で、音質を維持しつ
つ、長時間録音が可能になる。Since such efficient data compression can be performed, recording can be performed for a long time with a limited memory capacity while maintaining sound quality.

[Brief description of drawings]

【図１】実施例の全体構成を示すブロック図である。FIG. 1 is a block diagram showing an overall configuration of an embodiment.

【図２】特性判定部の構成例を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration example of a characteristic determination unit.

【図３】特性判定部の他の構成例を示すブロック図であ
る。FIG. 3 is a block diagram illustrating another configuration example of a characteristic determination unit.

【図４】基本周波数の分布を示す図である。FIG. 4 is a diagram showing a distribution of fundamental frequencies.

【図５】録音の際の動作を示す説明図である。FIG. 5 is an explanatory diagram showing an operation at the time of recording.

[Explanation of symbols]

１０固体録音再生ＬＳＩ１２Ａ／Ｄ変換器１４符号化器２０復号化器２２Ｄ／Ａ変換器３０音声メモリ 10 Solid-state recording / playback LSI 12 A / D converter 14 Encoder 20 Decoder 22 D / A converter 30 Voice memory

Claims

[Claims]

1. A recording device for converting an audio signal into digital data by an A / D converter and recording the digital data, and a frequency characteristic judging means for judging a frequency characteristic of an input audio signal, and a frequency characteristic judging means for judging the frequency characteristic. And a sampling frequency changing means for changing the sampling frequency in the A / D converter.

2. In a recording device which converts an audio signal into digital data by an A / D converter and then converts it into coded data of a predetermined bit by an encoder for recording, a frequency characteristic of an input audio signal is measured. Frequency characteristic judging means for judging, sampling frequency changing means for changing a sampling frequency in the A / D converter according to the judged frequency characteristic, and code data obtained by the coding according to the judged frequency characteristic. A recording device comprising: a bit number changing means for changing the bit number.

3. The recording device according to claim 1, wherein the frequency characteristic determining means detects a repetition frequency of a voice signal waveform corresponding to a fundamental frequency of a vowel part of voice, and selects either low frequency voice or high frequency voice. The recording device is characterized in that the sampling frequency changing means lowers the sampling frequency in the case of low frequency sound and increases the sampling frequency in the case of high frequency sound.

4. The recording device according to claim 1, wherein the frequency characteristic determination means detects whether there are many components having a frequency equal to or lower than a predetermined frequency and determines whether the component is a low frequency voice or a high frequency voice. The recording device characterized in that the sampling frequency changing means lowers the sampling frequency in the case of low-frequency sound and increases the sampling frequency in the case of high-frequency sound.