JP2006146110A

JP2006146110A - Speech converting device

Info

Publication number: JP2006146110A
Application number: JP2004365023A
Authority: JP
Inventors: Kazuyuki Nakayama; 一之中山
Original assignee: MICRO COMMUNICATION KK
Current assignee: MICRO COMMUNICATION KK
Priority date: 2004-11-19
Filing date: 2004-11-19
Publication date: 2006-06-08

Abstract

<P>PROBLEM TO BE SOLVED: To provide an inexpensive speech converting device capable of playing a speech at high speed without changing an interval with the use of one-chip microcomputer while suppressing deterioration of the speech at the high speed playing. <P>SOLUTION: The device is composed of a microphone for inputting a speech, a jack for outputting a speech, an ADC for converting the inputted speech to digital speech data, a DAC for reconverting the digital speech data to a speech, a memory card for recording the digital speech data, a one-chip microcomputer for implementing a program, a program memory for storing an algorithm, a display unit for displaying an operation state, and an operation part by which a user selects an operation. The digital speech data are divided into blocks, and by laying the digital speech data by block and changing the sampling frequency at the time of playing, high speed and smooth playing of the speech can be achieved while suppressing deterioration of the speech. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は音程を変えずに音声の高速再生を行う、音声変換装置に関する。 The present invention relates to an audio conversion device that performs high-speed audio reproduction without changing the pitch.

従来、音程を変えずに音声の高速再生を行うには、音程が変わらない範囲でデジタル音声データを間引いて実現しているものもある（例えば、特許文献１参照。）。また、高性能のＤＳＰを使用してフーリエ変換などを行うことで実現しているものもある。 Conventionally, in order to perform high-speed playback of sound without changing the pitch, there is also a method in which digital voice data is thinned out within a range where the pitch does not change (for example, see Patent Document 1). Some are realized by performing Fourier transform using a high-performance DSP.

前記従来の音程が変わらない範囲でデジタル音声データを間引く方法は、再生速度が上がるにつれて間引かれるデータの間隔が長くなり、再生した際の音声の劣化が激しくなってしまうという問題点を有していた。
特開２０００−８９８００ The conventional method of thinning out digital audio data in the range where the pitch does not change has a problem that the interval of the thinned data becomes longer as the reproduction speed increases, and the deterioration of sound during reproduction becomes severe. It was.
JP 2000-89800 A

また、前記従来のＤＳＰを使用してフーリエ変換などを行う方法は、高性能のＤＳＰや高速のメモリを必要とするため、装置が高価になってしまうという問題点を有していた。 In addition, the conventional method of performing Fourier transform using a DSP requires a high-performance DSP and a high-speed memory, so that the apparatus becomes expensive.

本発明は、このような従来の手法が有していた問題を解決しようとするものであり、高速再生時の音声の劣化を抑えながら、１チップマイコンを使用して音程を変えずに音声の高速再生を行うことことができる安価な音声変換装置を実現することを目的とする。 The present invention is intended to solve the problems of such a conventional method, and while suppressing deterioration of sound during high-speed playback, the sound can be reproduced without changing the pitch using a one-chip microcomputer. An object of the present invention is to realize an inexpensive audio conversion device that can perform high-speed reproduction.

本発明に係わる音声変換装置は、音声を入力するマイクと、音声を出力するジャックと、入力された音声をデジタル音声データに変換するＡＤＣと、デジタル音声データを音声に再変換するＤＡＣと、デジタル音声データを記録するメモリーカードと、プログラムを実行する１チップマイコンと、アルゴリズムを記憶したプログラムメモリと、動作状態を表示する表示部と、使用者が動作を選択する操作部で構成されている。 An audio conversion apparatus according to the present invention includes a microphone that inputs audio, a jack that outputs audio, an ADC that converts input audio into digital audio data, a DAC that reconverts digital audio data into audio, and a digital It consists of a memory card that records audio data, a one-chip microcomputer that executes a program, a program memory that stores an algorithm, a display unit that displays an operation state, and an operation unit that allows a user to select an operation.

以下、本発明の実施の形態を図１〜図３に基づいて説明する。 Hereinafter, embodiments of the present invention will be described with reference to FIGS.

図１は本発明の音声変換装置のブロック図である。マイクから入力された音声はＡＤＣによってアナログの音声信号からデジタル音声データに変換され、１チップマイコンに送られ、メモリーカードに記録される。 FIG. 1 is a block diagram of a speech conversion apparatus according to the present invention. The audio input from the microphone is converted from an analog audio signal to digital audio data by the ADC, sent to a one-chip microcomputer, and recorded on a memory card.

メモリーカードに記録されたデジタル音声データは、音声変換装置の使用者が操作部から指示を与えることで１チップマイコンによりメモリーカードから取り出され、プログラムメモリに書き込まれているアルゴリズムに従って速度変換の処理が行われてＤＡＣに送られ、ＤＡＣによってデジタル音声データからアナログの音声信号に変換されてジャックより出力される。なお、表示部はこれらの動作状態の変移を表示する。 The digital audio data recorded on the memory card is taken out from the memory card by a one-chip microcomputer when the user of the audio conversion device gives an instruction from the operation unit, and the speed conversion process is performed according to the algorithm written in the program memory. After being sent to the DAC, the DAC converts the digital audio data into an analog audio signal and outputs it from the jack. The display unit displays changes in these operating states.

次に、図２と図３により２倍速での再生を例にとり、本発明の音声の高速再生のアルゴリズムを説明する。 Next, with reference to FIGS. 2 and 3, taking the reproduction at double speed as an example, the algorithm for high-speed reproduction of sound according to the present invention will be described.

図２−１は連続したデジタル音声データを、間引いても音程の変化が起こらない周期でブロック化した図であり、１ブロックには図２−２に示すように２５６ワードのデジタル音声データが含まれているものとする。 FIG. 2-1 is a diagram in which continuous digital audio data is blocked in a cycle in which the pitch does not change even if it is thinned out. One block includes 256 words of digital audio data as shown in FIG. It shall be assumed.

この１ブロックに含まれている２５６ワードのデジタル音声データを、図２−３に示すように１ワードを２回重ねて並べ替える。このときオーバーフローした１２９〜２５６番目のデジタル音声データは切り捨てる。 The 256-word digital audio data included in one block is rearranged by overlapping one word twice as shown in FIG. At this time, the overflowed 129th to 256th digital audio data are discarded.

図２−３の１ワードを２回重ねて並べ替えられたデジタル音声データを、録音時と同じサンプリング周波数で再生すると、再生速度は１倍速のままで１オクターブ音程の下がった音声が再生される。ここで再生時のサンプリング周波数を２倍に上げると音程が元の高さに戻り、再生速度が２倍になって、音程を変えずに２倍速での再生を実現することができる。 When the digital audio data that has been rearranged by overlapping one word in Fig. 2-3 is played back at the same sampling frequency as when recording, the playback speed remains at 1x speed, and the audio is lowered by one octave. . Here, when the sampling frequency at the time of reproduction is doubled, the pitch returns to the original pitch, the reproduction speed is doubled, and reproduction at double speed can be realized without changing the pitch.

図３は従来の音程が変わらない範囲でデジタル音声データを間引く方法と本発明のアルゴリズムを比較した図である。 FIG. 3 is a diagram comparing a conventional method of thinning out digital audio data within a range where the pitch does not change and the algorithm of the present invention.

従来の音程が変わらない範囲でデジタル音声データを間引く方法は、図３−１におけるＴの周期で斜線の箇所の、偶数のデジタル音声データのブロックが間引かれる。 In the conventional method of thinning out digital audio data within a range in which the pitch does not change, even-numbered blocks of digital audio data are thinned out at hatched portions in the period T in FIG. 3-1.

図３−２は、図２−３で示した１ワードのデジタル音声データを２回重ねて並べ替えたデーターブロックであり、各データーブロックは１２９〜２５６番目のデジタル音声データが切り捨てられている。この切り捨てられた音声データーの周期はＴ／２になる。 FIG. 3B is a data block in which the 1-word digital audio data shown in FIG. 2C is overlapped and rearranged twice. In each data block, the 129th to 256th digital audio data are truncated. The period of the truncated audio data is T / 2.

図３−３は、図３−２のデジタル音声データのブロックを、サンプリング周波数を２倍にして２倍速で再生したときの図であり、再生速度が２倍になるため切り取られたデジタル音声データの周期はＴ／４となり、従来の音程が変わらない範囲でデジタル音声データを間引く方法と比べて１／４の周期でデジタル音声データを間引くことができ、再生時の音声の劣化を抑えて滑らかな音声の高速再生が可能となる。 FIG. 3C is a diagram when the block of the digital audio data in FIG. 3-2 is reproduced at double speed with the sampling frequency doubled, and the digital audio data cut out because the reproduction speed is doubled. The period is T / 4, and digital audio data can be thinned out with a period of 1/4 compared with the conventional method of thinning out digital audio data within the range where the pitch does not change, and smoothness is achieved by suppressing sound deterioration during playback. Sound can be played at high speed.

以上、２倍速での再生を例にとって説明したが、デジタル音声データの重ね合わせ方と再生時のサンプリング周波数の組み合わせで、様々な速度での音声の高速再生を行うことが可能となる。 As described above, the reproduction at the double speed has been described as an example. However, it is possible to perform high-speed audio reproduction at various speeds by combining digital audio data superimposition and the sampling frequency at the time of reproduction.

The invention's effect

上述したように本発明の音声変換装置は、従来の音程が変わらない範囲でデジタル音声データを間引く方法と比べて１／４の周期でデジタル音声データを間引くことが可能なため、再生される音声の劣化を抑えて滑らかな音声の高速再生が可能となる。また、アルゴリズムがシンプルであるため高性能のＤＳＰや高速のメモリを使用しなくても、安価な１チップマイコンを使用して音声変換装置を実現することが可能となる。 As described above, the audio conversion apparatus of the present invention can thin out digital audio data at a quarter cycle compared to the conventional method of thinning out digital audio data within a range where the pitch does not change. Smooth speech can be played at high speed while suppressing deterioration of the sound. In addition, since the algorithm is simple, it is possible to realize an audio conversion device using an inexpensive one-chip microcomputer without using a high-performance DSP or a high-speed memory.

本発明の実施形態を示す音声変換装置のブロック図。1 is a block diagram of an audio conversion device showing an embodiment of the present invention. 本発明の実施形態におけるアルゴリズムの、デジタル音声データの重ね合わせを説明するための図。The figure for demonstrating the superimposition of the digital audio | voice data of the algorithm in embodiment of this invention. 本発明の実施形態におけるアルゴリズムの、２倍速での再生を説明するための図。The figure for demonstrating reproduction | regeneration in the double speed of the algorithm in embodiment of this invention.

Claims

Digital audio data sampled based on the constant velocity is divided into blocks, and the digital audio data is overlaid on a block basis, and playback is performed with varying sampling rates, thereby suppressing deterioration of the audio during playback and reducing the original sound A method for high-speed sound reproduction, characterized in that sound is reproduced at high speed without changing the frequency.

2. A voice conversion apparatus according to claim 1, wherein said voice is reproduced at high speed using an inexpensive one-chip microcomputer.