JPS59149438A

JPS59149438A - Method of compressing and elongating digitized voice signal

Info

Publication number: JPS59149438A
Application number: JP58212288A
Authority: JP
Inventors: ロ−レンス・エドモンド・バ−ゲロン; ダニエル・フランシス・ダリ−; エレイン・チポウラス・グロツソ
Original assignee: Wang Laboratories Inc
Current assignee: Wang Laboratories Inc
Priority date: 1983-02-14
Filing date: 1983-11-11
Publication date: 1984-08-27
Also published as: CA1218462A; DE3484447D1; EP0118771B1; ATE62778T1; EP0118771A3; EP0118771A2; US4631746A; AU1902283A; AU559775B2

Abstract

Apparatus and a method are disclosed for digitizing a voice signal as it occurs and the digitized signal is compressed and stored for subsequent reconversion to a voice signal at a latter time. Binary words derived from digitizing the signal in an analog-to-digital converter represent the voice non lineary and are first linearized by a first processor and then stored in a first memory until a block of binary words are assembled. Next the block of binary words are transferred to a second buffer memory and a second processor compresses it using time domain harmonic scaling processing which extracts the pitch of the original voice and also compresses the voice frequency spectrum to affect a 2:1 reduction in the number of stored binary words representing the digitized voice signal. This reduced amout of data is then upsampled by a factor of four using linear interpolation to derive a pseudo higher sample rate required for further processing using continuously variable slope delta modulation processing which yields one binary bit out for every binary word processed. The resultant bits are assembled into eight bit words and then stored in a third memory. Another binary word representing the pitch is added and the result is a 4:1 compression of the original digitized data. The compressed digitized data is stored in a bulk memory. Thereafter, more digitized samples of the voice signal undergo this same processing. Compressed digitized voice signals may be reconverted to a voice signal for playback by processing the digitized and compressed voice signals generally in the reverse order. Playback speed control may also be accomplished without changing the pitch of the voice. To double the playback speed the time domain harmonic scaling processing is deleted in playback, thus leaving the frequency spectrum compressed and a 2:1 reduction in binary words representing the digitized voice. To slow the playback speed the digitized voice signals undergo modified TDHS processing.

Description

【発明の詳細な説明】１の　　　一本発明は、一般に連続的に変化する信号をディジタル化
し後でこれを連続的に変化する信号に再変換するディジ
タル化技術にｐＡＪる。より詳細には、本発明はディジ
タル化アナ［１グ信号を圧縮′し後にアナログ信号への
再変換のため該圧縮ディジタル化アナログ信号を伸長す
る技術に関する。DETAILED DESCRIPTION OF THE INVENTION Part 1: The present invention is generally directed to a digitization technique that digitizes a continuously varying signal and later converts it back into a continuously varying signal. More particularly, the present invention relates to a technique for compressing a digitized analog signal and subsequently decompressing the compressed digitized analog signal for reconversion to an analog signal.

炙迷」１１辺ＪＬＬ電気通信分野ではディジタル伝送技術の利用が普及しつ
つあり、アナログ信号、特に音声信号は、ディジタル化
され、伝送され、しかる後電気通信リンクの受信端にて
分配されるべきオリジナルの音声信号を十分正確に表わ
すよう再変換される。このような音声受信のディジタル
化により多数の音声帯域チャンネルを一緒に多重化する
ことが可能となり、これらの多重化音声帯域チレンネル
のスイッチングはディジタル技術を用いて経済的に行わ
れる。今日までこのような音声ディジタル技術は主に長
距離伝送に適用されており、この種伝送では伝送コスｉ
・が支配的で、ディジタル化音声信号のディジタル伝送
技術を用いることによって大幅な節約がｈｌれる。マイ
クロプロセッサおよびメモリの勺イズ、コストは減少し
続けているため、このようなアナログ信号処理用のディ
ジタル化技術を電気通信リンクの両端にある端末機器ま
で拡張するのが経済的に有利である。１０年乃至１５年
以内にディジタル伝送技術および機器は加入者端末機器
からスイッチング機器、ざらには長距離伝送ｌＩ器にわ
たる電気通信分野において完全に支配的になるであろう
。The use of digital transmission technology is becoming widespread in the telecommunications field, where analog signals, especially voice signals, have to be digitized, transmitted, and then distributed at the receiving end of a telecommunications link. It is retransformed to sufficiently accurately represent the original audio signal. This digitization of voice reception allows multiple voiceband channels to be multiplexed together, and the switching of these multiplexed voiceband channels can be done economically using digital techniques. To date, such audio digital technology has been mainly applied to long-distance transmission, and the transmission cost i
- are predominant, and significant savings can be made by using digital transmission techniques for digitized audio signals. As the size and cost of microprocessors and memory continue to decrease, it is economically advantageous to extend such digitization techniques for analog signal processing to terminal equipment at both ends of a telecommunications link. Within 10 to 15 years, digital transmission technology and equipment will completely dominate the telecommunications field, from subscriber terminal equipment to switching equipment and even long distance transmission equipment.

電気通信分野におけるディジタル技術の有益性をざらに
拡大するため、ディジタル信号の導出源であるオリジナ
ル音声信号の再生を損なうことなく会話信号を表わづデ
ィジタル信号中のビット数を減らすための多大な努力が
なされている。これは、連続的に変化する音声信号およ
びそれから導出されるディジタル信号をビットレート減
縮が達成されるよう、信号処理づることによって行われ
る。ディジタル化音声信号のごットレー１〜減縮のため
に種々の技術が開発されており、これらは一般に波形コ
ード化技術と称される。このような波形コード化技術に
は、適応微分パルスコード変調、副帯域コーディングお
よび変換コーディング等があり、これらはどれも典型的
には１つのディジタル化音声信号のビットレートを２フ
ァクタ以上減縮することができる。ベルシステムで用い
られている別の技術は時間割当会話補間ＴＡＳＩ（Ｔｉ
ｍｅａｓｓｉｇｎｎ＋ｅｎｔ　　５ｐｅｅｃｈ　　１ｎ
ｔｅｒｐｏｌａｔｉｏｎ　）で会話中の沈黙期間を検出
してこれを伝送しないものである。他の技術は音声コー
ディング（ｖｏｃｏｄｉｎｇ）で、会話を分析してその本質的な
パラメータを抜き取り、後で総合して会話を再構成する
ものである。現在のところ、音声コーディング技術は語
り手の特徴の自然らしさをだすところまでいっておらず
、その利用範囲は安全音声伝送におけるような極めて低
いピッ１−レートが０受されるアプリケーションに限定
されている。In order to significantly expand the usefulness of digital technology in the telecommunications field, significant efforts are being made to reduce the number of bits in a digital signal to represent speech signals without compromising the reproduction of the original audio signal from which the digital signal is derived. Efforts are being made. This is done by signal processing a continuously varying audio signal and a digital signal derived therefrom such that bit rate reduction is achieved. Various techniques have been developed for the reduction of digitized audio signals, and these are commonly referred to as waveform coding techniques. Such waveform coding techniques include adaptive differential pulse code modulation, subband coding, and transform coding, all of which typically reduce the bit rate of a digitized audio signal by a factor of two or more. I can do it. Another technique used in the Bell System is Time Allocation Speech Interpolation TASI (Ti
meassignn+ent 5peech 1n
terpolation) to detect a silent period during a conversation and not transmit it. Another technique is vocoding, which analyzes a conversation to extract its essential parameters, which are later synthesized to reconstruct the conversation. At present, voice coding technology has not reached the point where it can bring out the naturalness of the narrator's characteristics, and its use is limited to applications where extremely low pixel rates are received, such as in secure voice transmission. .

ディジタル化音声伝送信号を構成する２進ビツトの数を
減らすための会話処理およびディジタル化技術は、ディ
ジタル化音声を記憶し、後にこれを読み出して音声信号
に再構成するような比較的新しい分野にも適用されてい
る。これは記憶および転送（５ｔｏｒｅ　−ａｎｄ　−
ｆｏｒｗａｒｄ　）法で、電子メールと同様に、音声メ
ツセージはディジタル化されて記憶され後に電話機を介
して受信側へ分配される。記憶および転送システムはウ
オング、Ｉ　ＢＭ、ベルシステム、ＶＭＸ等多数の企業
によって開発されてきた。Speech processing and digitization techniques to reduce the number of binary bits that make up a digitized audio transmission signal are a relatively new field in which digitized audio is stored and later retrieved and reconstructed into audio signals. is also applied. This is storage and transfer (5tore-and-
Similar to e-mail, voice messages are digitized and stored before being distributed via telephone to the recipient. Storage and transfer systems have been developed by a number of companies, including Wong, IBM, Bell Systems, and VMX.

記憶および転送システムでは、大容量メモリが要求され
、現在比較的高価なディスクメモリが使われる。Storage and transfer systems require large amounts of memory and currently use relatively expensive disk memory.

ビットレート減縮技術がディジタル化音声信号に適用さ
れない場合、会話は普通６４キロビット／秒でディジタ
ル化され、ディジタル化会話に値する２分毎につき１メ
ガバイトのメモリ°空間が必要である。容易に理解され
るように、拡張可能な記憶および転送システムに対して
は広範囲な量のメモリが要求され、所与の量のディジタ
ル化音声を記憶づるのに要するメモリｍを減らず技法が
必要とされている。１つのアプローチは能動的な会話だ
けを記憶するものであるが、これは小ωの信号圧縮を提
供するにすぎない。上述した適応微分パルスコード変調
、副帯域コーディングおよび変換コーディング技術はデ
ィジタル化信号を２７アクタ圧縮でき、これはかなりの
節約ではあるが、しかしそのような信号を記憶するのに
必要なメモリωを最小化するには、より以上にディジタ
ル化音声信号を圧縮することが当該技術分野で要求され
ている。If bit rate reduction techniques are not applied to the digitized audio signal, speech is typically digitized at 64 kilobits per second, requiring 1 megabyte of memory space for every two minutes worth of digitized speech. As will be readily appreciated, extensive amounts of memory are required for scalable storage and transfer systems, and techniques are needed that do not reduce the memory m required to store a given amount of digitized audio. It is said that One approach is to store only active speech, but this only provides small ω signal compression. The adaptive differential pulse code modulation, subband coding, and transform coding techniques described above can compress digitized signals by 27 actors, which is a considerable savings, but minimizes the memory ω required to store such signals. There is a need in the art to further compress digitized audio signals.

の目的および特徴上記要求は、従来のビットレート圧縮技術を用いてディ
ジタル化音声信号を表わす２進信号？に４：１の減縮を達成づる本発明によって適えられる。OBJECTIVES AND CHARACTERISTICS OF A DIGITAL AUDIO SIGNAL can be achieved by the present invention achieving a 4:1 reduction.

本発明の好適な実施例では時間領域調和スケーリングＴ
　Ｄ　ＨＳ　（Ｔｉｍｅ　　Ｄ　ｏｎ＋ａｉｎト１ａｒ
ｍｏｎｉｃ　　３　ｃａｌｉｎｇ）技術が利用され、こ
の技術は従来技術に単独で適用されたときにディジタル
化音声信号を表わす２進データの２：１減縮をもたらす
。さらに本発明は、通常は単独で使用されてビットレー
ト減縮をもたらす連続可変スロープデルタＣＶ　Ｓ　Ｄ
　（ＣｏｎｔｉｎｕｏｕｓｌｙＶ　ａｒｉａｂｌｅ　　
３１ｏｐｅ　　［）　ｅｌｔａ）変調技術を利用する。In a preferred embodiment of the invention, the time domain harmonic scaling T
D HS (Time D on+aint 1ar
A monic 3 caling technique is utilized which, when applied alone to the prior art, results in a 2:1 reduction of binary data representing a digitized audio signal. Additionally, the present invention provides a continuously variable slope delta CV SD
(Continuously V ariable
31ope [) elta) Utilizes modulation techniques.

ＴＤＨ３処理は、データ圧縮に対する時間領域調和圧縮
ＴＤｆ−ＩＣ（Ｔｉｍｅ　　［）ｏｍａｉｎ＠　ａｒｎ
＋ｏｎｉｃ　　Ｇ　ｏｍｐｒｅｓｓｉｏｎ）処理、およ
びデータ伸長に対する時間領域調和圧縮王ＤＨＥ（Ｔ　
ｉｍｅ　　Ｄ　ｏｉａｉｎ　　Ｈａｒｍｏｎｉｃ　　Ｅ
　ｘｐａｎｓｉｏｎ）処理からなる。また、非線型アナ
ログ−ディジタル変換器からの８ビツト出力を１２ビツ
ト数に変換して該変換器からの出力を線型化する公知技
術も利用される。さらに、連続可変スロープデルタ変調
の動作に必要なデータサンプリングレートを増大させる
ため、線型補間アップ勺ンブリング技術も利用される。TDH3 processing is a time domain harmonic compression TDf-IC (Time[)omain@arn
+onic G compression processing, and time domain harmonic compression king DHE (T
ime Doiain Harmonic E
xpansion) processing. Also utilized are known techniques for converting the 8-bit output from a nonlinear analog-to-digital converter into a 12-bit number to linearize the output from the converter. Additionally, linear interpolation upscaling techniques are also utilized to increase the data sampling rate required for continuously variable slope delta modulation operation.

最初に、連続的に変化する音声、オーディオその他のア
ナログ信号が毎秒８，０００回のレートでサンプリング
され、各サンプルは２進数に線型変換される。これは、
各サンプルに対してＵ−２５５ＬＡＷ　　ＰＧＭ形態の
８ビツト２進数出力を与える従来のアナログ−ディジタ
ル変換器に音声信号を供給することによって行われる。First, a continuously varying voice, audio, or other analog signal is sampled at a rate of 8,000 times per second, and each sample is linearly converted to a binary number. this is,
This is done by feeding the audio signal into a conventional analog-to-digital converter which provides an 8-bit binary output in the form of a U-255LAW PGM for each sample.

アナログ−ディジタル変換器は固有の非線型性を有して
いるため、そこから出力されたディジタル化音声信号は
第１マイクロプロセツサを用いてさらに処理され、該ア
ナログーディジタル変換器による非線形変換効果を取除
く。このディジタル化音声の線型化において各８ビツト
２進数は１２ビツト２進数に変換され、オーディオ信号
のディジタル化サンプルを表わ（それらの第１の数の１
２ピッ１−２進数は！！１バッファメモリに蓄積される
。これは実時間で行われる。第１バツフ１メモリが所与
の会話時間セグメントを表わす所定数の２進数を蓄積し
たとき、それらの２進ワードは第２の高速プロセッサに
関連した第２バツフ７メモリへ転送される。この第２プ
ロセツサはＴＤＨＣ技術のピッチ検出を実行し、自動補
正を用いて本発明の装置に入力されている音声信号のピ
ッチ値を抜き取る。さらにＴＤＨＣは三角評価を用いて
ビットレート減縮を行い、第１の数の２進数は半分にさ
れたサンプル化音声信号を表わず。これは２：１ビツト
レート減縮で毎秒４，０００サンプルを与える。Since the analog-to-digital converter has inherent non-linearity, the digitized audio signal output therefrom is further processed using a first microprocessor to eliminate the non-linear conversion effect of the analog-to-digital converter. remove. In this linearization of digitized audio, each 8-bit binary number is converted to a 12-bit binary number, representing a digitized sample of the audio signal (the first number of them
2 pips 1-binary number is! ! 1 buffer memory. This is done in real time. When the first BUF1 memory has accumulated a predetermined number of binary digits representing a given speech time segment, those binary words are transferred to a second BUF7 memory associated with a second high speed processor. This second processor performs the pitch detection of the TDHC technique and uses automatic correction to extract the pitch value of the audio signal being input to the device of the invention. Additionally, TDHC uses triangular evaluation to perform bit rate reduction, so that the binary number of the first number does not represent the halved sampled audio signal. This gives 4,000 samples per second with a 2:1 bit rate reduction.

またこの信号処理ではＣＶＳＤ処理が実行され、これは
適正に機能する上で普通少なくとも毎秒３２　、０００
サンプルのサンプリングレート入力を必要とする。ＴＤ
ＨＣ処理からの出力はｃｖｓ。This signal processing also performs CVSD processing, which typically operates at least 32,000 Hz per second to function properly.
Requires sample sampling rate input. T.D.
Output from HC processing is cvs.

処理の適正動作に対して十分高いサンプルレート人力を
与えない。したがって本発明では、毎秒１６，０００サ
ンプルの出力を有効に与えるようＴＤＨＣ処理の出力を
４）１クタアツプザンプリングする。これは、Ｔ　Ｄ　
ＨＣ処理から出力された２進数の間に２進数を発生する
従来周知の補間技術を用いて行われる。アップサンプリ
ングの結果毎秒１６　、０００の２進数サンプルが得ら
れるが、オーディオスペクトラムがＴ　Ｄ　ＨＣ５１！
ｊ　理によって２：１フアクタ圧縮されるため、ＣＶＳ
Ｄ処理はあたかも毎秒３２，０００勺ンプルのレートで
ディジタル化オーディオ信号を受取ったかのように動作
する。ＣＶＳＤは、入力した各２進ワードに対して１つ
の出力ビットを与える。ＣＶＳＤ処理から出力された個
々のビットはシリアル／パラレル変換器によって２進ワ
ードに組み立てられる。シリアル／パラレル変換器で１
つの完全な２進ワードが組み立てられると、このワード
は該変換器からバルクメモリに転送される。このように
して、第１バツフアメモリに蓄積された１２ビツト２進
数の各々は、バルクメモリにストアされた２進数の中の
１つに含まれる１つの２進ビツトに変換される。バルク
メモリには変換器で組み立てられた２進数と一緒に、Ｔ
　Ｄ　ＨＣ処理中に決定された２進形態のピッチ期間値
もストアされる。本発明の動作にしたがってオリジナル
のディジタル化音声信号の４：１ビツトレート減縮が達
成されるにしたがって、ディジタル・圧縮化音声信号を
記憶するために要するメ・しりは従来技術に比して小さ
い。Does not allow sample rate power to be high enough for proper operation of the process. Therefore, in the present invention, the output of the TDHC process is 4) upsampled by one factor to effectively provide an output of 16,000 samples per second. This is T.D.
This is done using well-known interpolation techniques that generate binary numbers between the binary numbers output from the HC process. Upsampling results in 16,000 binary samples per second, but the audio spectrum is TDHC51!
CVS
The D process operates as if it were receiving a digitized audio signal at a rate of 32,000 samples per second. CVSD provides one output bit for each input binary word. The individual bits output from the CVSD process are assembled into binary words by a serial-to-parallel converter. 1 with serial/parallel converter
Once one complete binary word has been assembled, this word is transferred from the converter to bulk memory. In this way, each of the 12-bit binary numbers stored in the first buffer memory is converted to one binary bit contained in one of the binary numbers stored in the bulk memory. The bulk memory contains T, along with the binary numbers assembled by the converter.
The pitch period value in binary form determined during DHC processing is also stored. Since a 4:1 bit rate reduction of the original digitized audio signal is achieved in accordance with the operation of the present invention, the space required to store the digital compressed audio signal is smaller than in the prior art.

本発明による音声信号のディジタル・圧縮化技術を拡張
するため、第１マイクロプロセツザの制御の下でバルク
メモリの内容が一時に１ワードづつ読み出されＤＭＡイ
ンターフェイスを介して第１バツフアメモリに入力され
る。次に′データは第１バツフアメモリから上記シリア
ル／パラレル変換器に移され、該変換器はこのときパラ
レル−シリアル変換を行う。従来周知のように、この段
階においてｃｖｓｏ処理は逆モードで動作し、該パラレ
ル−シリアル変換より出力された２進ワードの各ビット
を受取り、それらを２進数に再変換する。逆ＣＶＳＤ処
理より出力された２進数はダウンサンプリングされ、ア
ップサンプリングでの補間により生成されてところどこ
ろに入れられていた２進数は取除かれる。４：１ダウン
サンプリング後に残っている２進数は時間領域調和伸長
ＴＤＨＥ（Ｔｔａ＋ｅＤｏｍａｉｎ　Ｈａｒｍｏｎｉｃ
　Ｅｘｐａｎｓｉｏｎ）処理を受番ノる。To extend the audio signal digital compression technique of the present invention, the contents of the bulk memory are read out one word at a time under the control of a first microprocessor and input into a first buffer memory via a DMA interface. be done. The 'data is then transferred from the first buffer memory to the serial/parallel converter, which then performs parallel-to-serial conversion. As is well known in the art, at this stage the CVSO process operates in reverse mode, taking each bit of the binary word output from the parallel-to-serial converter and converting them back to binary. The binary numbers output from the inverse CVSD process are downsampled, and the binary numbers generated by interpolation during upsampling and inserted here and there are removed. The remaining binary numbers after 4:1 downsampling are time domain harmonic extension TDHE (Tta+eDomain Harmonic
Expansion) process.

Ｔ　Ｄ　＋１Ｅ処理はバルクメモリから読出されている
ディジタル化ピッチ値を用い、ダウンサンプリングされ
た２３ｍ信号と協働し、■ＤＨＣ処理によって最初に処
理された音声信号を表わす線型化された第１の数の２進
数を再生成する。１−ＤＨＥ処理によって発生された２
進ワードは、今やディジタル−アナログモードで動作す
る上記変換器に供給される前に、第２プロセツサでＵ−
２５５１ＡＷ　　ＰＣＭに逆線型化（ｄｅｌｉｎｅａｒ
ｉｚａｔｉｏｎ　）される。変換器の出力は、本発明の
教示にしたがって最初にディジタル化され、圧縮され、
記憶された音声信号を正確に表わ（。The T D +1E process uses the digitized pitch values being read from the bulk memory and works with the downsampled 23m signal to generate a linearized first signal representing the audio signal originally processed by the DHC process. Regenerate the binary form of a number. 1-2 generated by DHE treatment
The forward word is processed in a second processor before being fed to the converter, which now operates in digital-to-analog mode.
2551AW PCM inverse linearization (delinear
ization). The output of the converter is first digitized and compressed according to the teachings of the present invention, and
Accurately represents the stored audio signal (.

本発明の別の特徴は、ディジタル・圧縮化音声信号が音
声信号に再変換されるときに速度制御を提供覆ることで
ある。再生速度を２倍にしようとする場合、ダウンサン
プリングより得られる２進ワードは、２進ワード数を２
倍にするＴ　Ｄ　ＨＥ処理を受けず、代わりに逆線型化
されてディジタル−アナログ変換器に直接送られそこで
音声信号に再変換される。再生速度を遅くづるには、通
常速度で再生するときとは少し違うＴ　ＯＨＥ処理が行
われる。Another feature of the present invention is that it provides speed control when a digital compressed audio signal is reconverted to an audio signal. When trying to double the playback speed, the number of binary words obtained by downsampling is
It does not undergo doubling T D HE processing, but is instead delinearized and sent directly to a digital-to-analog converter where it is reconverted to an audio signal. To slow down the playback speed, a slightly different TOHE process is performed than when playing back at normal speed.

好適な実施例の説明第１図にブロック形式で示される本発明の好適な実施例
は、概してポストプロセッサ７と共に機能りるプログラ
ム可能信号プロセッサ９を具備する。ボスミープロセッ
サ７はプロセッサ９から出力されたディジタル・圧縮化
音声の記憶または転送を制御する。ホストプロセッサ７
は普通のプロセッサ構成でよく、ここでは制御入力２２
、ホス１−マイクロプロセッサ２３、プログラムメモリ
２４、出力チャンネル２５、入力チャンネル２６および
蓄積メモリ３４を含み、これらの要素は８ビツトバス２
１を介して相互接続され機能動作覆る。制御人力２２は
システム用の任意の型式の制御入力でよく、例えばキー
、スイッチ、あるいはキーボードである。これらの人力
を用いてプロセッサ９を作動させ、ディジタル信号が音
声信号に再変換されるときの再生速度を選定する。プロ
グラムメモリ２４にはホストプロセッサ２３の動作を制
御するプログラムが収納される。DESCRIPTION OF THE PREFERRED EMBODIMENTS The preferred embodiment of the invention, shown in block form in FIG. The boss me processor 7 controls the storage or transfer of the digital compressed audio output from the processor 9. host processor 7
may have an ordinary processor configuration, here the control input 22
, host 1 - includes a microprocessor 23, program memory 24, output channels 25, input channels 26 and storage memory 34, these elements are connected to an 8-bit bus 2
1 and are interconnected to cover functional operation. Control input 22 may be any type of control input for the system, such as a key, switch, or keyboard. These human powers are used to operate the processor 9 and select the playback speed at which the digital signal is reconverted into an audio signal. The program memory 24 stores a program that controls the operation of the host processor 23.

出力チャンネル２５は、ディジタル化され圧縮された音
声信号を外部のメモリ装置またはデータ通信チャンネル
（どちらも図示せず）に出力するための経路である。同
様に人力チャンネル２６は、メモリ３４にストアされる
かまたはブロヒッサ９によって音声信号に再変換される
べき外部メモリまたはデータ通信チャンネル（どちらも
図示せず）からのディジタル圧縮化音声信号を受取る。Output channel 25 is a path for outputting the digitized and compressed audio signal to an external memory device or data communication channel (neither shown). Similarly, human power channel 26 receives a digitally compressed audio signal from an external memory or data communication channel (neither shown) to be stored in memory 34 or reconverted to an audio signal by Brohisser 9.

メモリ３４は大容量バルクメモリで、プロセッサ９より
発生されたディジタル・圧縮化音声信号または外部ソー
スより入力チャンネル２６を介して受取られたディジタ
ル・圧縮化音声信号を記憶するために用いられる。ホス
トプロセッサ７は、バス２１をプロセッサ９のバス１９
に結合するステータス回路３６および８本ワイヤ２８を
介してプロセッサ９と通信する。ＤＭＡインターフェイ
ス回路３０とシリアル／パラレル変換器３１は協働して
動作し、ポストプロセッサ７とプロセッサ９間でディジ
タル・圧縮化音声信号を転送するように制御される。こ
れらの回路の動作は後で詳しく記述する。プロセッサ９
は２つの基本セクションからなる。第１のセクションは
、アナログ−ディジタル変換器またはディジタル−アナ
ログ変換器として機能する変換器１１と、バッファメモ
リ１２と、入力／出カブ日セッザ１３とを含む。入力／
出力プロセッサ１３はインテル８０８９　１１０コント
ローラで、アナログ人力１０から音声ディジタル化装置
に入力される音声信号の基本的ディジタル化を実行する
よう、プログラムメモリ１８にストアされたプログラム
にしたがって両回路１．１，１２を制御する。変換器１
１はナショナル３０５４コーデツク／フイルタ・コンビ
ネーション・チップである。アナログ入力１０はマイク
ロボンその他任意の音声源またはその他連続的に変化す
る信号であってよい。同様に入力／出力プロセッサ１３
は、プロセッサ９が再生モードにあるときディジタル信
号を音声信号に再変換してアナログ出力回路３３に供給
するよう両回路１１．１２を制御づる。出力回路３３は
スピーカ、テープレコーダその他の装置であつＣよい。Memory 34 is a large bulk memory used to store digital compressed audio signals generated by processor 9 or received via input channel 26 from an external source. Host processor 7 connects bus 21 to bus 19 of processor 9.
It communicates with the processor 9 via a status circuit 36 and eight wires 28 coupled to the processor 9 . DMA interface circuit 30 and serial/parallel converter 31 operate in conjunction and are controlled to transfer digital compressed audio signals between postprocessor 7 and processor 9. The operation of these circuits will be described in detail later. processor 9
consists of two basic sections. The first section includes a converter 11 functioning as an analog-to-digital converter or a digital-to-analog converter, a buffer memory 12 and an input/output converter 13. input/
The output processor 13 is an Intel 8089 110 controller which operates both circuits 1.1 and 1.1 according to a program stored in a program memory 18 to carry out the basic digitization of the audio signal input from the analog human power 10 to the audio digitization device. 12. converter 1
1 is a National 3054 codec/filter combination chip. Analog input 10 may be a microphone or any other audio source or other continuously varying signal. Similarly input/output processor 13
controls both circuits 11 and 12 to reconvert the digital signal into an audio signal and supply it to the analog output circuit 33 when the processor 9 is in playback mode. The output circuit 33 may be a speaker, tape recorder or other device.

プロセッサ９の他方のセクションはマイクロプロセッサ
１４で、これはテキザスインスツルメンツ社のＴＭＳ−
３２０プロセツサでよい。The other section of processor 9 is microprocessor 14, which is a Texas Instruments TMS-
320 processor is sufficient.

プロセッサ１４に含まれる内部データメモリ１５はバッ
ファメモリとして働き、バス２１を介してマイクロプロ
セッサ１６により制御される。乗算回路１７は３２ビツ
ト積の１６Ｘ１６パラレル乗算器で、実時間２進数圧縮
に必要な基本的高速数クランチップ（ｃｒｕｎｃｈｉｎ
ｇ　）を行う。１’ＭＳ−３２０１ロセッサ１４内の回
路１５．１６．１７はバス２０を介して通信する。マイ
クロプロセッサ１６およびＩ１０プロセッザ１４を制御
するプログラムは共にプログラムメモリ１８にストアさ
れる。プロセッサ１４、プログラムメモリ１８および基
本ディジタル化回路１１．１２．１３は１６ビツトバス
１９に接続しこれを介して通信覆る。An internal data memory 15 included in processor 14 serves as a buffer memory and is controlled by microprocessor 16 via bus 21 . The multiplier circuit 17 is a 16x16 parallel multiplier with a 32-bit product, and is equipped with a basic high-speed crunch chip necessary for real-time binary number compression.
g). Circuits 15, 16, 17 within 1'MS-3201 processor 14 communicate via bus 20. The programs controlling microprocessor 16 and I10 processor 14 are both stored in program memory 18. The processor 14, program memory 18 and basic digitization circuits 11, 12, 13 are connected to and communicate with a 16-bit bus 19.

基本動作において、アナログ入力回路１０からの音声信
号は回路１１．１２．１３に！よりαＬ知の方法でサン
プリングされ多数のディジタルワードに変換される。回
路１１．１２．１３の動作は後で詳しく記述する。これ
らのディジタル化音声信号はバス１９を経由してプロセ
ッサ１４に送られる。プロセッサ１４は、本発明の教示
にしたがって４：１データ圧縮を達成するよう、ディジ
タル化音声信号を処理Ｊる。ディジタル化されかつ圧縮
された音声信号はプロセッサ１４から一時に１ビツトず
つ出力されシリアル−パラレル変換器３１により８ビツ
ト２進ワードに組み立てられる。In basic operation, audio signals from analog input circuit 10 are sent to circuits 11, 12, and 13! The data is sampled in a more αL known manner and converted into a large number of digital words. The operation of circuits 11, 12, and 13 will be described in detail later. These digitized audio signals are sent to processor 14 via bus 19. Processor 14 processes the digitized audio signal to achieve 4:1 data compression in accordance with the teachings of the present invention. The digitized and compressed audio signal is output from processor 14 one bit at a time and assembled into 8-bit binary words by serial-to-parallel converter 31.

シリアル−パラレル変換器３１がいっばいになると、そ
の内容がパラレルフォーマットでシフトされＤＭＡイン
ターフェイス回路３０およびホストプロセツナバス２７
を通ってバルクメモリ３４あるいは出力２５を介して外
部メモリまたはデータ通信リンクに送られる。再生モー
ドにおいてディジタル・圧縮化音声信号がメモリ３４よ
り得られ−ｂ”／）Ｓまたは入力２６を介して外部ソー
スより受取られ、バッファメモリ１２に一時的にストア
された後、変換器３１に送られる。このとき変換器３１
はパラレル−シリアル変換器として動作し、データを１
ピツトずつバス１９を介してプロセッサ１４に供給づる
。プロセッサ１４は再生モードで入力したビット情報を
処理してデータを逆圧縮（ｄｅｃｏｍｐｒｅｓｓｉｏｎ
　）　Ｌ／、しかる後このデータをＵ−２５５１ＡＷ　
　ＰＣ，Ｍ形態に逆線型化（ｄｅｌｉｎｅａｒｉｚａｔ
ｉｏｎ　）　Ｌ／、最後に該逆線型化データを回路１１
に直接送る。回路１１はこのときディジタル−アナログ
変換器として動作し、２進ワードを音声信号に変換して
アナログ出力３３に送る。以上概略的に説明した回路は
全て実時間で動作し、音声信号がアナログ人力１０を介
してプログラム可能信号プロセッサに入力されたときに
これをディジタル化しかつ圧縮する。前述したように、
再生モードでオペレータが制御入力２２を介して命令を
与えることによりマイクロプロセッサ１４はアナログ出
力３３における音声信号出力を同一の音声ピッチでスピ
ード・アップしたりスロー・ダウンするように動作する
。Once the serial-to-parallel converter 31 is fully loaded, its contents are shifted in parallel format to the DMA interface circuit 30 and the host processor bus 27.
through to bulk memory 34 or via output 25 to external memory or a data communications link. In the playback mode, a digital compressed audio signal is obtained from the memory 34, received from an external source via the input 26, temporarily stored in the buffer memory 12, and sent to the converter 31. At this time, the converter 31
operates as a parallel-to-serial converter, converting data into 1
Each pit is supplied to processor 14 via bus 19. The processor 14 processes the input bit information in playback mode and decompresses the data.
) L/, then transfer this data to U-2551AW
PC, inverse linearization to M form
ion) L/, and finally the inverse linearized data is sent to the circuit 11.
Send directly to. Circuit 11 then operates as a digital-to-analog converter, converting the binary word into an audio signal and sending it to analog output 33. All of the circuitry outlined above operates in real time to digitize and compress audio signals as they are input to the programmable signal processor via analog human power 10. As previously mentioned,
In playback mode, an operator provides commands via control input 22 that cause microprocessor 14 to operate to speed up or slow down the audio signal output at analog output 33 by the same audio pitch.

ホストプロセッサ７およびプロセッサ９の基本的動作は
以上の通りであるが、次に第１図の回路をざらに詳細に
説明する。The basic operations of the host processor 7 and processor 9 are as described above. Next, the circuit shown in FIG. 1 will be roughly explained in detail.

アナログ人力１０を通ってプロセッサ９に入力された音
声信号はアナログ−ディジタル変換器またはディジタル
−アナログ変換器として機能する上記ナショナル変換器
に供給される。当業者には明らかなように、そのような
機能のために２つの別個の変換器を使用してもよい。本
発明の好適な実施例において変換器１１は毎秒８，００
０回のレートで音声信号をサンプリングしてその結果得
られたデータを８ビツト形態にディジタル化する。変換
器１１の出力は既知のＵ−２５５ＬＡＷ　　ＰＣＭ形態
で−ある。また知られるように、この出力はサンプリン
グ・ディジタル化処理の非線型特性に起因する固有の不
正確さをもつ。この不正確さを是正するためより高価な
１２ビツトデイジタル化装置を使用してもよいが、所要
スペースも大きくなる。本発明では、スペースもコスト
も節約するために、ディジタル化装置１１と共に入力／
出力プロセツリ１３を用いてディジタル化装置１１から
出力された各８ビツトワードを既知の方法で処理し、デ
ィジタル化装＠１１の非線型性に対する補正を含む１２
ビツト２進ワードを発生ずる。このような８ビツトワー
ドから１２ビツトワードへの変換は既知の仕方で索引テ
ーブルを用いて行われる。The audio signal input to the processor 9 through the analog input 10 is fed to the above-mentioned national converter, which functions as an analog-to-digital converter or a digital-to-analog converter. As will be apparent to those skilled in the art, two separate transducers may be used for such functionality. In the preferred embodiment of the invention, transducer 11 operates at 8,000
The audio signal is sampled at a rate of 0 times and the resulting data is digitized in 8-bit form. The output of converter 11 is in the known U-255LAW PCM format. As is also known, this output has inherent inaccuracies due to the nonlinear nature of the sampling and digitization process. More expensive 12-bit digitizing equipment may be used to correct this inaccuracy, but also requires more space. In the present invention, in order to save space and cost, the input/output
An output processor 13 is used to process each 8-bit word output from the digitizer 11 in a known manner, including corrections for nonlinearities in the digitizer @12.
Generates a bit binary word. Such conversion from 8-bit words to 12-bit words is performed in a known manner using look-up tables.

ディジタル化装置１１から出力された各２進ワードがプ
ロセッサ１３によって非線型性に対重る補正を受ｔノ、
そうして得られた約２００個の１２ビツトワードがバッ
ファメモリ１２の一部に蓄積される。メモリ１２は全部
で２，０００個の１２ビットワ−ドを収納でき、実際の
ブロックサイズは問題のピッチレンジに依存して可変で
ある。第１バツフ１メモリ１２が約２００個のワードを
ストアしたとき、プロセッサ１３はそれらのワードをバ
ス１９、２０を介して転送しＴＭＳ−３２’０プロセツ
サ１４内の第２バツフ７データメモリ１５にロードする
。次の２００１１１ｉ１の２進ワードはバッフ１メモリ
１２の別の部分にストアされ、これによってメモリへ転
送される先の２００個の２進ワードとの干渉が避けられ
る。Each binary word output from the digitizer 11 is subjected to a correction for non-linearities by the processor 13;
The approximately 200 12-bit words thus obtained are stored in part of the buffer memory 12. Memory 12 can accommodate a total of 2,000 12-bit words, with the actual block size being variable depending on the pitch range in question. When the first buffer 1 memory 12 has stored approximately 200 words, the processor 13 transfers those words via buses 19, 20 to the second buffer 7 data memory 15 in the TMS-32'0 processor 14. Load. The next 200111i1 binary words are stored in a separate part of buffer 1 memory 12, thereby avoiding interference with the previous 200 binary words that are transferred to memory.

入力音声信号をディジタル化して得られデータメモリ１
５に転送された２００個の２進ワードは、まず初めに時
間領域調和圧縮ＴＤＨＣによって信号処理される。この
既知の技術はここでは詳細に述べないが、参考として引
用する次の文献に詳しく記載されている。Data memory 1 obtained by digitizing the input audio signal
The 200 binary words transferred to 5 are first signal processed by time domain harmonic compression TDHC. This known technique will not be described in detail here, but is described in detail in the following documents, which are cited by reference:

１、パ調波帯域幅縮少および会話信号の時間スケーリン
グのための時間領域アルゴリズム″、。1. Time-domain algorithm for subharmonic bandwidth reduction and time scaling of speech signals''.

ディー・マラー著、ＩＥＥＥＡ　　　　Ａ　お止」ＬＩ
Ｌ旦ｊ［贋−１ＡＳＳＰ−２７巻ル、２、ＰＰ。Written by Dee Muller, IEEE A Stop” LI
Ldanj [Fake-1 ASSP-27 volume, 2, PP.

１２１−　１３３．１９１９年４月；（Ｔ　１ｌｌｅ　　　Ｄ　０１ｌｌａｉｎ　　　Ａ　Ｉ
ｇｏｒｔｔｈｍｓ　　Ｆｏｒト１ａｒｍｏｎｉｃ　　　
　Ｂａｎｄｗｉｄｔｈ　　　ＲｅｄＬＩＣｔｉＯｎａｎ
ｄＴ　ｉｍｅ　　Ｓ　ｃａｌｉｎｇ　　ｏｆ　　　３　
ｐｐｅｃｌｌ　　３　ｉｇｎａｌｓ″　。121- 133. April 1919;
gorthms fort1armonic
Bandwidth RedLICtiOnan
dTime Scale of 3
ppecll 3 signals''.

ｐ　ｒｏｃｅｓｓ±ＱＪＬ、　　Ｖｏｌｕｍｅ　　　Ａ
ＳＳＰ−２７，ｆｆ１．２゜ＰＰ、　　　１２１−　１
３３．Ａｐｒｉｌ　　　１９７９：　　）２、“会話信
号の７．２に１７１〜７秒のための結合された時間領域
調和圧縮およびＤ　Ｖ　Ｓ　Ｄ　”ディー・マラー著、
　　　　　　　　ＳＳ　　今　、デンバー、コロラド、
　１９８０．　Ｐ　ｐ　、　　５０４−５０７゜１９８
０年４月；（”　Ｃｏｍｂｉｎｅｄ　　Ｔ　ｉｍｅ　　Ｄ　ｏｍａ
ｉｎＨａｒｍｏｎｉｃ　Ｃｏｍｐｒｅｓｓｉｏｎ　ａｎ
ｄ　　ＣＶＳＤ　　ｆｏｒ７．２Ｋｂｉｔ　／ｓ　Ｔｒ
ａｎｓｍｉｓｓｉｏｎ　ｏｆ　　ＳｐｅｅｃｈＳｉｇｎ
ａｌｓ”　、　ｂｙ　　Ｑ、　Ｍａｌａｈ、　　Ｐｒｏ
ｃｃｅｄｉｎｇｓｏｒｌＥＥＥ　　ＩＣＡＳＳＰ、Ｄｅ
ｎｖｅｒ。process±QJL, Volume A
SSP-27, ff1.2゜PP, 121-1
33. April 1979: ) 2, “Combined Time-Domain Harmonic Compression and DVSD for 7.2 to 171 Seconds of Speech Signals” by Dee Muller,
SS now, Denver, Colorado,
1980. P p , 504-507゜198
April 0; (“Combined Time Doma
inHarmonic Compression an
d CVSD for7.2Kbit/s Tr
anmission of SpeechSign
als”, by Q, Malah, Pro
ccedingsorlEEE ICASSP, De
nver.

Ｃ０ＩＯｒａｄＯ、１９８０，ＰＰ、　　５０４−５０
７．　Ａｐｒｉｌ。COIOradO, 1980, PP, 504-50
7. April.

１９８０；）３、゛時間領域調和スケーリング適応残留コーディング
を結合する高速デジイタル化への新しい手法″、ジエー
・エル・メルザおよびニー・ケー・ハンプ著、Ｌ二二Ｚ
二二しニュ性二組二Ｇ−００５０８＝、１９８１年８月
：（”　Ｎ　ｅｗ　　Ａ　ｐｐｒｏａｃｌｌｔｏ　　Ｓ　
ｐｅｅｄＤ　１ｇ１ｔｉｚａｔｉｏｎ　　Ｃｏｍｂｉｎ
ｉｎｇ　　Ｔ　ｉｍｅＤｏｍａｉｎ　　Ｈａｒｍｏｎｉ
ｃ　　５ｃａｌｉｎｏ　　ＡｄａｐａｔｉｖｅＲｅｓｉ
ｄｕａｌ　　Ｃｏｄｉｎｇ　”　、　ｂｙ　　Ｊ、　Ｌ
、　Ｍｅｌｓａａｎｄ　　Ａ、　Ｋ、　Ｐａｎｄｅ、　
Ｆｉｎａｌ　　Ｒｅｏｒｔ　　ｏｎＤＣＡ　１００−８
０−Ｃ−００５０，Ａｕｏｕｓｔ　、　１９８１：　）
４６　゛ピッチ検出のための自動分析の使用について″
、エル・アール・ラビナー著、ＩＥＥＥ会報音響、会話
および信号処理、ＡＳＳＰ−２５巻、１１＆１．１　、
　Ｐ　Ｐ　、　２４−３３．１９７７年２月：（Ｑｎ　
ｔｈｅ　（Ｊ　ｓｅ　ｏｆ　Ａｕｔｏｃｏｒｒｅｌａｔ
ｉｏｎＡｎａｌｙｓｉｓ　ｆｏｒ　Ｐｉｔｃｈ　　［）
ｅｔｅｃｔｉｏｎ”　、　ｂｙ　　ｌ　。1980;) 3. ``A New Approach to High-Speed Digitization Combining Time-Domain Harmonic Scaling Adaptive Residual Coding'', by G. L. Melza and N. K. Hamp, L22Z
22 Newness 2 Set 2 G-00508=, August 1981:
peedD 1g1tization Combine
ing TimeDomain Harmoni
c 5calino Adaptive Resi
dual coding”, by J,L
, Melsaand A.K., Pande,
Final Reort on DCA 100-8
0-C-0050, Auoust, 1981: )
46 ゛About the use of automatic analysis for pitch detection''
, L. R. Rabiner, IEEE Bulletin Acoustics, Conversation and Signal Processing, ASSP-25, 11 & 1.1,
P P , 24-33. February 1977: (Qn
the (J se of Autocorrelat
ionAnalysis for Pitch [)
'ection', by l.

ｒｏｃｅｓｓｉｎ　　、　Ｖｏｌｕｍｅ　　ＡＳＳＰ−
２５，ＮＯ，１゜ＰＰ、２４−３３．　　Ｆｅｂ、１９
７７、　　）この時間領域調和圧ＭＴＤＨＣはディジタ
ル化音声信号を調べて波形の自動相関のピークを捜し出
す。この情報から音声のピッチ周期が導出され、ざらに
ディジタル化情報の２つのピッチ周期が三角評価関数を
用いて一つに平均化され、２：１データ減縮を与える圧
縮化データセットが生成される。従来このＴ　Ｄ　ＨＣ
処理は、データを圧縮するようディジタル化音声信号に
施される信号処理にすぎなかった。しかし本発明の教示
にしたがえば、既に圧縮されたデータがさらに処理され
一層の圧縮化が行われる。rocessin, Volume ASSP-
25, NO, 1°PP, 24-33. Feb, 19
77.) This time-domain harmonic pressure MTDHC examines the digitized audio signal and looks for autocorrelation peaks in the waveform. The pitch period of the audio is derived from this information, and the two pitch periods of the roughly digitized information are averaged together using a trigonometric evaluation function to produce a compressed data set that provides a 2:1 data reduction. . Conventionally, this T D HC
Processing was simply signal processing applied to the digitized audio signal to compress the data. However, in accordance with the teachings of the present invention, already compressed data is further processed to provide further compression.

既に２：１に圧縮されているデータに施される次の主た
る信号処理は、既知の連続的可変スロープデルタ変調処
理技術を用いて行われる。The next major signal processing applied to the already 2:1 compressed data is performed using the known continuously variable slope delta modulation processing technique.

通常、この技術が適正に機能づるには、入力υンブルレ
ートを少なくとも３２，０００サンプル／秒にしなけれ
ばならない。Ｔ　Ｄ　ＨＣ処理の後のサンプリングレー
トは４　、０００ザンブル／秒になるが、これはＣＶＳ
Ｄ処理の適正動作に対して不十分である。この問題を回
避するため、Ｔ　Ｄ　ＨＣ処理を用いて得られた部分的
圧縮化データは既知の補間技術を用いて４フアクタアツ
プサンプリングされ、これにより総サンプルレートは１
６．０００サンプル／秒になる。これは適正ＣＶＳＤ処
理にとって通常不十分であるが、先のＴ、Ｄｌ−ＩＣ処
理がディジタル化音声の周波数スペクトラムを２７１ク
タだけ圧縮しているので、Ｃ■ＳＤ処理は１６，０００
サンプル／秒のす゛ンプルレートを受取るもののあたか
も３２，０００サンプル／秒のサンプルレートで動作す
るのと同じ結果を与える。このｃｖｓｏ処理は従来知ら
れているためここでは詳しく述べないが、参考として掲
げる次の文献に詳細な説明が載っている。Typically, the input nucleation rate must be at least 32,000 samples/second for this technique to function properly. The sampling rate after T D HC processing is 4,000 samples per second, which is different from CVS.
D is insufficient for proper operation of processing. To avoid this problem, the partially compressed data obtained using T D HC processing is upsampled by 4 factors using known interpolation techniques, which reduces the total sample rate to
6.000 samples/second. This is usually insufficient for proper CVSD processing, but since the previous T,Dl-IC processing compressed the frequency spectrum of the digitized audio by 271 ta, the CSD processing compressed the frequency spectrum by 16,000
Although it receives a sample rate of 32,000 samples/second, it gives the same result as if it were operating at a sample rate of 32,000 samples/second. Since this CVSO processing is conventionally known, it will not be described in detail here, but a detailed explanation can be found in the following literature cited as a reference.

１、“連続的デルタ変調″、ジエー・ニー・グリーフリ
クス著、フィリップス　　　１５．−、　ｌＩ＆＋２３
゜ＰＰ、　　２３３−　２４６．　１９６８年；（”　
Ｃｏｎｔｉｎｕｏｕｓ　　［）ｅｌｔａ　　１ｙｌｏｄ
ｕｌａｔｉｏｎ　”　。1. “Continuous Delta Modulation” by J.N. Grieflix, Philips 15. −, lI&+23
゜PP, 233-246. 1968;(”
Continuous [)elta 1ylod
ulation”.

Ｊ、Ａ、Ｑｒｅｅｆｋｅｓ　、　Ｐｈ１ｌｌｉ　ｓ　　
Ｒｅｓ。J, A, Qreefkes, Ph1lli s.
Res.

旦至肘工、陽、２３．　ＰＰ、　　２３３−２４６．１
９６８；　）２、゛デルタ変調システム″、アール・ス
テイール著、ハルステッド・プレス、ロンドン。Danshijiji, Yang, 23. PP, 233-246.1
968; ) 2, ``Delta Modulation System'', by Earl Stehr, Halsted Press, London.

１９７５年；（ｅｌｔａ　　　ｏｄｕｌａｔｉｏｎ　　　　ｓｔｅｍ
ｓ″Ｒ。1975;
s″R.

５ｔｅｅｌｅ　、　ｔｌａｌｓｔｅｄ　　Ｐｒｅｓｓ、
　Ｌｏｎｄｏｎ、１９７５；　）３、連続的に変化可能
スロープデルタ変調／復調器”　Ｍ　Ｃ３４１８のため
の仕様マニュアル″モトローラ・セミコンダ、フェニッ
クス、アリシナ。5teele, tlsted Press,
London, 1975; ) 3. Specification Manual for the Continuously Variable Slope Delta Modulator/Demodulator "MC3418" Motorola Semiconda, Phoenix, Alisina.

Ｍｏｔｏｒｏｌａ　　Ｓｅｍ１ｃｏｎｄｕｃｔｏｒｓ　
、　Ｐｌｌｏｅｎｉｘ。Motorola Sem1 conductors
, Plloenix.

Ａ　ｒｉｚｏｎａ、　）後述するｖ！ｌ：４アツプザンブリングおよび４：１ダ
ウンザンプリングは本明細幽では詳述しないが、次の文
献に詳しく記述されている。Arizona, ) v! Although 1:4 upsampling and 4:1 downsampling are not discussed in detail in this specification, they are described in detail in the following documents.

“合理的な比率でデータサンプリングレー１〜変換を行
うための一般的プログラム″、アール・イー・クロチー
ル著、ベル研究所、マレ−ヒル。“A General Program for Performing Data Sampling Rates to Conversions at Reasonable Rates,” by R.E. Clotir, Bell Laboratories, Murray Hill.

エヌ・ジエー、−イぐ゛　　：？ＬＬ　　　−１ＬΣに
、アイ・イー・イー・プレス、　１９７９年、頁８．２
−　ｉ乃至８．２−７゜（”　ｐ、　　Ｑｅｎｅｒａｌ　　Ｐｒｏｇｒａｍ　　
ｔｏ　　ｐｅｒｆｏｒｎ＋Ｓ　ａｍｐｌ　ｉｎａ　　　
　Ｒａｔｅ　　　　Ｃｏｎｖｅｒｓｉｏｎ　　ｏｆ　　
　Ｄ　ａｔａｂｙ　　Ｒａｔｉｏｎａｌ　　Ｒａｔｉｏ
ｎｓ、　　ｂｙ　　Ｒ，Ｅ。N.G.-Ig゛:? LL-1LΣ, IE Press, 1979, p. 8.2
-i to 8.2-7゜("p, Qeneral Program
to perforn+S amplina
Rate Conversion of
Data by Ratio
ns, by R,E.

Ｃｒｏｃｈｉｅｒｅ　ｏｒ　　Ｂ　ｅｌｌ　　Ｌ　ａｂ
ｏｒａｔｏｒｉｅｓ　ｉｎＭｕｒｒａｙ　　Ｈｉｌｌ、
Ｎ、Ｊ、、Ｐｒｏｒａｍｓ　　ＦｏｒＤｉ　１ｔａｌ　
　　ｉ　ｎａｌ　　ｒｏｃｅｓｓｉｎ　　、　　ＩＥＥ
ＥＰ　ｒｅｓｓ、　１９７９．　ｐａｇｅｓ　　　８．
２−１　ｔｏ　８．２−７　）　。Crochiere or Bell Lab
oratories in Murray Hill,
N, J,, Prorams ForDi 1tal
inal rocessin, IEE
EP ress, 1979. pages 8.
2-1 to 8.2-7).

ＣＶＳＤ信号処理は入力した各２進ワードに対して１つ
の２進ビツト出力を生成する。小さな会話セグメントに
対してデータメモリ１５にストアされた２００個の２進
ワードにつき、総数２００の２進ビツトがｃｖｓｏ処理
より発生される。これら２進ビツトの各々はバス２０．
１９を介して出力され、シリアル−パラレル変換器３１
に一時的にス１〜アされる。変換器３１はそれらのビッ
トを８ビツト２進ワードに組み立てる。かくして、ｃｖ
ｓｏ処理より出力された２００個のビットは変換器３１
によって２５個の８ビツト２進ワードに組み合わされる
。ポストマイクロプロセッサ２３は、プロセッサ１４の
データメモリ１５にストアされた２００個の２進ワード
の処理状況を知らされる。ホストマイクロプロセッサ２
３は、シリアル−パラレル変換器３１で組み立てられた
各８ビツト２進ワードをそこからＤＭＡインターフェイ
ス回路３０を介して転送させ、バルクメモリ３４にスト
アするか、あるいは出力２５を介して外部メモリまたは
電気通信リンクに送出する。CVSD signal processing produces one binary bit output for each input binary word. For every 200 binary words stored in data memory 15 for a small speech segment, a total of 200 binary bits are generated by the CVSO process. Each of these binary bits is connected to bus 20.
19 and is output via serial-to-parallel converter 31
It will be temporarily switched to S1-A. Converter 31 assembles the bits into 8-bit binary words. Thus, cv.
The 200 bits output from the so processing are sent to the converter 31.
are combined into 25 8-bit binary words. Post microprocessor 23 is informed of the processing status of the 200 binary words stored in data memory 15 of processor 14. host microprocessor 2
3 causes each 8-bit binary word assembled in serial-to-parallel converter 31 to be transferred therefrom via DMA interface circuit 30 and stored in bulk memory 34 or transferred via output 25 to external memory or electrically. Send to communication link.

さらに、ＴＤＨＣ処理によって決定された２進ワード形
態のピッチ周期がバルクメモリ３４にストアされた２５
個の関連辻縮化２進ワードに加えられる。Furthermore, the pitch period in the form of a binary word determined by the TDHC processing is stored in the bulk memory 34.
associated condensed binary words.

データメモリ１５にストアされた２００個の２進ワード
形態の音声サンプルが上述のようにして処理されると、
プロセッサ１４はディジタル化音声を表わ（別の２００
個の２進ワードを次の処理のために受取る用意ができる
。このとき別の２００個の２進ワード形態音声サンプル
がバッファメモリ１２で組み立てられており、これらの
２進ワードはバス１９．２０を介してデータメモリ１５
に転送される。メモリ１５は前述した仕方でこの新たな
情報を処理する。かくして、音声ディジタル化装＠９は
実時間で動作し、アナログ入力１０を介して入力した音
声信号をディジタル化しかつ圧縮する。When the audio samples in the form of 200 binary words stored in the data memory 15 are processed as described above,
Processor 14 represents the digitized audio (another 200
binary words are ready to be received for further processing. Another 200 binary word audio samples are then assembled in the buffer memory 12, and these binary words are transferred via the bus 19.20 to the data memory 15.
will be forwarded to. Memory 15 processes this new information in the manner described above. Thus, the audio digitizer@9 operates in real time to digitize and compress the audio signal input via the analog input 10.

第２図には上述した第１図のハードウェアによってなさ
れる機能ステップを示ブ。音声信号はまずＵ−２５５１
ＡＷ　　ＰＣＭ標準へのナンプリング・ディジタル化を
施される（ブロック３５）。次にディジタル化信号は線
型化される（ブロック３６）。次にディジタル・線型化
音声信号は、音声信号のピッチ周期を導出しディジタル
化サンプルを２：１フアクタ圧縮づるための信号処理を
受ける（ブロック３７）。次に、ＴＩ）　ＨＣ処理より
出力された部分的圧縮化ディジタル信号は４フアクタア
ツブザンプリングされる（ブロック３８）。しかる後、
アップサンプリングされたディジタル信号はｃｖｓｏ信
号処理を受ける（ブロック３９）。ｃ　ｖ　ｓ、ｏ処理
より出力された２進ビツトは２進ワードに組み立てられ
、これら２進ワードの数はブロック３５で発生されたオ
リジナルのディジタル信号に比して４：１フアクタだけ
圧縮されている。これら圧縮化信号は経路４０からのピ
ッチ周期値と結合されメモリ３４に記憶される。FIG. 2 illustrates the functional steps performed by the hardware of FIG. 1 described above. The audio signal is first U-2551
Numbering and digitization to the AW PCM standard is performed (block 35). The digitized signal is then linearized (block 36). The digital linearized audio signal is then subjected to signal processing to derive the pitch period of the audio signal and to compress the digitized samples by a 2:1 factor (block 37). Next, the partially compressed digital signal output from the TI) HC process is subjected to four-factor upsampling (block 38). After that,
The upsampled digital signal is subjected to CVSO signal processing (block 39). The binary bits output from the cvs,o processing are assembled into binary words, the number of which is compressed by a factor of 4:1 compared to the original digital signal generated in block 35. There is. These compressed signals are combined with the pitch period values from path 40 and stored in memory 34.

第３図には、本発明の教示にしたがってディジタル・圧
縮化された音声信号がメモリ３４から読み出されて音声
信号に再変換されるときの機能ステップを示す。メモリ
３４から読み出されたディジタル・圧縮化データは、Ｔ
　Ｄ　ｌ−Ｉ　Ｅ処理（ブロック４３）で用いられるピ
ッチ周期を表わす２進数を含む。ディジタル・圧縮化音
声はまずこの信号処理形式によって生じた圧縮の効果を
取り除くための逆ＣＶＳＤ処理を受ける。この逆処理は
公知で、上記引用文献に記載されている。ブロック４１
より出力された伸長化ディジタル信号は、第２図の補間
（ブロック３８）によってディジタル情報に加えられた
２進数を取り除くために４７アクタだＩＪダウンチン１
リングされる。ダウンサンプリング化ディジタル音声信
号と経路４４からのピッチ周期値は、音声信号のディジ
タル化によって生成されたオリジナル２進信号を導出す
るだめの１−　Ｄ、　ＨＥ処理を受ける（ブロック４３
）。本発明の装置のオペレータが再生速度を変えると決
めたときには、その旨の入力を装置に与えればよい。２
倍の再生速度を望む場合、ブロック４２より出力された
ダウンサンプリング化ディジタル信号はブロック４３を
バイパスしてＴＤ）−ＩＥ処理を受けず、代りにディジ
タル−アナログ変換ステップに直接進む。FIG. 3 illustrates the functional steps when a digital compressed audio signal is read from memory 34 and reconverted to an audio signal in accordance with the teachings of the present invention. The digital compressed data read from memory 34 is T
Contains a binary number representing the pitch period used in the Dl-IE processing (block 43). Digital compressed audio is first subjected to inverse CVSD processing to remove the effects of compression caused by this form of signal processing. This inverse processing is known and described in the above-mentioned references. block 41
The decompressed digital signal output by the IJ Down Chin 1 is used to remove the binary digits added to the digital information by the interpolation (block 38) of FIG.
Ringed. The downsampled digital audio signal and pitch period values from path 44 are subjected to 1-D, HE processing (block 43) to derive the original binary signal produced by the digitization of the audio signal.
). When an operator of the apparatus of the present invention decides to change the playback speed, an input to that effect may be provided to the apparatus. 2
If double playback speed is desired, the downsampled digital signal output from block 42 bypasses block 43 and is not subjected to TD)-IE processing, but instead proceeds directly to the digital-to-analog conversion step.

再生速度をおそくしたい場合、ダウンサンプリング化デ
ィジタル信号は２進ワード数を拡大するだめの変更処理
を受ける。次に、プロセッサ１４を用いて線型化効果を
取除く。その結果、ブロック４５より出力された信号は
Ｕ−２５５ＬＡＷ　　ＰＣＭフォーマットで、ディジタ
ル形態がら音声に変換される（ブロック４６）。以上第
３図を参照して再生動作を概略的に述べたが、次に第１
図を参照してより詳しく説明する。If it is desired to slow down the playback speed, the downsampled digital signal is subjected to a modification process to expand the number of binary words. Next, processor 14 is used to remove linearization effects. As a result, the signal output from block 45 is converted from digital form to audio in U-255LAW PCM format (block 46). The playback operation has been outlined above with reference to Figure 3, but next
This will be explained in more detail with reference to the drawings.

再生モードにおいて、本発明の教示にしたがってディジ
タル化され圧縮された音声信号はプロセッサ７内のメモ
リ３４がら読み出されるが、あるいは外部メモリまたは
入力２６を介して電気通信チャンネル（どちらも図示せ
ずンがら得られる。これらのディジタル・圧縮化信号を
構成する２進ワードの各々はホストプロセッサ７、ＤＭ
Ａインターフェイス３ｏおよびバス１９を経由して第１
バツフアメモリ１２に送られ、次いでそこからプロセッ
サ１４に送られて伸長および逆線型化を施され、しかる
後シリアル／パラレル変換器３１に送られる。このとき
変換器３１はパラレル−シリアル変換器として機能する
。このようにして、各圧縮化データワードの個々のビッ
トは一時に１つずつ取り出されてマイクロブロセッ丈１
４に供給される。逆ｃｖｓｏ処即が既知の方法で行われ
、その結果得られた伸長化２進数は４つの２進ワードに
つき３つの２進ワード、すなわち補間によってアップザ
ンプリしグ中に生成された２進ワードが除去されるよう
にダウンサンプリングされる。ダウンサンプリングステ
ップより出力された２進ワードは再生速度を倍増するか
、定常にするか、または減速づるよう、ＴＩ）ＨＥ処理
ステップをバイパスするかまたはその処理を受（プる。In playback mode, the audio signal, digitized and compressed in accordance with the teachings of the present invention, is read from memory 34 within processor 7, or alternatively via an external memory or input 26 to a telecommunications channel (neither shown). Each of the binary words making up these digital compressed signals is processed by the host processor 7, DM
via A interface 3o and bus 19
It is sent to buffer memory 12 and from there to processor 14 for decompression and delinearization, and then sent to serial/parallel converter 31. At this time, converter 31 functions as a parallel-serial converter. In this way, the individual bits of each compressed data word are taken out one at a time to fill the microblock length 1.
4. The inverse cvso processing is performed in a known manner and the resulting unstretched binary number is 3 out of 4 binary words, i.e. the binary words generated during upsampling by interpolation. is downsampled so that it is removed. The binary words output from the downsampling step bypass or undergo processing by the TIHE processing step to double, stabilize, or slow down the playback speed.

ディジタル化音声信号が上述したステップを通って逆圧
縮されると、伸長化２進数はＵ−２５５１ＡＷ　　ＰＧ
Ｍへの逆線型化を施され、しかる後Ｄ／Ａ変換器１１に
直接送られて２進ワードからオリジナル音声信号に再変
換されアナログ出力３３に送出される。Once the digitized audio signal is decompressed through the steps described above, the decompressed binary number is U-2551AW PG
The signal is then inversely linearized into M and then sent directly to the D/A converter 11 where it is reconverted from a binary word to an original audio signal and sent to the analog output 33.

圧縮化ディジタル音声信号を音声に再変換する際の１２
ビツトから８ビツトへの変換方法については次の文献を
参照されたい。゛任意の連続的ＰＣＭ伸長化法の実施の
ための新しいディジタル技術″、ビー・■−・デツシエ
ンズおよびエッヂ・ステフェン茗、シェルブルーフ人学
電気工学利、キューベック、カナダ：　１９７３　、１
　Ｅ　ＥＥ通信国際会議の会議録、１巻０頁１１−１２
乃至１−１７（Ａ　　Ｎｅｗ　　Ｄｉｇｉｔａｌ　　７ｅｃｈｎｉｑ
ｕｅ　　ＦｏｒＩ　ｍｐｌｅｍｅｎｔａｔｉｏｎ　　ｏ
ｆ　　ａｎｙ　　Ｃｏｎｔｉｎｕｏｕｓ　　’　ＰＣＭ
　　　Ｃｏｍｐａｎｄｉｎｇ　　　ｌ　ａｗ″、　　ｂ
ｙ　　Ｍ　。12 when reconverting a compressed digital audio signal to audio
Please refer to the following document for the method of converting from bit to 8 bit. ``New Digital Techniques for the Implementation of Arbitrary Continuous PCM Stretching Methods'', B. Dessiens and Edge Steffen Mei, Scherbruch Institute of Human and Electrical Engineering, Kubeck, Canada: 1973, 1
Proceedings of the International Conference on EE Communications, Vol. 1, pp. 11-12.
to 1-17 (A New Digital 7echniq
ue For I implementation o
f any Continuous' PCM
Companding law'', b
yM.

Ｖｉｌｌｅｒｅｔ　　、Ｐ、Ａ、Ｄｅｓｃｂｅｎｅｓ　
　ａｎｄ　　　ｌ−１゜Ｓ　ｔｅｐｌ＞ｅｎｎｅ　ｏｆ
　　ｔｈｅ　　　Ｅ　ｌｅｃｔｒｉｃａｌＥｎｇｉｎｅ
ｅｒｉｎｇＤｅｐａｒｔｍｅｎｔ　　ｏｆ　　ｔｈｅＵ
ｎｉｖｅｒｓｉｔｙ　ｏｆ　　５ｈｅｒｂｒｏｏｋｅ　
　、　　５ｈｅｒｂｒｏｏｋｅ　。Villeret, P., A., Descbenes.
and l-1゜S tepl>enne of
the ElectricalEngine
eringDepartment of theU
university of 5herbrooke
, 5herbrooke.

Ｑｕｅｂｅｃ　　、　　Ｃａｎａｄａ　　：　Ｃｏｎｔ
’ｅｒｅｎｃｅ　　　Ｒｅｃｏｒｄ　　。Quebec, Canada: Cont.
'erence Record.

１　、　　ｐａｇｅｓ　　　１１−１２　　ｔｏ　　１
ｌ−１７）　　。1, pages 11-12 to 1
l-17).

以上本発明の好適な実施例を述べたが、当業者には明ら
かなように本発明の技術思想から逸１；２りることなく
多くの変形が可能である。例えば、アブログ−ディジタ
ル変換サンプリングを３２．０００リンプル／秒で行う
場合、ｃｖｓｏ信号処理前に１：４アツプサンプリング
は必要でない。また他のタイプの信号処理も利用可能で
ある。−１−Ｄ　ＨＳ処理の代わりに、ハ声信号を表わ
す２進ワードの数を減らすと同時に音声信号のピッチを
導出してスペクトラムを圧縮するような別のプロレスを
用いてもよい。同様に、ＣｖＳＤ処理の代わりに、ザン
ブル点を表わす２進ワードを音声信号のスロープを表わ
す２進ピツ１へとしてコード化するプロセスを用いても
よい。Although preferred embodiments of the present invention have been described above, it will be obvious to those skilled in the art that many modifications can be made without departing from the technical idea of the present invention. For example, if log-to-digital conversion sampling is performed at 32,000 ripples/second, 1:4 upsampling is not required before cvso signal processing. Other types of signal processing are also available. -1-D Instead of HS processing, other methods may be used, such as reducing the number of binary words representing the voice signal and simultaneously deriving the pitch of the voice signal and compressing the spectrum. Similarly, instead of CvSD processing, a process may be used that encodes binary words representing zumble points as binary pits 1 representing the slope of the audio signal.

２進ワードをコード化するために他の波形］−ド化技術
を用いてもよい。ＴＤＨ３および／またはｃｖｓｏある
いはそれらに代わる処理を圧縮および伸長において２回
以上行ってもよい。Other waveform encoding techniques may be used to encode binary words. TDH3 and/or cvso or an alternative process may be performed more than once during compression and decompression.

再生速度制御に対して、２進ワード伸長の度合に作用す
るように２進ワードを適当な処理ステップに入力しても
よい。また、当業者であれば静粛または沈黙期間を検出
してそのディジタル化を行わないことにより会話情報を
さらに圧縮することも可能である。本発明はモデム、二
重たがって他の信号処理アプリケーションに利用可能で
ある。For playback speed control, the binary word may be input to the appropriate processing steps to affect the degree of binary word decompression. It is also possible for those skilled in the art to further compress conversational information by detecting periods of silence or silence and not digitizing them. The invention can be used in modems, duplexes, and other signal processing applications.

[Brief explanation of drawings]

第１図は本発明を実施するハードウェア要素の詳細なブ
ロック図；第２図は本発明にしたがって音声信号をディジタル化し
圧縮するための信号処理ステップを示１ブロック図；お
よび第３図はディジタル・圧縮化音声信号を再変換して音声
信号に戻すだめの信号処理ステップを示ずブロック図で
ある。７・・・ホストプロセラ１ノ９・・・プログラム可能信号ブロセッザ１０・・・アナ
ログ入力１１・・・アナログ／ディジタル（デ゛イジタル／アナログ）変換器１２・・・バッフ１メモリ１３・・・人力／出力ブロセッ→ノ１５・・・内部データメモリ１６・・・マイクロプロセッサ１７・・・乗算回路　　　　１８・・・プログラムメモ
リ２３・・・ポストマイクロプロセラυ ２４・・・プログラムメモリ２５・・・出力チャンネル　２６・・・入力ヂャンネル
３３・・・アナログ出力特許出願人　　ウオング・ラボラトリーズ・インコーホ
レーテッド（外４名）アメリカ合衆国マサチューセラツ州ローウェル・パーク・アベニューウェスト・ベルライディア・タウンハウス（番地なし）０発　明　者　ニレイン・チポウラス・グロ・ン１ノア
メリカ合衆国マサチューセッツ州ビレリカ・ランプソン・レイン７FIG. 1 is a detailed block diagram of the hardware elements implementing the invention; FIG. 2 is a block diagram illustrating the signal processing steps for digitizing and compressing audio signals in accordance with the invention; and FIG. - It is a block diagram that does not show the signal processing steps for reconverting a compressed audio signal back into an audio signal. 7...Host processor 1/9...Programmable signal processor 10...Analog input 11...Analog/digital (digital/analog) converter 12...Buffer 1 memory 13...Manual power /Output processor→ノ15...Internal data memory 16...Microprocessor 17...Multiplication circuit 18...Program memory 23...Post microprocessor υ 24...Program memory 25...Output channel 26...Input channel 33...Analog output Patent applicant Wong Laboratories, Inc. (4 others) Lowell Park Avenue West Bellydia Townhouse, Massachusetts, USA (no street address) 0 Inventor: Nilayne Tsipouras Gron 7, Billerica Lampson Lane, Massachusetts, United States of America

Claims

[Claims]

(1) A method for compressing a first number of binary words representing a continuously varying signal over a given time period, wherein the continuously varying signal is sampled at a predetermined rate per unit time. digitized into said first number of binary words, said continuously varying signal having a pitch value; Extract the pitch value of the signal that changes over time, and divide the pitch value into one 2
(b) using W41 signal processing techniques to reduce the first number of binary words to a second number of binary words; and (c) representing the continuously varying signal. encoding each of said second number of binary words as one binary bit representing a slope of said continuously varying signal using a second signal processing technique to process binary words representing sample points of said second number of binary words; , thereby reducing the second number of binary words to a plurality of binary bits.

(2) further comprising the step of assembling the bits provided by the second signal processing technique into a multi-bit binary word: the multi-bit binary word and the binary word representing the pitch value are connected to the continuously varying signal. and is substantially compressed by an order of 4:1 relative to said first number of binary words representing said continuously varying signal. .

(3) Obtained from the first signal processing technique, 1=the second
If the binary words of the second number are insufficient for proper processing of step (c), then the number of binary words of the second number is Claim 2 which roughly includes the step of increasing words
The method described in section.

(4) A method of digitizing a continuously varying signal and reducing the number of resulting binary words, comprising: (a) a first number representing said continuously varying signal over a given period of time; (b) extracting the pitch value of the continuously changing signal from the first number of binary words and converting it into one binary word; (c) reducing the first number of binary words to a second number of binary words using a first signal processing technique; and (d) representing the continuously varying number of binary words. encoding each of the second number of binary words to reduce them to smaller binary representations using a second signal processing technique that processes binary words representing sample points of a signal. How to do it.

(5) the digitizing step (in) sampling the continuously varying signal at multiple points and generating one binary number for each number, and eliminating the nonlinearity inherent in the sampling; 5. A method as claimed in claim 4, comprising the step of correctingly altering each said binary digit, thereby generating a binary digit of said first number.

(6) assembling smaller binary representations provided by the second signal processing technique into larger binary words, wherein the larger binary word and the binary word representing the pitch value are arranged in the continuously varying 6. A signal representing a signal and substantially compressed on the order of 4:1 compared to said first number of binary words representing said continuously varying signal. the method of.

(7) In the case where the binary words of the second number obtained from step (c) are insufficient for proper processing of step (d), the second number of binary words of the pair
6. The method of claim 5, further comprising increasing the second number of binary words by interpolating between the binary words.

(8) The method further comprises the step of assembling the first number of binary words obtained by digitizing the continuously changing signal into a block of binary 4s, and (b) 6
7. The method of claim 6, wherein steps 3 and (c) are performed on said binary block.

(9) Digitizing the continuously varying signal into a first number of binary words; deriving the pitch of the continuously varying signal using the first number of binary words; a signal processing technique is used to represent the first number of binary words as one binary word and reduce the first number of binary words to a second number of binary words representing the continuously varying signal; The reproduction of a signal varying in pitch comprises processing said second number of binary words and said pitch binary number in an inverse manner to derive said first number of binary words, which is then converted into a digital-to-analog signal. A method for providing control of playback speed in a system by converting and reconverting into said continuously varying signal; expanding into a plurality of binary words different from the word: when the plurality of binary words are transformed into the continuously varying signal, the reproduced density of the continuously varying signal is the same as that of the original. 1. The speed at which a continuously changing signal is digitized is different from that at which it is digitized.
6 Control methods.

(10) further comprising reducing the number of binary digits of the predetermined iR2 to a reduced number of binary words by removing periodic ones therein, the reduced number of binary words being processed by the signal processing; 10. A method as claimed in claim 9, which is processed in a reverse manner by technology.

(11) a continuously varying signal is digitized into a first number of binary words; the first number of binary words are used to derive bits of the continuously varying signal; a signal processing technique is used to represent the first number of binary words as one binary word and reduce the first number of binary words to a second number of binary words representing the continuously varying signal; Reproducing the varying signal comprises processing the binary number of the second number and the pitch binary number in an inverse manner using the signal processing technique to derive the binary number of the first number; A method for providing control of playback speed in a system by digital-to-analog conversion and reconversion to said continuously varying signal, said method comprising: applying said digital-to-analog conversion to said second number of binary words; directly, so that the binary words of the second number are nlJ
A method according to claim 1, comprising the step of increasing the playback speed by a factor of two when the first number of binary words is half.

(12) When the second number of binary words is converted into the first number of binary words by reverse processing using the signal processing technique, the continuously varying signal is first 12. The method of claim 11, wherein the method is played back at the same speed as when it was digitized.

(13) roughly reducing the second number of binary digits to a reduced number of binary words by removing periodic ones therein; 13. A method as claimed in claim 12, which is processed in a reverse manner by a processing technique.