JPS63253994A

JPS63253994A - Sound wave information recording/reproducing system

Info

Publication number: JPS63253994A
Application number: JP62088315A
Authority: JP
Inventors: 浩志村; 洋一森
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1987-04-10
Filing date: 1987-04-10
Publication date: 1988-10-20

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】侠晰分及本発明は、音波情報記録再装置、より詳細には、音波情
報をシートに記録し、そのシートの音波情報を再生する
記録再生装置における音波情報の圧縮に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a sound wave information recording and reproducing apparatus, more specifically, a method for compressing sound wave information in a recording and reproducing apparatus that records sound wave information on a sheet and reproduces the sound wave information on the sheet. Regarding.

丈米致亙音波情報記録媒体として、従来、レコードおよび磁気テ
ープが良く知られている。しかし、これらは、記録およ
び再生に特別な装置（テープレコーダ）を必要とする。Conventionally, records and magnetic tapes are well known as sound wave information recording media. However, these require special equipment (tape recorder) for recording and playback.

最近はカードやシートに磁気薄膜をコーティングして、
磁気薄膜には磁気情報を書き込み、また文字、絵、写真
、記号等を印刷している。つまり、一枚のカード又はシ
ートに磁気情報と可視情報の２つが担持されている。Recently, magnetic thin films have been coated on cards and sheets.
Magnetic information is written on the magnetic thin film, and letters, pictures, photographs, symbols, etc. are printed on it. In other words, one card or sheet carries both magnetic information and visible information.

磁気情報を音声情報とすることも出来る。しかしながら
、可視情報と磁気情報の両者を記録するには、印刷装置
と磁気記録装置の２つを必要とする。Magnetic information can also be converted into audio information. However, recording both visible information and magnetic information requires two devices: a printing device and a magnetic recording device.

これら２つの装置は、記録原理が全く異なり、カード又
はシートの搬送機構等は共用できるかも知れないが、書
込みヘッドやヘッドドライバは個別に備えなければなら
ない。These two devices have completely different recording principles, and although they may be able to share the same card or sheet conveyance mechanism, they must have separate write heads and head drivers.

最近のＯＡ機器の発達により１文章作成およびＩｇ集（
キャラクタ情報処理）がワードプロセッサで広く行なわ
れるようになり、また、イラスト。With the recent development of OA equipment, single sentence creation and Ig collection (
Character information processing) became widely used in word processors, and illustrations.

中間調画像などの入力、修正、作成等（イラスト情報処
理、中間調情報処理）もスキャナや入力ボード又は入力
タブレットおよび必要に応じてマウスなどの付加的入力
手段で容易に行なわれるようになり、これらの情報処理
を一画面上に組合せる高度編集処理も実用化されている
。しかし音声情報は磁気媒体記録であり、口述録音機（
テープレコーダ）を必要とする。したがって、一枚のカ
ード又はシートに可視情報と音声（音波）情報とを記録
すると言っても、カード又はシートは磁気薄膜を塗布す
るか貼り付ける必要があり、また、記録手段に可視記録
手段である印刷機（複写機を含む）又はプリンタと、磁
気録音装置の２つを必要とする。上述のごとき実情に鑑
みて１本出願人は、先に、カード又はシートに再生可能
に音声情報を可視記録することを可能にするとともに、
１つの記録手段で、キャラクタ情報、イラスト情報、中
間調情報等の可視情報に加えて、音声情報を再生可能に
記録することを可能にした音声情報の記録再生装置につ
いて提案した。Inputting, modifying, creating halftone images, etc. (illustration information processing, halftone information processing) can now be easily performed using a scanner, input board or input tablet, and if necessary, additional input means such as a mouse. Advanced editing processing that combines these information processes on one screen has also been put into practical use. However, audio information is recorded on magnetic media, and is recorded using a dictation machine (
(tape recorder) required. Therefore, even if visible information and audio (sound wave) information are recorded on a single card or sheet, it is necessary to coat or paste a magnetic thin film on the card or sheet, and it is necessary to apply a magnetic thin film to the card or sheet. It requires two things: a printing press (including a copying machine) or printer, and a magnetic recording device. In view of the above-mentioned circumstances, the applicant first made it possible to visually record audio information in a reproducible manner on a card or sheet, and
We have proposed an audio information recording and reproducing device that is capable of reproducing audio information in addition to visual information such as character information, illustration information, and halftone information using one recording means.

第１０図は、上記本出願人が先に提案した音声情報記録
再生装置の一例を示す外観図、第１１図は、第１０図の
動作説明をするためのシステム図で、第１０図及び第１
１図において、ＣＲＴディスプレイ１１６１およびフロ
ッピーディスク装置１１８．１１９を装備したワードプ
ロセッサ本体１００には、マイクロホン１１１１．スピ
ーカ１２２１、キーボード１１７２、画像読取スキャナ
１１７１およびレーザプリンタ１１７３が接続されてい
る。ワードプロセッサ本体１００とキーボード１１７２
およびレーザプリンタ１１７３の組合せは、いわゆる英
文兼用の日本語ワードプロセッサであり、キーボード１
１７２を操作して作文し、文章をディスク装置１１９で
フロッピーディスクに記録し、あるいはレーザプリンタ
１１７３でプリントアウトする。FIG. 10 is an external view showing an example of the audio information recording and reproducing device previously proposed by the applicant, and FIG. 11 is a system diagram for explaining the operation of FIG. 10. 1
1, a word processor body 100 equipped with a CRT display 1161 and floppy disk devices 118, 119 includes microphones 1111. A speaker 1221, a keyboard 1172, an image reading scanner 1171, and a laser printer 1173 are connected. Word processor main body 100 and keyboard 1172
The combination of the laser printer 1173 and the keyboard 1173 is a so-called English-Japanese word processor.
172 to write a composition, and record the text on a floppy disk using the disk device 119 or print it out using the laser printer 1173.

ワードプロセッサ本体１００とキーボード１１７２およ
びスキャナ１１７１との組合せは、いわゆる画像処理装
置であり、スキャナで読んだ画像を必要に応じて編集し
、フロッピーディスクに記録し、あるいはプリントアウ
トする。The combination of the word processor main body 100, the keyboard 1172, and the scanner 1171 is a so-called image processing device, which edits the image read by the scanner as necessary, records it on a floppy disk, or prints it out.

ワードプロセッサ本体１００はローカルネットワークに
接続されており、ワードプロセッサ本体１００とキーボ
ード１１７２、スキャナ１１７１およびレーザプリンタ
１１７３はいわゆるファクシミリ装置を構成しており、
他のステーション（ファクシミリ装置）より受信した画
像をプリントアウトし、また、スキャナにセットされた
原稿の画像データ、フロッピーディスクに記録されたデ
ータ、又はワードプロセッサ本体１００の内部メモリに
記憶しているデータを他のステーションに伝送する。The word processor main body 100 is connected to a local network, and the word processor main body 100, a keyboard 1172, a scanner 1171, and a laser printer 1173 constitute a so-called facsimile device.
It prints out images received from other stations (facsimile machines), and also prints out image data of originals set in a scanner, data recorded on a floppy disk, or data stored in the internal memory of the word processor main body 100. Transmit to other stations.

スキャナ１１７１とレーザプリンタ１１７３の組合せは
、いわゆるデジタル複写機として構成されているが、ス
キャナのみが動作してビデオ信号（デジタル）をホスト
（ワードプロセッサ本体１００）に与える画像読取モー
ド、ホストよりのビデオ信号（デジタル）をプリントア
ウトするプリントモードおよびスキャナで原稿画像を読
んでプリントアウトする複写モードの３モードで動作す
るものである。The combination of the scanner 1171 and the laser printer 1173 is configured as a so-called digital copying machine, but there is an image reading mode in which only the scanner operates and provides a video signal (digital) to the host (word processor body 100), and a video signal from the host. It operates in three modes: a print mode for printing out (digital) images, and a copy mode for reading original images with a scanner and printing them out.

上記装置は、以上に説明した従来の機能に加えて、マイ
クロホン１２１１で吹込んだ音声の記憶、再生およびプ
リントアウトの機能があり、音声データが単独にあるい
は画像情報と組合せて処理されるものであり、その機能
を要約すると次の通りである。In addition to the conventional functions described above, the above-mentioned device has functions of storing, reproducing, and printing out the sound blown into the microphone 1211, and the sound data is processed alone or in combination with image information. The functions are summarized as follows.

ａ６文書作成編集機能・・・各種の文書を作成、編集す
ることができ、例えば標準的和文／欧文ワードプロセッ
サの機能に加えて、各種フォントの使用、画像の入力と
編集、フオーム、グラフおよび数表の作成と処理、これ
らの合成編集、ページレイアウト、文書フォーマツティ
ングの各機能を具備する。a6 Document creation and editing functions: You can create and edit various documents, such as using various fonts, inputting and editing images, forms, graphs, and numerical tables in addition to standard Japanese/Roman word processor functions. It has functions for creating and processing images, compositing editing them, page layout, and document formatting.

ｂ、音声吹込編集機能・・・マイクロホンで捕えた音声
を圧縮データとして、この圧縮データを単独又は上記ａ
の文−基データと組合せて処理する。b. Audio recording editing function...The audio captured by the microphone is converted into compressed data, and this compressed data is used alone or in the above a.
This sentence is processed in combination with the basic data.

Ｃ０音声プリントアウト機能・・・上記すの圧縮データ
をプリントアウトする。C0 audio printout function...Prints out the compressed data of the above.

ｄ、音声再生機能・・・ＲＡＭの圧縮データ、フロッピ
ーディスクの圧縮データを再生して音声を再生する。ス
キャナで読んだ圧縮データを音声に再生する。また、マ
イクロホンで捕えた音声を同時にスピーカで再生する。d. Audio playback function: Plays back compressed data in RAM and compressed data on floppy disk to play back audio. The compressed data read by the scanner is played back into audio. In addition, the audio captured by the microphone is simultaneously played back by the speaker.

ｅ、市販プログラムを利用する機能・・・市販プログラ
ムを使用する機能を具備する。e. Function to use commercially available programs: Equipped with a function to use commercially available programs.

ｆ、ターミナル機能・・・コミュケーションステーショ
ンを介して接続される装置のファイルの検索、およびプ
ログラムの利用等を行なう機能を具備する。f. Terminal function: Equipped with a function to search for files in devices connected via the communication station, use programs, etc.

ｇ、印刷機能・・・作成された文書、音声圧縮データ等
を印刷する機能を具備する。g. Print function: Equipped with a function to print created documents, audio compressed data, etc.

ｈ、複写機能・・・通常の複写を行なう機能を具備する
。h. Copying function: Equipped with a function for performing normal copying.

ｉ、保管機能・・・作成された文書、音声圧縮データ等
を、フロッピーディスクに記録する機能を具備する。i. Storage function: Equipped with a function to record created documents, audio compressed data, etc. on a floppy disk.

ｊ、検索機能・・・ローカルネットワークに接続された
ファイルステーション（図示せず）のファイルを検索す
る機能を具備する。j. Search function: Equipped with a function to search for files in a file station (not shown) connected to the local network.

ｋ、伝達機能・・・ローカルネットワークを使用して。k.Transmission function...using the local network.

他ステーション（図示せず）間で文書、音声圧縮データ
等の送受信を行なう機能、およびコミュニケーションス
テーション（図示せず）を介して外部装置の間で文書、
音声圧縮データ等の送受信を行なう機能を具備する。A function for sending and receiving documents, audio compressed data, etc. between other stations (not shown), and sending and receiving documents, compressed audio data, etc. between external devices via a communication station (not shown).
It has the function of transmitting and receiving compressed audio data, etc.

第１１図に示すように、ワードプロセッサ本体１００は
、それの通信制御装置ＣＣＵ　１２０で、トランシーバ
ＴＲを介してローカルネットワークケーブル１２０１に
接続されている。As shown in FIG. 11, the word processor main body 100 is connected to a local network cable 1201 via a transceiver TR at its communication control unit CCU 120.

このケーブル１２０１は、例えば特願昭５７−２３０８
２８号に開示したローカルネットワークのケーブルであ
り、これに、各種のステーションが接続されている。ワ
ードプロセッサ本体１００は、このネットワークの１つ
のワークステーションである。This cable 1201 is, for example,
This is a local network cable disclosed in No. 28, to which various stations are connected. The word processor main body 100 is one workstation of this network.

ワークステーションであるワードプロセッサ本体１００
には、音声（圧縮データ）、文書（テキスト、グラフィ
ック）およびイメージ（ピクセル）の混合情報又はその
中の一部を入力しうる。ここで、圧縮データとは、デジ
タル処理回路１２１で音声アナログ信号を８ビツトデジ
タルデータにＡ／Ｄ変換したデータを、ＣＰＵ２　（１
１２）で予測符号化処理して４ビツトデータに圧縮した
データであり、テキストとは、コード化された文字の集
合であり、グラフィックとはコード化された図形情報で
あり、例えば、円５円弧等を描かせるコマンドの集合で
ある。また、イメージ（ピクセル）とは、画像を画素（
ドツトピクセル）単位に分割して、画素の白黒情報ある
いは明暗、カラー情報をビットの「１」、「０」で対応
させたビット列情報であって、第１０図のスキャナ１１
７１から入力される。なお、スキャナ１１７１からは、
文字や図形情報も入力できるが、入力された時点では、
これらはイメージ情報として扱われる。ワードプロセッ
サ本体１００はまた、文書を作成編集する機能をも有す
る。すなわち、標準的和文、欧文、ワードプロセッサの
機能に加え、各種フォントの使用、画像の編集、フオー
ムの作成、グラフの作成、イラストの作成、数表の作成
と処理、これらの合成編集、ページレイアウト、文書フ
ォーマツティングの各機能を有している。これらの処理
において、音声圧縮データもこれらのデータと同様に処
理する。更にプログラム作成機能を有し、標準的言語を
使用してプログラムを作成しうる。Word processor body 100 which is a workstation
may be input with a mixture of audio (compressed data), documents (text, graphics), and images (pixels), or portions thereof. Here, the compressed data is data obtained by A/D converting an audio analog signal into 8-bit digital data in the digital processing circuit 121, and is
12), the data is compressed into 4-bit data through predictive encoding processing.Text is a set of coded characters, and graphics is coded graphical information.For example, 5 arcs of circles. This is a collection of commands to draw etc. Also, an image (pixel) refers to an image in pixels (
Bit string information that is divided into pixels (dot pixels) and corresponds to black and white information, brightness and color information of a pixel with bits "1" and "0", and is used by the scanner 11 in FIG.
It is input from 71. Note that from the scanner 1171,
Text and graphic information can also be entered, but at the time they are entered,
These are treated as image information. The word processor main body 100 also has functions for creating and editing documents. In other words, in addition to standard Japanese, Roman, and word processor functions, it also includes the use of various fonts, image editing, form creation, graph creation, illustration creation, creation and processing of numerical tables, composite editing of these, page layout, It has document formatting functions. In these processes, audio compressed data is also processed in the same way as these data. It also has programming capabilities and can create programs using standard languages.

また、ターミナル機能も有し、コミュニケーションステ
ーションを介して接続されるホストコンピュータのファ
イルの検索や、プログラムの利用等を行ないうる。It also has a terminal function and can search for files on a host computer connected via a communication station, use programs, etc.

更には、保管機能を有し１本体１００で作成した文書（
音声圧縮データを含む：以下同様）あるいはファイル、
および、ファイルステーション、ホストコンピュータ、
ファクシミリ、他のワークステーション等から転送され
てきた文書あるいはファイル等をフロッピーディスクに
保管する機能を有する。なお、イメージ情報は、指定に
よってデータ圧縮し−でから保管することができる。本
体１００はまた、検索機能も有する。すなわち、ファイ
ルステーションおよび自ワークステーションである本体
１００のフロッピーファイルを検索する機能を有する。Furthermore, it has a storage function and documents created on one main body 100 (
(including compressed audio data: the same applies below) or files,
and file station, host computer,
It has the function of storing documents or files transferred from facsimiles, other workstations, etc. on floppy disks. Note that the image information can be stored after data compression according to specifications. The main body 100 also has a search function. That is, it has a function of searching for floppy files in the main body 100, which is a file station and its own workstation.

ワークステーションである本体１００は更に、伝達機能
を有する。すなわち、ネットワークの資源を使用して、
他のステーションたとえば両像合成装置、ファクシミリ
装置との間で文書あるいはメツセージの送信、受信を行
なう機能を有する。The main body 100, which is a workstation, further has a transmission function. That is, using the resources of the network,
It has the function of transmitting and receiving documents or messages to and from other stations, such as a double image synthesizer and a facsimile machine.

なお、イメージ情報は、指定により、データ圧縮して転
送することができる。Note that the image information can be compressed and transferred if specified.

本体１００へのデータの入力方法としては、マイクロホ
ン１２１１．キーボード１１７２．スキャナ１１７１か
ら入力する場合と、外部からファクシミリ信号として通
信回線を介して入力する場合と、ホストコンピュータか
ら入力する場合とがある。As a method of inputting data to the main body 100, microphones 1211. Keyboard 1172. There are cases in which the information is input from the scanner 1171, cases in which it is input from the outside as a facsimile signal via a communication line, and cases in which it is input from the host computer.

完成された文書を他のワークステーションに転送したり
、ファクシミリを介して送信したり、レーザプリンタ１
１７３により複製する場合に、スキャナ１１７１からピ
クセル（イメージ）情報として入力処理することになる
。この場合に、本体１００は情報をすべてピクセル（ド
ツトビット：音声圧縮データではビット）で扱い、デー
タ圧縮してネットワークに送出する。送出の相手方がワ
ークステーションであれば、電子メイルの扱いとなり、
そのワークステーションが仕事中のときには、ディスプ
レイ装［１１６１の画面の一部にメイルが到着している
旨を表示して、オペレータに通知する。オペレータは、
適当な時刻に、その文書を画面に呼び出すことができる
。この場合に、情報はピクセルであるため、データ圧縮
されたフォーマットを復元し、画面表示用のピクセルに
線密度変換を行なってからディスプレイ１１６１に送出
する。なお、元の圧縮されている情報は、メモリ内にそ
のまま残っているので、オペレータの指示によって、保
存するか、あるいは消去するかを決定する。Transfer the completed document to another workstation, send it via fax, or print it to a laser printer.
173, input processing is performed as pixel (image) information from the scanner 1171. In this case, the main body 100 handles all information in pixels (dot bits: bits in audio compressed data), compresses the data, and sends it to the network. If the recipient of the transmission is a workstation, it will be treated as an e-mail.
When the workstation is working, a message indicating that a mail has arrived is displayed on a part of the screen of the display device [1161] to notify the operator. The operator is
At an appropriate time, the document can be recalled to the screen. In this case, since the information is in pixels, the data is restored from its compressed format, subjected to linear density conversion to pixels for screen display, and then sent to the display 1161. Note that since the original compressed information remains as it is in the memory, it is determined whether to save or delete it based on the operator's instructions.

次に、スキャナの利用方法として、第１に、各種画像の
読取りがあり、第２に大量文書の入力がある０画像又は
文書の全部が音声圧縮データの場合（通常は記録紙より
の音声の再生）、一部分が音声圧縮データの場合、およ
び、全部が音声圧縮データを含まない場合の３通りの態
様がある。Next, as for how to use a scanner, firstly, there is the reading of various images, and secondly, there is the input of a large amount of documents.If all the images or documents are audio compressed data (usually audio from recording paper) There are three modes: playback), a part of which is compressed audio data, and a case where all of the data does not include compressed audio data.

ワードプロセッサ本体１００において、音声、文字、グ
ラフ（イラスト）、絵が混合された文章を作成する場合
、音声はマイクロホン１２１１から、文字はキーボード
１１７２から、イラストはキーボード１１７２（のグラ
フ処理入力）から、絵はスキャナ１１７１からそれぞれ
入力し、それらを合成する。その場合、本体、１００に
おいて、画像処理プログラムを実行させることにより２
位置の移動、拡大、縮少等の自由な編集が可能である。In the word processor main unit 100, when creating a sentence that includes a mixture of voice, text, graphs (illustrations), and pictures, the voice is output from the microphone 1211, the text is input from the keyboard 1172, and the illustration is input from the keyboard 1172 (graph processing input). are respectively input from the scanner 1171 and synthesized. In that case, the main body 100 executes the image processing program to
Free editing such as moving, enlarging, and reducing the position is possible.

この作業を援助するため、ディスプレイ１１６１の画像
は、マルチウィンド処理ができるようになっている。す
なわち、画面上のイメージは、完全分割形あるいは複数
枚の紙を互いにずらせて重ねたイメージのいずれかを選
ぶことができ、前者は各イメージが縮少されるのに対し
て、後者ではイメージの大きさは変らず、一部が見えな
くなる。その他、システムステータスや、現在使用でき
るコマンド等もディスプレイ上のシステムエリアに表示
されている。In order to assist this work, the image on the display 1161 is capable of multi-window processing. In other words, the image on the screen can be either a completely divided image or an image made by stacking multiple sheets of paper offset from each other. In the former, each image is reduced, while in the latter, the image is The size remains the same, but some parts are no longer visible. In addition, the system status and currently available commands are also displayed in the system area on the display.

このようにして作成された複合文書がメモリに格納され
るとき、コード（文字情報、イラスト情報）、および、
圧縮されたピクセル（中間調画像、絵、音声）の形で格
納される。また、この複合文書を、前述のイメージ情報
と同じように他のワークステーションに送出する場合、
他のワークステーションは送られてきたコードおよび圧
縮されたピクセル情報を受は取り、これらの情報をディ
スプレイ１１６１に表示するため、コード（文字）をキ
ャラクタジェネレータにより、またピクセル（絵、音声
）を伸長することにより、ピットマツプメモリ上にピッ
トストリームを展開する。このピットストリーム情報を
ディスプレイ１１６１のビデオＲＡＭ１１６にＤＭＡ転
送すれば、送出側のワークステージ目ンと全く同じ情報
が受取り側のワークステーションのディスプレイ１１６
１上に表示されることになる。同じようにして、ファイ
ルステーションも、ビットマツプメモリ上にピットスト
リームを展開し、それ以後の処理は前述のピクセル情報
と同じ手続きで、プリントあるいはファイル格納を行な
う。When the compound document created in this way is stored in memory, the code (text information, illustration information) and
Stored in the form of compressed pixels (halftone images, pictures, sounds). Also, if you want to send this compound document to another workstation in the same way as the image information mentioned above,
The other workstations receive the transmitted code and compressed pixel information and decompress the code (letters) and pixels (pictures, sounds) using a character generator to display this information on the display 1161. By doing this, the pit stream is developed on the pit map memory. If this pit stream information is DMA-transferred to the video RAM 116 of the display 1161, exactly the same information as the work stage information on the sending side will be displayed on the display 116 of the receiving workstation.
1 will be displayed on top. In the same way, the file station also develops the pit stream on the bitmap memory, and the subsequent processing is the same as that for the pixel information described above to print or store the pit stream.

第１２図は、第１１図に示したデジタル処理回路１２１
及びアナログ処理回路１２２の詳細を説明するための回
路図で、音声信号をＡ／Ｄ変換するためのインターフェ
イスすなわちデジタル処理回路１２１は、マイクロホン
１２１１の音声信号を増幅する増幅器、音声信号レベル
を調整するオフセット調整器、ローパスフィルタ、サン
プルホールド回路およびＡ／Ｄ変換器で構成されており
、音声信号を８ビツトデータに変換してバス１１２１を
介してマイクロプロセッサ１１２に与える。FIG. 12 shows the digital processing circuit 121 shown in FIG.
and a circuit diagram for explaining details of the analog processing circuit 122. The interface for A/D converting the audio signal, that is, the digital processing circuit 121 includes an amplifier that amplifies the audio signal of the microphone 1211, and an amplifier that adjusts the audio signal level. It is composed of an offset adjuster, a low-pass filter, a sample-and-hold circuit, and an A/D converter, converts the audio signal into 8-bit data, and supplies the data to the microprocessor 112 via the bus 1121.

マイクロプロセッサ１１２は、８ビツトデータを予測符
号化により４ビツトに圧縮して、圧縮データを、ピクセ
ル情報と同様に処理し、メモリに書込む。Microprocessor 112 compresses the 8-bit data to 4 bits using predictive encoding, processes the compressed data in the same manner as pixel information, and writes it to memory.

音声再生の場合には、マイクロプロセッサ１１２は、音
声ピクセル情報（圧縮データ）を復号して８ビツトデー
タを得てバス１１２１を介して、音声信号（アナログ）
を再生するためのインターフェイスすなわちアナログ処
理回路１２２に与える。アナログ処理回路１２２は、８
ビツトデータをアナログ信号に変換するＤ／Ａ変換器、
バッファアンプ、ローパスフィルタおよび出力増幅器で
構成されており、８ビツトデータに応じたレベルでスピ
ーカ１２２１を付勢する。In the case of audio playback, microprocessor 112 decodes the audio pixel information (compressed data) to obtain 8-bit data and outputs the audio signal (analog) via bus 1121.
is provided to an interface, ie, an analog processing circuit 122, for reproducing the data. The analog processing circuit 122 has 8
A D/A converter that converts bit data into an analog signal;
It is composed of a buffer amplifier, a low-pass filter, and an output amplifier, and energizes the speaker 1221 at a level corresponding to 8-bit data.

ワードプロセッサ本体１００は前述のように、各種の機
能を具備しており、かつイメージ情報および音声情報の
ように扱う量も膨大なものであるため、複数個のプロセ
ッサ（ＣＰ　Ｕ）を配置して、並行処理を行なう。すな
わち、第１１図に示すように、メインプロセッサ１１１
とサブプロセッサ１１２をマルチパス１１１ｏで結び、
両者間の通信はメインメモリ１１３を介して行なう１両
者間の連結は、割込み信号あるいはステータス信号によ
り行なう。As mentioned above, the word processor main body 100 has various functions and handles a huge amount of image information and audio information, so it is equipped with a plurality of processors (CPUs). Perform parallel processing. That is, as shown in FIG.
and the sub-processor 112 are connected by a multipath 111o,
Communication between the two is performed via the main memory 113. Connection between the two is performed by an interrupt signal or a status signal.

プロセッサ１１１から１１２への画像データの転送は、
メモーリ１１３を介して行なう。The image data is transferred from the processor 111 to the processor 112 by
This is done via the memory 113.

両プロセッサ１１１，１１２にはそれぞれローカルバス
１１１１，１１１０が接続され、それらのローカルバス
１１１１，１１１０にはメモリ１１４．１１５、パラレ
ルＩ１０を介したキーボード１１７２．スキャナ１１７
１およびプリンタ１１７３、符号化圧縮、復号器１１５
１、イメージプロセシングユニット１１５２，１１５３
、コントローラ１１８，１１９を介した外部メモリもし
くはＦＤＤ（フロッピーディスクドライブユニット）１
１８１，１１９１、コントローラ１１６を介したＣＲＴ
表示装置１１６１、および、通信制御装置１２０が接続
されている。Local buses 1111, 1110 are connected to both processors 111, 112, respectively, and these local buses 1111, 1110 are connected to memories 114, 115, and keyboards 1172, 1172, . Scanner 117
1 and printer 1173, encoder compressor, decoder 115
1. Image processing units 1152, 1153
, external memory or FDD (floppy disk drive unit) 1 via controllers 118 and 119
181, 1191, CRT via controller 116
A display device 1161 and a communication control device 120 are connected.

なお５通信制御装置１２０は、通信ラインにワードプロ
セッサ本体１００を結線するものであり、通信ラインに
接続されているときには、ラインからの呼びがあるとき
、およびプロセッサ１１１がライン（他局）を呼ぶとき
にプロセッサ１１１を通信ラインに接続する。Note that the communication control device 120 connects the word processor main body 100 to the communication line, and when connected to the communication line, it is used when there is a call from the line and when the processor 111 calls the line (another station). The processor 111 is connected to the communication line.

プロセッサ１１１　　（ＣＰＵＩ）は、ワードプロセッ
サのメインプロセッサとして機能を有し、ディスプレイ
のためのタスクを除くすべてのタスクを司どる。したが
って、ワードプロセッサ１００のＯ８（オペレーティン
グシステム）は、このプロセッサ１１１上で走行する。The processor 111 (CPUI) functions as the main processor of the word processor and manages all tasks except for display tasks. Therefore, the O8 (operating system) of the word processor 100 runs on this processor 111.

また、アイドル状態のとき、診断プログラム（Ｄｉａｇ
ｎｏｓｔｉｃ　Ｐｒｏｇｒａｍ）を走行させることがで
きる。その他、パラレルＩ１０、シリアルエ／○ポート
、タイマ、および割込制御回路を内蔵する。Also, when the diagnostic program (Diag) is in the idle state,
nostic Program) can be run. Additionally, it has a built-in parallel I10, serial port, timer, and interrupt control circuit.

プロセッサ１１２　（ＣＰＵ２）は、前述のようにプロ
セッサ１１１　（ＣＰＵＩ）の従属的存在であり、ＣＲ
Ｔディスプレイ用画像画像処理用び音声データ圧縮・再
生処理用で動作する。デジタル処理回路１２１から送ら
れてくるＡ／Ｄ変換データ（ピクセルと同じに扱う）お
よびプロセッサ１１１からメモリ１１３を介して送られ
て来る文字コード、ピクセル（Ｐｉｘｅｌ）、およびベ
クトル等を用いて、最終的な絵まじり文書ビットマツプ
をＲＡＭ　（３２０ＫＢ）上に合成する。そして、完成
した絵まじり文書ビットマツプをＣＲＴコント０−ラ１
１６内（７）ＶＲＡＭ　（１９２ＫＢ）　に転送する。The processor 112 (CPU2) is a subordinate entity of the processor 111 (CPUI) as described above, and the CR
It operates for T-display image processing and audio data compression/playback processing. Using A/D conversion data sent from the digital processing circuit 121 (treated in the same way as pixels) and character codes, pixels, vectors, etc. sent from the processor 111 via the memory 113, the final A picture-filled document bitmap is synthesized on RAM (320KB). Then, transfer the completed picture-mixed document bitmap to the CRT controller 0-1.
16 (7) Transfer to VRAM (192KB).

音声再生時には、ＲＡＭ上のビットマツプの音声圧縮デ
ータを摘出して、１ワード（４ビツト）づつ復号して８
ビツトデータを作り、アナログ処凡回路１２２に与える
。When playing audio, the bitmap audio compressed data on the RAM is extracted and decoded one word (4 bits) at a time.
Bit data is created and provided to the analog processing circuit 122.

ＣＲＴコントローラ１１６は、高解像度ＣＲＴディスプ
レイ用水用水型直同期信号および映像信号を発生する。CRT controller 116 generates a water type direct synchronization signal and a video signal for a high resolution CRT display.

表示用メモリとしてＶＲＡＭが内蔵され、データはプロ
セッサ１１２内のＲＡＭから転送される。このコントロ
ーラ１１６は、和文用ランドスケープ（Ｌａｎｄｓｃａ
ｐｅ）型ＣＲＴと英文用のポートレイト型（解像度９４
５Ｘ１２６０ドツト）のモノクロのラスタースキャン方
式とが接続可能である。A VRAM is built in as a display memory, and data is transferred from the RAM in the processor 112. This controller 116 is a Japanese language landscape (Landsca).
pe) type CRT and portrait type for English text (resolution 94
5x1260 dots) monochrome raster scan system can be connected.

メモリ１１３は、プロセッサ１１１から１１２に画像デ
ータ（文字コード、ピクセル、ベクトル）の転送のため
に使われる。メモリエリアの一部に、キャラクタジェネ
レータが存在する。メモリアドレス空間は、１０２４Ｋ
Ｂである。Memory 113 is used to transfer image data (character codes, pixels, vectors) from processor 111 to processor 112. A character generator exists in a part of the memory area. Memory address space is 1024K
It is B.

メモリ１１４は、ワードプロセッサ１００のメインメモ
リであって、メモリアドレス空間は１．５ＭＢである。Memory 114 is the main memory of word processor 100, and has a memory address space of 1.5 MB.

また、デュアルポート機能、つまりローカルバス１１１
１とのインターフェイスとパラレルＩ１０モジュール１
１７とのインターフェイスを備えている。これによって
、パラレルＩ１０モジュールから直接スキャナデータが
転送され、また、キーボードおよびマウス（カーソル位
置の指示）のコードも転送される。It also has dual port functionality, i.e. local bus 111
Interface with 1 and parallel I10 module 1
It has an interface with 17. This transfers scanner data directly from the parallel I10 module, as well as keyboard and mouse (cursor position indication) codes.

パラレルｌ１０１１７は、パラレルＩ１０インターフェ
イスとして１２ポート（９６ビツト）を具備し、スキャ
ナデータ（キーボード、マウスおよびスキャナのデータ
）をローカルバス１１１１を介することなく、直接メモ
リ１１４に転送する。The parallel I10117 has 12 ports (96 bits) as a parallel I10 interface, and transfers scanner data (keyboard, mouse, and scanner data) directly to the memory 114 without going through the local bus 1111.

キーボード１１７２は、３種（カナ漢字変換用、タブレ
ット漢字入力用、英字用）の文字キーと、１６個のファ
ンクションキーを具備している。The keyboard 1172 includes three types of character keys (for kana-kanji conversion, for tablet kanji input, and for alphabetic characters) and 16 function keys.

スキャナ１１７１は、読取サイズが最大Ａ３で、解像度
が１２ドツト／ｍｍ（３０００Ｐ　Ｉ）であり、シート
型の原稿を読取ることができる。The scanner 1171 has a maximum reading size of A3, a resolution of 12 dots/mm (3000 PI), and can read sheet-type originals.

ＦＤＣＩ　１．８，１１９は、ＦＤＤ　（フロッピーデ
ィスクドライブユニット）１１８１．１１９１の制御を
行なう。FDCI 1.8, 119 controls FDD (floppy disk drive unit) 1181.1191.

フロッピーディスクは、両面倍密度（ＩＭＢ／ＤＲＩＶ
Ｅ）のものが使用される。これらは、プログラム、ロー
カルファイル、カナ漢字変換用辞書、およびキャラクタ
ジェネレータが格納される他に、ログアウト用メモリと
しても用いられる。Floppy disks are double-sided double density (IMB/DRIV)
E) is used. In addition to storing programs, local files, kana-kanji conversion dictionaries, and character generators, these are also used as logout memories.

一度フロッピーデータをメモリに読込んだ後は、一方の
ディスクには文章等記録用の別フロッピーを装着して作
成文章等の記録、記録文章等の読み出しが行なわれる。Once the floppy data has been read into the memory, another floppy disk for recording texts, etc. is attached to one of the disks, and written texts, etc., are recorded and recorded texts, etc. are read out.

イメージプロセシングユニット（ＩＰＵＩ）１１５１は
、２値ＤＣＲ（データ圧縮、再生）の機能を有する。The image processing unit (IPUI) 1151 has a binary DCR (data compression and reproduction) function.

イメージプロセシングユニット（ＩＰＵ２）１１５２は
、密度変換・拡大／縮小を行なう機能を有する。密度変
換としては、１２→４ドツト／　ｍｍ。The image processing unit (IPU2) 1152 has a function of density conversion and enlargement/reduction. The density conversion is 12 → 4 dots/mm.

１２→６ドツト／ｍｍ、１２→８ドツト／ｍｍ、８−＋
１２ドツト／ｎ＋ｍがあり、拡大／縮小としては、０．
５〜２倍の間を０．１ステツプずつ設定可能である。12 → 6 dots/mm, 12 → 8 dots/mm, 8-+
There are 12 dots/n+m, and the enlargement/reduction is 0.
It can be set between 5 and 2 times in 0.1 steps.

イメージプロセシングユニット（ＩＰＵ３）１１５３は
、イメージ回転機能を有する。１ステツプで＋９０°ず
つ回転する。Image processing unit (IPU3) 1153 has an image rotation function. Rotates +90° in one step.

通信制御装置１２０は、ローカルネットワークを介して
伝送されるデータの授受に関する制御を行い、少なくと
もデータリンクレベルまでの階層を含むプロトコル制御
を行なう。すなわち、データリンク階層の制御としては
、データカプセルの分解／組立（フレーミング、アドレ
ッシング、誤り検出）およびリンク管理（チャネル割当
（衝突回避）、衝突処理）があり、また、物理階層の制
御としては、データエンコード（プリアンプルの生成／
除去（同期化のため）、ビットエンコード／デコード）
およびチャネルアクセスＣビット送信／受信、キャリア
検知、衝突検出）がある。The communication control device 120 controls the exchange of data transmitted via the local network, and performs protocol control including layers up to at least the data link level. In other words, data link layer control includes data capsule disassembly/assembly (framing, addressing, error detection) and link management (channel allocation (collision avoidance), collision processing), and physical layer control includes: Data encoding (preamble generation/
removal (for synchronization), bit encoding/decoding)
and channel access (C-bit transmission/reception, carrier sense, collision detection).

トランシーバＴＲは、ローカルネットワークの通信媒体
（同軸ケーブル）と直接接続され、通信制御装置！１ｆ
１２０とトランシーバＴＲとはトランシーバケーブルに
より接続される。The transceiver TR is directly connected to the communication medium (coaxial cable) of the local network and acts as a communication control device! 1f
120 and transceiver TR are connected by a transceiver cable.

而して、上記装置によれば、音声データが紙又はそれと
同等のシートに、通常の文字又は画像の記録と同様に可
視記録される。音声データの記録画像は可視であるが、
通常人は意味を認識し得ない。しかし、広く使われてい
るスキャナ、たとえばファクシミリに用いられているス
キャナ、で記録シートの画像を読み取って音声データを
得て、このデータを、前記デジタル変換の逆ロジックで
音声信号に再生しこの音声信号でスピーカを付勢すると
、記録音声が再生される。また、上記装置によれば、音
声アナログ信号のレベルを、たとえば８ビット以上の比
較的ビット数が多いデジタルデータにＡ／Ｄ変換し、更
にこのデジタルデータを予測符号化してたとえば４ビツ
ト以下の、比較的ビット数が少ないデジタルデータに圧
縮するデジタル変換手段を用いており、これによれば、
音声信号のデジタル処理が、比較的簡単なハードウェア
と比較的簡単なプログラム処理で実行され。According to the above device, audio data is visibly recorded on paper or an equivalent sheet in the same way as ordinary characters or images are recorded. Although the recorded image of audio data is visible,
Normal people cannot understand the meaning. However, a widely used scanner, such as a scanner used in facsimiles, reads the image on the recording sheet to obtain audio data, and this data is reproduced into an audio signal using the reverse logic of the digital conversion described above. When the signal energizes the speaker, the recorded audio is played back. Further, according to the above device, the level of the audio analog signal is A/D converted into digital data having a relatively large number of bits, for example, 8 bits or more, and furthermore, this digital data is predictively encoded to be, for example, 4 bits or less. It uses digital conversion means to compress digital data with a relatively small number of bits, and according to this,
Digital processing of audio signals is performed using relatively simple hardware and relatively simple program processing.

しかも記録ドツト（ビット）数が少なくなるので、一枚
の紙又はそれに類するシート又はカード（以下紙および
それに類するシート又はカードを一括してシートと称す
る）に比較的長時間に渡る音声を記録し得る。たとえば
速記又は口述記録において、ワードプロセッサが用いら
れることが多いが、口述にワードプロセッサのキー人力
が遅れる場合は、キーボード操作で音声記録を指示して
、文章入力から音声記録に切換えるか、あるいは、文章
入力と音声記録の並行処理を行なうことができる。Moreover, since the number of recording dots (bits) is reduced, it is possible to record audio over a relatively long period of time on a single piece of paper or a similar sheet or card (hereinafter, paper and similar sheets or cards are collectively referred to as a sheet). obtain. For example, a word processor is often used for shorthand or dictation recording, but if the manual input of the word processor is delayed during dictation, you can switch from text input to voice recording by instructing voice recording using the keyboard, or switch from text input to voice recording. and audio recording can be performed in parallel.

また、ワードプロセッサの打出し文章に、筆者（又は口
述者）の生の声をサインに代えて、あるいはサインと共
に記録するのが好ましい場合もあるが、このような場合
には、ワードプロセッサのＣＰＵ等を前記プリント手段
として共用し、このＣＰＵ等に前記デジタル変換手段の
ハードウェアを接続してワードプロセッサに音声信号デ
ジタル信号変換器を組み込めば、一枚のシート上に文章
と音声データの両者を組合せて記録することができ、ま
た、シート別に文章と音声データを記録することができ
る。記録装置は実質上ワードプロセッサであって・、テ
ープレコーダ等の音声記録装置を要しない、また、同一
材質のシートが文章記録と音声データ記録に共用される
ので、特別な音声データ記録媒体を要しない。In addition, there are cases where it is preferable to record the author's (or dictator's) real voice instead of or together with the signature on the word processor's text, but in such cases, the word processor's CPU, etc. If the digital conversion means hardware is connected to the CPU, etc., and an audio signal/digital signal converter is incorporated into the word processor, both text and audio data can be combined and recorded on one sheet. It is also possible to record text and audio data for each sheet. The recording device is essentially a word processor and does not require an audio recording device such as a tape recorder, and since the same material sheet is used for text and audio data recording, no special audio data recording medium is required. .

更には、ＣＡＤ　（コンピュータ・エイデツド・デザイ
ン）においては、記録図面又はイラストに加えて設計者
あるいはデザイナの説明音声又は識別音声等を加えるの
が好ましく、ファクシミリにおいても、原稿送信と別に
電話で話すのではなく、原稿受（１と同時に、又は受信
紙より音声信号を再生して、自動的に原稿上の記録デー
タより送信者の音声が再生されるのが望まれる。また、
キーボード、入力タブレット、マウス、および／又は画
像スキャナで入力された、又は読み取られた、グラフ、
イラスト、写真、あるいは絵、などの記録に加えて、説
明音声、バックグラウンドミューシック、自然音、朗読
音声等を組み合せて記録するのが好ましいが、上記装置
によれば、これらはいずれにも利用することができる。Furthermore, in CAD (computer aided design), it is preferable to add the designer's explanatory voice or identification voice in addition to recorded drawings or illustrations, and even in facsimile, it is preferable to add the explanatory voice or identification voice of the designer in addition to the recorded drawings or illustrations. Instead, it is desirable that the sender's voice be automatically reproduced from the recorded data on the manuscript at the same time as the original document receiver (1) or by reproducing the audio signal from the received paper.Also,
graphs entered or read with a keyboard, input tablet, mouse, and/or image scanner;
In addition to recording illustrations, photographs, or pictures, it is preferable to record a combination of explanatory audio, background music, natural sounds, reading audio, etc., but according to the above device, these can be used for any of these. can do.

しかし、上記従来装置においては波形のみを符号化する
ものであるため高圧縮率が得られながった。However, in the conventional apparatus described above, only the waveform is encoded, and therefore a high compression ratio cannot be obtained.

１−煎本発明は、上述のごとき実情に鑑みてなされたもので、
特に、従来の波形を符号化していく圧縮法（ＤＰＣＭな
ど）において、無音部を切り分け、さらに、相似波形を
切り分けて、より高圧縮率を得ることを目的としてなさ
れたものである。1-The present invention was made in view of the above-mentioned circumstances,
In particular, in a conventional compression method (such as DPCM) that encodes a waveform, this was done with the aim of obtaining a higher compression ratio by separating silent parts and further separating similar waveforms.

捧−一戊本発明は、上記目的を達成するために、無音有音判別に
おいて、入力振巾の絶対値が所定の値より小さいものが
連続して所定の数以上あれば無音部とし、入力振巾の絶
対値が所定の値より大きいものが連続して所定の数以上
あれば有音部とすること、或いは、音声波形のピッチの
検出において。SUMMARY OF THE INVENTION In order to achieve the above object, the present invention determines whether there is a sound or not, and if the absolute value of the input amplitude is smaller than a predetermined value in a predetermined number or more consecutively, it is regarded as a silent part, and the input amplitude is determined as a silent part. If the absolute value of the amplitude is greater than a predetermined value in a predetermined number or more consecutively, it is determined as a sound part, or in detecting the pitch of an audio waveform.

振巾の絶対値が大きい極大、極小を交互に検出し、その
極大（または極小）間の時間がある所定の領内のずれで
ある所定回数以上連続して操り返されるものをピッチと
して検出すること、或いは、音声波形のピッチを検出し
た後、波形相似部を検出する際、１ピツーチ内の極値の
数が１つ前のピッチ内の極値の数と等しいときは、注目
ピッチ内の波形が１つ前のピッチの波形と相似波形であ
るとすることを特徴としたものである。以下、本発明の
実施例に基いて説明する。Alternately detecting maxima and minima with large absolute values of amplitude, and detecting as a pitch a deviation in a predetermined area in which the time between the maxima (or minima) is repeated a predetermined number of times or more. , or when detecting a waveform similar part after detecting the pitch of the audio waveform, if the number of extreme values in one pitch is equal to the number of extreme values in the previous pitch, the waveform in the pitch of interest is is characterized in that the waveform is similar to the waveform of the previous pitch. Hereinafter, the present invention will be explained based on examples.

第１図は、本発明の一実施例を説明するための構成図で
、図中、１は無音声音判定部、２は無音時間コーデング
部、３は平滑化部、４はピッチ検出部、５は波形相似部
検出部、６は振幅、ピッチコーデング部（ただし、後述
する実験では、前と同じ波形にしている。また、５個以
上連続すると、５個毎に波形コーティングを行っている
）、７は波形コーデング部（ただし、圧縮はしていない
）である。FIG. 1 is a block diagram for explaining one embodiment of the present invention, in which 1 is a voiceless sound determination section, 2 is a silent time coding section, 3 is a smoothing section, 4 is a pitch detection section, and 5 is a waveform similar part detection section, and 6 is an amplitude and pitch coding section (however, in the experiments described later, the same waveform as before was used. Also, when 5 or more pieces are consecutive, waveform coating is performed for every 5 pieces) , 7 is a waveform coding section (but not compressed).

第２図乃至第５図は、第１図に示した各ブロックの動作
説明をするためのフローチャートで、第２図は、無音有
音判定部１のフローチャートで、無音部と有音部を切り
分けるフローチャート、第３図及び第４図は、ピッチ検
出部４のフローチャートで、第３図は極値検出及びピッ
チ候補検出のフローチャート、第４図はピッチ候補より
ピッチ検出をするフローチャート、第５図は、波形相似
部検出部のフローチャートで、相似部候補と極値より相
似部を検出するフローチャートである。2 to 5 are flowcharts for explaining the operation of each block shown in FIG. 1, and FIG. 2 is a flowchart of the silence/sound determination unit 1, which separates silent parts and sound parts. Flowcharts, FIGS. 3 and 4 are flowcharts of the pitch detection unit 4, FIG. 3 is a flowchart of extreme value detection and pitch candidate detection, FIG. 4 is a flowchart of pitch detection from pitch candidates, and FIG. 5 is a flowchart of pitch detection from pitch candidates. , which is a flowchart of a waveform similar portion detection unit, which detects a similar portion from similar portion candidates and extreme values.

以下、第１図の各ブロックについて説明する。Each block in FIG. 1 will be explained below.

而して、音声データには必ず無音部が存在すること、ま
た、母音部は相似波形が繰り返されることが一般によく
知られている６本発明は、このような認識のもとに従来
の波形を符号化していく圧縮法においてさらに高圧縮率
を得るために、無音部を切り分け、さらに、相似波形を
切り分けるようにしたものである。It is generally well known that there are always silent parts in audio data, and that similar waveforms are repeated in vowel parts6.The present invention is based on this recognition, and is based on the conventional waveform In order to obtain an even higher compression rate in a compression method that encodes a signal, silent parts are separated and similar waveforms are further separated.

濡」コＬＩＬＹ町（皿第２図は、無音有音判定部１のフローチャートで、図中
、ＦＬＧは・・・・・・・旧・・・・・無音（１）、有音
（２）のフラグ。Figure 2 is a flowchart of the sound/silence determination unit 1. flag.

ＡＭＰは・・・・・・・・・・・・・・・振巾。AMP is......

ＴＨ−ＡＭＰ−１は・・・有音から無音への振巾のしき
い値。TH-AMP-1 is the threshold of amplitude from sound to silence.

ＴＨ−ＡＭＰ−２は・・・無音から有音への振巾のしき
い値。TH-AMP-2 is the threshold for amplitude from silence to sound.

ＣＮＴｌは・・・・・・・・・・・・有音状態で、ＴＨ
−ＡＭＰ−１より小さい振巾の連続個数。CNTl is in the sound state and TH
- The number of consecutive amplitudes smaller than AMP-1.

ＣＮＴ２は・・・・・・・旧・・無音状態で、ＴＨ−Ａ
ＭＰ−２より大きい振巾の連続個数。CNT2 is...old...silent, TH-A
The number of consecutive pieces with a swing width larger than MP-2.

Ｔ　Ｈ−ＣＮ　Ｔ−１は・・・有音状態がら無音状態へ
移るＣＮＴ−１のしきい値。T H-CN T-1 is the threshold value of CNT-1 that changes from a sound state to a silent state.

ＴＨ−ＣＮＴ−２は・・・無音状態から有音状態へ移る
ＣＮＴ−２のしきい値で、無音部は、ある所定の値（ＴＨ−ＡＭＰ−１）より絶対
値が小さい振巾がある所定の回数（Ｔ　Ｈ−ＣＮ　Ｔ−
１）連続して入力されたとき無音部とする。また、有音
部は、ある所定の値（ＴＨ−ＡＭＰ−２）より絶対値が
大きい振巾がある所定の回数（ＴＨ−ＣＮＴ−２）連続
して入力されたとき有音部とする。TH-CNT-2 is...the threshold value of CNT-2 that changes from a silent state to a sound state, and the silent part has an amplitude whose absolute value is smaller than a certain predetermined value (TH-AMP-1). Predetermined number of times (TH-CN T-
1) When input continuously, it is treated as a silent part. Further, a sound part is defined as a sound part when a predetermined number of consecutive inputs (TH-CNT-2) having an amplitude whose absolute value is larger than a certain predetermined value (TH-AMP-2) is made.

王立化ピッチ検出、波形相似部検出の前処理として、これらに
不要な高周波成分を取り去るために平滑化を行なう。具
体的には、例えばＬＰＦ（Ｉ）＝（ＡＭＰ（Ｉ−１，）＋２ＸＡＭＰ（Ｉ
）＋ＡＭＰ（Ｉ＋１））／４により平滑化する。ただし、ＬＰＦは平滑化の結果、Ａ
ＭＰは入力される振巾、■は時間のインデックスである
。As preprocessing for royal pitch detection and waveform similar portion detection, smoothing is performed to remove unnecessary high frequency components. Specifically, for example, LPF(I)=(AMP(I-1,)+2XAMP(I
)+AMP(I+1))/4. However, as a result of smoothing, LPF has A
MP is the amplitude to be input, and ■ is a time index.

ピッチ検出第３図（第３図（ａ）、（ｂ））は、極値検出及びピッ
チ候補検出のフローチャート、第４図は、ピッチ候補よ
りピッチ検出をするフローチャートで。Pitch Detection FIG. 3 (FIGS. 3(a) and 3(b)) is a flowchart of extreme value detection and pitch candidate detection, and FIG. 4 is a flowchart of pitch detection from pitch candidates.

第３図において、ＡＭＰは・・・・・・・・・・・・・・・・・・・・・
・・・振巾。In Figure 3, AMP is...
...Swinging width.

Ｄｌは・・・・・・・・・・・・・・・・・・・・・・
・・・・・１つ前から注目点への傾き。Dl is・・・・・・・・・・・・・・・・・・
...Inclination from the previous point to the point of interest.

Ｄ２は・・・・・・・・・・・・・・・・・・・・・・
・・・・・注目点から１つ後への傾き。D2 is・・・・・・・・・・・・・・・・・・
・・・・・・Tilt one position back from the point of interest.

Ｍ　Ｉ　Ｎ−Ｐ　Ｉ　ＴＣＨは・・・・・・ピッチとし
てありうる最小時間。M I N-P I TCH is the minimum possible pitch time.

第４図においてＰは・・・・・・・・・・・・・・・・・・・・・ピッ
チ候補とされたものの時間（インデックス）ＭＡＸ−ＺＵＲＥは・・・注目のピッチ候補間の時間と
１つ前のピッチ候補間の時間が同一ピッチとするかのしきい値ＣＮＴＰは・・・・・・・・・・・・同一ピッチと考え
られるピッチ候補の連続回数Ｔ　Ｈ−ＣＮ　Ｔ− ＰＩＴＣＨは・・・・・・・・・ピッチとするＣＮＴＰ
のしきい値で、互に検出し、その極大（または極小）間がある所定の値（ＭＡＸ−ＺＵＲＥ）内のずれで、ある所定回数（ＴＨ−ＣＮＴ−ＰＩＴＣＨ）以上連続して操り返されされるものをピッチとして検出した。In Fig. 4, P is the time (index) of the pitch candidate, and MAX-ZURE is the time between the pitch candidates of interest. The threshold value CNTP for determining whether the time and the time between the previous pitch candidate are considered to be the same pitch is......The number of consecutive pitch candidates that are considered to be the same pitch T H-CN T - PITCH is the CNTP that is the pitch.
Detect each other at the threshold of What was returned was detected as pitch.

波溝」■υ罷挾遇− 第５図は、相似部候補と極値より相似部を検出する場合
のフローチャートで１図中、Ｓは・・・・・・・・・・・・・・・相似候補とされた
ものの時間（インデックス）Ｄは・・・・・・・・・・・・・・・極値の数５ＡＶＥ
−Ｄは・・・１つ前の極値の数で、ピッチ内の極値の数
が１つ前のピッチ内の極値の数と等しいときは注目ピッ
チ内の波形は１つ前のピッチの波形と相似であるとする
。Figure 5 is a flowchart for detecting similar parts from similar part candidates and extreme values.In Figure 1, S is...・The time (index) of the similar candidate D is the number of extreme values 5AVE
-D is the number of the previous extreme value, and when the number of extreme values in the pitch is equal to the number of extreme values in the previous pitch, the waveform in the pitch of interest is the number of extreme values in the previous pitch. It is assumed that the waveform is similar to the waveform of .

第６図乃至第９図は、上述のごとくして処理される波形
の状態を示す図で、第６図は、オリジナル波形、第７図
は、ピッチを検出した図で、正に大きく振れている間が
ピッチである。第８図は相似部を検出した図で、正に大
きく振れている間が。Figures 6 to 9 are diagrams showing the states of the waveforms processed as described above. Figure 6 is the original waveform, and Figure 7 is the diagram with the detected pitch. The period of time is the pitch. Figure 8 is a diagram where similar parts are detected, and the part where there is a large swing is the correct one.

波形相似部である。また、第９図は、連続５個までを限
度に相似部を相似波形で入れかえたもので、全回を通し
て各波形のａ−ｊはそれぞれ対応している。This is a similar part of the waveform. Moreover, in FIG. 9, similar parts are replaced with similar waveforms up to a maximum of five consecutive times, and a-j of each waveform corresponds to each other throughout the entire series.

勲ニーー釆− 以上の説明から明らかなように、本発明によると、音声
データ圧縮において、高圧縮率を得ることができる。As is clear from the above description, according to the present invention, a high compression ratio can be obtained in audio data compression.

[Brief explanation of drawings]

第１図は１本発明の一実施例を説明するための構成図、
第２図は、無音有音判定部の動作説明をするためのフロ
ーチャート、第３図は、極値検出及びピッチ候補検出を
説明するためのフローチャート、第４図は、ピッチ候補
よりピッチ検出する動作説明をするためのフローチャー
ト、第５図は、相似部候補と極値より相似部を検出する
動作説明をするためのフローチャート、第６図乃至第９
図は、波形の処理過程の状態を説明するための図、第１
０図は、本出願人が先に拠案じた音声情報記録再生装置
の一例を示す外観図、第１１図は、第１０図の動作説明
をするためのシステム図、第１２図は、第１１図に示し
たデジタル処理回路及びアナログ処理回路の詳細回路図
である。１・・・無音声音判定部、２・・・無音時間コーデング
部。３・・・平滑化部、４・・・ピッチ検出部、５・・・波
形相似部検出部、６・・・振幅、ピックコーデング部、
７・・・波形コーデング部。第１図第　　３　図（０） ■　　　　　　　■ 第３図（ｂ） ■ 第４図 ■ 第５図手続補正書防幻昭和６２年　特許願　第８８３１５号２、発明の名称音波情報記録再生方式３、補正をする者事件との関係　　特許出願人オオタ　り　ナカマゴメ住所　　東京都大田区中馬込１丁目３番６号氏名（名称
）　　（６７４）株式会社リコー代表者　　浜　　１）
　　広４、代理人住　所　　　　〒２３１　横浜市中区不老町１−２−７
シヤトレーイン横浜８０７号６、補正の対象図面７、補正の内容FIG. 1 is a configuration diagram for explaining one embodiment of the present invention.
FIG. 2 is a flowchart for explaining the operation of the utterance/absence determination unit, FIG. 3 is a flowchart for explaining extreme value detection and pitch candidate detection, and FIG. 4 is an operation for detecting pitches from pitch candidates. A flowchart for explaining, FIG. 5, a flowchart for explaining the operation of detecting a similar portion from similar portion candidates and extreme values, and FIGS. 6 to 9
The figure is a diagram for explaining the state of the waveform processing process.
0 is an external view showing an example of the audio information recording and reproducing device previously proposed by the present applicant, FIG. 11 is a system diagram for explaining the operation of FIG. 10, and FIG. FIG. 3 is a detailed circuit diagram of the digital processing circuit and analog processing circuit shown in the figure. 1... Silent sound determination section, 2... Silent time coding section. 3... Smoothing section, 4... Pitch detection section, 5... Waveform similar portion detection section, 6... Amplitude, pick coding section,
7... Waveform coding section. Figure 1 Figure 3 (0) ■ ■ Figure 3 (b) ■ Figure 4 ■ Figure 5 Procedural Amendment Statement 1988 Patent Application No. 88315 2, Title of Invention Sonic Information Recording and Reproducing System 3, Relationship with the case of the person making the amendment Patent applicant Ota Ri Nakamagome Address 1-3-6 Nakamagome, Ota-ku, Tokyo Name (674) Ricoh Co., Ltd. Representative Hama 1)
Hiro 4, Agent address: 1-2-7 Furocho, Naka-ku, Yokohama 231
Shear Train Yokohama No. 807 6, Drawing subject to amendment 7, Contents of amendment

Claims

[Claims]

(1) In the silence/sound discrimination, if the absolute value of the input amplitude is smaller than a predetermined value for a predetermined number or more consecutively, it is considered a silent part, and if the absolute value of the input amplitude is larger than the predetermined value. A sound wave information recording and reproducing method characterized in that a sound part is determined if a predetermined number or more are consecutive.

(2) The sound wave information according to claim (1) is characterized in that the actual sound part is determined to be a certain predetermined time before the time when it is determined to be a sound part. Recording and playback method.

(3) The sound wave information recording and reproducing method according to claim (1), wherein only the time of the silent portion is encoded.

(4) In detecting the pitch of an audio waveform, local maxima and local minima with large absolute values of amplitude are detected alternately, and the time between the maximums (or minimums) is a deviation within a certain predetermined value for a predetermined number of times or more. A sound wave information recording and reproducing method that is characterized by detecting a continuously manipulated sound as a pitch.

(5) According to claim (4), the value to be compared with the absolute value of the amplitude is a weighted value of the amplitude determined to be the immediately preceding large maximum (or minimum). A method for recording and reproducing sound wave information.

(6) The acoustic wave information recording and reproducing method according to claim (4) or (5), wherein the input signal is smoothed as pre-processing.

(7) After detecting the pitch of the audio waveform, when detecting the similar part of the waveform, if the number of extreme values in one pitch is equal to the number of extreme values in the previous pitch, the number of extreme values in the pitch of interest is A sound wave information recording and reproducing method characterized in that the waveform is similar to the waveform of the previous pitch.

(8) The sound wave information recording and reproducing method according to claim (7), wherein, regarding waveform similar portions, only the number of similar waveforms is encoded, and the other waveforms are encoded.

(9) The acoustic wave information recording and reproducing method according to claim (7) or (8), wherein the input signal is smoothed as pre-processing.