JPH08502609A

JPH08502609A - Non-numeric coprocessor

Info

Publication number: JPH08502609A
Application number: JP6509863A
Authority: JP
Inventors: ハラース，アーネ
Original assignee: ハラース，アーネ
Priority date: 1992-10-16
Filing date: 1992-10-16
Publication date: 1996-03-19
Also published as: WO1994009443A1; CA2146352A1; KR950704751A; NO951401L; EP0664910A1; NO951401D0

Abstract

(57)【要約】ファジー情報の検索とパターン認識用の非数値コプロセッサは、情報処理手段を有し、ホストコンピュータ及びデータ源に接続可能である。データ源からのデータストリームを検査するために配設された多数の同時動作可能なウインドウモジュール（Ｗ０、Ｗ１、．．．）内には、複数の内部処理素子が編成されている。処理素子は、データストリームのバイトと所定の上限及び下限とを比較してバイトが前記境界内にあるかどうかを決定し、境界内にある場合は、ヒット信号を生成する。各ウインドウモジュールは、その異なる処理素子からのヒット信号を相関させるためのウインドウ突合せ論理回路を有し、所定の突合せの発生によりウインドウ突合せ信号を生成する。このようにコプロセッサを構成することで、別個のデータストリームを、アプリケーションの必要に応じて、個々のウインドウモジュールに或いはウインドウグループ又はスーパウインドウとして選択可能に構成された連鎖ウインドウモジュールに転送するために、データルーティング手段（１２）により利用可能なパラレル処理能力が得られる。 (57) [Summary] The non-numerical coprocessor for fuzzy information retrieval and pattern recognition has information processing means and can be connected to a host computer and a data source. Internal processing elements are organized in a number of simultaneously operable window modules (W0, W1, ...) Arranged for examining the data stream from the data source. The processing element compares a byte of the data stream with predetermined upper and lower limits to determine if the byte is within the bounds and, if so, generates a hit signal. Each window module has window matching logic for correlating the hit signals from its different processing elements and produces a window matching signal upon the occurrence of a predetermined match. By configuring the coprocessor in this way, it is possible to transfer a separate data stream to individual window modules or chained window modules that are selectably configured as window groups or super windows, depending on the needs of the application. , The parallel processing capacity available by the data routing means (12) is obtained.

Description

【発明の詳細な説明】非数値コプロセッサ技術分野本発明は、電子計算機を用いたファジー情報検索及びパターン認識用の非数値コプロセッサに関する。背景技術従来のコンピュータシステムは、複雑なプログラミング及びデータ構成技術を用いて、情報の記述、蓄積、認識及び検索に利用可能である。しかしながら、かかる公知の方法を利用することにより、特に複雑な情報項目の検索や複雑なパターン認識等のタスクの際に、システム性能が著しく低下することも多い。情報検索、テキスト及びデータベース探索、及びパターン照合を通じて、殆どのコンピュータユーザは、非数値計算への従来のアプローチの非効率から生じる周知の問題に直面してきた。例えば、「燐濃度が１５μｇ／ｌ以上測定された現場で、１９８０年−８５年の間にカドミウムが測定された日付と位置を全て見つけなさい」というタイプの照会に応答するために膨大な量の測定値を読むことや、これよりずっと簡単な照会でも、従来のシステムでは性能上過酷な問題となることがある。多量の情報を検索しそこから複雑な項目を認識する人間の行動に鼓舞されて、本発明は、従来の方法に基づいたシステムと今日の性能上の要求との間の乖離を解消しようとする。従って、本発明の目的は、単純な非数値計算プログラミングを提供すると同時に、従来の非数値計算システムより実質的に高速で膨大なデータボリュームを走査検索し且つより低速で特定の完全な文章を探索したり或いは多くの語の断片から成る複雑な組合せを探索し得る機能を提供することである。更に、本発明は、従来の技術を採用するときにしばしば必要とされてきた、拡張セグメンテーション、ベクトル化、及び複雑な情報項目を探索するためのデータの複写格納に対する必要性を除去することを企図する。本発明の開示において、「バイト」は、１単位として処理される一連の隣接したビットとして解釈されるべきであり、ビット数は必ずしも８ではない。発明の開示本発明によれば、従来技術が直面した問題は、情報処理手段を有してホストコンピュータとデータ源とに接続可能なファジー情報検索とパターン認識用の非数値コプロセッサであって、情報処理手段が、前記データ源からのデータストリームを検査するために配設した所定数の同時動作可能なウインドウモジュール内に編成された複数の内部処理素子を備え、各処理素子が、データストリーム内の１バイト例えば８ビットバイトを、前記処理素子に割り当てられた所定の個別にプログラム可能な上限値及び下限値と比較して、当該処理素子内に存するバイトの値が前記境界値内にあるかどうかを決定し、境界値内にあるならば各ウインドウモジュール内に設けたウインドウ突合せ論理回路に送出されるヒット信号を生成して、当該ウインドウモジュール内の異なる処理素子から受信したヒット信号と相関させ、更に、前記ウインドウモジュール内の所定の突合せの発生により、ウインドウ突合せ信号を生成するコプロセッサにより解決される。このようにコプロセッサを構成することにより、情報検索とパターン認識用の従来のシステムの性能を遙かに凌ぐ性能をコプロセッサシステムに付与するために利用可能な、強力な並列処理能力が得られる。本発明に係るコプロセッサは、好ましくは、更に、アプリケーションの必要性に対応する構成データに応じて、前記データ源からの別個のデータストリームを、前記同時動作可能なウインドウモジュールに、個別的に、或いは前記ウインドウモジュールが個々のスーパウインドウ又はスーパウインドウグループ又は全てのウインドウモジュールを含む単一のスーパウインドウ等の異なる選定可能なウインドウ構成に連鎖される態様で、転送するデータルーティング手段を備えている。ウインドウをより長いウインドウに連鎖することで、より複雑な検索条件にも対応でき、また、異なるウインドウへの入力データのルーティングは、現在実行しているアプリケーションの、ウィンドウ長さ及びデータストリームに対する必要に応じて、幾通りかの方法で行うことができる。実際に、利用可能なウインドウの数の制限内で、それぞれ１バイトから成る任意の数のストリームを処理することができる。例えば、それぞれ好ましくは８ビット長の多数の個々のデータ入力は、異なるウインドウに転送されて並列処理を行うことができ、同一のデータストリームを異なるウインドウにより処理すべきときにデータを複写蓄積する必要が無くなる。かくして、本発明にかかるコプロセッサは、フレキシブルで構成可能なデータルーティング機能を有すると共に、６４ビット長以下のアプリケーションでのプロセッサの使用もサポートする。好ましくは、データルーティング手段は、異なるレベルで編成されたマルチプレクサのネットワークから成り、各マルチプレクサは、それぞれ好ましくは８ビット長の二つのデータ入力のうち、一方を選択してその出力に転送可能である。特に、コプロセッサの好ましい実施例において、マルチプレクサのレベルは、それぞれ、フォールディング、パラレル、及びシリアルのマルチプレクサレベルから成る。本コプロセッサは、更に、コプロセッサにロード可能な前記ウインドウ構成の内部記憶用のスタティックランダムアクセスメモリ（ＲＡＭ）を備えている。これにより、探索動作毎に構成データをコプロセッサに転送する必要はなくなり（構成を変更した場合のみ転送要）、ダウンロード可能な構成データの個別化セットを含むソフトウェアの開発には有利になる。コプロセッサの一実施例において、各処理素子は、検査すべきバイトの一時記憶用のラッチセルと、当該処理素子用の前記上限値及び下限値を備えた二つの比較器セルとを備え、比較器セルは、ヒット信号を生成するように構成されている。更に、コプロセッサ内に、ウインドウ突合せ信号を受信してプログラム可能な中央ヒットマスクと比較するように構成された結果制御論理回路を設け、中央ヒットマスクが、ウインドウヒットの論理組合せを定義させると共に、検出された全ての発生のアドレスを報告させる（ヒットアドレスモード）か、或いは検査されたデータボリューム内の突合せの総数を報告させる（ヒットカウントモード）、ことは好ましい。当業者にとって、本コプロセッサをプログラムし制御する際の簡便性、及び本コプロセッサの更なる利点と特徴は、以下の説明から明らかになろう。図面の簡単な説明以下、添付図面を参照しつつ、本発明に係るコプロセッサの好ましい実施例の一例に基づき、本発明を詳細に説明する。図１は、本発明に係るコプロセッサの典型的なアプリケーションを示す。図２は、ホストコンピュータ及びデータ源と接続された、本発明に係るコプロセッサのブロック図である。図３は、本発明に係るコプロセッサ内の単一のウインドウを示す図である。図４は、本発明に係るコプロセッサ内のウインドウの単一の処理素子を示す図である。図５は、本発明に係るコプロセッサ内のデータルーティングネットワークを示す。図６乃至図１５は、本発明に係るコプロセッサ内のデータルーティングネットワークの種々の好ましい構成を示す。図１６乃至図１７は、図７乃至図１６のそれぞれで示した二つの異なる構成に係るコプロセッサ内のルーティングネットワークを通るデータの流れを詳細に示す。図１８は、典型的なホスト／コプロセッサ構成を示す。図１９は、コプロセッサ内のアドレスマップ編成を示す。図２０は、所与のアプリケーション例のためのウインドウの一部を示す。好適な実施例の説明図１に示したように、本発明に係るコプロセッサチップ１は、一般には、双方向データ転送リンクを介してホストコンピュータ２に、また、一方向データ転送リンクを介してデータ源３に接続されている。コプロセッサチップ１の好ましい実施例は、図２に示したように、一連の８個のウインドウモジュールＷ０−Ｗ８を備えている。これらのデータモジュールＷ０−Ｗ８は、ホストインタフェースモジュール１４に接続された８ビットデータバスを介して相互に連結されるデータルータモジュール１２と結果制御論理回路１３との間に、論理的に位置している。データ源インタフェースモジュール１５は、データ源３からデータルータモジュール１２への一方向６４ビットデータ転送を行う。次に図３を参照すると、８個のデータウインドウモジュールＷ０−Ｗ８は、それぞれ、ウインドウ突合せ論理回路１６と、３２個の処理素子ＰＥ０−ＰＥ３１に対応する３２バイトのシフトレジスタと、を有する。図４に示したように、各処理素子ＰＥは、ラッチセル１７と、個別にプログラム可能な上下限を突合せチェックするために連係した二つの比較器セル１８、１９と、に分割される。図５に詳細に示したように、データルータモジュール１２内のデータルーティングネットワークを介して、各ウインドウを、より長いウインドウに連鎖させることができ、より複雑なデータ検索を可能としている。データルータは、３レベルのマルチプレクサから成り、それぞれ二つの長さ８ビットの入力のうち一つを選択して出力する。データ源から長さ８ビットのデータストリームが供給される第一のレベルのマルチプレクサは、フォールディングマルチプレクサ（図５の上側の多重化行）即ちデータストリームを円形にフォールドするマルチプレクサから構成され、これにより、各ウインドウを、実際のデータストリームがコプロセッサチップに入力される位置から、独立にすることができる。フォールディングマルチプレクサは、データストリームを複写して異なるウインドウに同一のデータストリームを同時に読ませるパラレルマルチプレクサ（図５の中央の多重化行）に、接続されている。最後に、シリアルマルチプレクサ（図５の下側の多重化行）が、各ウインドウをスーパウインドウに連鎖させるべきか否かを選択する。これら３個のマルチプレクサレベルにより、入力の組合せ及びウインドウの連鎖が可能となる。以下、図６乃至図１７を参照して、可能な構成の一部を説明する。入力データストリームに０乃至７の符号を付し、例えば、ストリーム０をＤ（７、０）、ストリーム１をＤ（１５、８）に対応させる。ａ）８本のデータストリームをそれぞれ、並列に配置した各ウインドウに供給する。これは、最も単純なルーティング戦略である。各ストリームは、対応するウインドウに送られる。この構成は図６に示されており、図１６には実際の構成がボールド体で詳細に示されている。ｂ）四つのデータストリームをそれぞれ、並列に配置した２個のウインドウに供給する。各データストリームは対にして、並列に配した２個のウインドウに送られる。ストリームは、一つおきのデータセット０、２、４、６又はデータセット１、３、５、７とすることができ、例えば、ストリーム０の次にストリーム１を配する。この構成は図７に示されている。ｃ）四つのデータストリームをそれぞれ、直列に配置した２個のウインドウに供給する。この構成は、ウインドウの各対が二倍の長さの一つのスーパウインドウを形成するように連鎖されている点を除き、ｂ）と同様である。この構成は図８に示されている。ｄ）二つのデータストリームをそれぞれ、並列に配置した４個のウインドウに供給する。ストリームは、４本ずつ結合されて、それぞれ並列に配した４個のウインドウから成る２つの組にそれぞれ送られる。ストリームは、交互に（０、４）、（１、５）、（２、６）又は（３、７）とすることができ、例えば、ストリーム０の次にストリーム１、２、３を、ストリーム４の次にストリーム５、６、７を配する。この構成は図９に示されている。ｅ）二つのデータストリームをそれぞれ、直列に配した２個のウインドウから成る２群の並列なウインドウに供給する。ｄ）の場合のように、ウインドウは連鎖されているが、並列処理は減少している。この構成は図１０に示されおり、図１７には入力ストリーム２及び６と実際の構成がボールド体で詳細に示されている。ｆ）二つのデータストリームをそれぞれ、直列に配した４個のウインドウに供給する。この構成は、個々のウインドウの長さの４倍の一つのスーパウインドウを形成するように各群のウインドウを連鎖させた他は、ｄ）の構成と同様である。この構成は図１１に示されている。ｇ）一つのデータストリームを、並列に配置した８個のウインドウに供給する。各データストリームは、一つのストリームを形成するように結合され、並列に配した８個のウインドウに送られる。８本のストリームのいずれも入力として使用可能であり、全データ記憶領域を一つの長い８ビットデータファイルとして扱う。この構成は図１２に示されている。ｈ）一つのデータストリームを、直列に配した２個のウインドウから成る４群の並列なウインドウに供給する。ｇ）に類似しているが、各ウインドウは、より複雑な探索用に連鎖されている。この構成は図１３に示されている。ｉ）一つのデータストリームを、直列に配した４個のウインドウから成る２群の並列なウインドウに供給する。ｇ）及びｈ）に類似しているが、より多くのウインドウを連鎖させて複雑さを増している。この構成は図１４に示されている。ｊ）一つのデータストリームを、直列に配した８個のウインドウに供給している。ｇ）に類似しているが、全てのウインドウを接続して単一のスーパウインドウを形成し、最も複雑な探索を可能にしている。この構成は図１５に示されている。別のフィルタ及びデータ経路構成は、チップにロードされた構成データにより決定される。全ての構成は、データのルーティングにより、秒あたり１０ギガシングルバイトの比較を行うことができる。しかしながら、同時データ経路の数をトレードオフすることにより、複雑な問合わせがなされ、この結果、チップ上に多数のアプリケーションをマップすることが容易となる。コプロセッサ１１は、一般には、図１８に示したように、ホストインタフェース１４を介してコンピュータにリンクされている。図示したホストコンピュータは、ディスクユニット２１及びそれに係るディスク制御装置２２、中央処理装置即ちＣＰＵ２３、システムメモリ２４及びシステムバス２５から成る。ホストインタフェース１４は、８ビット双方向ポート（ＨＤバス）に基づいており、読書きサイクルは、同期及び非同期で実行される。ホストインタフェース自体は、ＣＳ信号と組み合わせたＩＯＲ信号及びＩＯＷ信号のアサーション、及びＳＥＴＡＤＲライン上の特定の極性により、制御される（図２参照）。構成データは、構成ＲＡＭ内に蓄積されて合計８２８バイトから成り、約１００マイクロ秒内での完全な再構成を可能としている。殆どのシステムでは、構成時間はホストコンピュータからの転送速度により決定される。パーソナルコンピュータの入出力チャネルを使用した場合、転送速度をＩＭＢ／秒と仮定すると、一般には１０００マイクロ秒かかる。構成データのマイナチェンジは、コプロセッサの内部アドレスレジスタを介してアドレス指定することにより行われ、これにより構成時間を一層短くすることができる。かくして、再構成と探索が極めて迅速に行われるので、異なる基準で等量のデータを探索することが可能となる。カウントモードでは、コプロセッサは、検出された突合せデータ項目数をチップ上に累積する。報告モードでは、コプロセッサは、ヒットを検出した時に割り込み信号を発する。この信号は、ホストコンピュータがＡＣＫ信号又はＩＯＷ信号を発するまで、送出され続ける。内部結果位置カウンタは、シャドーレジスタ（図示せず）内に収納される。構成データは、ヒットが生じた場合にチップがデータの受信を停止すべきか否かを決定する。データストリームを停止するようにチップが構成されている場合、ＤＷＴＤ信号（図２参照）は、ＡＣＫ信号が送信されるまで非活性状態となる。突合せにも拘らずデータストリームが停止されずにそのまま流れるようにプログラムされている場合、シャドウレジスタ内に収納されたカウンタは、ＡＣＫ信号が送信されるまでオーバライトされることはない。これは、所望のデータを含むテキスト部分にヒットが頻繁に生じる場合、テキスト探索に有利である。また、問題がテキストの正規の部分（章、項）のみに発生した場合、テキストの当該部分内でその発生の正確な検出を期することは適切でない。６４ビットの同期データインタフェース１５（図２参照）は、単純なハンドシェーク手順で制御される。コプロセッサがデータを受信し得る状態にあるとき、ＤＷＴＤ信号は、実際のデータが読まれる１クロックサイクル前に、送出される。これにより、インタフェースを設計する際に、一層適切なタイミングが確保できる。データ源は、データを準備完了すると、ＤＶＡＬＩＤ信号を発する。最後の立ち上がりクロック時にＤＷＴＤ信号が非活性であった場合、ＤＶＡＬＩＤ信号の送出により、データがコプロセッサに読まれることはない。従って、ＤＷＴＤ信号の送出と共に立ち上がりクロックエッジの検出後、最初の立ち上がりクロックの間にデータとＤＶＡＬＩＤ信号を検出するまで、データ源は、データ転送を完了したと見做すべきではない。対応するタイミング方式は、係る同期及び非同期の読書き機能に対して、実現することができる。１バイト以上に亘る数値として解すべきデータを入力ストリームが含む場合、最上位のバイトは、ウインドウ内に最初に達する必要がある。チップのプログラミングは、ホストインタフェース内の異なるアドレスに構成データを書き込むことにより、行われる。構成の異なる部分は、間接的にアドレス指定可能である。これにより、必要に応じて極めて短時間の間に、構成の一部のみを変更することができる。再構成の間データストリームを停止する必要はないが、構成が部分的に書き込まれる場合に生じる一時的な状態の故に、誤った突合せが行われることがある。また、レコードカウンタは対応するウインドウへの書き込みによりリセットされるので、調整不良のレコードの場合、問題が生じることがある。従って、データストリームを停止してから構成を変えることが望ましい。コプロセッサ内の内部構成アドレスは、１１ビットのアドレスＡＤＲ（１０、０）から成り、これは、内部アドレスレジスタ内で生成される。このレジスタの８個の上位ビットは、ホストインタフェースを介してロード可能である。３個の下位ビットは、上位ビットがロードされるとクリアされる。ロードは、ＨＤバス上のアドレスビットを設定し、ＣＳ信号、ＩＯＷ信号、及びＳＥＴＡＤＲ信号を同時に送信することにより、行われる。１１ビットアドレスの編成は、図１９に示されている。コプロセッサは、それぞれ自身のアドレスを有する１２個の内部モジュールを備えている。モジュールアドレスは、アドレス内の４個の上位ビットから成り、ＨＤバスから新しい値を書き込むことによってのみ変化する。モジュール基底アドレスは、表１に示されている。７個の下位ビットは、カウンタ内に保持され、ホストインタフェースからのアクセス毎にインクリメントされる。かくして、１モジュール内の同順バイトは、容易にアクセスすることができる。モジュールアドレスの自動インクリメントは、モジュール間のアドレスマップに孔があるので、サポートされていない。殆どのモジュールにおいて、オフセットアドレスは、より詳細なアドレス指定に用いることができる。単一のバイトをアクセスするために、絶対アドレスは以下のように計算される。アドレス＝（モジュールアドレス^*１２８（１０））＋オフセットアドレス次に、ＨＤインタフェースを介してアドレスレジスタに値をロードし、その後アドレスレジスタを自動インクリメントすべくアクセスする。従前の構成についての知識を要しないので、読取りアクセスが最も簡単である。各ウインドウモジュールは、以下のものから成る。・下限３２バイト・上限３２バイト・フィールド区切り（セパレータ）マスク３２バイト（全バイトで、最下位ビットのみが重要である）・突合せ待ち時間値２バイト・レコード長値１バイトこれらのレジスタ用のオフセットアドレスは、以下の表２に示されている。実際の探索では、限界レジスタに適当な値をロードする。フィールド区切りマスクは、各フィールドの最下位バイトの位置で、１をとる。尚、フィールド区切りマスクでは、各バイトの最下位ビットのみが使用される。即ち、処理素子ＰＥを突合せする区切りマスクバイトには１が書き込まれ、他の場合はフィールドの最後のバイトを０に保持する。突合せ待ち時間は、ウインドウがヒットを実行するまでに必要なクロックサイクルの数である。突合せ待ち時間ゼロは、ヒットを生じたクロックサイクル中に、ウインドウから中央突合せ論理回路に、ヒットが報告されることを意味する。突合せ待ち時間が例えば４であるときは、更に４サイクル経過した後、即ち合計５サイクル内に突合せが報告されることを意味する。突合せ待ち時間レジスタに書き込む値は、６５５３５（１０）−前記待ち時間とする。即ち、例えば突合せ待ち時間４は、レジスタに値６５５３１（１０）を書き込むことにより指定される。データストリームのルーティングは、全て同量の時間を要するので、データストリームをチップに同時に入力した場合、データストリームは、各ウインドウ（又は連鎖の第一のウインドウ）の入力部に同時に発生する。ストリームを連鎖ウインドウに送ると、３２サイクルの累積遅延が導入されるが、このことは、突合せ待ち時間を計算する際に考慮すべきである。データ転送の無いクロックサイクル即ち非活性のＤＶＡＬＩＤ信号が、待ち時間サイクル数に関わることはない。従って、突合せ待ち時間は、物理的なクロックサイクルではなく、データ転送に対して測定される。レコード長は、レコード境界と合わないパターンを時折突合せすることにより、ヒットを抑制するために使用される。レコード長を１に設定することにより、全てのヒットが中央突合せ論理回路に報告される。レコード長が例えば６であるとき、６番目毎のデータ転送で行われる突合せのみを、ウインドウから報告させる。ウインドウをカウンタするバイトは、当該ウインドウに対応するモジュールアドレスを用いた全ての書き込み操作によりリセットされる。即ち、最初の実行可能な突合せは、ウインドウ構成を書き込み後６回目のデータ転送に対して行われる。レコード長値は、２５６（１０）−レコード長として、コプロセッサ内の対応するレジスタにプログラムされる。従って、１０（１０）のレコード長は、２４６（１０）として書き込まれる。２５６ビットのヒットマスクＲＡＭは、ホストインタフェースから分かるように、３２×８ビットとして編成される。この３２バイトは、内部アドレスレジスタ内の５個の下位ビットにより選定され、ビット５及び６を「ドントケア」として、ビット７乃至１０をモジュールアドレス１０００（２）として残す。バイトアドレス０への書き込みは、２５６ビットアドレス指定方式におけるビット０乃至７に影響を及ぼし、バイト内の最下位ビットはビット位置０に対応する。同様に、バイトアドレス４への書き込みは、２５６ビットアドレス指定方式のビット３２乃至３９に影響する。各ウインドウは、ＲＡＭを２５６×１ビットとしてアドレス指定する。８個のウインドウからの突合せ信号は、アドレスとして使用され、ウインドウ０からの突合せ信号は、最下位アドレスビットを表す。実ロケーションに１が蓄積されると、ヒットが検出される。モードレジスタと、結果カウンタと、ヒットパターンと、ヴァージョンレジスタとから成るモジュールは、内部オフセットアドレスを有しないが、逐次読取り方式のシフトレジスタチェーンとして編成されている。このモジュールへの書込みは全て、チェーン内で唯一の書込み可能レジスタであるモードレジスタ内で行われる。これは、コプロセッサがヒットにどのように作用するか、に影響する。この場合、３個の下位ビットのみを機能させて、他には常に０を書き込む。表３は、モードビットを説明する。これらのビットから、表４で全て説明する、８通りの動作の組合せが生じる。このモジュールから読み取る場合、以下の順序で値を呈示する。１．結果カウンタ、バイト３（最上位バイト）２．結果カウンタ、バイト２３．結果カウンタ、バイト１４．結果カウンタ、バイト０（最下位バイト）５．ヒットパターン６．モード７．ヴァージョン番号モードレジスタを除く全てのレジスタは、読取り専用型である。最初の読取り開始後に、全ての値を読み取る必要はない。ＡＣＫ信号又はＩＯＷ信号をコプロセッサに送出することで、結果カウンタ及びヒットパターンの値を新しいヒットによりオーバライトすると共に、結果カウンタの最上位バイトにアクセスして新しい読取りを開始する（ＩＯＷ信号はＣＳ信号でクオリファイされなければならない）。これは、また、ＩＮＴ信号が送信されないときに有効である。結果カウンタは、モードレジスタへの最後の書込み後に生じたヒット数又はデータ転送数をカウントする。従って、このモジュールに任意の値を書込みことで、結果カウンタはクリアされる。カウンタは、長さ３２ビットであり、カウンタオーバフロー標識は呈しない。ヒットパターンは、最後のヒット時に８個のウインドウのそれぞれから報告された突合せパターンである。これは、ヒットマスクＲＡＭのプログラミングで幾つかの突合せパターンにヒットを生成させる場合に、使用可能である。実際にヒットを引き起こしたパターンは、後処理を容易にする、より多くの情報を提供するために読み出される。ヴァージョンレジスタは、現在のコプロセッサのヴァージョンを示す番号を含む。これは、後の改定で使用され、ソフトウェアを現在のハードウェアに適合可能にする。データルータ設定モジュールは、レベル深さ３で長さ１バイトのシフトレジスタとして編成される。これらは図５乃至図１７に基づいて先に説明したマルチプレクサを制御するバイトである。各バイトは、以下の順序で読み書きされる。１）シリアルマルチプレクサ２）パラレルマルチプレクサ３）フォールディングマルチプレクサ読取り動作は、破壊的である。即ち、全てのマルチプレクサの構成データは、読出し後に再書込みしなければならない。しかしながら、マルチプレクサ構成はシステム試験中に読み取られるだけであるので、これはあまり重要ではない。各バイトの最上位ビットは、図５の一番左のマルチプレクサに対応する。シリアルマルチプレクサでは、値をどのように組合せても、リーガルである。他の二つのマルチプレクサでは、マルチプレクサ制御バイトの一つに４個以上連続したビットを有する構成は、（伝播遅延のために）イリーガルである。これは、また、循環フィードバックにも当てはまる。即ち、Ｃ３（１６）のマルチプレクサ構成もイリーガルである。図６乃至図１７に基づいて先に説明した構成ａ）乃至ｊ）のそれぞれに対する構成バイトは、作動すべく保証されており、以下の表５に示されている。本発明に係るコプロセッサチップは、製造試験専用のデータ経路モジュールを含み、ウインドウ７からデータを出力させる読取り動作を行う。本発明に係る非数値コプロセッサを含むチップの好ましい実施例は、パーソナルコンピュータ又はワークステーション用の拡張カード上に搭載される高度並列超ＬＳＩチップである。好ましくは、チップは、ＣＭＯＳ（相補形ＭＯＳ）工程により製造されてＴＴＬ（トランジスタトランジスタ論理回路）及びＣＭＯＳコンパチブル入力を有し、＋５Ｖ電源上で動作する１００ピンＰＱＦＰパッケージである。２０ＭＨｚの動作周波数で、かかるコプロセッサチップは、１６０ＭＢ／秒の持続可能なデータ処理能力を有し、秒あたり１０ギガシングルバイトの比較を行う。プログラミング例以下の例は電話帳で人を探索するための設定を示す。図２０は、本例のためのウインドウの一部構成を示す。このウインドウは、アルファベットのＡ乃至Ｇで始まる姓を有する人及び１４２０００乃至１６００００の範囲の電話番号の突合せを報告する。フィールド区切りマスク内に適当なビットを設定することにより、電話番号は６バイトに亘る数値フィールドとして処理される。図示ウインドウでは、他の二つのフィールドも示されているが、フィールドＮｏ．１及びＮｏ．２に対して全ビットが１である場合、それらは１バイト以上から成る比較には含まれないので、差異が生じない。探索基準に含まれないバイトは、ＦＦ（１６）乃至００（１６）の最大範囲内の全データパターンを突合せする「ドントケア」状態に設定される。プログラムされた１６バイトのレコード長のために、構成内のレコード境界と合わないデータに対しては、突合せは行われない。また、１つのウインドウのみを使用する場合、突合せ待ち時間は不要である。上述したことが唯一の探索基準である場合、これを全ウインドウにコピーしてもよく、位置０を除く全位置が１になるようにヒットマスクＲＡＭを設定する。これにより、ウインドウの少なくとも一つがヒットを生じると突合せ信号が発せられる。また、モード値も適切に設定する必要があり、例えば、Report＝１、St op＝１、Flank＝０とする。次に、コプロセッサは、各突合せ毎に割り込み信号を生成し、更にそれぞれに対して探索開始に係る突合せ位置を読み取る。機能説明図３に示すように、コプロセッサの好ましい実施例は、それぞれ３２バイトのシフトレジスタを含む８個のデータウインドウから成る。図４に示すように、各レジスタ素子は、上限と下限との突合せをチェックする二つの比較器と連係している。各境界は個別にプログラム可能であり、バイト範囲にある任意の連続インタバル内でデータを突合せることができる。二つの比較器は、各ウインドウに接続された突合せ論理回路に、突合せを報告する。１バイトより大きい項目の場合、異なるバイトの突合せを組み合わせる。これにより、２５６バイトまでのデータレコードを処理することができる。データフィールドは任意のインタバル試験に対して８バイトまで構成することができ、等価試験のときは２５６バイトまで可能である。各ウインドウは、個々のヒットをチップ上の中央ヒットマスクに報告する。２５６ビットのユーザプログラム可能ＲＡＭのアドレスとしては、８個のウインドウそれぞれの突合せが使用される。このＲＡＭは、ヒットとして検出されるべきウインドウヒットの任意の組合せに対して１を蓄積する。プログラム可能であるため、ユーザは、例えば１個のウインドウだけがヒットしたときでも報告されるべきか、８個のウインドウのうち４個のヒット時に報告されるべきか、或いは全てのウインドウがヒットしたときに報告されるべきか、を選定することができる。一般に、８個のウインドウヒットの任意の論理組合せは、ユーザ定義のヒットである。チップは、検出された全てのヒットの発生アドレス、或いはデータボリューム内の突合せの総数を報告する。報告モードに設定した場合、コプロセッサは、ヒット時に、被検出データの位置を含む内部カウンタを、シャドーレジスタ内に蓄積する。この蓄積された表示は、後にホストコンピュータにより読み取ってもよい。シャドーレジスタは、ＡＣＫ信号の送信によりホストコンピュータが突合せを確認するまで、オーバライトされない。各ウインドウは、プログラム可能な時間の間ヒットを記憶するように、設定可能である。これにより、正確な整合を検出できない場合に、文脈依存探索を行うことができる。これは、コプロセッサの原理に従う。即ち、一つの（厳格な）探索キーの代わりに、必要データに係る多くの弱い条件が使用される。この特徴は、複雑且つ／又は多量のデータ内の探索に特に重要である。アプリケーション例非構造化テキスト内の探索コプロセッサにより実現される検索速度の故に、従来の索引を作成し維持することは、時代遅れとなろう。被制御相対距離を以てワイルドカードを有するテキスト断片を組合せることにより、重要な情報を独自に識別することができる。探索では、同義語を同時に使用してもよい。以下は、異なる二つのタイプの簡単な問合わせである。Ｑ１：「シェイクスピアの作品において、'take it in what sence thou wilt 'という文章は、何度、そして何処に現れるか？」Ｑ２：「'Gorbat'ではなく、'Jelts','Mitter','Kohl''Major','Bush'という５つの（部分）名のうち少なくとも３つが現れる新聞記事を見つけなさい。」Ｑ１タイプの文体研究は、従来のテキスト探索システムでは十分に行うことができない。また、索引がテキストよりも多くのスペースを要することも良く知られている。所与の例に係るコプロセッサは、Ｑ１タイプの照会に対して１６０ＭＢ／秒の持続可能なデータ速度を実現する。また、Ｑ２タイプの複雑な照会は、２０ＭＢ／秒の持続可能なデータ速度で処理される。パターン照合例えば指紋等の画像から、様々なタイプの特徴が抽出され、多くのかかる特徴の組合せにより対象を識別し得る。多数の応募者間の探索は、本コプロセッサの頑強でファジーな機構から大いに得るところがある。一般に、コプロセッサの能力は、ＤＮＡ研究等でなされるような部分照合を含む問題に良く適合する。画像アーカイブ種々のタイプの画像を蓄積し処理する必要は、多くの分野で技術的発展を促進している。効率的な画像検索システムは、例えば病院で、新聞社で、或いは不動産業者に、急速に不可欠なものになっている。画像は、相当量の誘導及び付加特性を備えた、多次元対象を表す。データベース探索殆どのデータベースシステムは、階層構造及び厳密に定義された識別子に依存している。選択性の低い属性を有するファジーな照会は、既存のシステムに厳しい性能上の問題を課すが、かかる照会は、作動中に全ての弱い制約を組み合わせる本コプロセッサには理想的である。例えば、環境測定及び化学薬品のデータベースに関する具体的な研究は、単純化及び性能の向上に対する潜在的需要が本来的に存する、ことを示している。信号処理潜在的アプリケーションとしては、非線形フィルタリング、レーダターゲット相関、及び異常信号検出等がある。データネットワーク潜在的アプリケーションとしては、例えばイリーガルなアドレス範囲を報告するための、或いは情報をスナップするための、寄生監視関数がある。本コプロセッサを備えたパーソナルコンピュータ及びワークステーションの幾つかは、データ源として、データの全量を周期的に広める中央データポンプから生じたネットワークを使用することにより、分散処理システムに固有の全ての問題を除去することができる。ディスク制御装置本発明に係るコプロセッサは、理想的なディスク制御装置構成要素である。これは、単にデータを要求項目のみに制限することにより、バスを介してホストコンピュータにデータを転送する必要性を大幅に減少させる。コプロセッサの機能は、従来の内容アドレス指定を超えてより進んだ「データ特性」アドレス指定を行っている。Description: TECHNICAL FIELD The present invention relates to a non-numerical coprocessor for fuzzy information retrieval and pattern recognition using an electronic computer. BACKGROUND ART Conventional computer systems can be used to describe, store, recognize and retrieve information using complex programming and data organization techniques. However, by using such a known method, the system performance often deteriorates remarkably, especially in tasks such as complicated information item search and complex pattern recognition. Through information retrieval, text and database searching, and pattern matching, most computer users have faced well-known problems resulting from the inefficiencies of conventional approaches to non-numerical computation. For example, a huge amount to respond to a query of the type "find all dates and locations where cadmium was measured between 1980 and 1985, where phosphorus concentrations were measured above 15 μg / l". Reading a measurement of or even a much simpler query can cause severe performance problems in conventional systems. Inspired by human behavior that retrieves large amounts of information and recognizes complex items therefrom, the present invention seeks to eliminate the gap between traditional method-based systems and today's performance requirements. . Accordingly, it is an object of the present invention to provide simple non-numerical programming while at the same time scanning large volumes of data substantially faster than conventional non-numerical computing systems and slower to find certain complete sentences. Or to provide the ability to search for complex combinations of many word fragments. Further, the present invention seeks to eliminate the need for extended segmentation, vectorization, and duplicate storage of data to search for complex information items, which is often required when employing conventional techniques. To do. In the present disclosure, a "byte" should be construed as a series of contiguous bits treated as a unit, and the number of bits is not necessarily eight. DISCLOSURE OF THE INVENTION According to the present invention, the problem faced by the prior art is a non-numeric coprocessor for fuzzy information retrieval and pattern recognition, which has information processing means and is connectable to a host computer and a data source, The information processing means comprises a plurality of internal processing elements organized within a predetermined number of simultaneously operable window modules arranged to examine a data stream from said data source, each processing element being within the data stream. 1 byte, for example an 8-bit byte, of the byte is present within the boundary value by comparing the predetermined individually programmable upper and lower limit values assigned to the processing element with each other. If it is within the boundary value, generate a hit signal to be sent to the window matching logic circuit provided in each window module, It is solved by a coprocessor which correlates with hit signals received from different processing elements in the window module and further generates a window match signal by the occurrence of a predetermined match in the window module. This coprocessor configuration provides powerful parallel processing capabilities that can be used to provide coprocessor systems with performance far exceeding that of conventional systems for information retrieval and pattern recognition. . The coprocessor according to the invention preferably further comprises a separate data stream from said data source to said simultaneously operable window module individually in response to configuration data corresponding to the needs of the application. Alternatively, the window module comprises data routing means for transferring in a manner that is chained to different selectable window configurations such as individual super windows or groups of super windows or a single super window containing all window modules. By chaining windows into longer windows, more complex search conditions can be accommodated, and the routing of input data to different windows is required for the window length and data stream of the currently executing application. Depending on the method, it can be done in several ways. In fact, within the limit of the number of available windows, any number of streams of 1 byte each can be processed. For example, a number of individual data inputs, each preferably eight bits long, can be transferred to different windows for parallel processing, and the same data stream needs to be duplicated when processed by different windows. Disappears. Thus, the coprocessor according to the present invention has a flexible and configurable data routing function and also supports the use of the processor in applications of 64 bits or less in length. Preferably, the data routing means comprises a network of multiplexers organized at different levels, each multiplexer being able to select one of two data inputs, each preferably eight bits long, and transfer it to its output. . In particular, in the preferred embodiment of the coprocessor, the multiplexer levels comprise folding, parallel, and serial multiplexer levels, respectively. The coprocessor further comprises a static random access memory (RAM) for internal storage of the window structure which can be loaded into the coprocessor. This eliminates the need to transfer configuration data to the coprocessor for each search operation (necessary to transfer only if the configuration is changed), which is advantageous for developing software that includes a personalized set of downloadable configuration data. In one embodiment of the coprocessor, each processing element comprises a latch cell for temporary storage of a byte to be tested and two comparator cells with the upper and lower limits for the processing element. The cell is configured to generate a hit signal. Further provided in the coprocessor is result control logic configured to receive the window match signal and compare it to a programmable central hit mask, the central hit mask defining a logical combination of window hits, and It is preferable to have the addresses of all detected occurrences reported (hit address mode) or the total number of matches in the examined data volume (hit count mode). The convenience of programming and controlling the coprocessor, and further advantages and features of the coprocessor, will be apparent to those skilled in the art from the following description. BRIEF DESCRIPTION OF THE DRAWINGS The present invention will now be described in detail with reference to the accompanying drawings, on the basis of an example of a preferred embodiment of a coprocessor according to the present invention. FIG. 1 shows a typical application of a coprocessor according to the invention. FIG. 2 is a block diagram of a coprocessor according to the present invention connected to a host computer and a data source. FIG. 3 is a diagram showing a single window within a coprocessor according to the present invention. FIG. 4 is a diagram showing a single processing element of a window in a coprocessor according to the present invention. FIG. 5 shows a data routing network within a coprocessor according to the present invention. 6 to 15 show various preferred configurations of the data routing network within the coprocessor according to the present invention. 16 to 17 detail the flow of data through the routing network in the coprocessor according to the two different configurations shown in each of FIGS. 7 to 16. FIG. 18 shows a typical host / coprocessor configuration. FIG. 19 shows the address map organization within the coprocessor. FIG. 20 shows a portion of the window for a given example application. Description of the Preferred Embodiments As shown in FIG. 1, a coprocessor chip 1 according to the present invention generally comprises a bidirectional data transfer link to a host computer 2 and a unidirectional data transfer link. It is connected to the data source 3. The preferred embodiment of coprocessor chip 1 comprises a series of eight window modules W0-W8, as shown in FIG. These data modules W0-W8 are logically located between the data router module 12 and the result control logic circuit 13 which are interconnected via an 8-bit data bus connected to the host interface module 14. ing. The data source interface module 15 performs one-way 64-bit data transfer from the data source 3 to the data router module 12. Referring now to FIG. 3, each of the eight data window modules W0-W8 has a window matching logic circuit 16 and a 32-byte shift register corresponding to 32 processing elements PE0-PE31. As shown in FIG. 4, each processing element PE is divided into a latch cell 17 and two comparator cells 18, 19 associated with each other to check individually programmable upper and lower limits. As shown in detail in FIG. 5, each window can be chained to a longer window via the data routing network in the data router module 12, allowing more complex data retrieval. The data router is composed of a 3-level multiplexer, and selects and outputs one of two 8-bit length inputs. The first level multiplexer, which is supplied with an 8-bit long data stream from the data source, consists of a folding multiplexer (upper multiplexing row in FIG. 5), i.e. a multiplexer that folds the data stream in a circle. Each window can be independent of the location where the actual data stream enters the coprocessor chip. The folding multiplexer is connected to a parallel multiplexer (the central multiplexing row in FIG. 5) that duplicates the data stream and simultaneously reads the same data stream in different windows. Finally, the serial multiplexer (the lower multiplexing row in FIG. 5) chooses whether or not each window should be chained to the superwindow. These three multiplexer levels allow input combinations and window chains. Hereinafter, a part of the possible configurations will be described with reference to FIGS. 6 to 17. The input data streams are assigned codes 0 to 7, for example, the stream 0 corresponds to D (7,0) and the stream 1 corresponds to D (15,8). a) Each of the eight data streams is supplied to each window arranged in parallel. This is the simplest routing strategy. Each stream is sent to the corresponding window. This configuration is shown in FIG. 6 and the actual configuration is shown in detail in bold in FIG. b) Supply each of the four data streams to two windows arranged in parallel. Each data stream is paired and sent to two windows arranged in parallel. The streams can be every other dataset 0, 2, 4, 6 or datasets 1, 3, 5, 7, eg stream 0 is followed by stream 1. This configuration is shown in FIG. c) Feed each of the four data streams to two windows arranged in series. This configuration is similar to b), except that each pair of windows is concatenated to form one superwindow of double length. This configuration is shown in FIG. d) Each of the two data streams is supplied to four windows arranged in parallel. The streams are combined in groups of four and sent to two sets of four windows arranged in parallel. The streams can alternately be (0, 4), (1, 5), (2, 6) or (3, 7), eg stream 0 followed by streams 1, 2, 3 Next to 4, streams 5, 6, and 7 are arranged. This configuration is shown in FIG. e) Feeding the two data streams respectively into two groups of parallel windows consisting of two windows arranged in series. As in d), the windows are chained, but the parallelism is reduced. This configuration is shown in FIG. 10, and in FIG. 17 the input streams 2 and 6 and the actual configuration are shown in detail in bold type. f) Each of the two data streams is supplied to four windows arranged in series. This configuration is similar to the configuration of d) except that the windows of each group are linked so as to form one super window that is four times the length of each window. This configuration is shown in FIG. g) Supply one data stream to eight windows arranged in parallel. Each data stream is combined to form one stream and sent to eight windows arranged in parallel. Any of the eight streams can be used as input, treating the entire data storage area as one long 8-bit data file. This configuration is shown in FIG. h) Feeding one data stream to four groups of parallel windows consisting of two windows arranged in series. Similar to g), but each window is chained for a more complex search. This configuration is shown in FIG. i) Supply one data stream to two groups of parallel windows consisting of four windows arranged in series. Similar to g) and h), but chaining more windows to increase complexity. This configuration is shown in FIG. j) One data stream is supplied to eight windows arranged in series. Similar to g), but connects all windows to form a single super window, allowing the most complex searches. This configuration is shown in FIG. Alternative filter and data path configurations are determined by the configuration data loaded into the chip. All configurations are capable of 10 gigabytes of single byte comparisons per second with data routing. However, by trading off the number of simultaneous data paths, complex queries are made, which facilitates mapping a large number of applications on a chip. The coprocessor 11 is typically linked to a computer via a host interface 14, as shown in FIG. The illustrated host computer comprises a disk unit 21, a disk controller 22 associated therewith, a central processing unit or CPU 23, a system memory 24 and a system bus 25. The host interface 14 is based on an 8-bit bidirectional port (HD bus) and read / write cycles are executed synchronously and asynchronously. The host interface itself is controlled by the assertion of the IOR and IOW signals in combination with the CS signal and the specific polarity on the SETA DR line (see Figure 2). The configuration data is stored in the configuration RAM and consists of a total of 828 bytes, enabling a complete reconfiguration within about 100 microseconds. In most systems, the configuration time is determined by the transfer rate from the host computer. When the input / output channel of a personal computer is used, assuming that the transfer rate is IMB / sec, it generally takes 1000 microseconds. Minor changes to the configuration data are made by addressing through an internal address register of the coprocessor, which can further reduce the configuration time. Thus, the reconstruction and the search are performed very quickly, making it possible to search for equal amounts of data on different criteria. In counting mode, the coprocessor accumulates the number of matching data items found on the chip. In reporting mode, the coprocessor issues an interrupt signal when it detects a hit. This signal continues to be sent until the host computer issues an ACK signal or an IOW signal. The internal result position counter is housed in a shadow register (not shown). The configuration data determines whether the chip should stop receiving data if a hit occurs. If the chip is configured to stop the data stream, the DWTD signal (see Figure 2) will be inactive until the ACK signal is sent. If the data stream is programmed to flow uninterrupted despite a match, the counter contained in the shadow register will not be overwritten until the ACK signal is sent. This is advantageous for text searching when hits frequently occur on the text portion containing the desired data. Also, if the problem occurs only in the legitimate part of the text (chapter, section), it is not appropriate to seek an exact detection of its occurrence within that part of the text. The 64-bit synchronous data interface 15 (see FIG. 2) is controlled by a simple handshake procedure. When the coprocessor is ready to receive data, the DWTD signal is sent one clock cycle before the actual data is read. This makes it possible to ensure more appropriate timing when designing the interface. When the data source is ready with data, it issues a DVALID signal. If the DWTD signal was inactive at the last rising clock, the sending of the DVALID signal prevents the data from being read by the coprocessor. Therefore, after sending the DWTD signal and detecting the rising clock edge, the data source should not consider the data transfer completed until it detects the data and the DVALID signal during the first rising clock. Corresponding timing schemes can be implemented for such synchronous and asynchronous read / write functions. If the input stream contains data that is to be interpreted as a numeric value that spans more than one byte, the most significant byte must reach the first in the window. Programming of the chip is done by writing configuration data to different addresses in the host interface. The different parts of the configuration are indirectly addressable. As a result, only a part of the configuration can be changed in a very short time if necessary. It is not necessary to stop the data stream during the reconfiguration, but false matches may occur due to the transient condition that occurs when the configuration is partially written. Also, since the record counter is reset by writing to the corresponding window, problems may arise in the case of misaligned records. Therefore, it is desirable to stop the data stream before changing the configuration. The internal configuration address in the coprocessor consists of the 11-bit address ADR (10,0), which is generated in the internal address register. The eight high-order bits of this register can be loaded via the host interface. The three lower bits are cleared when the upper bits are loaded. Loading is done by setting address bits on the HD bus and sending CS, IOW and SETADR signals simultaneously. The organization of the 11-bit address is shown in FIG. The coprocessor comprises twelve internal modules, each with its own address. The module address consists of the four upper bits in the address and is changed only by writing a new value from the HD bus. The module base address is shown in Table 1. The 7 low-order bits are held in the counter and incremented for each access from the host interface. Thus, in-order bytes within a module can be easily accessed. Module address auto-incrementing is not supported due to holes in the address map between modules. In most modules, offset addresses can be used for more detailed addressing. To access a single byte, the absolute address is calculated as: Address = (module address ^* 128 (10)) + Offset Address Next, the address register is loaded with a value via the HD interface and then accessed to autoincrement the address register. Read access is easiest because it requires no knowledge of the previous configuration. Each window module consists of:・ Lower limit 32 bytes ・ Upper limit 32 bytes ・ Field delimiter (separator) mask 32 bytes (all bytes, only the least significant bit is important) ・ Match waiting time value 2 bytes ・ Record length value 1 byte Offset for these registers The addresses are shown in Table 2 below. In the actual search, the limit register is loaded with the appropriate value. The field delimiter mask takes 1 at the least significant byte position of each field. Note that the field delimiter mask uses only the least significant bit of each byte. That is, a 1 is written to the delimiter mask byte that matches the processing element PE, and otherwise the last byte of the field is held at 0. The match latency is the number of clock cycles required for the window to perform a hit. Zero match latency means that the hit is reported from the window to the central match logic during the clock cycle that caused the hit. When the matching waiting time is 4, for example, it means that the matching is reported after 4 more cycles, that is, within a total of 5 cycles. The value written in the matching waiting time register is 65535 (10) -the waiting time. That is, for example, the matching waiting time 4 is specified by writing the value 65531 (10) in the register. The routing of the data streams all take the same amount of time, so if the data streams are input to the chip at the same time, the data streams will occur simultaneously at the input of each window (or the first window of the chain). Sending the stream through the chain window introduces a cumulative delay of 32 cycles, which should be taken into account when calculating the match latency. A clock cycle with no data transfer, that is, an inactive DVALID signal does not affect the number of waiting cycles. Therefore, the match latency is measured for data transfers, not physical clock cycles. The record length is used to suppress hits by occasionally matching patterns that do not match record boundaries. By setting the record length to 1, all hits are reported to the central match logic. When the record length is 6, for example, only the matching performed in the sixth data transfer is reported from the window. The byte counting window is reset by every write operation using the module address corresponding to that window. That is, the first executable match is performed for the sixth data transfer after writing the window configuration. The record length value is programmed into the corresponding register in the coprocessor as 256 (10) -record length. Therefore, a record length of 10 (10) is written as 246 (10). The 256-bit hit mask RAM is organized as 32x8 bits as can be seen from the host interface. This 32 bytes is selected by the 5 low order bits in the internal address register, leaving bits 5 and 6 as "don't care" and bits 7 through 10 as the module address 1000 (2). Writing to byte address 0 affects bits 0 through 7 in the 256-bit addressing scheme, the least significant bit in the byte corresponds to bit position 0. Similarly, writing to byte address 4 affects bits 32-39 of the 256-bit addressing scheme. Each window addresses the RAM as 256 x 1 bit. The match signal from the eight windows is used as the address, and the match signal from window 0 represents the least significant address bits. A hit is detected when a 1 is stored in the real location. The module consisting of mode register, result counter, hit pattern, and version register does not have an internal offset address, but is organized as a serial read shift register chain. All writes to this module occur in the mode register, the only writable register in the chain. This affects how the coprocessor acts on hits. In this case, only the 3 lower bits are made to function, and 0 is always written to the other bits. Table 3 describes the mode bits. These bits result in eight combinations of operations, all described in Table 4. When reading from this module, the values are presented in the following order: 1. Result counter, byte 3 (most significant byte) 2. Result counter, byte 2 3. Result counter, byte 1 4. Result counter, byte 0 (least significant byte) 5. Hit pattern 6. Mode 7. All registers, except the version number mode register, are read-only. Not all values need to be read after the first read has started. Sending an ACK or IOW signal to the coprocessor overwrites the value of the result counter and hit pattern with a new hit and accesses the most significant byte of the result counter to initiate a new read (IOW signal is CS Must be qualified with the signal). This is also valid when the INT signal is not transmitted. The result counter counts the number of hits or data transfers that have occurred since the last write to the mode register. Therefore, writing any value to this module will clear the result counter. The counter is 32 bits long and does not exhibit a counter overflow indicator. The hit pattern is the matching pattern reported from each of the eight windows at the last hit. This can be used when programming the hit mask RAM to generate hits for some matching patterns. The pattern that actually caused the hit is read to provide more information that facilitates post-processing. The version register contains a number indicating the version of the current coprocessor. It will be used in later revisions to allow the software to fit into current hardware. The data router configuration module is organized as a shift register with level depth 3 and length 1 byte. These are the bytes that control the multiplexer previously described with reference to FIGS. Each byte is read and written in the following order. 1) Serial multiplexer 2) Parallel multiplexer 3) Folding multiplexer The read operation is destructive. That is, all multiplexer configuration data must be rewritten after being read. However, this is not very important since the multiplexer configuration is only read during system test. The most significant bit of each byte corresponds to the leftmost multiplexer in FIG. In a serial multiplexer, any combination of values is legal. In the other two multiplexers, configurations with 4 or more consecutive bits in one of the multiplexer control bytes are illegal (due to propagation delay). This also applies to cyclic feedback. That is, the multiplexer configuration of C3 (16) is also illegal. The configuration bytes for each of the configurations a) to j) described above with reference to FIGS. 6 to 17 are guaranteed to work and are shown in Table 5 below. The coprocessor chip according to the present invention includes a data path module dedicated to a manufacturing test and performs a read operation for outputting data from the window 7. A preferred embodiment of a chip containing a non-numeric coprocessor according to the present invention is a highly parallel VLSI chip mounted on an expansion card for a personal computer or workstation. Preferably, the chip is a 100-pin PQFP package manufactured by a CMOS (Complementary MOS) process, having TTL (Transistor Transistor Logic) and CMOS compatible inputs and operating on a + 5V power supply. At an operating frequency of 20 MHz, such a coprocessor chip has a sustainable data processing capacity of 160 MB / s and makes a comparison of 10 Giga single bytes per second. Programming Example The following example shows the settings for searching a person in the phone book. FIG. 20 shows a partial configuration of the window for this example. This window reports a person with a surname starting with the letters A to G and a match for a phone number in the range 142000 to 160000. The telephone number is treated as a 6-byte numeric field by setting the appropriate bits in the field delimiter mask. Although the other two fields are also shown in the illustrated window, the field number. 1 and No. If all bits are 1's for 2, they do not make a difference because they are not included in the comparison of more than 1 byte. Bytes not included in the search criteria are set to a "don't care" state that matches all data patterns within the maximum range of FF (16) through 00 (16). Due to the programmed 16-byte record length, no matching is done for data that does not meet record boundaries in the configuration. Further, when only one window is used, the matching waiting time is unnecessary. If the above is the only search criterion, it may be copied to all windows and the hit mask RAM is set so that all positions except position 0 are 1. This causes a match signal to be issued when at least one of the windows has a hit. In addition, the mode value also needs to be set appropriately, for example, Report = 1, Stop = 1, and Blank = 0. Next, the coprocessor generates an interrupt signal for each match, and further reads the match position relating to the search start for each match. Functional Description As shown in FIG. 3, the preferred embodiment of the coprocessor consists of eight data windows, each containing a 32-byte shift register. As shown in FIG. 4, each register element is associated with two comparators that check the upper and lower bounds. Each boundary is individually programmable, allowing data to match within any contiguous interval in the byte range. The two comparators report the match to the matching logic circuit connected to each window. For items larger than 1 byte, combine matches of different bytes. This allows data records of up to 256 bytes to be processed. The data field can be configured up to 8 bytes for any interval test and up to 256 bytes for the equivalence test. Each window reports an individual hit to the central hit mask on the chip. A match of each of the eight windows is used as the address of the 256-bit user programmable RAM. This RAM stores a 1 for any combination of window hits that should be detected as a hit. Being programmable, the user should, for example, be reported even when only one window is hit, be reported when four of eight windows are hit, or all windows are hit You can choose when it should be reported. In general, any logical combination of 8 window hits is a user-defined hit. The chip reports the address of occurrence of all hits detected, or the total number of matches in the data volume. When set to report mode, the coprocessor stores an internal counter containing the position of the detected data in the shadow register on hit. This accumulated display may be read later by the host computer. The shadow register is not overwritten until the host computer confirms the match by sending an ACK signal. Each window is configurable to store hits for a programmable amount of time. This allows a context sensitive search to be performed if an exact match cannot be detected. It follows the coprocessor principle. That is, instead of one (strict) search key, many weak conditions on the required data are used. This feature is especially important for searches in complex and / or large amounts of data. Application Example Creating and maintaining traditional indexes would be obsolete because of the search speed achieved by the search coprocessor in unstructured text. By combining text fragments with wildcards with controlled relative distances, important information can be uniquely identified. Synonyms may be used simultaneously in the search. Below are two different types of simple queries. Q1: "In Shakespeare's work, how often and where does the sentence'take it in what sence thou wilt 'appear?" Q2: "'Jelts','Mitter','Kohl,not'Gorbat'." Find a newspaper article in which at least three of the five (partial) names'Major 'and'Bush' appear. "Q1 type stylistic research cannot be done well with conventional text search systems. It is also well known that indexes require more space than text. The coprocessor according to the given example achieves a sustainable data rate of 160 MB / sec for Q1 type queries. Also, Q2-type complex queries are processed at a sustainable data rate of 20 MB / sec. Pattern matching Various types of features can be extracted from images such as fingerprints, and many combinations of such features can identify an object. Searching among multiple applicants has much to gain from the robust and fuzzy mechanism of this coprocessor. In general, the coprocessor's capabilities are well suited to problems involving partial matching, such as those done in DNA research. Image Archiving The need to store and process different types of images has facilitated technological development in many areas. Efficient image retrieval systems are rapidly becoming indispensable, for example in hospitals, newspaper companies or real estate agents. The image represents a multi-dimensional object with a considerable amount of guidance and additional properties. Database Search Most database systems rely on hierarchical structures and well-defined identifiers. Fuzzy queries with less selective attributes impose severe performance problems on existing systems, but such queries are ideal for this coprocessor which combines all weak constraints during operation. For example, specific research on environmental measurement and chemical databases has shown that there is an inherent potential need for simplification and improved performance. Signal Processing Potential applications include nonlinear filtering, radar target correlation, and abnormal signal detection. Data Networks Potential applications include parasitic monitoring functions, for example to report illegal address ranges or to snap information. Some of the personal computers and workstations with this coprocessor use all the problems inherent in distributed processing systems by using as a data source a network originating from a central data pump that periodically spreads the full amount of data. Can be removed. Disk Controller The coprocessor according to the present invention is an ideal disk controller component. This greatly reduces the need to transfer data to the host computer via the bus by simply limiting the data to the required items. The function of the coprocessor is to provide more advanced "data characteristic" addressing beyond conventional content addressing.

Claims

[Claims] 1. Host computer (2) and data source (3) having means for information processing A non-numerical coprocessor for fuzzy information retrieval and pattern recognition (1 ) Said information processing means examines the data stream from said data source (3) A predetermined number of window modules (W0, W1 ,. ．． ), A plurality of internal processing elements (PE0, PE1, ...) Each processing element stores one byte in the data stream, for example an 8-bit byte, Ratio to predetermined individually programmable upper and lower limits assigned to processing elements To determine whether the byte value present in the processing element is within the boundary value. If it is within the boundary value, within each window module (W0, W1, ...). To generate a hit signal to be sent to the window matching logic circuit (16) provided in , Received from different processing elements (PE0, PE1) in the window module Correlation with the hit signal, Generate a window match signal, A coprocessor characterized by that. 2. Furthermore, Depending on the configuration data corresponding to the needs of the application, the data source (3 ), A separate data stream from (W0, W1, ...) Individually or individually Super window or super window group or all window modules Linked to different selectable window configurations, such as a single super window containing Data routing means (12) for transferring in a manner described The coprocessor according to claim 1, further comprising: 3. The data routing means (12) may be organized at different levels. A network of chipplexers, each multiplexer preferably One of two 8-bit data inputs can be selected and transferred to its output The coprocessor according to claim 2, characterized in that: 4. The multiplexer levels are folding, parallel, And serial multiplexer level, A coprocessor according to claim 3, characterized in that: 5. Furthermore, Static for internal storage of the window structure loadable on the coprocessor With random access memory (RAM), A coprocessor according to claim 2, characterized in that: 6. Each processing element (PE0, PE1, ...) Temporarily stores the byte to be inspected And a latch cell (17) for the processing element, and With two comparator cells (18, 19), The comparator cell is configured to generate the hit signal, The coprocessor according to claim 1, wherein: 7. Furthermore, A programmable center hit mask for receiving the window match signal; A result control logic circuit (13) configured to compare, The center hit mask supports the definition of window match logical combinations. And report the addresses of all detected occurrences (hit address mode). Or report the total number of matches in the examined data volume (see Count mode), The coprocessor according to claim 1, wherein: 8. Each window module (W0, W1, ...) The record length value of the data record existing in the Field separator mask that separates fields and A match latency value that can be set for each window to store Designed to, The coprocessor according to claim 1, wherein: 9. The number of said window modules (W0, W1, ...) Is 8, Each is designed to handle 8-bit long byte inputs and each window C) A shift register with a length corresponding to the data stream supplied to the module Prepared Consisting of 32 processing elements (PE0, PE1, ...), The coprocessor according to claim 1, wherein: 10. Furthermore, An 8-bit interface having an interrupt function is preferable. A host interface designed to allow coprocessors to be used with icroprocessors. Face means (14), A 64-bit interface is preferred to allow the coprocessor to Data source, RAM bank, disk array, or network Data source interface means (15), Equipped with Preferably 64, 56, 48, 40, 32, 24, 16 or 8 bit data. Programmable for data transfer, Coprocessor according to any one of the preceding claims, characterized in that: